Explain agent types better in Guided tour docs (#1426)

albertvillanova · web-flow · commit 757e334963d9 · 2025-06-10T15:43:22.000+02:00
diff --git a/docs/source/en/guided_tour.mdx b/docs/source/en/guided_tour.mdx
@@ -4,6 +4,106 @@
 
 In this guided visit, you will learn how to build an agent, how to run it, and how to customize it to make it work better for your use-case.
 
+## Choosing an agent type: CodeAgent or ToolCallingAgent
+
+`smolagents` comes with two agent classes: [`CodeAgent`] and [`ToolCallingAgent`], which represent two different paradigms for how agents interact with tools.
+The key difference lies in how actions are specified and executed: code generation vs structured tool calling.
+
+- [`CodeAgent`] generates tool calls as Python code snippets.
+  - The code is executed either locally (potentially unsecure) or in a secure sandbox.
+  - Tools are exposed as Python functions (via bindings).
+  - Example of tool call:
+    ```py
+    result = search_docs("What is the capital of France?")
+    print(result)
+    ```
+  - Strengths:
+    - Highly expressive: Allows for complex logic and control flow and can combine tools, loop, transform, reason.
+    - Flexible: No need to predefine every possible action, can dynamically generate new actions/tools.
+    - Emergent reasoning: Ideal for multi-step problems or dynamic logic.
+  - Limitations
+    - Risk of errors: Must handle syntax errors, exceptions.
+    - Less predictable: More prone to unexpected or unsafe outputs.
+    - Requires secure execution environment.
+
+- [`ToolCallingAgent`] writes tool calls as structured JSON.
+  - This is the common format used in many frameworks (OpenAI API), allowing for structured tool interactions without code execution.
+  - Tools are defined with a JSON schema: name, description, parameter types, etc.
+  - Example of tool call:
+    ```json
+    {
+      "tool_call": {
+        "name": "search_docs",
+        "arguments": {
+          "query": "What is the capital of France?"
+        }
+      }
+    }
+    ```
+  - Strengths:
+    - Reliable: Less prone to hallucination, outputs are structured and validated.
+    - Safe: Arguments are strictly validated, no risk of arbitrary code running.
+    - Interoperable: Easy to map to external APIs or services.
+  - Limitations:
+    - Low expressivity: Can't easily combine or transform results dynamically, or perform complex logic or control flow.
+    - Inflexible: Must define all possible actions in advance, limited to predefined tools.
+    - No code synthesis: Limited to tool capabilities.
+
+When to use which agent type:
+- Use [`CodeAgent`] when:
+  - You need reasoning, chaining, or dynamic composition.
+  - Tools are functions that can be combined (e.g., parsing + math + querying).
+  - Your agent is a problem solver or programmer.
+
+- Use [`ToolCallingAgent`] when:
+  - You have simple, atomic tools (e.g., call an API, fetch a document).
+  - You want high reliability and clear validation.
+  - Your agent is like a dispatcher or controller.
+
+## CodeAgent
+
+[`CodeAgent`] generates Python code snippets to perform actions and solve tasks.
+
+By default, the Python code execution is done in your local environment.
+This should be safe because the only functions that can be called are the tools you provided (especially if it's only tools by Hugging Face) and a set of predefined safe functions like `print` or functions from the `math` module, so you're already limited in what can be executed.
+
+The Python interpreter also doesn't allow imports by default outside of a safe list, so all the most obvious attacks shouldn't be an issue.
+You can authorize additional imports by passing the authorized modules as a list of strings in argument `additional_authorized_imports` upon initialization of your [`CodeAgent`]:
+
+```py
+model = InferenceClientModel()
+agent = CodeAgent(tools=[], model=model, additional_authorized_imports=['requests', 'bs4'])
+agent.run("Could you get me the title of the page at url 'https://huggingface.co/blog'?")
+```
+
+Additionally, as an extra security layer, access to submodule is forbidden by default, unless explicitly authorized within the import list.
+For instance, to access the `numpy.random` submodule, you need to add `'numpy.random'` to the `additional_authorized_imports` list.
+This could also be authorized by using `numpy.*`, which will allow `numpy` as well as any subpackage like `numpy.random` and its own subpackages.
+
+> [!WARNING]
+> The LLM can generate arbitrary code that will then be executed: do not add any unsafe imports!
+
+The execution will stop at any code trying to perform an illegal operation or if there is a regular Python error with the code generated by the agent.
+
+You can also use [E2B code executor](https://e2b.dev/docs#what-is-e2-b) or Docker instead of a local Python interpreter. For E2B, first [set the `E2B_API_KEY` environment variable](https://e2b.dev/dashboard?tab=keys) and then pass `executor_type="e2b"` upon agent initialization. For Docker, pass `executor_type="docker"` during initialization.
+
+
+> [!TIP]
+> Learn more about code execution [in this tutorial](tutorials/secure_code_execution).
+
+### ToolCallingAgent
+
+[`ToolCallingAgent`] outputs JSON tool calls, which is the common format used in many frameworks (OpenAI API), allowing for structured tool interactions without code execution.
+
+It works much in the same way like [`CodeAgent`], of course without `additional_authorized_imports` since it doesn't execute code:
+
+```py
+from smolagents import ToolCallingAgent
+
+agent = ToolCallingAgent(tools=[], model=model)
+agent.run("Could you get me the title of the page at url 'https://huggingface.co/blog'?")
+```
+
 ## Building your agent
 
 To initialize a minimal agent, you need at least these two arguments:
@@ -260,46 +360,6 @@ This validation mechanism enables:
 - Implementing domain-specific validation rules
 - Creating more robust agents that validate their own outputs
 
-## CodeAgent and ToolCallingAgent
-
-`smolagents` comes with two agent classes: [`CodeAgent`] and [`ToolCallingAgent`]. `CodeAgent` is the default and writes Python code snippets that are then executed, while `ToolCallingAgent` outputs JSON tool calls. Both share the same interface so you can pick whichever style you prefer.
-
-By default, the execution is done in your local environment.
-This should be safe because the only functions that can be called are the tools you provided (especially if it's only tools by Hugging Face) and a set of predefined safe functions like `print` or functions from the `math` module, so you're already limited in what can be executed.
-
-The Python interpreter also doesn't allow imports by default outside of a safe list, so all the most obvious attacks shouldn't be an issue.
-You can authorize additional imports by passing the authorized modules as a list of strings in argument `additional_authorized_imports` upon initialization of your [`CodeAgent`]:
-
-```py
-model = InferenceClientModel()
-agent = CodeAgent(tools=[], model=model, additional_authorized_imports=['requests', 'bs4'])
-agent.run("Could you get me the title of the page at url 'https://huggingface.co/blog'?")
-```
-
-Additionally, as an extra security layer, access to submodule is forbidden by default, unless explicitly authorized within the import list.
-For instance, to access the `numpy.random` submodule, you need to add `'numpy.random'` to the `additional_authorized_imports` list.
-This could also be authorized by using `numpy.*`, which will allow `numpy` as well as any subpackage like `numpy.random` and its own subpackages.
-
-> [!WARNING]
-> The LLM can generate arbitrary code that will then be executed: do not add any unsafe imports!
-
-The execution will stop at any code trying to perform an illegal operation or if there is a regular Python error with the code generated by the agent.
-
-You can also use [E2B code executor](https://e2b.dev/docs#what-is-e2-b) or Docker instead of a local Python interpreter. For E2B, first [set the `E2B_API_KEY` environment variable](https://e2b.dev/dashboard?tab=keys) and then pass `executor_type="e2b"` upon agent initialization. For Docker, pass `executor_type="docker"` during initialization.
-
-
-> [!TIP]
-> Learn more about code execution [in this tutorial](tutorials/secure_code_execution).
-
-We also support the widely-used way of writing actions as JSON-like blobs: this is [`ToolCallingAgent`], it works much in the same way like [`CodeAgent`], of course without `additional_authorized_imports` since it doesn't execute code:
-
-```py
-from smolagents import ToolCallingAgent
-
-agent = ToolCallingAgent(tools=[], model=model)
-agent.run("Could you get me the title of the page at url 'https://huggingface.co/blog'?")
-```
-
 ## Inspecting an agent run
 
 Here are a few useful attributes to inspect what happened after a run: