Update close tool + add output to agent result #1505

tkattkat · 2026-01-06T18:47:45Z

Why

Previously, agent executions could end without a structured final response - either by hitting maxSteps or the LLM breaking out of its loop without calling the close tool. This made it difficult to:

Reliably determine if a task was completed successfully
Extract structured data from the agent's execution

What Changed

Ensured Close Tool is Always Called

Added handleCloseToolCall utility that forces a close tool call via a separate generateText call when the main agent loop ends without explicitly closing
Integrated via new ensureClosed private method in v3AgentHandler.ts
Works for both execute() and stream() modes
Triggers when maxSteps is reached or the LLM stops ( completes its task)

Added Output Schema Support (Experimental)

Users can now pass a Zod schema to agent.execute({ output: z.object({...}) }) to return structured data at the end of execution
The schema dynamically extends the close tool's input schema
Extracted data is returned in result.output
Added validation:
- CUA mode: Throws StagehandInvalidArgumentError (not supported)
- Non-CUA without experimental: true: Throws ExperimentalNotConfiguredError

Example Usage

const result = await agent.execute({
  instruction: "search for a shampoo on amazon and click into one of the results",
  maxSteps: 20,
  output: z.object({
    productName: z.string().describe("The name of the shampoo product"),
    price: z.string().describe("The price of the product"),
    rating: z.string().describe("The star rating of the product"),
  }),
});

console.log(result.output);
// { productName: "...", price: "$12.99", rating: "4.5 out of 5 stars" }

Test Plan

Verify close tool is called when agent naturally completes (no change in behavior)
Verify close tool is forced when maxSteps is reached
Verify output schema extracts data correctly in execute() mode
Verify output schema extracts data correctly in stream() mode
Verify output throws StagehandInvalidArgumentError when used with CUA mode
Verify output throws ExperimentalNotConfiguredError when used without experimental: true

Summary by cubic

Ensures every agent run ends with a structured final response and adds optional structured output via a Zod schema. Improves reliability by always setting completion status and final reasoning.

New Features
- Always triggers a "close" tool call at the end of a run (LLM stops or maxSteps), for both execute() and stream().
- Optional output schema: pass output: z.object({...}) to return typed data in result.output.
- Validation: output schema is not supported in CUA (throws StagehandInvalidArgumentError). In non-CUA, requires experimental: true (throws ExperimentalNotConfiguredError otherwise).
- Removed "close" from the main tool list and system prompt; closing is handled automatically post-run.

^{Written for commit dfb703a. Summary will update on new commits.}

changeset-bot · 2026-01-06T18:47:49Z

🦋 Changeset detected

Latest commit: dfb703a

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 3 packages

Name	Type
@browserbasehq/stagehand	Patch
@browserbasehq/stagehand-evals	Patch
@browserbasehq/stagehand-server	Patch

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

cubic-dev-ai

No issues found across 8 files

greptile-apps · 2026-01-06T18:55:34Z

Greptile Summary

Ensures agent executions always end with a structured close tool call and adds optional Zod schema-based output extraction to improve reliability and data extraction capabilities
Removes close tool from the manual agent toolkit and automatically handles closing via handleCloseToolCall utility when agents complete or reach maxSteps
Adds experimental output schema feature allowing users to extract structured data from agent results via result.output with proper validation for CUA mode restrictions

Important Files Changed

Filename	Overview
`packages/core/lib/v3/agent/utils/handleCloseToolCall.ts`	New utility that forces close tool calls via separate LLM inference to ensure structured agent completion
`packages/core/lib/v3/handlers/v3AgentHandler.ts`	Integrates forced close handling and output schema support into agent execution flow
`packages/core/lib/v3/types/public/agent.ts`	Adds `output` field to agent options and result types for structured data extraction

Confidence score: 4/5

This PR requires careful review due to significant architectural changes in agent completion handling
Score reflects potential for agent behavior changes and the introduction of experimental features that alter the fundamental execution flow
Pay close attention to handleCloseToolCall.ts and the close handling logic in v3AgentHandler.ts to ensure reliable forced closing

Sequence Diagram

sequenceDiagram
    participant User
    participant AgentHandler as V3AgentHandler
    participant LLMClient
    participant CloseHandler as handleCloseToolCall
    participant Model as LanguageModel
    
    User->>AgentHandler: "execute(instruction, options)"
    AgentHandler->>AgentHandler: "prepareAgent()"
    AgentHandler->>LLMClient: "generateText(systemPrompt, messages, tools)"
    
    loop Agent Steps (up to maxSteps)
        LLMClient->>Model: "Generate next step"
        Model-->>LLMClient: "Tool calls + reasoning"
        LLMClient->>AgentHandler: "onStepFinish(toolCalls, results)"
        AgentHandler->>AgentHandler: "Process tool results and update state"
        
        alt Tool call is "close"
            AgentHandler->>AgentHandler: "Mark state.completed = true"
        end
    end
    
    LLMClient-->>AgentHandler: "Generation result"
    
    alt state.completed == false
        Note over AgentHandler,CloseHandler: Force close tool call
        AgentHandler->>CloseHandler: "handleCloseToolCall(model, messages, instruction, outputSchema)"
        CloseHandler->>Model: "generateText() with close tool only"
        Model-->>CloseHandler: "Close tool call with reasoning + output"
        CloseHandler-->>AgentHandler: "closeResult(reasoning, taskComplete, output)"
        AgentHandler->>AgentHandler: "Update state with close result"
    end
    
    AgentHandler->>AgentHandler: "consolidateMetricsAndResult()"
    AgentHandler-->>User: "AgentResult(success, message, actions, output)"

pirate · 2026-01-07T01:16:03Z

packages/core/lib/v3/agent/prompts/agentSystemPrompt.ts

    { name: "wait", description: "Wait for a specified time" },
    { name: "navback", description: "Navigate back in browser history" },
    { name: "scroll", description: "Scroll the page x pixels up or down" },
-    { name: "close", description: "Mark the task as complete or failed" },


Suggested change

{ name: "close", description: "Mark the task as complete or failed" },

{ name: "close", description: "Mark the task as complete or failed" }, // TODO: consider renaming this tool to "done"

pirate · 2026-01-07T01:18:13Z

packages/core/lib/v3/agent/utils/handleCloseToolCall.ts

+import { StagehandZodObject } from "../../zodCompat";
+interface CloseResult {
+  reasoning: string;
+  taskComplete: boolean;


I think success: true | false may be better

packages/core/lib/v3/tests/agent-hybrid-mode.spec.ts

Co-authored-by: Nick Sweeting <[email protected]>

pirate · 2026-01-07T01:20:38Z

packages/core/lib/v3/agent/utils/handleCloseToolCall.ts

+  reasoning: string;
+  taskComplete: boolean;
+  messages: ModelMessage[];
+  output?: Record<string, unknown>;


I recommend making output required, LLMs are really good at inferring what the ideal output should be for a task.

e.g. if user is researching something often it nails it and puts the exact data they were looking for in output.

tkattkat added 4 commits January 5, 2026 14:06

update close tool

7a4ffc7

add structured output

060dd41

format + update comments

1c10b11

experimental + changeset

2db6f84

Merge remote-tracking branch 'origin/main' into update-close-tool

a0cdf3b

tkattkat changed the title ~~Update close tool~~ Update close tool + add output to agent result Jan 6, 2026

cubic-dev-ai bot reviewed Jan 6, 2026

View reviewed changes

tkattkat added 3 commits January 6, 2026 10:57

update public types test

1212046

remove unused import

050b846

update specs

8555ada

pirate reviewed Jan 7, 2026

View reviewed changes

packages/core/lib/v3/tests/agent-hybrid-mode.spec.ts Outdated Show resolved Hide resolved

Update packages/core/lib/v3/tests/agent-hybrid-mode.spec.ts

d0082c6

Co-authored-by: Nick Sweeting <[email protected]>

pirate reviewed Jan 7, 2026

View reviewed changes

pirate approved these changes Jan 7, 2026

View reviewed changes

update message handling

dfb703a

pirate approved these changes Jan 8, 2026

View reviewed changes

tkattkat merged commit 6fbf5fc into main Jan 8, 2026
19 checks passed

github-actions bot mentioned this pull request Jan 8, 2026

Version Packages #1479

Open

tkattkat mentioned this pull request Jan 8, 2026

Stagehand.agent.execute() returns typed object output #1494

Closed

3 tasks

This was referenced Jan 9, 2026

Version Packages SValanukonda/stagehand#1

Open

Version Packages CloudEngineHub/stagehand#1

Open

Version Packages MasterReb00t/stagehand#1

Open

Version Packages filip-michalsky/stagehand#2

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update close tool + add output to agent result #1505

Update close tool + add output to agent result #1505

Uh oh!

tkattkat commented Jan 6, 2026 •

edited by cubic-dev-ai bot

Loading

Uh oh!

changeset-bot bot commented Jan 6, 2026 •

edited

Loading

Uh oh!

cubic-dev-ai bot left a comment

Uh oh!

greptile-apps bot commented Jan 6, 2026

Uh oh!

pirate Jan 7, 2026 •

edited

Loading

Uh oh!

pirate Jan 7, 2026

Uh oh!

Uh oh!

pirate Jan 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	{ name: "close", description: "Mark the task as complete or failed" },
	{ name: "close", description: "Mark the task as complete or failed" }, // TODO: consider renaming this tool to "done"

Update close tool + add output to agent result #1505

Update close tool + add output to agent result #1505

Uh oh!

Conversation

tkattkat commented Jan 6, 2026 • edited by cubic-dev-ai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why

What Changed

Ensured Close Tool is Always Called

Added Output Schema Support (Experimental)

Example Usage

Test Plan

Summary by cubic

Uh oh!

changeset-bot bot commented Jan 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🦋 Changeset detected

Uh oh!

cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot commented Jan 6, 2026

Greptile Summary

Important Files Changed

Confidence score: 4/5

Sequence Diagram

Uh oh!

pirate Jan 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pirate Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

pirate Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tkattkat commented Jan 6, 2026 •

edited by cubic-dev-ai bot

Loading

changeset-bot bot commented Jan 6, 2026 •

edited

Loading

pirate Jan 7, 2026 •

edited

Loading