Fix LLM callback isolation without serializing requests by VedantMadane · Pull Request #4252 · crewAIInc/crewAI

VedantMadane · 2026-01-19T19:21:16Z

This is a follow-up to #4218 (auto-closed by bot) addressing the same race in LLM callback handling without holding a global lock across the network call.

What changed

Stop mutating LiteLLM global callback lists for per-request callbacks.
Pass callbacks via the request params ("callbacks") and continue to invoke token usage callbacks from CrewAI response handlers.
Make test_llm_callback_replacement deterministic by mocking litellm.completion (removes sleep/heisenbug).

Why

The approach in #4218 used a class-level lock held across the entire LLM request which can serialize all concurrent agent calls. This keeps concurrency while still ensuring callback isolation.

Fixes #4214.

Note

Medium Risk
Touches the core LLM.call/LLM.acall execution path by changing how callbacks are wired into LiteLLM, which could affect observability/token-usage integrations and streaming/non-streaming parity under concurrency.

Overview
Stops mutating LiteLLM’s global callback state for per-request handlers, and instead computes effective_callbacks and passes them via params["callbacks"] on each completion/acompletion call to avoid cross-request races.

Updates tests to make callback isolation deterministic: rewrites test_llm_callback_replacement to mock litellm.completion (removing the sleep/flakiness) and adds a new threaded concurrency test asserting each request receives only its own callback and token usage is not mixed.

^{Written by Cursor Bugbot for commit 8dfc1f4. This will update automatically on new commits. Configure here.}

VedantMadane · 2026-01-19T20:27:34Z

Not covered in this PR description:

Lock scoping alternative (save previous global callbacks, set new ones, perform request, then restore) and why we avoided it.
Context local callback isolation using contextvars or thread local dispatch.
A true concurrency regression test (multi thread or async) that proves no cross contamination under parallel calls.

If you prefer, I can add a follow up commit that documents these options or adds a concurrency focused test.

# Conflicts: # lib/crewai/src/crewai/llm.py

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{Bugbot Autofix is OFF. To automatically fix reported issues with Cloud Agents, enable Autofix in the Cursor dashboard.}

This PR is being reviewed by Cursor Bugbot

Details

Your team is on the Bugbot Free tier. On this plan, Bugbot will review limited PRs each billing cycle for each member of your team.

To receive Bugbot reviews on all of your PRs, visit the Cursor dashboard to activate Pro and start your 14-day free trial.

cursor · 2026-02-10T11:08:08Z

lib/crewai/src/crewai/llm.py

+                            call_id=get_current_call_id(),
+                        ),
+                    )
+                    raise


set_callbacks is now dead code, never called

Low Severity

The set_callbacks static method mutates LiteLLM's global callback lists, which is exactly the pattern this PR removes. Its only call site (self.set_callbacks(callbacks or []) in __init__) was deleted, and a grep confirms no other callers exist anywhere in the codebase or tests. Leaving it around risks someone re-introducing the global-mutation pattern unintentionally.

VedantMadane mentioned this pull request Jan 19, 2026

Fix race condition in LLM callback system #4218

Closed

3 tasks

VedantMadane added 2 commits February 10, 2026 16:31

Fix LLM callback isolation without global LiteLLM locks

ba88a44

# Conflicts: # lib/crewai/src/crewai/llm.py

test: add concurrency regression test for LLM callback isolation

35483b6

VedantMadane force-pushed the fix/llm-callbacks-no-global-mutation branch from 31fdc55 to 35483b6 Compare February 10, 2026 11:02

cursor bot reviewed Feb 10, 2026

View reviewed changes

Merge branch 'main' into fix/llm-callbacks-no-global-mutation

8dfc1f4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix LLM callback isolation without serializing requests#4252

Fix LLM callback isolation without serializing requests#4252
VedantMadane wants to merge 3 commits intocrewAIInc:mainfrom
VedantMadane:fix/llm-callbacks-no-global-mutation

VedantMadane commented Jan 19, 2026 •

edited by cursor bot

Loading

Uh oh!

VedantMadane commented Jan 19, 2026

Uh oh!

cursor bot left a comment

Uh oh!

cursor bot Feb 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

VedantMadane commented Jan 19, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changed

Why

Uh oh!

VedantMadane commented Jan 19, 2026

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

This PR is being reviewed by Cursor Bugbot

Uh oh!

cursor bot Feb 10, 2026

Choose a reason for hiding this comment

set_callbacks is now dead code, never called

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

VedantMadane commented Jan 19, 2026 •

edited by cursor bot

Loading

`set_callbacks` is now dead code, never called