fix(gemini): fix token counting in gemini #1360

chaosyaan · 2025-07-27T04:07:00Z

Related Issues or Context

Incorrect Token Counting for Gemini

The code accumulates completion_tokens for every chunk when processing Gemini's candidates_token_count:

if chunk.usage_metadata:
   completion_tokens += (
       chunk.usage_metadata.candidates_token_count or 0
   )

However, Gemini's candidates token count is already incrementally increasing with each chunk. The current implementation causes duplicate counting, leading to an explosion of token counts.

This PR contains Changes to Non-Plugin

Documentation
Other

This PR contains Changes to Non-LLM Models Plugin

I have Run Comprehensive Tests Relevant to My Changes

This PR contains Changes to LLM Models Plugin

My Changes Affect Message Flow Handling (System Messages and User→Assistant Turn-Taking)

My Changes Affect Tool Interaction Flow (Multi-Round Usage and Output Handling, for both Agent App and Agent Node)

My Changes Affect Multimodal Input Handling (Images, PDFs, Audio, Video, etc.)

My Changes Affect Multimodal Output Generation (Images, Audio, Video, etc.)

My Changes Affect Structured Output Format (JSON, XML, etc.)

My Changes Affect Token Consumption Metrics

My Changes Affect Other LLM Functionalities (Reasoning Process, Grounding, Prompt Caching, etc.)

Other Changes (Add New Models, Fix Model Parameters etc.)

Version Control (Any Changes to the Plugin Will Require Bumping the Version)

I have Bumped Up the Version in Manifest.yaml (Top-Level Version Field, Not in Meta Section)

Dify Plugin SDK Version

I have Ensured dify_plugin>=0.3.0,<0.5.0 is in requirements.txt (SDK docs)

Environment Verification (If Any Code Changes)

Local Deployment Environment

Dify Version is: , I have Tested My Changes on Local Deployment Dify with a Clean Environment That Matches the Production Configuration.

SaaS Environment

[] I have Tested My Changes on cloud.dify.ai with a Clean Environment That Matches the Production Configuration

fdb02983rhy · 2025-07-27T15:33:19Z

Please fill the template and bump up the plugin ver.
Also it will be nice to provide an evidence like the test example below, showing that the token counting in Dify matches that in GCP.

chaosyaan · 2025-07-28T03:12:19Z

gemini plugin version: 0.2.9
prompt: 随意输出问题，但是要保证结果是1000个token

official plugin, gemini-2.5-flash and 0 thinking budget.

- modified local plugin, gemini-2.5-flash and 128 thinking budget.

fdb02983rhy · 2025-07-28T06:25:09Z

gemini plugin version: 0.2.9

prompt: 随意输出问题，但是要保证结果是1000个token

official plugin, gemini-2.5-flash and 0 thinking budget.

modified local plugin, gemini-2.5-flash and 128 thinking budget.

Could you compare them with the result on your GCP console? LLM has no clue counting words or tokens.

https://console.cloud.google.com/apis/api/generativelanguage.googleapis.com

chaosyaan · 2025-07-29T06:55:04Z

I can't find any data about tokens on this page. Maybe it's on the cost page, but I can't access it with my personal account, and it's not convenient to check with the company account.

I can show the results from direct API calls using the same prompts, that the token count is reasonable at around 1000. The problematic version appears to be summing up all the candidates_token_count values.

fix(gemini): fix token counting in gemini

1a4ffed

QIN2DIM mentioned this pull request Aug 12, 2025

fix(gemini): usage metadata #1491

Merged

15 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(gemini): fix token counting in gemini #1360

fix(gemini): fix token counting in gemini #1360

Uh oh!

chaosyaan commented Jul 27, 2025

Uh oh!

fdb02983rhy commented Jul 27, 2025 •

edited

Loading

Uh oh!

chaosyaan commented Jul 28, 2025 •

edited

Loading

Uh oh!

fdb02983rhy commented Jul 28, 2025

Uh oh!

chaosyaan commented Jul 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix(gemini): fix token counting in gemini #1360

Are you sure you want to change the base?

fix(gemini): fix token counting in gemini #1360

Uh oh!

Conversation

chaosyaan commented Jul 27, 2025

Related Issues or Context

This PR contains Changes to Non-Plugin

This PR contains Changes to Non-LLM Models Plugin

This PR contains Changes to LLM Models Plugin

Version Control (Any Changes to the Plugin Will Require Bumping the Version)

Dify Plugin SDK Version

Environment Verification (If Any Code Changes)

Local Deployment Environment

SaaS Environment

Uh oh!

fdb02983rhy commented Jul 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chaosyaan commented Jul 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fdb02983rhy commented Jul 28, 2025

Uh oh!

chaosyaan commented Jul 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fdb02983rhy commented Jul 27, 2025 •

edited

Loading

chaosyaan commented Jul 28, 2025 •

edited

Loading