fix(embedder): replace text-embedding-004 with gemini-embedding-001 by Kavirubc · Pull Request #98 · similigh/simili-bot

Kavirubc · 2026-03-05T06:08:08Z

Problem

text-embedding-004 is not the correct Gemini embedding model. The right model is gemini-embedding-001, which also outputs 3072 dimensions rather than 768.

Changes

File	Change
`internal/integrations/ai/embedder.go`	Default Gemini fallback model: `text-embedding-004` → `gemini-embedding-001`; remove stale dimension case and detection check
`DOCS/examples/single-repo/simili.yaml`	Model + dimensions updated (`768` → `3072`)
`DOCS/examples/multi-repo/simili.yaml`	Model + dimensions updated (`768` → `3072`)
`cmd/simili-web/README.md`	Example config snippet updated
`README.md`	Default model reference and dimension table updated

Test plan

go test ./internal/integrations/ai/... — all pass
go test ./internal/core/config/... — all pass
grep -r "text-embedding-004" returns no results

Summary by CodeRabbit

Documentation
- Updated README and examples to show the new default Gemini embedding model and dimensions.
Configuration Updates
- Default embedding model set to gemini-embedding-001.
- Embedding dimensions updated to 3072 (applies to configs and runtime defaults).
Behavior Change
- Collection creation now validates vector-dimension compatibility and surfaces a clear error if dimensions mismatch.
Tests
- Added tests for embedding model detection and dimension inference.

text-embedding-004 is not the correct Gemini embedding model to use. Replace all references with gemini-embedding-001 throughout the codebase. - embedder.go: update default Gemini fallback model from text-embedding-004 to gemini-embedding-001 - inferEmbeddingDimensions: remove the text-embedding-004/005 case; the gemini-embedding-001 → 3072 entry already handles the correct model - isLikelyGeminiEmbeddingModel: remove redundant text-embedding-004 check; gemini-embedding-001 already matches via strings.Contains(m, "gemini") - DOCS/examples (single-repo, multi-repo): update model and dimensions (768 → 3072 to match gemini-embedding-001 output size) - cmd/simili-web/README.md: update example config snippet - README.md: update default model reference and dimension table Signed-off-by: Kavirubc <hapuarachchikaviru@gmail.com>

coderabbitai · 2026-03-05T06:08:28Z

📝 Walkthrough

Walkthrough

Replaced text-embedding-004 with gemini-embedding-001, updated embedding dimensions from 768 → 3072 across docs and defaults, tightened Gemini model detection and defaulting in the embedder, added unit tests, and added collection-dimension validation in the Qdrant client.

Changes

Cohort / File(s)	Summary
Example configs `DOCS/examples/multi-repo/simili.yaml`, `DOCS/examples/single-repo/simili.yaml`, `.simili.yaml`	Updated embedding model to `gemini-embedding-001` and `embedding.dimensions` to `3072`.
Docs & READMEs `README.md`, `cmd/simili-web/README.md`, `DOCS/0.0.2v/plan.md`, `.claude/sessions/...`	Replaced mentions of `text-embedding-004` with `gemini-embedding-001` and changed referenced dimensions from 768 → 3072.
CI / Workflows `.github/workflows/e2e-test.yml`, `.github/simili.yaml`	Updated Gemini embedding dimension values in workflow/index configurations to 3072.
Core config defaults `internal/core/config/config.go`	Default Embedding.Dimensions changed from 768 to 3072 when unset.
Embedder implementation & tests `internal/integrations/ai/embedder.go`, `internal/integrations/ai/embedder_test.go`	Narrowed Gemini model detection, map or reject legacy OpenAI embedding identifiers for Gemini (error for 004/005 when GEMINI_API_KEY present), default Gemini dims to 3072, and added tests for inference and detection.
Qdrant integration `internal/integrations/qdrant/client.go`	Added `validateCollectionDimension` to compare existing collection VectorParams.Size with requested dimension; CreateCollection now validates existing collection dimensions and postpones auth context until needed.

Sequence Diagram(s)

(Skipped — changes are focused on configuration/defaults, detection logic, tests, and a Qdrant helper; no new multi-component runtime flow requiring visualization.)

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Possibly related PRs

refactor(integrations): rename gemini package to provider-neutral ai (closes #68) #95 — related refactor touching embedder model resolution, dimension defaults, and tests.

Poem

🐰 I hopped through configs, quick and spry,
Swapped old four-oh-four for Gemini's sky,
From seven-six-eight to three thousand and more,
I checked collection sizes and bounded each door,
Tiny tests thumped softly — now vectors roar! ✨

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 14.29% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The pull request title clearly and accurately summarizes the main change: replacing the incorrect text-embedding-004 model with the correct gemini-embedding-001 model across the codebase, which is the primary focus of this PR.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings (stacked PR)
📝 Generate docstrings (commit on current branch)

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch fix/use-gemini-embedding-001

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

gh-simili-bot · 2026-03-05T06:09:29Z

Simili Triage Report

Note

Quality Score: 9.5/10 (Excellent)
The issue could be improved. See suggestions below.

Classification

Category	Value
Labels

Quality Improvements

The specific functional impact or consequence of using the incorrect model (text-embedding-004) is implied but not explicitly detailed (e.g., "causes API errors," "produces irrelevant embeddings").
Add a sentence or two explaining the practical implications of using the wrong model, such as potential API failures or reduced embedding quality.

Similar Threads

Similarity	Type	Thread	Status
83%	🔀	#44 Similar Issue	—
82%	🔀	#44 feat: add LLM configuration to YAML config (#37)	—
80%	🔀	#32 Similar Issue	—

_{Generated by Simili Bot}

coderabbitai

🧹 Nitpick comments (1)

internal/integrations/ai/embedder.go (1)
205-208: The narrowed detection for legacy Google embedding models is technically valid but has minimal practical impact.

The function correctly checks only for the "gemini" substring, meaning legacy Google models like text-embedding-004 and text-embedding-005 won't be detected. However, text-embedding-004 was shut down on the Gemini API on January 14, 2026, and text-embedding-005 is a Vertex AI-specific model, not a Gemini API model. If a Gemini provider is accidentally configured with these legacy models, the auto-correction to gemini-embedding-001 is actually the desired migration path. Documenting this behavior change in migration materials remains a good practice for users transitioning from older models.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@internal/integrations/ai/embedder.go` around lines 205 - 208,
isLikelyGeminiEmbeddingModel currently only matches the substring "gemini",
which misses legacy Google embedding model names; update the function
isLikelyGeminiEmbeddingModel to also detect legacy tokens (e.g.,
"text-embedding-004" and "text-embedding-005") by normalizing the input
(strings.ToLower/TrimSpace) and returning true if the model string contains
"gemini" OR contains any of those legacy identifiers so the auto-migration to
"gemini-embedding-001" still triggers for those older names.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@internal/integrations/ai/embedder.go`:
- Around line 205-208: isLikelyGeminiEmbeddingModel currently only matches the
substring "gemini", which misses legacy Google embedding model names; update the
function isLikelyGeminiEmbeddingModel to also detect legacy tokens (e.g.,
"text-embedding-004" and "text-embedding-005") by normalizing the input
(strings.ToLower/TrimSpace) and returning true if the model string contains
"gemini" OR contains any of those legacy identifiers so the auto-migration to
"gemini-embedding-001" still triggers for those older names.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: e2efd6e5-65c7-4caa-a78d-87f9ffb44e16

📥 Commits

Reviewing files that changed from the base of the PR and between 7118d9e and 90fed02.

📒 Files selected for processing (5)

DOCS/examples/multi-repo/simili.yaml
DOCS/examples/single-repo/simili.yaml
README.md
cmd/simili-web/README.md
internal/integrations/ai/embedder.go

Copilot

Pull request overview

Updates Simili’s Gemini embedding defaults to use the correct model and aligns documented/vector dimensions with the model’s 3072-d output.

Changes:

Switch Gemini default embedding model to gemini-embedding-001.
Update embedding dimension inference to return 3072 for gemini-embedding-001 and remove stale cases.
Refresh docs/examples to reflect the new model name and 3072 dimensions.

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
internal/integrations/ai/embedder.go	Updates Gemini default model and adjusts dimension inference / model-detection heuristics.
DOCS/examples/single-repo/simili.yaml	Updates example embedding model and dimensions to 3072.
DOCS/examples/multi-repo/simili.yaml	Updates example embedding model and dimensions to 3072.
cmd/simili-web/README.md	Updates config snippet to the new model/dimensions.
README.md	Updates default model reference and the model→dimensions mapping table.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-05T06:17:28Z

internal/integrations/ai/embedder.go

 		if strings.TrimSpace(model) == "" || isLikelyOpenAIEmbeddingModel(model) {
-			model = "text-embedding-004"
+			model = "gemini-embedding-001"
 		}


NewEmbedder will now leave an explicitly configured legacy Gemini model like text-embedding-004/text-embedding-005 unchanged when provider == ProviderGemini, which is likely to result in a runtime API error given this PR’s premise that those aren’t valid Gemini embedding models. Consider mapping those legacy names to gemini-embedding-001 (optionally with a warning) or failing fast with a clear error so misconfigurations are caught at startup rather than during embedding.

Suggested change

}

}

trimmedModel := strings.TrimSpace(model)

if trimmedModel == "text-embedding-004" || trimmedModel == "text-embedding-005" {

return nil, fmt.Errorf(

"model %q is not a valid Gemini embedding model; use %q instead or update configuration",

trimmedModel, "gemini-embedding-001",

)

}

model = trimmedModel

Copilot · 2026-03-05T06:17:29Z

internal/integrations/ai/embedder.go

-		case strings.Contains(m, "text-embedding-004"), strings.Contains(m, "text-embedding-005"):
-			return 768
 		default:
 			return 768


inferEmbeddingDimensions now defaults Gemini to 768 for any model string other than gemini-embedding-001. If the model name is provided in a different but equivalent form (e.g., fully-qualified) or if additional Gemini embedding models are supported later, this will silently produce a wrong dimension and can break vector DB collection sizing. Consider broadening the match (or using the returned embedding length from a probe call) and/or changing the Gemini default to the current default model’s dimension (3072).

Suggested change

return 768

// Default to the current Gemini embedding model dimension (3072)

return 3072

Copilot · 2026-03-05T06:17:29Z

internal/integrations/ai/embedder.go

 func isLikelyGeminiEmbeddingModel(model string) bool {
 	m := strings.ToLower(strings.TrimSpace(model))
-	return strings.Contains(m, "gemini") || strings.Contains(m, "text-embedding-004") || strings.Contains(m, "text-embedding-005")
+	return strings.Contains(m, "gemini")


isLikelyGeminiEmbeddingModel was narrowed to only match strings containing gemini. This removes the previous safeguard that auto-corrected legacy Gemini embedding model names (e.g., text-embedding-004/text-embedding-005) when provider == ProviderOpenAI, which can lead to sending invalid model names to OpenAI. Consider keeping legacy checks (even if only for migration) or adding explicit per-provider model validation with a clear startup error.

Suggested change

return strings.Contains(m, "gemini")

// Primary check: explicit Gemini model identifiers.

if strings.Contains(m, "gemini") {

return true

}

// Legacy Gemini embedding model identifiers that may still be used or misconfigured

// under other providers (e.g., ProviderOpenAI). Keep recognizing them here to allow

// migration logic to auto-correct and avoid sending invalid models to OpenAI.

switch m {

case "text-embedding-004", "text-embedding-005":

return true

}

return false

Copilot · 2026-03-05T06:17:29Z

internal/integrations/ai/embedder.go

 		if strings.TrimSpace(model) == "" || isLikelyOpenAIEmbeddingModel(model) {
-			model = "text-embedding-004"
+			model = "gemini-embedding-001"
 		}


The default model/dimension selection logic changed here, but internal/integrations/ai has unit tests for related selection logic (provider_test.go, retry_test.go) while embedder.go currently has no coverage. Adding a small unit test around model fallback and dimension inference (Gemini default -> 3072; OpenAI default -> 1536; cross-provider mismatches) would help prevent regressions.

Copilot review fixes: 1. Fail fast with a clear error when a legacy Gemini model name (text-embedding-004 / text-embedding-005) is passed at startup, guiding users to gemini-embedding-001 immediately. 2. inferEmbeddingDimensions: default unknown Gemini models to 3072 (the current default model's dimension) instead of the old 768. 3. isLikelyGeminiEmbeddingModel: restore legacy name recognition for text-embedding-004/005 so they are never forwarded to OpenAI. 4. Add embedder_test.go with full coverage of dimension inference, Gemini model detection, and legacy model rejection. Stale 768-dimension references replaced with 3072 across all files: - internal/core/config/config.go (applyDefaults) - .simili.yaml - .github/workflows/e2e-test.yml (two inline config blocks) - DOCS/0.0.2v/plan.md - .claude/sessions/2026-02-02-0941-v0.0.2v-foundation.md Signed-off-by: Kavirubc <hapuarachchikaviru@gmail.com>

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

.github/workflows/e2e-test.yml (1)
108-108: Reduce config drift by centralizing embedding constants in this workflow.

Lines 108 and 208 are consistent now, but this model/dimension pair is duplicated in multiple heredocs. Consider templating these values once (e.g., placeholders + single substitution step) so future model upgrades don’t miss one path.

Also applies to: 208-208
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In @.github/workflows/e2e-test.yml at line 108, Hardcoded embedding constants
(e.g., the "dimensions: 3072" entries repeated in multiple heredocs) should be
centralized: add workflow-level variables (e.g., EMBEDDING_DIMENSIONS and
EMBEDDING_MODEL) in the job/env section and replace each literal "dimensions:
3072" and model string inside the heredocs with a placeholder that references
those variables, then ensure the heredocs are rendered with those env vars (use
GitHub Actions env interpolation or a single substitution step like envsubst) so
future model/dimension updates only change the centralized EMBEDDING_* values.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@internal/core/config/config.go`:
- Around line 283-285: The config default change to 3072 can hide mismatches
with existing Qdrant collections (e.g., 768), so update the Qdrant startup flow
to detect and fail on dimension mismatches: in
internal/integrations/qdrant/client.go, inside CreateCollection (the branch that
treats existing collections as success), fetch the existing collection metadata
(collection info/describe), compare its vector size to
config.Embedding.Dimensions (from internal/core/config/config.go), and if they
differ return a clear fatal error describing the expected vs actual dimensions
and a remediation (e.g., recreate collection or set correct embedding dimension)
so the service fails at startup rather than deferring to runtime; include the
dimension numbers and next steps in the error message.

---

Nitpick comments:
In @.github/workflows/e2e-test.yml:
- Line 108: Hardcoded embedding constants (e.g., the "dimensions: 3072" entries
repeated in multiple heredocs) should be centralized: add workflow-level
variables (e.g., EMBEDDING_DIMENSIONS and EMBEDDING_MODEL) in the job/env
section and replace each literal "dimensions: 3072" and model string inside the
heredocs with a placeholder that references those variables, then ensure the
heredocs are rendered with those env vars (use GitHub Actions env interpolation
or a single substitution step like envsubst) so future model/dimension updates
only change the centralized EMBEDDING_* values.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 0a3ddf6f-7ecf-4f98-ab3f-ef41f51b0720

📥 Commits

Reviewing files that changed from the base of the PR and between 90fed02 and c1ebbd5.

📒 Files selected for processing (7)

.claude/sessions/2026-02-02-0941-v0.0.2v-foundation.md
.github/workflows/e2e-test.yml
.simili.yaml
DOCS/0.0.2v/plan.md
internal/core/config/config.go
internal/integrations/ai/embedder.go
internal/integrations/ai/embedder_test.go

🚧 Files skipped from review as they are similar to previous changes (1)

internal/integrations/ai/embedder.go

internal/core/config/config.go

gh-simili-bot · 2026-03-05T06:35:07Z

🧪 E2E Test

✅ Bot responded: yes

| Auto-closer (dry-run) | processed: 0 closed: 0 grace: 0 human: 0 |

Test repo → gh-simili-bot/simili-e2e-22705431212
Run → logs

_{Auto-generated by E2E pipeline}

…nt mismatches When CreateCollection finds an existing collection it previously returned nil immediately, meaning a 768-dim collection would survive startup and fail only at first write when a 3072-dim vector was upserted. Add validateCollectionDimension: fetches the existing collection's VectorParams.Size via GetCollectionInfoRequest and compares against the requested dimension. Returns a clear fatal-ready error describing the mismatch and the remediation steps (recreate collection or align embedding.dimensions in config). Path: CollectionInfo → CollectionConfig → CollectionParams → VectorsConfig → VectorParams.GetSize() If collection info is unavailable the check is skipped gracefully so read-only or restricted deployments are not broken. Signed-off-by: Kavirubc <hapuarachchikaviru@gmail.com>

coderabbitai

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

internal/integrations/qdrant/client.go (1)

95-115: ⚠️ Potential issue | 🟠 Major

Reject non-positive dimensions before issuing collection requests.

CreateCollection should fail fast for invalid dimension values instead of proceeding into existence/creation logic. This prevents invalid vector-size requests from reaching Qdrant.

Proposed fix

 func (c *Client) CreateCollection(ctx context.Context, name string, dimension int) error {
+	if dimension <= 0 {
+		return fmt.Errorf("invalid embedding dimension %d: must be > 0", dimension)
+	}
+
 	// Check if exists first
 	exists, err := c.CollectionExists(ctx, name)
 	if err != nil {
 		return err
 	}

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@internal/integrations/qdrant/client.go` around lines 95 - 115, In
Client.CreateCollection, validate the incoming dimension argument and return an
error immediately for non-positive values (<= 0) before calling CollectionExists
or making any collection requests; update the method so it checks dimension at
the top of the function (in CreateCollection) and returns a clear error instead
of proceeding, keeping the rest of the flow (CollectionExists call and
subsequent Create flow and call to validateCollectionDimension) unchanged.

🧹 Nitpick comments (1)

internal/integrations/qdrant/client.go (1)

137-140: Avoid silently skipping validation on collection-info fetch errors.

On Line 138-Line 139, returning nil for any metadata lookup failure weakens the startup check and can reintroduce write-time failures. Consider returning an error (or only skipping on explicitly tolerated cases).

Proposed fix

 	resp, err := c.collections.Get(authCtx, &pb.GetCollectionInfoRequest{
 		CollectionName: name,
 	})
 	if err != nil {
-		// Cannot confirm — proceed and let Qdrant reject mismatched vectors at write time.
-		return nil
+		return fmt.Errorf("failed to inspect collection %q dimension: %w", name, err)
 	}

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@internal/integrations/qdrant/client.go` around lines 137 - 140, The code
currently swallows any error from the collection-info fetch (the block where `if
err != nil { return nil }`), which weakens startup validation; change this to
propagate the error instead of returning nil (i.e., return the `err` or wrap it
with context) so callers can fail fast, but if you intentionally want to
tolerate specific cases only allow skipping on explicit, documented conditions
(e.g., a NotFound/404 from Qdrant or a known transient error type such as
`qdrant.ErrCollectionNotFound`) by checking `err` for those cases and continuing
only then; update the collection-info fetch error handling to either return
`err` (or a wrapped error) or explicitly handle tolerated error types.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Outside diff comments:
In `@internal/integrations/qdrant/client.go`:
- Around line 95-115: In Client.CreateCollection, validate the incoming
dimension argument and return an error immediately for non-positive values (<=
0) before calling CollectionExists or making any collection requests; update the
method so it checks dimension at the top of the function (in CreateCollection)
and returns a clear error instead of proceeding, keeping the rest of the flow
(CollectionExists call and subsequent Create flow and call to
validateCollectionDimension) unchanged.

---

Nitpick comments:
In `@internal/integrations/qdrant/client.go`:
- Around line 137-140: The code currently swallows any error from the
collection-info fetch (the block where `if err != nil { return nil }`), which
weakens startup validation; change this to propagate the error instead of
returning nil (i.e., return the `err` or wrap it with context) so callers can
fail fast, but if you intentionally want to tolerate specific cases only allow
skipping on explicit, documented conditions (e.g., a NotFound/404 from Qdrant or
a known transient error type such as `qdrant.ErrCollectionNotFound`) by checking
`err` for those cases and continuing only then; update the collection-info fetch
error handling to either return `err` (or a wrapped error) or explicitly handle
tolerated error types.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 14d79580-7d9d-41b2-9116-44b676a1e093

📥 Commits

Reviewing files that changed from the base of the PR and between c1ebbd5 and 2e5c796.

📒 Files selected for processing (1)

internal/integrations/qdrant/client.go

gh-simili-bot · 2026-03-05T07:25:40Z

🧪 E2E Test

✅ Bot responded: yes

| Auto-closer (dry-run) | processed: 0 closed: 0 grace: 0 human: 0 |

Test repo → gh-simili-bot/simili-e2e-22706877355
Run → logs

_{Auto-generated by E2E pipeline}

Kavirubc linked an issue Mar 5, 2026 that may be closed by this pull request

[Bug]: Triage failing #93

Closed

2 tasks

Kavirubc requested a review from Copilot March 5, 2026 06:08

Copilot started reviewing on behalf of Kavirubc March 5, 2026 06:08 View session

gh-simili-bot added documentation Improvements or additions to documentation fix ai labels Mar 5, 2026

coderabbitai bot reviewed Mar 5, 2026

View reviewed changes

gh-simili-bot previously approved these changes Mar 5, 2026

View reviewed changes

Copilot AI reviewed Mar 5, 2026

View reviewed changes

Kavirubc dismissed gh-simili-bot’s stale review via c1ebbd5 March 5, 2026 06:27

gh-simili-bot added the e2e label Mar 5, 2026

gh-simili-bot previously approved these changes Mar 5, 2026

View reviewed changes

coderabbitai bot reviewed Mar 5, 2026

View reviewed changes

internal/core/config/config.go Show resolved Hide resolved

Kavirubc dismissed gh-simili-bot’s stale review via 2e5c796 March 5, 2026 07:18

gh-simili-bot approved these changes Mar 5, 2026

View reviewed changes

Kavirubc added e2e and removed e2e labels Mar 5, 2026

coderabbitai bot reviewed Mar 5, 2026

View reviewed changes

Kavirubc merged commit 3fb471a into main Mar 5, 2026
7 checks passed

Kavirubc deleted the fix/use-gemini-embedding-001 branch March 5, 2026 07:28

-		}
+		}
+		trimmedModel := strings.TrimSpace(model)
+		if trimmedModel == "text-embedding-004" || trimmedModel == "text-embedding-005" {
+			return nil, fmt.Errorf(
+				"model %q is not a valid Gemini embedding model; use %q instead or update configuration",
+				trimmedModel, "gemini-embedding-001",
+			)
+		}
+		model = trimmedModel

	return 768
	// Default to the current Gemini embedding model dimension (3072)
	return 3072

-	return strings.Contains(m, "gemini")
+	// Primary check: explicit Gemini model identifiers.
+	if strings.Contains(m, "gemini") {
+		return true
+	}
+	// Legacy Gemini embedding model identifiers that may still be used or misconfigured
+	// under other providers (e.g., ProviderOpenAI). Keep recognizing them here to allow
+	// migration logic to auto-correct and avoid sending invalid models to OpenAI.
+	switch m {
+	case "text-embedding-004", "text-embedding-005":
+		return true
+	}
+	return false

Conversation

Kavirubc commented Mar 5, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Changes

Test plan

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Poem

❌ Failed checks (1 warning)

Uh oh!

gh-simili-bot commented Mar 5, 2026

Simili Triage Report

Classification

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

gh-simili-bot commented Mar 5, 2026

🧪 E2E Test

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

gh-simili-bot commented Mar 5, 2026

🧪 E2E Test

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Kavirubc commented Mar 5, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Mar 5, 2026 •

edited

Loading