fix: project offline state into vocab space #297

timmy-feng · 2025-11-13T05:01:49Z

Motivation

PR #293 introduces new behavior which saves only the last layer hidden states from the target model when generating offline hidden states. This saves us from needing to load the entire vocab size of logits into CPU RAM.

This breaks the old OfflineEagle3Model class which passes target to OnlineEagle3Model without projecting the shape from hidden_size to vocab_size.

Modifications

Project the offline hidden state into the correct shape using the target LM head in OfflineEagle3Model.forward().

Related Issues

Accuracy Test

Benchmark & Profiling

Checklist

Format your code according to the Code Formatting with Pre-Commit.
Add unit tests as outlined in the Running Unit Tests.
Update documentation / docstrings / example tutorials as needed, according to Writing Documentation.
Provide throughput / latency benchmark results and accuracy evaluation results as needed, according to Benchmark and Profiling and Accuracy Results.
For reviewers: If you haven't made any contributions to this PR and are only assisting with merging the main branch, please remove yourself as a co-author when merging the PR.
Please feel free to join our Slack channel at https://sgl-fru7574.slack.com/archives/C09784E3EN6 to discuss your PR.

gemini-code-assist · 2025-11-13T05:02:25Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

project offline state into vocab space

a984b3d

timmy-feng requested a review from FrankLeeeee as a code owner November 13, 2025 05:01

zhyncs approved these changes Nov 15, 2025

View reviewed changes

zhyncs merged commit a4453bf into sgl-project:main Nov 15, 2025
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: project offline state into vocab space #297

fix: project offline state into vocab space #297

Uh oh!

timmy-feng commented Nov 13, 2025 •

edited

Loading

Uh oh!

gemini-code-assist bot commented Nov 13, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix: project offline state into vocab space #297

fix: project offline state into vocab space #297

Uh oh!

Conversation

timmy-feng commented Nov 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Related Issues

Accuracy Test

Benchmark & Profiling

Checklist

Uh oh!

gemini-code-assist bot commented Nov 13, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

timmy-feng commented Nov 13, 2025 •

edited

Loading