Skip to content
Closed
Changes from all commits
Commits
Show all changes
26 commits
Select commit Hold shift + click to select a range
1fe4d04
added and tested: OLMo-1B,OLMo-7B
jonasrohw Dec 12, 2024
0f3e3b3
fixed: numpy do not do a major upgrade!
jonasrohw Dec 13, 2024
3a101f4
fixed: dimensions of 7b to be correct
jonasrohw Dec 13, 2024
1b34ccd
tested: Loading checkpoints & model variations
jonasrohw Dec 13, 2024
f0a0a68
Reimplement OLMoE changes.
joelburget Dec 14, 2024
8c094e5
Implement TODO (norm_topk_prob)
joelburget Dec 14, 2024
7565c06
Disable bos token for OLMoE.
joelburget Dec 14, 2024
04cd309
Add q and k norm.
joelburget Dec 15, 2024
68d6961
Correct normalization type for OLMoE.
joelburget Dec 15, 2024
9afd032
Merge pull request #1 from joelburget/olmoe
jonasrohw Dec 15, 2024
96c1fbb
Merge branch 'dev' into OLMo
jonasrohw Dec 15, 2024
72fb903
ran formatting
jonasrohw Dec 15, 2024
9d3a85e
Merge branch 'dev' into OLMo
bryce13950 Feb 4, 2025
d4519b2
Merge branch 'dev' into OLMo
bryce13950 Feb 5, 2025
064310f
tmp update for olmo2
Ja1Zhou Feb 1, 2025
b1fd04b
Fix: Olmo2 uses normalization after the attention/mlp
jonasrohw Feb 15, 2025
871ba03
Merge branch 'dev' into OLMo
bryce13950 Jun 16, 2025
7939e8d
ran format
bryce13950 Jun 16, 2025
97fd1e7
fixed some type issues
bryce13950 Jun 19, 2025
9032fe7
Merge branch 'dev' into OLMo
bryce13950 Jun 24, 2025
39703c4
OLMo 2 RMS
jleechung Jul 22, 2025
1c283c1
OLMo 2 RMS
jleechung Jul 22, 2025
688a421
Tested Instruct models
jleechung Jul 22, 2025
9febc5c
Merge pull request #3 from jleechung/OLMo
jonasrohw Jul 23, 2025
1f01ef6
Merge remote-tracking branch 'origin/dev-3.x' into pr-816
jlarson4 Feb 14, 2026
374034e
Resolve duplicate code & irrelevant changes
jlarson4 Feb 14, 2026
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view

These merge commits were added into this branch cleanly.

There are no new changes to show.