[Model] Support to train glm5 by heavyrain-lzy · Pull Request #1227 · flagos-ai/FlagScale

heavyrain-lzy · 2026-06-17T09:42:37Z

Train

Others

Add GLM5 training support with new example configs under examples/glm5/, including both a small demo config and a 744B/40B Active model config.
Add GLM5 checkpoint conversion support in tools/checkpoint/, but not verified enoughly.
Update Megatron GPT builder to preserve experimental attention layer specs for MTP and print model structure/basic parameter-memory information after model construction.
Fix checkpoint save flow so energy monitor pause/resume is only triggered when log_energy is enabled.

support glm5

f52d446

heavyrain-lzy requested review from aoyulong and zhaoyinglia as code owners June 17, 2026 09:42

heavyrain-lzy added 3 commits June 17, 2026 17:43

Merge remote-tracking branch 'upstream/main' into support_glm5

873f2b3

support cached indexer

806b359

add flagscale tag

0dd27b7

Provide feedback