-
Notifications
You must be signed in to change notification settings - Fork 3.4k
OpenJun 30, 2026
Due by July 14, 2026
•Last updated The v0.9.0 architectural release promotes CodeWhale from a turn/subagent workbench into a WhaleFlow workflow workbench: typed branch-and-leaf workflows, pod-style background workflow monitoring, shared ARMH/RLM memoization, deterministic replay, external-memory evaluation, and a GEPA-style teacher/student promotion loop that turns validated lessons into a cached-main overlay.
Primary tracker: #2981 EPIC: v0.9.0 WhaleFlow branch/leaf workflow mode (re-established after #2667 was deleted)
In scope
- WhaleFlow workflow mode: background workflow runs, /workflows-style monitoring, done/total progress, longest-running item peek, inspect/replay/report surfaces.
- Typed Workflow IR as the source of truth: Starlark/YAML/generated plans compile to Rust-owned IR before execution.
- Rust async executor: bounded branches, bounded leaves, cancellation, budgets, permissions, LoopUntil, Cond, Expand, BranchTournament, and Pareto reducers.
- Branch/leaf semantics: isolated speculative branches, bounded leaves, losing-branch fruit harvesting, typed results.
- ARMH/RLM integration: exact-context shared memo across branches with visible hit/miss/cost telemetry.
- External-memory evaluation: decide whether Aleph-style memory belongs in core, optional plugins, or explicit workflow nodes, with visible state and clear/export controls.
- TraceStore and deterministic replay: replay from recorded leaf/control outputs, not live model calls, unless explicitly allowed.
- Teacher harness: TeacherReview proposes reusable lessons; StudentReplay and PromotionGate validate before promotion.
- Cached-main overlay: promoted notes, workflows, tests, branch heuristics, model/cache policies, and prompt patches warm future runs without mutating Git main.
- Janitor: stale invalidation, memo cleanup, candidate demotion, trace compaction, capacity enforcement.
- Model-provider abstraction: workflow roles map to capabilities and configured providers; no workflow logic hardcodes Arcee, DeepSeek, Claude, tool calls, JSON mode, or large context.
Non-goals
- No model-weight RL in v0.9.0.
- No arbitrary JS/Python as workflow source of truth.
- No script-level async/await. Starlark is a pure graph builder; Rust executes IR.
- No hidden external-memory dependency for normal CodeWhale operation.
- No uncontrolled self-modifying agent. Teacher output is inspectable, replayed, and reversible.
- No public performance claims until evals are reproducible.
Definition of done
- workflows/rlm_cache_change.star runs with mock provider in CI and can dogfood CodeWhale RLM/ARMH/provider changes.
- Branch/leaf engine, control flow, TraceStore, replay, ARMH shared memo, TeacherReview, StudentReplay, PromotionGate, overlay, and janitor have focused tests.
- Workflow mode can run, inspect, and replay a workflow from CLI and TUI.
- ARMH savings, provider costs, and any external-memory use are visible in workflow telemetry.
- All behavior is behind config/feature flags until stable.
Release gate
- Parity gates green on the v0.9.0 integration branch.
- CHANGELOG [0.9.0] frames the release as WhaleFlow branch/leaf workflows and validated cached-main learning.
- Docs explain the Claude-workflow-inspired UX while preserving CodeWhale's typed IR/Rust executor safety model.
55% complete
List view
0 of 30 selected 0 issues of 30 selected
/huntjurisdiction system: configurable LLM-as-judge with strict/evidentiary/permissive policiesbugSomething isn't workingSomething isn't workingenhancementNew feature or requestNew feature or requestrustPull requests that update rust codePull requests that update rust codeStatus: Open.#2094 In Hmbown/CodeWhale;Model Lab: connect CodeWhale traces to open-weight fine-tuning/eval services
documentationImprovements or additions to documentationImprovements or additions to documentationenhancementNew feature or requestNew feature or requestStatus: Open.#1977 In Hmbown/CodeWhale;Slash commands: tool studio wiring for tools, MCP, skills, providers, and previews
bugSomething isn't workingSomething isn't workingdocumentationImprovements or additions to documentationImprovements or additions to documentationenhancementNew feature or requestNew feature or requestStatus: Open.#1891 In Hmbown/CodeWhale;v0.9.0 HarnessPosture: model-specific context and subagent policy
cache-maximalismDeepSeek V4 cache-maximal context and agent architectureDeepSeek V4 cache-maximal context and agent architecturecompactionContext management / compactionContext management / compactioncontextContext management / contextContext management / contextdocumentationImprovements or additions to documentationImprovements or additions to documentationenhancementNew feature or requestNew feature or requestv0.9.0Targeting v0.9.0Targeting v0.9.0whaleflowWhaleFlow branch/leaf workflow runtime and workflow modeWhaleFlow branch/leaf workflow runtime and workflow modeStatus: Open.#2693 In Hmbown/CodeWhale;v0.9.0 Stabilization gate: Windows, large-repo, subagent, and live-state blockers
bugSomething isn't workingSomething isn't workingenhancementNew feature or requestNew feature or requestrelease-blockerMust be fixed before the next releaseMust be fixed before the next releasev0.9.0Targeting v0.9.0Targeting v0.9.0Status: Open.#2721 In Hmbown/CodeWhale;HarmonyOS/OpenHarmony tier-2 target: CI cargo-check job + remaining sandbox/clipboard gating
documentationImprovements or additions to documentationImprovements or additions to documentationenhancementNew feature or requestNew feature or requestStatus: Open.#2970 In Hmbown/CodeWhale;whaleflow: real async executor — replace MockWorkflowExecutor
enhancementNew feature or requestNew feature or requestStatus: Open.#2973 In Hmbown/CodeWhale;whaleflow: wire codewhale-whaleflow into tui/cli (currently zero dependents)
enhancementNew feature or requestNew feature or requestStatus: Open.#2974 In Hmbown/CodeWhale;whaleflow: ARMH/RLM shared-memo integration with live engine + telemetry
enhancementNew feature or requestNew feature or requestStatus: Open.#2975 In Hmbown/CodeWhale;whaleflow: TeacherReview → StudentReplay → PromotionGate end-to-end
enhancementNew feature or requestNew feature or requestStatus: Open.#2976 In Hmbown/CodeWhale;whaleflow: cached-main overlay
enhancementNew feature or requestNew feature or requestStatus: Open.#2977 In Hmbown/CodeWhale;whaleflow: janitor — stale invalidation, memo cleanup, demotion, trace compaction
enhancementNew feature or requestNew feature or requestStatus: Open.#2978 In Hmbown/CodeWhale;whaleflow: TUI /workflows monitoring surfaces — run/inspect/replay
enhancementNew feature or requestNew feature or requestStatus: Open.#2979 In Hmbown/CodeWhale;whaleflow: CI — run workflows/rlm_cache_change.star against mock provider
enhancementNew feature or requestNew feature or requestStatus: Open.#2980 In Hmbown/CodeWhale;EPIC: v0.9.0 WhaleFlow branch/leaf workflow mode (tracker, replaces deleted #2667)
enhancementNew feature or requestNew feature or requestStatus: Open.#2981 In Hmbown/CodeWhale;[feat] Run Trace Export System for WhaleFlow/Model Lab
bugSomething isn't workingSomething isn't workingenhancementNew feature or requestNew feature or requestStatus: Open.#2752 In Hmbown/CodeWhale;- Status: Open.#2535 In Hmbown/CodeWhale;
Feature request: Persistent agent state and signed compressed KV cache capsules for long-running coding tasks
bugSomething isn't workingSomething isn't workingenhancementNew feature or requestNew feature or requestStatus: Open.#2904 In Hmbown/CodeWhale;Proposal: universal PreToolUse/PostToolUse hook layer for Cancel/Pause/Resume across all action types
bugSomething isn't workingSomething isn't workingdocumentationImprovements or additions to documentationImprovements or additions to documentationenhancementNew feature or requestNew feature or requestv0.9.0Targeting v0.9.0Targeting v0.9.0Status: Open.#1917 In Hmbown/CodeWhale;Refactor command dispatch from monolithic match to modular strategy pattern
enhancementNew feature or requestNew feature or requestv0.9.0Targeting v0.9.0Targeting v0.9.0Status: Open.#2791 In Hmbown/CodeWhale;EPIC: staged command-boundary refactor for #2791
cleanupCode cleanup, refactor, or maintenance workCode cleanup, refactor, or maintenance workdocumentationImprovements or additions to documentationImprovements or additions to documentationtuiTerminal UI behavior, rendering, or interactionTerminal UI behavior, rendering, or interactionv0.9.0Targeting v0.9.0Targeting v0.9.0Status: Open.#2870 In Hmbown/CodeWhale;FR:适配Claude Code的技能生态
enhancementNew feature or requestNew feature or requestv0.9.0Targeting v0.9.0Targeting v0.9.0Status: Open.#2743 In Hmbown/CodeWhale;v0.9.0 EPIC: Chat-native CodeWhale workrooms for threaded, shareable agent work
documentationImprovements or additions to documentationImprovements or additions to documentationenhancementNew feature or requestNew feature or requestexternal-memoryExternal memory, context substrate, and long-running agent stateExternal memory, context substrate, and long-running agent statepod-workflowsPod-style background workflow monitoring and grouped agent orchestrationPod-style background workflow monitoring and grouped agent orchestrationreliabilityReliability, flaky behavior, retries, fallbacks, and robustnessReliability, flaky behavior, retries, fallbacks, and robustnesssubagentsSub-agent orchestration, lifecycle, and completion handlingSub-agent orchestration, lifecycle, and completion handlinguxUser experience, interaction, or presentation polishUser experience, interaction, or presentation polishv0.9.0Targeting v0.9.0Targeting v0.9.0workflow-runtimeWorkflow IR, executor, control flow, and replay runtimeWorkflow IR, executor, control flow, and replay runtimeStatus: Open.#3209 In Hmbown/CodeWhale;v0.9.0: Always-on stateful agent identity across workrooms, repos, and fleet runs
documentationImprovements or additions to documentationImprovements or additions to documentationenhancementNew feature or requestNew feature or requestexternal-memoryExternal memory, context substrate, and long-running agent stateExternal memory, context substrate, and long-running agent statepod-workflowsPod-style background workflow monitoring and grouped agent orchestrationPod-style background workflow monitoring and grouped agent orchestrationreliabilityReliability, flaky behavior, retries, fallbacks, and robustnessReliability, flaky behavior, retries, fallbacks, and robustnesssubagentsSub-agent orchestration, lifecycle, and completion handlingSub-agent orchestration, lifecycle, and completion handlinguxUser experience, interaction, or presentation polishUser experience, interaction, or presentation polishv0.9.0Targeting v0.9.0Targeting v0.9.0workflow-runtimeWorkflow IR, executor, control flow, and replay runtimeWorkflow IR, executor, control flow, and replay runtimeStatus: Open.#3210 In Hmbown/CodeWhale;WhaleFlow coordination substrate: Fleet ledger as shared task list + consume the whaleflow IR
bugSomething isn't workingSomething isn't workingdocumentationImprovements or additions to documentationImprovements or additions to documentationenhancementNew feature or requestNew feature or requestsubagentsSub-agent orchestration, lifecycle, and completion handlingSub-agent orchestration, lifecycle, and completion handlingwhaleflowWhaleFlow branch/leaf workflow runtime and workflow modeWhaleFlow branch/leaf workflow runtime and workflow modeStatus: Open.#3229 In Hmbown/CodeWhale;