Port turbostream GIGAFLOW policy + transition-PPO train loop by eugenevinitsky · Pull Request #405 · Emerge-Lab/PufferDrive

eugenevinitsky · 2026-04-14T04:02:22Z

Summary

Ports the turbostream network architecture, PPO training loop, and hparams onto 3.0 — without touching 3.0's eval/render pipeline.

What's in

Policy (pufferlib/ocean/torch.py) — replaces the current `Drive` with a GIGAFLOW-style backbone:

Per-group encoders (ego, partner, road): Linear → LayerNorm → Tanh → Dropout → Linear
Max-pool over set dims (deep-sets style)
4-layer GELU MLP backbone (hidden=512)
Linear actor + critic heads (split_network optional)
Preserves 3.0's road one-hot expansion (last categorical feature, 7 classes)
~910k params
LSTM disabled (`rnn_name = None`)

Training loop (`pufferlib/pufferl.py`) — replaces 3.0's single `train()` with turbostream's dispatcher:

`_ppo_loss` — extracted loss helper
`_train_ppo_trajectory` — prioritized-replay V-trace path (unchanged from turbostream)
`_train_ppo_transition` — new flat-PPO path with EWMA-thresholded |adv| filter (selected when use_rnn=False)
Dispatch via new `ppo_granularity` config (`auto` picks transition without LSTM)
Bridges 3.0's `self.is_invalid_step` to turbostream's `self.masks` via `masks = ~self.is_invalid_step.bool()`, so PR STOP behavior reward and learning masking. #353's stopped-agent masking is preserved

Hyperparams (`pufferlib/config/ocean/drive.ini [train]`) — adopts turbostream's values:

`total_timesteps = 10B`, `batch_size = auto`, `minibatch_size = 65536`, `bptt_horizon = 128`
`learning_rate = 5e-4`, `gamma = 0.99`, `gae_lambda = 0.95`
`ent_coef = 0.01`, `vf_coef = 0.5`, `max_grad_norm = 0.5`

Config plumbing (`pufferlib/config/default.ini`) — new defaults:

`ppo_granularity = auto`
`adv_filter_ewma_beta = 0.25`
`adv_filter_threshold_scale = 0.01`

Infra (`scripts/rebuild_on_cluster.py`) — hardcodes `TORCH_CUDA_ARCH_LIST="8.0 8.9 9.0"` inside the container shell script so multi-arch builds cover A100/L40S/H100/H200 regardless of which node runs the rebuild.

Plan doc (`docs/merge_turbostream_plan.md`) — 9-phase merge plan for the remaining turbostream features (OBB collisions, TTC, waypoints, multi-scenario eval, etc.). Not load-bearing on this PR.

What's not in (intentionally preserved from 3.0)

`_run_safe_eval` + the `safe_eval` config section
`Evaluator` / human_replay / self_play eval
`render()` + the GPU-headless c_render pipeline from PR GPU-accelerated headless rendering (32x speedup) #400
WOSAC realism subprocess eval
PR STOP behavior reward and learning masking. #353 stopped-agent masking (kept via the is_invalid_step → masks bridge)
PR WIP: fix stopped-agent metric dilution (active_step_count divisor) #402 active_step_count, PR Cap libx264 thread pool to 4 in drive.h ffmpeg pipe #403 libx264 threads=4, PR Partner velocity in ego frame (2D vector) replacing broken scalar #404 partner velocity in ego frame

Caveats

Existing .pt checkpoints from 3.0 will NOT load — the policy parameter names change completely (`actor_backbone.ego_encoder.` vs old `ego_encoder.`). This is an unavoidable consequence of the architecture swap.
`drivenet.h` is stale — the visualize binary's hardcoded MLP layout doesn't match the new `Drive` class. Safe eval GIFs will either crash or produce garbage until drivenet.h is ported. Training itself is unaffected.
The transition training path is a different algorithm from 3.0's — it uses an EWMA advantage-magnitude filter instead of prioritized replay, and does not use V-trace (rho/c clips forced to 1.0). Expect different training dynamics, not identical results.
Reward conditioning coefs pass through the ego encoder instead of a dedicated conditioning encoder as turbostream does — 3.0's obs layout packs coefs into the ego tail rather than as a separate slice, and the port didn't add the separation.

Validation

Syntax/import OK on macOS
End-to-end smoke test on emerge2: policy loads, forward pass yields correct shapes, `PuffeRL.train()` runs multiple epochs without error. Transition-path metrics (`filter_threshold`, `ema_max`, `kept_fraction`, `masked_fraction`) populate correctly.
3 jobs running on NYU Greene (`turbostream-port` wandb project, group `gigaflow-net-v1` = this PR's net + 3.0 hparams, group `gigaflow-hparams` = this PR's net + this PR's hparams)

Test plan

Verify 3 training runs converge (completion_rate > 0.5, explained_variance > 0.5 by ~1B steps)
Confirm training logs show transition-path metrics, no NaN losses
Spot-check that safe_eval failure does not crash training (expected to fail gracefully on stale drivenet.h)
If porting is revisited, update drivenet.h to match the new `Drive` class so safe_eval GIFs render

The local TORCH_CUDA_ARCH_LIST env var on the user's machine never made it into the SLURM build job, so the cuda extension was built only for the build node's GPU arch (compute 9.0 if it landed on H100), and training jobs that landed on A100 (8.0) crashed at first compute_puff_advantage call with "no kernel image is available for execution on the device". Set TORCH_CUDA_ARCH_LIST="8.0 8.9 9.0" inside the container shell script directly so every rebuild covers A100 + L40S + H100/H200.

Piecemeal port plan for integrating vcha/turbostream features onto 3.0 via the ev/merge_turbostream branch. Covers strategy, features to preserve from 3.0 (GPU renderer, libx264 threads cap, active_step_count fix, 2D rel-v partner obs, variable-agent spawning, current reward ranges), a 9-phase ordered port plan with dependencies and risks, explicit list of features NOT to port, open questions, and a merge hazard map.

…plit)

Copilot

Pull request overview

Ports turbostream’s GIGAFLOW-style Drive policy and its PPO training-loop refactor onto the 3.0 branch while aiming to preserve the existing eval/render/safe-eval pipelines and stopped-agent masking semantics.

Changes:

Replaces the Drive torch policy with a GIGAFLOW-style per-group encoder + deep-sets max-pool + GELU MLP backbone.
Refactors PPO training into a dispatcher with shared _ppo_loss, adding a transition-level PPO path with EWMA advantage filtering and keeping a trajectory-level prioritized-replay/V-trace path.
Updates Ocean Drive training hyperparameters and adds new default config knobs (ppo_granularity, advantage filter params); also hardens cluster rebuilds by exporting TORCH_CUDA_ARCH_LIST.

Reviewed changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated 6 comments.

Show a summary per file

File	Description
`scripts/rebuild_on_cluster.py`	Exports `TORCH_CUDA_ARCH_LIST` in the container rebuild script for multi-arch CUDA extension builds.
`pufferlib/pufferl.py`	Replaces monolithic `train()` with trajectory/transition PPO dispatch + shared loss helper; adds EWMA advantage filter state/metrics.
`pufferlib/ocean/torch.py`	Implements new GIGAFLOW-style `Drive` backbone/heads and keeps 3.0’s road one-hot expansion.
`pufferlib/config/ocean/drive.ini`	Switches to new policy hparams and turbostream-like training hyperparameters; disables RNN by default.
`pufferlib/config/default.ini`	Adds defaults for PPO granularity dispatch and transition-path advantage filter knobs.
`docs/merge_turbostream_plan.md`	Adds a detailed merge plan document for future turbostream feature ports.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-14T04:07:59Z

+            masks = ~self.is_invalid_step.bool()
+            terminals = torch.maximum(self.terminals, (~masks).float())
+            advantages = torch.zeros_like(self.values, device=device)
+            advantages = compute_puff_advantage(
+                self.values,
+                self.rewards,
+                terminals,
+                self.ratio,
+                advantages,
+                config["gamma"],
+                config["gae_lambda"],
+                config["vtrace_rho_clip"],
+                config["vtrace_c_clip"],
+            )
+            advantages.masked_fill_(~masks, 0.0)
+


In trajectory PPO, invalid timesteps are only handled by zeroing advantages, but _ppo_loss still includes those steps in advantage normalization and in the value loss (and potentially other loss terms). This regresses the stopped-agent / invalid-step gradient masking behavior (e.g., critic still gets gradients on invalid steps). Consider passing a mask into _ppo_loss and applying it to advantage mean/std and to pg_loss/v_loss reductions (and decide explicitly whether entropy should be masked).

Copilot · 2026-04-14T04:07:59Z

+                mb_obs = flat_obs[mb_idx]
+                mb_actions = flat_actions[mb_idx]
+                mb_logprobs = flat_logprobs[mb_idx]
+                mb_values = flat_values[mb_idx]
+                mb_returns = flat_returns[mb_idx]
+                mb_adv = flat_advantages[mb_idx]


flat_obs comes from self.observations, which is allocated on CPU when cpu_offload=True, but keep_idx/mb_idx are CUDA indices. Indexing a CPU tensor with CUDA indices will raise at runtime. To keep cpu_offload working, index observations with mb_idx.cpu() (and then move mb_obs to device), or move flat_obs onto the same device as the indices before slicing.

Suggested change

mb_obs = flat_obs[mb_idx]

mb_actions = flat_actions[mb_idx]

mb_logprobs = flat_logprobs[mb_idx]

mb_values = flat_values[mb_idx]

mb_returns = flat_returns[mb_idx]

mb_adv = flat_advantages[mb_idx]

mb_actions = flat_actions[mb_idx]

mb_logprobs = flat_logprobs[mb_idx]

mb_values = flat_values[mb_idx]

mb_returns = flat_returns[mb_idx]

mb_adv = flat_advantages[mb_idx]

mb_obs_idx = mb_idx.cpu() if flat_obs.device.type == "cpu" and mb_idx.device.type != "cpu" else mb_idx

mb_obs = flat_obs[mb_obs_idx]

if mb_obs.device != mb_actions.device:

mb_obs = mb_obs.to(mb_actions.device, non_blocking=True)

Copilot · 2026-04-14T04:07:59Z

+            self.amp_context.__enter__()
+
+            masks = ~self.is_invalid_step.bool()
+            terminals = torch.maximum(self.terminals, (~masks).float())
+            advantages = torch.zeros_like(self.values, device=device)
+            advantages = compute_puff_advantage(
+                self.values,
+                self.rewards,
+                terminals,
+                self.ratio,
+                advantages,
+                config["gamma"],
+                config["gae_lambda"],
+                config["vtrace_rho_clip"],
+                config["vtrace_c_clip"],
+            )
+            advantages.masked_fill_(~masks, 0.0)
+
+            adv = advantages.abs().sum(axis=1)
+            prio_weights = torch.nan_to_num(adv**a, 0, 0, 0)
+            prio_probs = (prio_weights + 1e-6) / (prio_weights.sum() + 1e-6)
+            idx = torch.multinomial(prio_probs, self.minibatch_segments)
+            mb_prio = (self.segments * prio_probs[idx, None]) ** -anneal_beta
+
+            profile("train_copy", epoch)
+            mb_obs = self.observations[idx]
+            mb_actions = self.actions[idx]
+            mb_logprobs = self.logprobs[idx]
+            mb_values = self.values[idx]
+            mb_returns = advantages[idx] + mb_values
+            mb_adv = advantages[idx]
+
+            if not config["use_rnn"]:
+                mb_obs = mb_obs.reshape(-1, *self.vecenv.single_observation_space.shape)
+
+            profile("train_forward", epoch)
+            loss, newvalue, ratio, stats = self._ppo_loss(
+                mb_obs,
+                mb_actions,
+                mb_logprobs,
+                mb_values,
+                mb_returns,
+                mb_adv,
+                clip_coef,
+                vf_clip,
+                adv_weights=mb_prio,
+                unbiased_std=True,
+            )
+            self.ratio[idx] = ratio.detach()
+            self.amp_context.__enter__()  # TODO: AMP needs some debugging
+
+            self.values[idx] = newvalue.detach().float()
+


self.amp_context.__enter__() is called without a matching __exit__(). For torch.amp.autocast this leaves autocast enabled (and increases nesting depth) beyond the intended scope. Use with self.amp_context: around the forward/loss section, or ensure __exit__ is called in a finally block.

Suggested change

self.amp_context.__enter__()

masks = ~self.is_invalid_step.bool()

terminals = torch.maximum(self.terminals, (~masks).float())

advantages = torch.zeros_like(self.values, device=device)

advantages = compute_puff_advantage(

self.values,

self.rewards,

terminals,

self.ratio,

advantages,

config["gamma"],

config["gae_lambda"],

config["vtrace_rho_clip"],

config["vtrace_c_clip"],

)

advantages.masked_fill_(~masks, 0.0)

adv = advantages.abs().sum(axis=1)

prio_weights = torch.nan_to_num(adv**a, 0, 0, 0)

prio_probs = (prio_weights + 1e-6) / (prio_weights.sum() + 1e-6)

idx = torch.multinomial(prio_probs, self.minibatch_segments)

mb_prio = (self.segments * prio_probs[idx, None]) ** -anneal_beta

profile("train_copy", epoch)

mb_obs = self.observations[idx]

mb_actions = self.actions[idx]

mb_logprobs = self.logprobs[idx]

mb_values = self.values[idx]

mb_returns = advantages[idx] + mb_values

mb_adv = advantages[idx]

if not config["use_rnn"]:

mb_obs = mb_obs.reshape(-1, *self.vecenv.single_observation_space.shape)

profile("train_forward", epoch)

loss, newvalue, ratio, stats = self._ppo_loss(

mb_obs,

mb_actions,

mb_logprobs,

mb_values,

mb_returns,

mb_adv,

clip_coef,

vf_clip,

adv_weights=mb_prio,

unbiased_std=True,

)

self.ratio[idx] = ratio.detach()

self.amp_context.__enter__() # TODO: AMP needs some debugging

self.values[idx] = newvalue.detach().float()

with self.amp_context:

masks = ~self.is_invalid_step.bool()

terminals = torch.maximum(self.terminals, (~masks).float())

advantages = torch.zeros_like(self.values, device=device)

advantages = compute_puff_advantage(

self.values,

self.rewards,

terminals,

self.ratio,

advantages,

config["gamma"],

config["gae_lambda"],

config["vtrace_rho_clip"],

config["vtrace_c_clip"],

)

advantages.masked_fill_(~masks, 0.0)

adv = advantages.abs().sum(axis=1)

prio_weights = torch.nan_to_num(adv**a, 0, 0, 0)

prio_probs = (prio_weights + 1e-6) / (prio_weights.sum() + 1e-6)

idx = torch.multinomial(prio_probs, self.minibatch_segments)

mb_prio = (self.segments * prio_probs[idx, None]) ** -anneal_beta

profile("train_copy", epoch)

mb_obs = self.observations[idx]

mb_actions = self.actions[idx]

mb_logprobs = self.logprobs[idx]

mb_values = self.values[idx]

mb_returns = advantages[idx] + mb_values

mb_adv = advantages[idx]

if not config["use_rnn"]:

mb_obs = mb_obs.reshape(-1, *self.vecenv.single_observation_space.shape)

profile("train_forward", epoch)

loss, newvalue, ratio, stats = self._ppo_loss(

mb_obs,

mb_actions,

mb_logprobs,

mb_values,

mb_returns,

mb_adv,

clip_coef,

vf_clip,

adv_weights=mb_prio,

unbiased_std=True,

)

self.ratio[idx] = ratio.detach()

self.values[idx] = newvalue.detach().float()

Copilot · 2026-04-14T04:08:00Z

+            profile("train_misc", epoch)
+            self.amp_context.__enter__()
+
+            masks = ~self.is_invalid_step.bool()
+            terminals = torch.maximum(self.terminals, (~masks).float())
+            advantages = torch.zeros_like(self.values, device=device)
+            advantages = compute_puff_advantage(
+                self.values,
+                self.rewards,
+                terminals,
+                self.ratio,
+                advantages,
+                config["gamma"],
+                config["gae_lambda"],
+                config["vtrace_rho_clip"],
+                config["vtrace_c_clip"],
+            )
+            advantages.masked_fill_(~masks, 0.0)
+
+            adv = advantages.abs().sum(axis=1)
+            prio_weights = torch.nan_to_num(adv**a, 0, 0, 0)
+            prio_probs = (prio_weights + 1e-6) / (prio_weights.sum() + 1e-6)
+            idx = torch.multinomial(prio_probs, self.minibatch_segments)
+            mb_prio = (self.segments * prio_probs[idx, None]) ** -anneal_beta
+
+            profile("train_copy", epoch)
+            mb_obs = self.observations[idx]
+            mb_actions = self.actions[idx]
+            mb_logprobs = self.logprobs[idx]
+            mb_values = self.values[idx]
+            mb_returns = advantages[idx] + mb_values
+            mb_adv = advantages[idx]
+
+            if not config["use_rnn"]:
+                mb_obs = mb_obs.reshape(-1, *self.vecenv.single_observation_space.shape)
+
+            profile("train_forward", epoch)
+            loss, newvalue, ratio, stats = self._ppo_loss(
+                mb_obs,
+                mb_actions,
+                mb_logprobs,
+                mb_values,
+                mb_returns,
+                mb_adv,
+                clip_coef,
+                vf_clip,
+                adv_weights=mb_prio,
+                unbiased_std=True,
+            )
+            self.ratio[idx] = ratio.detach()
+            self.amp_context.__enter__()  # TODO: AMP needs some debugging
+
+            self.values[idx] = newvalue.detach().float()
+
+            profile("train_misc", epoch)
+            for key, value in stats.items():
+                losses[key] += value / self.total_minibatches
+            losses["importance"] += ratio.mean().item() / self.total_minibatches
+
+            profile("learn", epoch)
+            loss.backward()


This second self.amp_context.__enter__() call is also unbalanced (no __exit__). If the intent is to enable autocast during training, wrap the whole minibatch forward/backward in with self.amp_context: rather than manually entering multiple times.

Suggested change

profile("train_misc", epoch)

self.amp_context.__enter__()

masks = ~self.is_invalid_step.bool()

terminals = torch.maximum(self.terminals, (~masks).float())

advantages = torch.zeros_like(self.values, device=device)

advantages = compute_puff_advantage(

self.values,

self.rewards,

terminals,

self.ratio,

advantages,

config["gamma"],

config["gae_lambda"],

config["vtrace_rho_clip"],

config["vtrace_c_clip"],

)

advantages.masked_fill_(~masks, 0.0)

adv = advantages.abs().sum(axis=1)

prio_weights = torch.nan_to_num(adv**a, 0, 0, 0)

prio_probs = (prio_weights + 1e-6) / (prio_weights.sum() + 1e-6)

idx = torch.multinomial(prio_probs, self.minibatch_segments)

mb_prio = (self.segments * prio_probs[idx, None]) ** -anneal_beta

profile("train_copy", epoch)

mb_obs = self.observations[idx]

mb_actions = self.actions[idx]

mb_logprobs = self.logprobs[idx]

mb_values = self.values[idx]

mb_returns = advantages[idx] + mb_values

mb_adv = advantages[idx]

if not config["use_rnn"]:

mb_obs = mb_obs.reshape(-1, *self.vecenv.single_observation_space.shape)

profile("train_forward", epoch)

loss, newvalue, ratio, stats = self._ppo_loss(

mb_obs,

mb_actions,

mb_logprobs,

mb_values,

mb_returns,

mb_adv,

clip_coef,

vf_clip,

adv_weights=mb_prio,

unbiased_std=True,

)

self.ratio[idx] = ratio.detach()

self.amp_context.__enter__() # TODO: AMP needs some debugging

self.values[idx] = newvalue.detach().float()

profile("train_misc", epoch)

for key, value in stats.items():

losses[key] += value / self.total_minibatches

losses["importance"] += ratio.mean().item() / self.total_minibatches

profile("learn", epoch)

loss.backward()

with self.amp_context:

profile("train_misc", epoch)

masks = ~self.is_invalid_step.bool()

terminals = torch.maximum(self.terminals, (~masks).float())

advantages = torch.zeros_like(self.values, device=device)

advantages = compute_puff_advantage(

self.values,

self.rewards,

terminals,

self.ratio,

advantages,

config["gamma"],

config["gae_lambda"],

config["vtrace_rho_clip"],

config["vtrace_c_clip"],

)

advantages.masked_fill_(~masks, 0.0)

adv = advantages.abs().sum(axis=1)

prio_weights = torch.nan_to_num(adv**a, 0, 0, 0)

prio_probs = (prio_weights + 1e-6) / (prio_weights.sum() + 1e-6)

idx = torch.multinomial(prio_probs, self.minibatch_segments)

mb_prio = (self.segments * prio_probs[idx, None]) ** -anneal_beta

profile("train_copy", epoch)

mb_obs = self.observations[idx]

mb_actions = self.actions[idx]

mb_logprobs = self.logprobs[idx]

mb_values = self.values[idx]

mb_returns = advantages[idx] + mb_values

mb_adv = advantages[idx]

if not config["use_rnn"]:

mb_obs = mb_obs.reshape(-1, *self.vecenv.single_observation_space.shape)

profile("train_forward", epoch)

loss, newvalue, ratio, stats = self._ppo_loss(

mb_obs,

mb_actions,

mb_logprobs,

mb_values,

mb_returns,

mb_adv,

clip_coef,

vf_clip,

adv_weights=mb_prio,

unbiased_std=True,

)

self.ratio[idx] = ratio.detach()

self.values[idx] = newvalue.detach().float()

profile("train_misc", epoch)

for key, value in stats.items():

losses[key] += value / self.total_minibatches

losses["importance"] += ratio.mean().item() / self.total_minibatches

profile("learn", epoch)

loss.backward()

Copilot · 2026-04-14T04:08:00Z

+                loss, _, _, stats = self._ppo_loss(
+                    mb_obs,
+                    mb_actions,
+                    mb_logprobs,
+                    mb_values,
+                    mb_returns,
+                    mb_adv,
+                    clip_coef,
+                    vf_clip,
+                    unbiased_std=False,
+                )
+                self.amp_context.__enter__()  # TODO: AMP needs some debugging


This self.amp_context.__enter__() inside transition PPO is unbalanced (no __exit__) and can leave autocast enabled outside the intended scope. Prefer with self.amp_context: around the forward/loss computation (or explicitly call __exit__ in finally).

Suggested change

loss, _, _, stats = self._ppo_loss(

mb_obs,

mb_actions,

mb_logprobs,

mb_values,

mb_returns,

mb_adv,

clip_coef,

vf_clip,

unbiased_std=False,

)

self.amp_context.__enter__() # TODO: AMP needs some debugging

with self.amp_context:

loss, _, _, stats = self._ppo_loss(

mb_obs,

mb_actions,

mb_logprobs,

mb_values,

mb_returns,

mb_adv,

clip_coef,

vf_clip,

unbiased_std=False,

)

Copilot · 2026-04-14T04:08:00Z

+        y_pred = self.values.flatten()
+        y_true = advantages.flatten() + self.values.flatten()
+        var_y = y_true.var()
+        return float("nan") if var_y == 0 else (1 - (y_true - y_pred).var() / var_y).item()


In _train_ppo_trajectory, self.values[idx] is mutated during training, but explained variance is computed as advantages + self.values after those updates. That mixes advantages computed from the pre-update values with post-update predictions and can make explained_variance misleading. Consider computing a returns tensor once from a snapshot of the rollout values (or store old_values = self.values.clone() before updates) and use that for the metric.

eugenevinitsky added 6 commits April 13, 2026 18:31

Merge remote-tracking branch 'origin/3.0' into ev/merge_turbostream

e8bb6a7

WIP: port turbostream GIGAFLOW policy as default

a70fba7

WIP: port turbostream train loop (_ppo_loss + trajectory/transition s…

f86f35d

…plit)

WIP: adopt turbostream [train] hparams

52dfc8d

Copilot AI review requested due to automatic review settings April 14, 2026 04:02

Copilot started reviewing on behalf of eugenevinitsky April 14, 2026 04:02 View session

Copilot AI reviewed Apr 14, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Port turbostream GIGAFLOW policy + transition-PPO train loop#405

Port turbostream GIGAFLOW policy + transition-PPO train loop#405
eugenevinitsky wants to merge 6 commits into3.0from
ev/merge_turbostream

eugenevinitsky commented Apr 14, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 14, 2026

Uh oh!

Copilot AI Apr 14, 2026

Uh oh!

Copilot AI Apr 14, 2026

Uh oh!

Copilot AI Apr 14, 2026

Uh oh!

Copilot AI Apr 14, 2026

Uh oh!

Copilot AI Apr 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

eugenevinitsky commented Apr 14, 2026

Summary

What's in

What's not in (intentionally preserved from 3.0)

Caveats

Validation

Test plan

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants