Skip to content

fix: preserve MDP integrity in PPO mini-batching#98

Open
lqzxt wants to merge 1 commit into
AgentR1:mainfrom
lqzxt:fix-preserve-trajectory-mini-batches
Open

fix: preserve MDP integrity in PPO mini-batching#98
lqzxt wants to merge 1 commit into
AgentR1:mainfrom
lqzxt:fix-preserve-trajectory-mini-batches

Commits

Commits on Jun 4, 2026