feat: add trajectory-level batching, loss aggregation, and dynamic batching support by lqzxt · Pull Request #85 · AgentR1/Agent-R1

lqzxt · 2026-04-21T13:00:44Z

Summary

This PR introduces trajectory-aware training support by batching samples at the trajectory level, aggregating loss across entire trajectories, and adding dynamic batching to better handle variable-length trajectories. These changes ensure that all multi-turn completions from the same trajectory are processed within the same forward/backward pass while improving training efficiency and reducing memory fragmentation.

Changes

implement trajectory-level mini-batch construction to keep all completions from the same trajectory in the same batch
refactor loss computation to support trajectory-level loss aggregation
add dynamic batching for variable-length trajectories to improve GPU memory utilization and reduce fragmentation

…tching support

for more information, see https://pre-commit.ci

feat: add trajectory-level batching, loss aggregation, and dynamic ba…

21499c2

…tching support

lqzxt requested a review from 0russwest0 April 21, 2026 13:00

[pre-commit.ci] auto fixes from pre-commit.com hooks

c35c521

for more information, see https://pre-commit.ci

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add trajectory-level batching, loss aggregation, and dynamic batching support#85

feat: add trajectory-level batching, loss aggregation, and dynamic batching support#85
lqzxt wants to merge 2 commits intoAgentR1:mainfrom
lqzxt:feat/add_traj_batching

lqzxt commented Apr 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

lqzxt commented Apr 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant