Skip to content

Pull requests: NVIDIA-NeMo/RL

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Remove policy offload for async grpo dtensor
#1608 opened Dec 7, 2025 by smahdavi4 Loading…
train on transitions
#1606 opened Dec 6, 2025 by cmunley1 Draft
4 tasks
feat: add support from building images using vllm from private repos CI:L1 Run doctests, unit tests, and functional tests documentation Improvements or additions to documentation
#1605 opened Dec 6, 2025 by terrykong Loading…
4 tasks
Megatron refactor POC
#1592 opened Dec 2, 2025 by ashors1 Draft
4 tasks
docs: get started section documentation Improvements or additions to documentation
#1582 opened Dec 1, 2025 by lbliii Loading…
feat: add SGLang rollout backend, part1 community-request
#1580 opened Nov 30, 2025 by PrinsYin Loading…
4 tasks
fix: Fix Fp8 sequence padding for PP>1 case CI:L2 Run doctests, unit tests, functional tests, and convergence tests
#1579 opened Nov 29, 2025 by guyueh1 Loading…
4 tasks
feat: Support top-p and top-k CI:L1 Run doctests, unit tests, and functional tests
#1578 opened Nov 27, 2025 by zhandaz Loading…
3 of 4 tasks
feat: genrm rlhf
#1576 opened Nov 27, 2025 by yfw Draft
4 tasks
chore: Bump vllm to 0.11.2, torch to 2.9, transformers to 4.57.1 CI:L1 Run doctests, unit tests, and functional tests CI Relating to CI
#1563 opened Nov 24, 2025 by yfw Loading…
4 tasks
feat: LoRA SFT support for DTensorV2 path CI:L1 Run doctests, unit tests, and functional tests documentation Improvements or additions to documentation
#1556 opened Nov 21, 2025 by samodi-nv Loading…
2 tasks done
fix: remove sft-qwen2.5-fsdp2tp8sp from nighlies CI:L0 Run doctests and unit tests
#1555 opened Nov 20, 2025 by ahmadki Loading…
fix: add H200 TFLOPS CI:L0 Run doctests and unit tests community-request
#1543 opened Nov 19, 2025 by clumsy Loading…
4 tasks done
feat: Support qwen3-next, mcore path
#1530 opened Nov 17, 2025 by ahmadki Loading…
1 task
feat: RL sampler [WIP]
#1522 opened Nov 14, 2025 by pjin-nvidia Draft
4 tasks
feat: Automodel init for DTensorPolicyV2 CI:L2 Run doctests, unit tests, functional tests, and convergence tests
#1509 opened Nov 12, 2025 by adil-a Loading…
refactor: refactor env and data processor & add nemotron super 49b recipes CI:L1 Run doctests, unit tests, and functional tests documentation Improvements or additions to documentation
#1506 opened Nov 11, 2025 by yuki-97 Loading…
feat: pipeline-rl style # of inflight prompt regulation CI:L1 Run doctests, unit tests, and functional tests documentation Improvements or additions to documentation
#1499 opened Nov 10, 2025 by youngeunkwon0405 Loading…
4 tasks
fix: Megatron static inference and adapt to mcore engine API changes CI:L1 Run doctests, unit tests, and functional tests r0.4.0
#1488 opened Nov 7, 2025 by shanmugamr1992 Loading…
4 tasks
feat: Add AceMathRL recipe
#1484 opened Nov 6, 2025 by ffrujeri Draft
4 tasks
ProTip! What’s not been updated in a month: updated:<2025-11-07.