Skip to content

Pull requests: NVIDIA-NeMo/RL

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix(distillation): persist validation samples as JSONL
#2858 opened Jun 17, 2026 by jinglinglingling Contributor Loading…
fix: disable draft config in Nemo Gym Nano v3 recipe default setting
#2857 opened Jun 17, 2026 by snowmanwwg Contributor Loading…
4 tasks
fix(data): stabilize multi-turn chat chunking and tokenization CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2856 opened Jun 17, 2026 by jinglinglingling Contributor Loading…
ci: Add super nightly tests Documentation Improvements or additions to documentation
#2855 opened Jun 16, 2026 by ashors1 Contributor Draft
4 tasks
docs(xtoken): X-Token distillation guide and README updates Documentation Improvements or additions to documentation
#2854 opened Jun 16, 2026 by avenkateshha Contributor Loading…
fix: missing validation logging in distillation community-request
#2847 opened Jun 16, 2026 by odedovadia Contributor Loading…
2 of 4 tasks
test: add vLLM HTTP logprobs contract test for NeMo-Gym capture CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2845 opened Jun 16, 2026 by ananthsub Contributor Loading…
test(data_plane): session-scope mooncake fixtures CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2838 opened Jun 16, 2026 by ZhiyuLi-Nvidia Contributor Loading…
feat: Support for dtensor ppo CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2837 opened Jun 16, 2026 by fujial-code Draft
ci: Bump Megatron-Bridge to e9529c3 CI:L1 Run doctests, unit tests, and functional tests
#2835 opened Jun 16, 2026 by svcnvidia-nemo-ci Contributor Loading…
feat: Support Linear CE Loss Fusion for GRPO community-request Documentation Improvements or additions to documentation waiting-on-customer Waiting on the original author to respond
#2833 opened Jun 16, 2026 by pengdurice Contributor Loading…
4 tasks done
ci: forward SANDBOX_CONTAINER/COMMAND/ENV_VARS to ray.sub
#2832 opened Jun 16, 2026 by kajalj22 Contributor Draft
3 tasks
Asyncrl/sc sync weights
#2831 opened Jun 16, 2026 by mehraakash Loading…
4 tasks
ci: List bundled codecs
#2830 opened Jun 15, 2026 by kajalj22 Contributor Draft
1 task
feat: super-v3 recipe and docs CI:L0 Run doctests and unit tests Documentation Improvements or additions to documentation super-v3
#2829 opened Jun 15, 2026 by macandro96 Contributor Loading…
4 tasks
feat: async checkpointing for Megatron policy workers CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2828 opened Jun 15, 2026 by ananthsub Contributor Loading…
2 of 4 tasks
perf: reduce srun overhead in ray.sub and gate driver on sandbox readiness CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2827 opened Jun 15, 2026 by ananthsub Contributor Loading…
4 tasks
feat(ppo): in-model value head for Megatron PPO CI:L1 Run doctests, unit tests, and functional tests
#2825 opened Jun 15, 2026 by bg51717 Contributor Loading…
3 of 4 tasks
feat: video + audio understanding GRPO training recipe CI:L1 Run doctests, unit tests, and functional tests Documentation Improvements or additions to documentation
#2823 opened Jun 15, 2026 by yuekaizhang Contributor Loading…
feat: single controller (w/o sync_weight)
#2819 opened Jun 15, 2026 by yuki-97 Contributor Draft
feat: full Kubernetes multi-node support (Kubeflow PyTorchJob + KubeRay) community-request Documentation Improvements or additions to documentation
#2818 opened Jun 15, 2026 by dafu-wu Loading…
feat: NCCL-Xfer refit merge PR CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version) Performance Related to improving performance
#2808 opened Jun 15, 2026 by youngeunkwon0405 Contributor Loading…
4 tasks
ProTip! Add no:assignee to see everything that’s not assigned.