-
Notifications
You must be signed in to change notification settings - Fork 425
Pull requests: NVIDIA-NeMo/RL
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix(distillation): persist validation samples as JSONL
#2858
opened Jun 17, 2026 by
jinglinglingling
Contributor
Loading…
fix: disable draft config in Nemo Gym Nano v3 recipe default setting
#2857
opened Jun 17, 2026 by
snowmanwwg
Contributor
Loading…
4 tasks
fix(data): stabilize multi-turn chat chunking and tokenization
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2856
opened Jun 17, 2026 by
jinglinglingling
Contributor
Loading…
docs(xtoken): X-Token distillation guide and README updates
Documentation
Improvements or additions to documentation
#2854
opened Jun 16, 2026 by
avenkateshha
Contributor
Loading…
fix: missing validation logging in distillation
community-request
#2847
opened Jun 16, 2026 by
odedovadia
Contributor
Loading…
2 of 4 tasks
test: add vLLM HTTP logprobs contract test for NeMo-Gym capture
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2845
opened Jun 16, 2026 by
ananthsub
Contributor
Loading…
feat: add vLLM prefix cache and preemption metrics
community-request
#2843
opened Jun 16, 2026 by
puneeshkhanna
Loading…
1 of 4 tasks
test(data_plane): session-scope mooncake fixtures
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2838
opened Jun 16, 2026 by
ZhiyuLi-Nvidia
Contributor
Loading…
feat: Support for dtensor ppo
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2837
opened Jun 16, 2026 by
fujial-code
•
Draft
ci: Bump Megatron-Bridge to e9529c3
CI:L1
Run doctests, unit tests, and functional tests
#2835
opened Jun 16, 2026 by
svcnvidia-nemo-ci
Contributor
Loading…
feat: Support Linear CE Loss Fusion for GRPO
community-request
Documentation
Improvements or additions to documentation
waiting-on-customer
Waiting on the original author to respond
#2833
opened Jun 16, 2026 by
pengdurice
Contributor
Loading…
4 tasks done
feat: super-v3 recipe and docs
CI:L0
Run doctests and unit tests
Documentation
Improvements or additions to documentation
super-v3
#2829
opened Jun 15, 2026 by
macandro96
Contributor
Loading…
4 tasks
feat: async checkpointing for Megatron policy workers
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2828
opened Jun 15, 2026 by
ananthsub
Contributor
Loading…
2 of 4 tasks
perf: reduce srun overhead in ray.sub and gate driver on sandbox readiness
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2827
opened Jun 15, 2026 by
ananthsub
Contributor
Loading…
4 tasks
feat(ppo): in-model value head for Megatron PPO
CI:L1
Run doctests, unit tests, and functional tests
#2825
opened Jun 15, 2026 by
bg51717
Contributor
Loading…
3 of 4 tasks
feat: video + audio understanding GRPO training recipe
CI:L1
Run doctests, unit tests, and functional tests
Documentation
Improvements or additions to documentation
#2823
opened Jun 15, 2026 by
yuekaizhang
Contributor
Loading…
fix: avoid duplicating assistant content in multi-turn reasoning templates
community-request
#2822
opened Jun 15, 2026 by
bzantium
Loading…
3 of 4 tasks
feat: full Kubernetes multi-node support (Kubeflow PyTorchJob + KubeRay)
community-request
Documentation
Improvements or additions to documentation
#2818
opened Jun 15, 2026 by
dafu-wu
Loading…
feat: NCCL-Xfer refit merge PR
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
Performance
Related to improving performance
#2808
opened Jun 15, 2026 by
youngeunkwon0405
Contributor
Loading…
4 tasks
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.