Skip to content

Pull requests: NVIDIA/Megatron-LM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Multimodal: add load weights only
#3452 opened Feb 17, 2026 by faradawn Loading…
6 tasks
Multimodal: fix argument checking
#3449 opened Feb 17, 2026 by faradawn Loading…
6 tasks
Core 0.16
Multimodal: Limit transformer version in Dockerfile
#3448 opened Feb 17, 2026 by faradawn Loading…
6 tasks
Log RL metrics per environment
#3446 opened Feb 16, 2026 by yobibyte Loading…
Fix slowdown in inference flask server
#3445 opened Feb 16, 2026 by tdene Loading…
6 tasks
Core 0.16
ci(fix): Weekly GPT tests
#3443 opened Feb 16, 2026 by ko3n1g Loading…
6 tasks
Core 0.16
update HF tokenizer defaults
#3440 opened Feb 16, 2026 by dimapihtar Loading…
6 tasks
Core 0.16
fix traceback when interrupting run
#3439 opened Feb 16, 2026 by dimapihtar Draft
6 tasks
ci: Update release-docs workflow to use FW-CI-templates v0.72.0 core_r0.16.0 Cherry-pick label for core_r0.16.0 release branch
#3438 opened Feb 15, 2026 by chtruong814 Loading…
6 tasks
Core 0.16
Clean up logging inside inference flask server
#3437 opened Feb 15, 2026 by tdene Loading…
6 tasks
Core 0.16
Create a Protocol for the MLP layer of TransformerLayer community-request complexity: medium Expert Review Apply this label to indicate that your PR is ready for expert review.
#3435 opened Feb 15, 2026 by nschank Loading…
2 of 6 tasks
Core 0.16
Use Protocols to type-check linear_proj submodules of Attention community-request complexity: medium Expert Review Apply this label to indicate that your PR is ready for expert review.
#3434 opened Feb 15, 2026 by nschank Loading…
2 of 6 tasks
Core 0.16
Enable CUDA graph for ADAM optimizer
#3429 opened Feb 14, 2026 by vasunvidia Loading…
6 tasks
Migrate MoeLayer submodules from ModuleSpec to Protocols community-request complexity: medium Expert Review Apply this label to indicate that your PR is ready for expert review.
#3426 opened Feb 14, 2026 by nschank Loading…
2 of 6 tasks
Core 0.16
Ensure type-checker understands use of Submodules in unit tests community-request needs-follow-up Issue needs follow-up
#3425 opened Feb 13, 2026 by nschank Loading…
2 of 6 tasks
Add single-process checkpoint save to avoid forked multiprocessing
#3424 opened Feb 13, 2026 by sbak5 Loading…
6 tasks
Fix --tokenizer-hf-include-special-tokens Expert Review Apply this label to indicate that your PR is ready for expert review. Run functional tests Run tests
#3422 opened Feb 13, 2026 by jon-barker Loading…
2 of 6 tasks
Core 0.16
Use copy_signature to preserve typing of pass-through methods community-request complexity: low Expert Review Apply this label to indicate that your PR is ready for expert review.
#3419 opened Feb 13, 2026 by nschank Loading…
2 of 6 tasks
Core 0.16
Improve Multimodal README: add pointer to Qwen3 VL example
#3418 opened Feb 13, 2026 by faradawn Loading…
1 of 6 tasks
Bug fix: add missing packages to Multimodal Dockerfile
#3417 opened Feb 13, 2026 by faradawn Loading…
1 of 6 tasks
ProTip! What’s not been updated in a month: updated:<2026-01-17.