Skip to content

Pull requests: NVIDIA/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Common] Persistent Grouped NVFP4 quantization kernel
#2743 opened Mar 6, 2026 by Oleg-Goncharov Loading…
8 of 13 tasks
[JAX] CGEMM + FP8
#2740 opened Mar 5, 2026 by phu0ngng Draft
13 tasks
[JAX] GEMM tex and FFI cleanup
#2739 opened Mar 5, 2026 by phu0ngng Loading…
6 of 13 tasks
[Common] Persistent Grouped MXFP8 quantization kernel enhancement New feature or request MoE
#2738 opened Mar 5, 2026 by Oleg-Goncharov Loading…
9 of 13 tasks
Feat/cp nvshmem enhanced community-contribution PRs from external contributor outside the core maintainers, representing community-driven work.
#2737 opened Mar 5, 2026 by Knight-of-Thunder Loading…
13 tasks
[PyTorch debug] Fix issue with tp_group=None
#2733 opened Mar 4, 2026 by pggPL Loading…
8 of 13 tasks
Feature/unswizzle community-contribution PRs from external contributor outside the core maintainers, representing community-driven work.
#2732 opened Mar 4, 2026 by int-smart Loading…
9 of 13 tasks
fix: scope get_full_cu_seqlens cache key by device and inference mode
#2728 opened Mar 3, 2026 by DmCarpe93 Loading…
8 of 13 tasks
[CI] Refactor CI build on GitHub
#2723 opened Mar 2, 2026 by ptrendx Draft
1 of 13 tasks
[Common, pyTorch] Grouped MXFP8 dequantize support
#2722 opened Mar 2, 2026 by ptrendx Draft
1 of 13 tasks
Add MXFP8 attention
#2719 opened Mar 1, 2026 by cyanguwa Draft
13 tasks
Add DCP compatibility for FSDP2-TP sharding in TransformerEngine.
#2713 opened Feb 26, 2026 by cspades Loading…
3 of 13 tasks
Enable dequantization from MXFP8 tensor with only columnwise data
#2712 opened Feb 26, 2026 by ptrendx Loading…
13 tasks
[Draft] Newton-Schulz via cuSOLVERMp
#2706 opened Feb 25, 2026 by vcherepanov-nv Loading…
6 of 13 tasks
[All] Added better error messages
#2705 opened Feb 25, 2026 by ptrendx Loading…
[Draft][PyTorch] torch.compile support for TE Linear
#2701 opened Feb 24, 2026 by pggPL Draft
13 tasks
[PyTorch] Zero-initialize learnable softmax_offset in DotProductAttention
#2694 opened Feb 20, 2026 by fjosw Loading…
7 of 13 tasks
[PyTorch] Error out if constructing LayerNormLinear with row tensor parallelism bug Something isn't working
#2688 opened Feb 17, 2026 by timmoon10 Loading…
6 of 13 tasks
ProTip! Filter pull requests by the default branch with base:main.