Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[None][feat] Add bf16 trtllm moe through flashinfer.
#13689 opened May 1, 2026 by nv-guomingz Collaborator Draft
1 task done
[TRTLLM-12432][perf] ltx2: drop redundant pe all-gather in AV cross-attention
#13687 opened May 1, 2026 by luyiyun1021 Collaborator Loading…
1 task done
[None][chore] KV Cache Transceiver Profiling Configs
#13681 opened Apr 30, 2026 by ekou24 Collaborator Loading…
1 task
[https://nvbugs/5911304][fix] Add URL validation for media input loading
#13680 opened Apr 30, 2026 by yibinl-nvidia Collaborator Loading…
1 task done
Eg/ad mla chunked prefill loop
#13677 opened Apr 30, 2026 by MrGeva Collaborator Draft
1 task
[None][fix] Fix GPT-OSS KV-aware router hashing
#13675 opened Apr 30, 2026 by SimengLiu-nv Collaborator Loading…
1 task done
[None][perf] Improve TRTLLM MoE autotune in DEP
#13667 opened Apr 30, 2026 by rosenrodt Collaborator Loading…
1 task done
[None][chore] Refactor attention forward context
#13662 opened Apr 30, 2026 by yuxianq Collaborator Draft
1 task done
Disagg KV transfer hardening (rebased onto v1.3.0rc13) Community want to contribute PRs initiated from Community
#13661 opened Apr 30, 2026 by yifjiang Contributor Draft
1 of 3 tasks
[None][test] add Nemotron Ultra V3 AutoDeploy accuracy test
#13658 opened Apr 30, 2026 by tcherckez-nvidia Collaborator Loading…
1 task done
[None][feat] Add more disagg conversation ID headers support
#13656 opened Apr 30, 2026 by reasonsolo Collaborator Loading…
1 task done
Disagg KV transfer hardening (rebased onto v1.3.0rc14)
#13655 opened Apr 30, 2026 by yifjiang Contributor Draft
1 of 3 tasks
ProTip! Type g i on any issue or pull request to go back to the issue listing page.