Skip to content

[gpt-oss-120b] Add --enable_speculative_decode flag for Interactive scenario#2582

Open
viraatc wants to merge 1 commit into
mlcommons:masterfrom
viraatc:feat/viraatc-gptoss-mtp
Open

[gpt-oss-120b] Add --enable_speculative_decode flag for Interactive scenario#2582
viraatc wants to merge 1 commit into
mlcommons:masterfrom
viraatc:feat/viraatc-gptoss-mtp

Conversation

@viraatc

@viraatc viraatc commented Apr 22, 2026

Copy link
Copy Markdown
Contributor

Summary

  • Adds an opt-in --enable_speculative_decode flag to language/gpt-oss-120b/sglang/run_server.sh, mirroring the flag already present on the DeepSeek-R1 SGLang backend.
  • Wires the flag up to EAGLE3 speculative decoding using defaults tuned for the gpt-oss-120b Interactive scenario on GB200: draft model nvidia/gpt-oss-120b-Eagle3-long-context, num_steps=3, topk=1. Each of these can be overridden (--speculative_draft_model_path, --speculative_num_steps, --speculative_topk).
  • Documents the new flag under the SGLang "Run the server" section in language/gpt-oss-120b/README.md.

Backward compatible: the pre-existing --eagle_path path takes precedence over the new defaults, and when --enable_speculative_decode is not passed the server command is unchanged.

@viraatc viraatc requested a review from a team as a code owner April 22, 2026 16:22
@github-actions

github-actions Bot commented Apr 22, 2026

Copy link
Copy Markdown
Contributor

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

…cenario

Adds an opt-in EAGLE3 speculative decoding config path to run_server.sh,
mirroring the --enable_speculative_decode flag in DeepSeek-R1. Defaults are
tuned for the gpt-oss-120b Interactive scenario on GB200 using the
nvidia/gpt-oss-120b-Eagle3-long-context head (num_steps=3, topk=1).

Preserves backward compatibility: existing --eagle_path callers are
unchanged, and the default server command still matches the previous
behavior when the flag is not passed.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
@viraatc viraatc force-pushed the feat/viraatc-gptoss-mtp branch from 40d927c to d2f48dc Compare April 22, 2026 16:23
@viraatc

viraatc commented May 13, 2026

Copy link
Copy Markdown
Contributor Author

@viraatc

viraatc commented May 19, 2026

Copy link
Copy Markdown
Contributor Author

@viraatc

viraatc commented May 19, 2026

Copy link
Copy Markdown
Contributor Author

looks like the gitleaks license is missing? repo setup issue?

missing gitleaks license. Go grab one at gitleaks.io and store it as a GitHub Secret named GITLEAKS_LICENSE. For more info about the recent breaking update, see [here](https://github.com/gitleaks/gitleaks-action#-announcement).

@hanyunfan

Copy link
Copy Markdown
Contributor

looks like the gitleaks license is missing? repo setup issue?

missing gitleaks license. Go grab one at gitleaks.io and store it as a GitHub Secret named GITLEAKS_LICENSE. For more info about the recent breaking update, see [here](https://github.com/gitleaks/gitleaks-action#-announcement).

Is this PR ready for review and approval? Also, what are the next steps to resolve the Gitleaks license issue mentioned above? @viraatc

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants