Update MiniMax M2.5 H200 recipe by anish-shanbhag · Pull Request #474 · vllm-project/recipes

anish-shanbhag · 2026-05-18T19:36:39Z

Summary: Mark MiniMax-M2.5 verified on H200 and align the recipe with SemiAnalysisAI/InferenceX#1354. Pin vLLM image/min version to v0.20.2 and add the FP8 KV cache, FlashInfer attention/autotune, Triton MoE, and MiniMax QK norm fusion settings.

vercel · 2026-05-18T19:36:44Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
vllm-recipes	Ready	Preview, Comment	May 18, 2026 7:47pm

gemini-code-assist

Code Review

This pull request updates the MiniMax-M2.5 model configuration, bumping the vLLM version to 0.20.2, adding H200 hardware support, and introducing performance-optimizing environment variables and hardware overrides for Hopper. Feedback indicates that the Docker usage example in the guide should be updated to include the new environment variables via '-e' flags to ensure consistency with the manual execution instructions.

Signed-off-by: Anish Shanbhag <ashanbhag@nvidia.com>

esmeetu · 2026-05-20T02:02:13Z

LGTM. Thanks!

vercel Bot deployed to Preview May 18, 2026 19:37 View deployment

gemini-code-assist Bot reviewed May 18, 2026

View reviewed changes

Comment thread models/MiniMaxAI/MiniMax-M2.5.yaml

Update MiniMax M2.5 H200 recipe

0bd348c

Signed-off-by: Anish Shanbhag <ashanbhag@nvidia.com>

anish-shanbhag force-pushed the codex/minimax-m25-h200-recipe branch from f3ab194 to 0bd348c Compare May 18, 2026 19:46

vercel Bot deployed to Preview May 18, 2026 19:47 View deployment

anish-shanbhag marked this pull request as ready for review May 18, 2026 20:37

anish-shanbhag mentioned this pull request May 18, 2026

Update MiniMax M2.5 FP8 H200 vLLM agg recipes SemiAnalysisAI/InferenceX#1354

Merged

esmeetu merged commit b3f013c into vllm-project:main May 20, 2026
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update MiniMax M2.5 H200 recipe#474

Update MiniMax M2.5 H200 recipe#474
esmeetu merged 1 commit into
vllm-project:mainfrom
anish-shanbhag:codex/minimax-m25-h200-recipe

anish-shanbhag commented May 18, 2026 •

edited

Loading

Uh oh!

vercel Bot commented May 18, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

esmeetu commented May 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

anish-shanbhag commented May 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vercel Bot commented May 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

esmeetu commented May 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

anish-shanbhag commented May 18, 2026 •

edited

Loading

vercel Bot commented May 18, 2026 •

edited

Loading