[AMD] add mori blog by billishyahao · Pull Request #336 · lm-sys/lm-sys.github.io

billishyahao · 2026-05-26T08:10:00Z

Add mori sglang blog
cc @HaiShaw @Duyi-Wang

HaiShaw

LGTM

HaiShaw · 2026-05-26T08:20:09Z

@merrymercy and @wisclmy0611 please help review.

functionstackx · 2026-05-26T15:24:22Z

nice blog tho one thing missing from ur blog is that it fails simple grade school math as @billishyahao & @HaiShaw has root caused this to yall using FP8 direct cast EP combine instead of quant aware FP8 EP combine. you probably wanna fix that before u publish

functionstackx · 2026-05-26T16:03:40Z

i would recommend shipping the fix into production InferenceX repo first and then redoing the screenshots before shipping

billishyahao · 2026-05-26T16:04:59Z

Hi @functionstackx Thanks for the reminders. We have fixed it through adding fp8 blockwise support for mori ROCm/mori#311 and sglang side change sgl-project/sglang#24879 and use this new function to resolve the accuracy issue through SemiAnalysisAI/InferenceX#1566

nice blog tho one thing missing from ur blog is that it fails simple grade school math as @billishyahao & @HaiShaw has root caused this to yall using FP8 direct cast EP combine instead of quant aware FP8 EP combine. you probably wanna fix that before u publish

functionstackx · 2026-05-26T16:50:16Z

+<img src="/images/blog/mori/curve1.png"
+     style="display: block; margin: 20px auto 0; width: 75%; max-width: 100%; height: auto;">
+
+*Figure 2: Full pareto curve — throughput vs interactivity for AMD Instinct™ MI355X and B200 configurations*


these r the curves using direct cast fp8 EP combine where gsm8k accuracy is not good, the FP8 blocksize EP combine is probably slightly worse perf compared to direct cast fp8

I would recommend shipping InferenceX that fixes accuracy and updating the the curve

[AMD] add mori blog

9e7d4aa

HaiShaw approved these changes May 26, 2026

View reviewed changes

rename

a5ba4e2

functionstackx reviewed May 26, 2026

View reviewed changes

fix author

fbf12ef

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AMD] add mori blog#336

[AMD] add mori blog#336
billishyahao wants to merge 3 commits into
lm-sys:mainfrom
billishyahao:add_mori_blog

billishyahao commented May 26, 2026

Uh oh!

HaiShaw left a comment

Uh oh!

HaiShaw commented May 26, 2026 •

edited

Loading

Uh oh!

functionstackx commented May 26, 2026

Uh oh!

functionstackx commented May 26, 2026 •

edited

Loading

Uh oh!

billishyahao commented May 26, 2026

Uh oh!

functionstackx May 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

billishyahao commented May 26, 2026

Uh oh!

HaiShaw left a comment

Choose a reason for hiding this comment

Uh oh!

HaiShaw commented May 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

functionstackx commented May 26, 2026

Uh oh!

functionstackx commented May 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

billishyahao commented May 26, 2026

Uh oh!

functionstackx May 26, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

HaiShaw commented May 26, 2026 •

edited

Loading

functionstackx commented May 26, 2026 •

edited

Loading