test(lit/vmi): add SimdVF per-token FP8 cast target IR by WenboCodes · Pull Request #487 · mouliangyu/PTOAS

WenboCodes · 2026-06-24T08:37:17Z

Summary

Add a design-reference .pto to test/lit/vmi/ capturing the target pto.vmi lowering for the SimdVF per-token cast-to-FP8 kernel.

Contents (verbatim from PTO-Gym/docs/simtvf-per-token-cast-to-fp8-pto-vmi-mapping.md):

Header comment holds the original TileLang with T.SimdVF(): source (section 1 of the doc).
Body holds the forward-looking pseudo MLIR (section 3 / AFTER) of the VMI lowering.

Why this shape

The VMI code in the source doc is explicitly design-level pseudo MLIR. It uses surface ops that are not yet implemented in the PTOAS VMI dialect:

Pseudo op in this file	Real VMI op today
`pto.vmi.vlds` / `pto.vmi.vsts`	`pto.vmi.load` / `pto.vmi.store`
`pto.vmi.pset_b32 "PAT_ALL"`	`pto.vmi.create_mask`
`pto.vmi.vreduce_max {sub_group = 2}`	none — group reductions exist only for `group_reduce_addf`/`group_reduce_addi`; there is no group max-reduce
`pto.vmi.vbr {to = 256, sub_group = 2}`	`pto.vmi.group_broadcast {num_groups = N}`
`pto.vmi.vcvt {to = ..., rnd = "R", sat = "SAT"}`	`pto.vmi.truncf`
`pto.vmi.vsts {mode = "ONE_PER_SUB_GROUP"}`	not yet implemented

So it cannot be parsed by ptoas/pto-test-opt. It is stored as a no-op (RUN: true) forward-looking spec rather than an executable regression test. See the upstream doc's section 8 ("需要 PTO/VMI surface 补齐的点") for the full list of missing surface points.

Test plan

lit test/lit/vmi/vmi_simdvf_per_token_cast_to_fp8.pto → passes as no-op (RUN: true).
No pto-test-opt execution, by design.

🤖 Generated with Claude Code

Add design-reference .pto capturing the pto.vmi lowering for the SimdVF per-token cast-to-FP8 kernel, verbatim from PTO-Gym/docs/simtvf-per-token-cast-to-fp8-pto-vmi-mapping.md. The file holds the original TileLang SimdVF source in its header comment and the forward-looking pseudo MLIR (vreduce_max{sub_group}, vbr{to,sub_group}, vcvt{to,rnd,sat}, pset_b32 "PAT_ALL", etc.) which uses surface ops not yet implemented in the VMI dialect. It is a no-op (RUN: true) spec, not an executable regression test. Co-Authored-By: Claude <noreply@anthropic.com>

mouliangyu and others added 27 commits June 24, 2026 09:13

feat: first stage of vmi

e312159

feat: support num_groups layout

ab6bc04

feat: new layout-lowering design

9d63f30

Add VMI layout assignment lowering coverage

50bffab

Support S32 partial grouped mask lowering

d225422

Support dynamic S32 grouped mask lowering

bd18dc4

Clarify VMI layout case coverage gaps

fcf1096

Record VMI layout coverage audit

6f04810

Add dynamic S32 group mask runtime coverage

4b3d5be

Detail VMI layout assignment request rules

604fd50

Complete VMI layout request builder coverage

e353ae0

Inline private VMI physical helpers before VPTO emission

cf9a04d

Validate required VMI selected plans

e96ba6c

Document VMI layout closure matrix

c1e74fb

Add VMI dense reduce multi-consumer case

067f699

Remove VMI selected plan attrs

e550b80

Implement VMI layout optimization pipeline

bb88c2c

Support multi-chunk VMI group reduce slots

7686028

Implement typed VMI group reduce lowering

58787c2

Implement VMI layout support lowering

221f02e

Support partial packed VMI group slots

85a98cb

Support arith select in VPTO LLVM lowering

c9604ad

Add VMI introduction design doc

fd5fc11

Fold deinterleaved VMI loads through vldsx2

46942f0

Document VMI layout assignment mechanism

910a2a9

Illustrate VMI layout equivalence classes

31abc14

mouliangyu force-pushed the feature-vmi branch from 1b67f26 to 87fc3fa Compare July 2, 2026 08:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test(lit/vmi): add SimdVF per-token FP8 cast target IR#487

test(lit/vmi): add SimdVF per-token FP8 cast target IR#487
WenboCodes wants to merge 27 commits into
mouliangyu:feature-vmifrom
WenboCodes:simdvf-per-token-fp8-vmi-lit-test

WenboCodes commented Jun 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

WenboCodes commented Jun 24, 2026

Summary

Why this shape

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants