Skip to content

slots(bench): benchmark 35B-A3B MoE ROCmMTP candidates on the agent slot (Crown / Chadrock / ace-saber) #636

@thinmintdev

Description

@thinmintdev

What to build

Once the agent (MoE) slot is set up (#634), benchmark the recently-added 35B-A3B MoE ROCmMTP candidates sequentially on the same tuned agent-slot config:

  1. Crown (locate exact registry id)
  2. Chadrockchadrock3.6-35b-uncensored
  3. ace/saberchadrock-35b-ace-saber (Qwen3.6-35B-A3B-NSC-ACE-SABER-MTP)

Record per model on a fixed prompt set: gen tok/s, TTFT, prompt-eval rate, GTT/VRAM, short quality impression. Recommend which to use primarily for the agent role. For the top 2, run thinking ON vs OFF and record the delta (tok/s + quality).

Acceptance criteria

  • All 3 benched sequentially on the identical agent-slot config (same ctx/-b/-ub/MTP/ngl)
  • Results table recorded to a doc (tok/s, TTFT, GTT, quality notes)
  • Primary recommendation with rationale
  • Top-2 thinking ON vs OFF comparison
  • Any load failures noted

Blocked by

Metadata

Metadata

Assignees

No one assigned

    Labels

    ready-for-humanNeeds human implementationslotsSlot roles / model assignment / perf tuningv0.5v0.5 scope — MCP admin + memory wiring across UI and agents

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions