Skip to content

[AMD] Default Qwen3.5 architectures to text-only path#1

Open
lishuoshuo-amd wants to merge 1 commit into
mainfrom
amd/qwen35-disable-mm
Open

[AMD] Default Qwen3.5 architectures to text-only path#1
lishuoshuo-amd wants to merge 1 commit into
mainfrom
amd/qwen35-disable-mm

Conversation

@lishuoshuo-amd
Copy link
Copy Markdown
Owner

@lishuoshuo-amd lishuoshuo-amd commented May 18, 2026

Motivation

Qwen3.5 text architectures include a vision_config in their Hugging Face config. With the current automatic multimodal detection, Qwen3_5ForConditionalGeneration and Qwen3_5MoeForConditionalGeneration default to the multimodal serving path even when users are serving them as text-only models.
This can trigger unnecessary multimodal initialization for the text-only path. Users can still opt in explicitly with --enable-multimodal.

Modifications

  • Add Qwen3_5ForConditionalGeneration and Qwen3_5MoeForConditionalGeneration to mm_disabled_models.
  • Add a unit test covering the default text-only behavior and explicit multimodal opt-in.

Test plan

PYTHONPATH=python python3 -m pytest test/registered/unit/configs/test_qwen3_5_model_config.py -q

@lishuoshuo-amd lishuoshuo-amd changed the title [AMD] Disable multimodal auto-detection for text-only Qwen3.5 [AMD] Default Qwen3.5 architectures to text-only path May 18, 2026
@lishuoshuo-amd lishuoshuo-amd force-pushed the amd/qwen35-disable-mm branch from 514b9fb to b50fcc7 Compare May 18, 2026 12:20
Default Qwen3.5 text architectures to the text-only path so their vision_config does not trigger multimodal initialization unless users explicitly opt in.
@lishuoshuo-amd lishuoshuo-amd force-pushed the amd/qwen35-disable-mm branch from b50fcc7 to 7aee0b7 Compare May 18, 2026 12:21
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant