Skip to content

Automated merge - qualcomm/Startup-Demos:feature_rel_20260617_210121 -> qualcomm/Startup-Demos:main#293

Closed
Jethin Sekhar R (smartcoder00) wants to merge 3 commits into
mainfrom
feature_rel_20260617_210121
Closed

Automated merge - qualcomm/Startup-Demos:feature_rel_20260617_210121 -> qualcomm/Startup-Demos:main#293
Jethin Sekhar R (smartcoder00) wants to merge 3 commits into
mainfrom
feature_rel_20260617_210121

Conversation

@smartcoder00

Copy link
Copy Markdown
Contributor

Automated merge via script.

Ryan, Chu (ZGarfield) and others added 3 commits June 4, 2026 16:18
- Added LLM serving benchmark using vLLM on Qualcomm AIC100 (QAIC)
- Implemented multi-endpoint setup with device-level mapping (1:1 endpoint to QAIC device)
- Enabled concurrency scaling tests across multiple decode workloads (32 / 128 / 512 tokens)
- Collected and analyzed throughput (TPS), tokens/sec, latency, and failure behavior
- Demonstrated balanced multi-device utilization via per-endpoint performance analysis

Signed-off-by: Chu(Temp), Ryan <ryachu@qti.qualcomm.com>
- Added LLM serving benchmark using vLLM on Qualcomm AIC100 (QAIC)
- Implemented multi-endpoint setup with device-level mapping (1:1
endpoint to QAIC device)
- Enabled concurrency scaling tests across multiple decode workloads (32
/ 128 / 512 tokens)
- Collected and analyzed throughput (TPS), tokens/sec, latency, and
failure behavior
- Demonstrated balanced multi-device utilization via per-endpoint
performance analysis
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants