Gemma4 vllm fixes by RobMulla · Pull Request #227 · AI-Hypercomputer/tpu-recipes

RobMulla · 2026-04-20T20:53:15Z

Fixes the Gemma 4 vLLM recipes for Ironwood (v7x) based on feedback.

Key Changes:

Single-chip configuration: Updated gemma4-server.yaml to use tensor-parallel-size=1 and 1x1x1 topology for a single-chip v7x deployment.
Documentation fixes: Fixed the manifest filename in README.md and added the required namespace to the secret creation command.

Signed-off-by: Rob Mulla <rob.mulla@gmail.com>

karan · 2026-04-20T22:44:31Z

        - --seed=42
        # TODO: Update tensor-parallel-size to match the number of chips in your topology
-        - --tensor-parallel-size=8
+        - --tensor-parallel-size=1


Did you mean to set TP=1 (replicated) and not TP=2 (sharded)?

hosseinsarshar · 2026-04-20T23:19:59Z

      nodeSelector:
        cloud.google.com/gke-tpu-accelerator: tpu7x
-        cloud.google.com/gke-tpu-topology: 2x2x1
+        cloud.google.com/gke-tpu-topology: 1x1x1


This is going to cause an issue given the nodepool is creating a 2x2x1:

gcloud container node-pools create ${NODEPOOL_NAME} \ --project=${PROJECT_ID} \ --location=${REGION} \ --node-locations=${ZONE} \ --num-nodes=1 \ --machine-type=tpu7x-standard-4t \ --cluster=${CLUSTER_NAME}

also 1x1x1 is not readily available to 3Ps - the gke team is still testing it for external use

Signed-off-by: Rob Mulla <rob.mulla@gmail.com>

RobMulla added 3 commits April 20, 2026 16:52

Fix Gemma4 vLLM recipe for single chip v7x and fix documentation

5390810

Signed-off-by: Rob Mulla <rob.mulla@gmail.com>

Merge remote-tracking branch 'origin/main' into gemma4-vllm-fixes

e7cbfa5

Signed-off-by: Rob Mulla <rob.mulla@gmail.com>

Simplify Gemma4 vLLM recipe to use default namespace

58dac49

Signed-off-by: Rob Mulla <rob.mulla@gmail.com>

karan approved these changes Apr 20, 2026

View reviewed changes

karan reviewed Apr 20, 2026

View reviewed changes

hosseinsarshar reviewed Apr 20, 2026

View reviewed changes

Revert Gemma 4 recipe to use 4 chips with TP=8 as verified

eaa3dc5

Signed-off-by: Rob Mulla <rob.mulla@gmail.com>

RobMulla force-pushed the gemma4-vllm-fixes branch from 6eb5da4 to eaa3dc5 Compare April 21, 2026 19:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gemma4 vllm fixes#227

Gemma4 vllm fixes#227
RobMulla wants to merge 4 commits into
AI-Hypercomputer:mainfrom
RobMulla:gemma4-vllm-fixes

RobMulla commented Apr 20, 2026

Uh oh!

karan Apr 20, 2026

Uh oh!

hosseinsarshar Apr 20, 2026

Uh oh!

hosseinsarshar Apr 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

RobMulla commented Apr 20, 2026

Fixes the Gemma 4 vLLM recipes for Ironwood (v7x) based on feedback.

Uh oh!

karan Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

hosseinsarshar Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

hosseinsarshar Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants