Skip to content

fix(connectordiscoverymcp): wire /healthz readiness probe + ALB HC annotation#174

Merged
sam-at-luther merged 2 commits into
mainfrom
fix/connectordiscoverymcp-healthcheck
May 26, 2026
Merged

fix(connectordiscoverymcp): wire /healthz readiness probe + ALB HC annotation#174
sam-at-luther merged 2 commits into
mainfrom
fix/connectordiscoverymcp-healthcheck

Conversation

@sam-at-luther
Copy link
Copy Markdown
Member

@sam-at-luther sam-at-luther commented May 26, 2026

Summary

Completes the work started in #173 for the last laggard target group, connectordiscoverymcp.

  • Adds readinessProbe + livenessProbe on the container, both hitting GET /healthz on the http (8000) port. Previously there were no probes at all, so K8s reported the pod ready immediately on container start — masking a misconfigured / crashing supergateway.
  • Adds the standard ALB health-check annotation block to the Service (same eight annotations and values as the sibling subcharts mars#173 wired up).

Pairs with luthersystems/agent-builder#47 which adds --healthEndpoint /healthz to the supergateway CMD in agent-builder/connector-mcp/Dockerfile. That PR is what actually makes luthersystems/connector-mcp-discovery:latest serve GET /healthz → 200 "ok".

Closes #172.

DO NOT MERGE until upstream image ships

This PR must not be merged until:

  1. The agent-builder companion PR is merged, AND
  2. The new luthersystems/connector-mcp-discovery:latest image is live on Docker Hub.

If merged before that, the next umbrella deploy will roll a pod where the supergateway does not serve /healthz, the new readinessProbe will fail, the pod will never become Ready, and the ALB will pull traffic — taking the connector-discovery MCP surface down.

Rendered locally via helm template to confirm both probe blocks and the annotation block come out with the right indentation.

Test plan

  • Verify agent-builder companion PR merged + new :latest image timestamp on Docker Hub
  • Merge this PR
  • Cut mars release (next minor, probably v0.107.0)
  • Bump .mars-version in ui-infrastructure → deploy umbrella on test
  • aws elbv2 describe-target-health on k8s-ui-umbrella-9fb257ad5f → all targets healthy
  • kubectl -n ui exec deploy/umbrella-connectordiscoverymcp -- python3 -c 'import urllib.request; print(urllib.request.urlopen("http://localhost:8000/healthz", timeout=2).read())'b'ok'

…ng at /healthz

The chart had no probes at all, so K8s reported the pod ready based on
container start alone — which meant a misconfigured / crashing supergateway
went undetected. Add readiness + liveness against /healthz (newly exposed
by the connector-mcp image, see luthersystems/agent-builder PR companion).

Closes the remaining part of #172.
…ervice

Mirrors the pattern established in #173 for the other umbrella subcharts.
ALB now probes /healthz (served by supergateway after the agent-builder
companion PR ships :latest), flipping this TG to healthy.
@sam-at-luther sam-at-luther merged commit d239bef into main May 26, 2026
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

ALB TGs unhealthy with Target.ResponseCodeMismatch: subcharts don't expose ALB health-check annotations on their Services

1 participant