Skip to content
View chokevin's full-sized avatar
💭
coding
💭
coding

Block or report chokevin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
chokevin/README.md

Hi, I'm Kevin Cho

I'm an AI infrastructure engineer focused on Kubernetes, GPU systems, and developer tooling for reliable AI workloads. My work spans Azure Kubernetes Service (AKS), distributed training and inference infrastructure, scheduling systems, runtime performance, and agent-assisted engineering workflows.

I specialize in turning complex research and platform ideas into practical systems that can be tested, operated, and improved in real environments. I bring a systems-level perspective across cloud infrastructure, ML runtimes, and developer experience, with an emphasis on reliability, observability, and repeatable engineering practices.

What I'm focused on

  • GPU orchestration and scheduling for AI workloads
  • Kubernetes-native infrastructure for distributed training and inference
  • Runtime performance, reliability, and production readiness
  • Developer tools and agent workflows that make engineering teams more effective

Background

  • Current role: Member of the Azure Kubernetes Service (AKS) team at Microsoft
  • Previous: Snap Inc. — Content Infrastructure
  • Previous: Amazon — AWS Lex

Connect

LinkedIn

Pinned Loading

  1. kstack kstack Public

    Personal AI software factory — slash-command skills for Copilot CLI (and friends). Inspired by gstack.

    Shell 1

  2. swordfish swordfish Public

    GPU profiling and kernel-performance lab for A100/H100/H200 evidence, NCU/NSYS/torch traces, and upstream inference-kernel contributions.

    Python