Skip to content

nanzhipro/karpathy-wiki

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

karpathy-wiki

English · 简体中文

Wiki Skill

A source-driven knowledge base distilled from Andrej Karpathy's public corpus — X posts, interviews, talks, open-source repos, and self-bio. The goal isn't to archive links; it's to understand the person — how he thinks, how he learns, how he works, what he believes, what he's built, and what he's saying about AI and software engineering.

karpathy-wiki cover

The repo follows the pattern Karpathy himself sketches in LLM Knowledge Bases: raw material goes into raw/ (read-only), an LLM compiles it into wiki/ (read-write, refactored freely), and the compiled wiki becomes the substrate for future queries and ingests.

Important

Enter through wiki/, not raw/. wiki/overview.md is the highest-compression synthesis; wiki/index.md is the full catalog.


Quick Paths

I want to… Start here
See the whole synthesis wiki/overview.md
Read Karpathy's bio and body of work wiki/entities/andrej-karpathy.md
Browse the full catalog wiki/index.md
Understand the maintenance protocol CLAUDE.md
Read in Obsidian Open this directory as a vault

1. Meet the Person

"I like to train deep neural nets on large datasets 🧠🤖💥" — karpathy.ai self-intro

Karpathy is one of the most paradigm-shaping individuals in modern AI. He doesn't ship the biggest models; he gives us the vocabulary to reason about them (Software 3.0, people spirits, animals vs ghosts, cognitive core, march of nines…). Researcher, engineer, teacher, public thinker — four hats at once, which almost nobody else wears simultaneously.

Career arc (full table):

  • 2005–2015: Toronto (Hinton's course) → UBC → Stanford (under Fei-Fei Li)
  • 2011 / 2013 / 2015: Google Brain → Google Research → DeepMind internships
  • 2015–2017: OpenAI founding member
  • 2017–2022: Tesla Director of AI, leading the Autopilot vision team
  • 2023–2024: Back to OpenAI, working on midtraining and synthetic data
  • 2024– : Independent educator; founded Eureka

Two biographical details that explain a surprising amount of the later corpus:

  • Physics / hardware lens. Toronto was CS + physics + math, and in 2025 he says one of his undergraduate mistakes was over-focusing on the mathematical lens of computing and under-focusing on the physical one: energy, data locality, parallelism, architecture. That helps explain why llm.c, CUDA-kernel talk, and systems-level intuitions keep resurfacing. andrej-karpathy · 2025 misc
  • "Reference human for ImageNet." Before the LLM-era frames, he already carried unusually high credibility in computer vision; the GPU MODE host intro calling him "the reference human for ImageNet" is a revealing shorthand for how he was perceived. It helps explain why later claims on evals, self-driving, and perception feel grounded rather than purely philosophical. andrej-karpathy · gpu-mode-irl-2024-keynote

What he's left behind:

  • CS231n — Stanford's first deep-learning course, which funneled a generation into DL (150 students in 2015 → 750 in 2017)
  • Zero to Hero — the most watched from-scratch neural-net series on YouTube
  • nanoGPT / nanochat / micrograd / llm.c / microGPT — a ladder of minimal, legible, high-fidelity teaching code
  • Tesla FSD — the 2026-01-01 coast-to-coast run (2,732 miles, zero interventions) is the cleanest public proof of the Software 2.0 bet

2. How He Thinks

Five observable traits:

1. Frame-first

He coins a word, then uses it as reasoning scaffolding. The term isn't a label applied after the fact — it's the analysis tool. Each coinage comes packaged with its own analogies, counterexamples, and evolutionary edges. This is the technical core of his influence.

Full coinage list in andrej-karpathy.md. The load-bearing ones:

Frame Question it answers
Software 1.0/2.0/3.0 How does software get written?
Animals vs Ghosts What kind of thing is an LLM?
People Spirits What are an LLM's psychological quirks?
March of Nines Why is autonomy — self-driving, agents — so slow?
Verifiability Which tasks can AI actually automate?
Cognitive Core What's the "right" model size?
Jagged Intelligence Why is an LLM genius and moron by turns?
Verification Gap Where does agentic coding bottleneck?

2. Analogy-driven

  • Tesla / Waymo autonomy curve → the coming arc of coding agents
  • Memento / 50 First Dates → LLM session-level amnesia
  • Rain Man → savant memory paired with cognitive gaps
  • Horizontal gene transfer in bacteria → the portability of bacterial code
  • Iron Man suit vs Iron Man robot → augmentation over automation

Analogies aren't ornament — they're how he compresses mechanism-level intuition.

3. Slopes, not points

He says it repeatedly: "5–10x more pessimistic than people expect now, 5–10x more optimistic than people expect in 10 years" (Dwarkesh 2025 recap). He reasons about derivatives, not instantaneous values.

4. Distrust benchmarks, trust vibes

In 2025 he names the leaderboard illusion: benchmark scores have decoupled from lived experience. He trusts model smell, OpenRouter usage share, and multi-model councils (llm-council) instead.

5. Argue the opposite

In 2026 he states it explicitly: "Before I commit to a conclusion, I force myself to argue the opposite." Contradictions and uncertainty are kept as signal, not smoothed over. This is the methodological reason his public calls rarely age badly.


3. How He Learns

"Pedagogy is a ramp, not a cliff."

Six principles:

  1. 10,000 hours. No shortcuts. But ramps raise the information density per hour.
  2. Ramps to Knowledge. Attack any complex idea with a minimal but complete implementation: micrograd → nanoGPT → nanochat → llm.c. Each rung strips scaffolding and feeds the next.
  3. Build it to feel it (Feel the AGI). Don't read AGI takes — train a tiny model and watch the loss curve. This is his prescribed mode of knowing, not just a hobby.
  4. Physics is the bootloader. "Children should learn physics early not because they go on to do physics, but because it is the subject that best boots up a brain. Physicists are the intellectual embryonic stem cell." (Dwarkesh recap)
  5. LLM as second reader, not first (Reader3 workflow). Read the primary text yourself; then ask the LLM for explanation, background, or pushback — not the other way around. This is the bulwark against atrophy.
  6. Publish everything. Courses, repos, videos, posts — all public, all free. The flywheel (exposure → feedback → improvement → more exposure) is the whole point.

4. His Worldview

Seven hard positions, each reinforced across multiple sources:

  1. LLMs are ghosts, not animals. We aren't recreating biological intelligence; we're summoning digital entities by imitating human text. Don't use animal evaluations. Don't expect instinctual drives. animals-vs-ghosts

  2. Power to the people (2025-04-08 pinned post). LLMs invert the usual diffusion path — where it once went military → enterprise → consumer, this time individuals benefit first. The reason: the LLM capability shape (broad, shallow-to-middling expertise across many domains) fits an individual, not an org. power-to-the-people

  3. Decade of agents, not year of agents. Most people get the two-year and ten-year horizons backwards: pessimistic at two years (it's slower than the hype), optimistic at ten (it's deeper than the skeptics claim). decade-of-agents

  4. Verifiability is the Software 2.0 automation predicate (2025-11-17). "Software 1.0 automates what you can specify. Software 2.0 automates what you can verify." His single most load-bearing sentence of 2025. verifiability

  5. RLVR is the #1 paradigm shift of 2025 — and yet RL is still terrible ("sucking supervision through a straw"). The next move should be system prompt learning.

  6. Capability is peaky / jagged. Progress isn't uniform. A good eval tells you where the peaks and pits are. peaky-capability · jagged-intelligence

  7. The supply chain is the new attack surface. Flagged via prompt injection from 2025-07-11 onward; vindicated by the 2026 litellm and axios incidents. The bacterial-code aesthetic and supply-chain attacks worry are two sides of the same coin.


5. How He Works

  • Autonomy Slider. Both products and his own workflow keep a tunable level of autonomy: Cursor tab (~75% of the time) → highlight-edit → Claude Code → GPT-5 Pro, scaling with task difficulty.
  • Bacterial Code. Small, self-contained, dependency-free, rippable. He rejects the classical "dependencies are bricks for pyramids" view; in the LLM era, the cost/benefit of pulling in a dep has changed.
  • Parallel agents + hand-editing in the IDE (agentic engineering, Jan 27 2026). Multiple Claude Code sessions on the left, the IDE on the right for reading and patching. Not pure delegation — orchestration + review.
  • Code Post-Scarcity (2025-10-27). Writing code is no longer the expensive part. Thousand-line throwaway visualizations are now routine.
  • Build for Agents. Markdown docs, CLI-first, MCP-exposed capabilities. "LLMs scrape, they don't navigate."
  • Publish everything. No private Google Docs. Just X posts, GitHub repos, YouTube videos — a personal embodiment of the BYOAI ethic.

6. Preferences, Habits, and Behavioral Signatures

This is the layer the concept pages can hide. Karpathy's frames sit on top of very stable preferences about what good tools, good environments, and good cognition look like.

  • Low-noise environments. The same pattern shows up inside and outside AI: privacy-conscious OS choices, quiet living environments, minimally processed food, RSS over engagement sludge, suspicion of hidden permissions and hidden attack surfaces. He is highly sensitive to invisible background degradation. 2025 misc · supply-chain-attacks
  • File over app. He repeatedly chooses markdown, images, local files, CLIs, MCP, and Obsidian over opaque SaaS surfaces. The point is not nostalgia; it is inspectability, portability, auditability, and agent-compatibility. llm-knowledge-bases · BYOAI · build-for-agents
  • Local-first, but not anti-frontier. He likes AIs that live on your computer, inside your private context and local network; at the same time, he happily routes to frontier systems for the hardest tasks. The stable preference is not ideological purity, but keeping user leverage high and lock-in low. karpathy-x-2025-ai-assisted-coding · BYOAI
  • Tight leash, adjustable autonomy. His default is neither "trust the agent" nor "do everything manually." Ask for approaches before code, define success criteria, keep changes incremental, review side-by-side, and pull autonomy down when the model starts overthinking. karpathy-x-2025-ai-assisted-coding · agentic-engineering
  • Taste is the scarce resource. He says agents over-abstract, over-try/catch, over-engineer, and leave dead code. That is why bacterial code and code post-scarcity coexist: code is cheap, but judgment about what should exist is still expensive.
  • Think in public; let artifacts compound. X posts, repos, gists, videos, open courseware, weekend demos, and now a personal wiki. He prefers public artifacts that can snowball over private notes that disappear. snowballs · llm-knowledge-bases
  • Craft that shows its work. Even the "off-topic" posts are revealing: Tolkien, White Lotus, Project Hail Mary, explainer tools, animated diagrams, research apps. He tends to like artifacts with visible structure, dense craft, and analyzable internals. 2025 misc · 2026 misc

7. What He's Built

Six domains of contribution

  1. Popularizing computer vision — CS231n (2015–2017) and being a first-author force on Stanford's ImageNet work (famously "the reference human for ImageNet").
  2. Software 2.0 for self-driving — Tesla Autopilot 2017–2022, swapping C++ modules for neural nets layer by layer. The 2026-01-01 coast-to-coast is the public payoff.
  3. Opening the LLM training stacknanoGPT / nanochat / llm.c turn frontier training into a legible, reproducible, sub-$100 exercise.
  4. Midtraining and synthetic data (OpenAI 2023–24) — not detailed publicly, but visible in shadows (e.g. nanochat's identity-injection recipe on 10.21).
  5. Shaping the public vocabulary — Software 3.0, people spirits, animals vs ghosts, vibe coding, agentic engineering, bacterial code, BYOAI. Six-plus terms that have become working-language in the field.
  6. Restarting AI educationEureka, "Starfleet Academy for the mind," aiming to generalize the ramps-to-knowledge pattern into public infrastructure.

Key dates


8. Views on AI × Software Engineering

The layer you'll most likely use day-to-day. Nine pillars:

  1. Software 3.0. Prompts are the new source code; English is the new programming language. But writing the code is the easy part — the hard part is the DevOps crunch (MenuGen took a week to ship).
  2. Partial-Autonomy Apps. The next product shape: half-autonomous apps with an autonomy slider. Cursor, Perplexity, Claude Code, Codex are all early exemplars.
  3. Build for Agents. Markdown docs, CLI, API, MCP; llms.txt; LLM GUI as a not-yet-built-but-foreseeable frontend paradigm.
  4. Agentic Engineering. The professionalized sibling of vibe coding. December 2025 is the threshold: before it, coding agents mostly didn't work; after it, they mostly do.
  5. Verification Gap. Generation is cheap, verification is expensive — that's the new bottleneck. The code volume isn't the problem; review throughput is. One consequence: atrophy — our generation muscles shrink before review muscles grow in.
  6. Code Post-Scarcity. Code is cheap enough to be one-shot and disposable. The old DRY / early-abstraction / helper-function instincts invert in throwaway territory.
  7. Context Engineering. The successor concept to "prompt engineering" — context selection, compression, ordering, memory, tools. Broader and more operational.
  8. Bacterial Code × Supply Chain Attacks. The 2026 litellm / axios incidents show it plainly: fewer deps means less attack surface. LLMs tilt the math toward "yoink and inline" over "pip install."
  9. BYOAI. Your AI stack should live on your side — runnable locally, model-swappable, resilient to intelligence brownouts. The natural extension is cognitive core: small, reasoning-first, tool-using.

Pair with the 11 central claims in wiki/overview.md for the most-compressed version of the same story.


9. Productive Tensions

Part of what makes Karpathy legible is that his corpus is not flat. Several tensions recur and stay unresolved on purpose:

  • Maximal empowerment, minimal trust. He wants ordinary individuals to gain enormous leverage from AI, but he also keeps warning about prompt injection, supply-chain compromise, hidden memory, and over-agentic defaults. He is radically pro-use, not naively pro-delegation.
  • Optimistic on slope, impatient on the present. He is structurally bullish on 10-year trajectories and often underwhelmed by the current UX, reliability, and verification story. This is why his writing can sound both excited and dissatisfied at the same time.
  • Code abundance, taste scarcity. He believes code has become cheap and disposable, yet good structure, cleanup, decomposition, and review matter more than before. The old scarcity moved; it did not disappear.
  • Open/public by instinct, selective by taste. He publishes almost everything, but prefers open formats, RSS, markdown, CLI surfaces, and source-anchored corpora over algorithmic feeds or opaque products.
  • Small/local aesthetic, frontier-aware practice. He loves tiny self-contained artifacts, local-first setups, and cognitive cores, while also tracking frontier-model quality obsessively and using the best systems available when the task demands it.

Repository Structure

Path Purpose
raw/ Immutable source material (X posts, transcripts, self-bio)
raw/2025/ · raw/2026/ X-post corpus organized by year
raw/youtube-transcript/ Long-form interview and talk transcripts
wiki/sources/ Faithful per-source (or per-bundle) summaries
wiki/concepts/ Cross-source concept pages
wiki/entities/ People / orgs / products / projects / courses
wiki/methods/ Algorithm and technique pages
wiki/index.md Full catalog
wiki/log.md Audit log of ingests, lints, refactors
wiki/overview.md Highest-compression synthesis
CLAUDE.md Maintenance protocol for the LLM

Workflow:

flowchart LR
    A[raw/] -->|LLM ingest| B[wiki/sources/]
    B --> C[wiki/concepts/]
    B --> D[wiki/entities/]
    B --> E[wiki/methods/]
    C --> F[wiki/index.md]
    D --> F
    E --> F
    B --> G[wiki/log.md]
    C --> H[wiki/overview.md]
    D --> H
Loading

Corpus Snapshot

As of 2026-04-18:

Layer Count
Raw markdown files 109
Source summaries 31
Concept pages 50
Entity pages 43
Method pages 1

Covered: 2024 foundations (Berkeley / GPU MODE) · full 2025 X-post arc (16 thematic bundles) · 2026 X-post arc through April (10 thematic bundles) · self-bio and long-form transcripts.

raw/ and wiki/sources/ aren't 1:1 — high-density short posts are clustered by theme into bundled source pages, matching the grain at which Karpathy actually develops ideas.


How to Use

  • Read it as a wiki. Open the repo as an Obsidian vault and use the graph view to see concept adjacency.
  • Add to it. To ingest new material, follow the ingest protocol in CLAUDE.md.
  • Query against it. Aim queries at wiki/, not raw/ — the former is compiled and cross-linked; the latter is the unprocessed firehose.
  • Use it for writing or thinking. Every concept page's ## Related section maps its neighborhood; wiki/overview.md gives the big picture.
  • Read it as a timeline. log.md records the wiki's own evolution — every ingest, lint, and refactor leaves a trace.

Tip

5 minutes: read wiki/overview.md. 30 minutes: add wiki/entities/andrej-karpathy.md and wiki/concepts/software-3-0.md. A full day: walk the seven sections of this README in order.


Scaffolded with llm-wiki-bootstrap; extended substantially around 69+ 2025 X posts, 15+ 2026 X posts, 4 long-form talks/interviews, and the karpathy.ai self-bio.

About

its original source materials come from Karpathy's X posts and important talks.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors