English · 简体中文
A source-driven knowledge base distilled from Andrej Karpathy's public corpus — X posts, interviews, talks, open-source repos, and self-bio. The goal isn't to archive links; it's to understand the person — how he thinks, how he learns, how he works, what he believes, what he's built, and what he's saying about AI and software engineering.
The repo follows the pattern Karpathy himself sketches in LLM Knowledge Bases: raw material goes into raw/ (read-only), an LLM compiles it into wiki/ (read-write, refactored freely), and the compiled wiki becomes the substrate for future queries and ingests.
Important
Enter through wiki/, not raw/. wiki/overview.md is the highest-compression synthesis; wiki/index.md is the full catalog.
| I want to… | Start here |
|---|---|
| See the whole synthesis | wiki/overview.md |
| Read Karpathy's bio and body of work | wiki/entities/andrej-karpathy.md |
| Browse the full catalog | wiki/index.md |
| Understand the maintenance protocol | CLAUDE.md |
| Read in Obsidian | Open this directory as a vault |
"I like to train deep neural nets on large datasets 🧠🤖💥" — karpathy.ai self-intro
Karpathy is one of the most paradigm-shaping individuals in modern AI. He doesn't ship the biggest models; he gives us the vocabulary to reason about them (Software 3.0, people spirits, animals vs ghosts, cognitive core, march of nines…). Researcher, engineer, teacher, public thinker — four hats at once, which almost nobody else wears simultaneously.
Career arc (full table):
- 2005–2015: Toronto (Hinton's course) → UBC → Stanford (under Fei-Fei Li)
- 2011 / 2013 / 2015: Google Brain → Google Research → DeepMind internships
- 2015–2017: OpenAI founding member
- 2017–2022: Tesla Director of AI, leading the Autopilot vision team
- 2023–2024: Back to OpenAI, working on midtraining and synthetic data
- 2024– : Independent educator; founded Eureka
Two biographical details that explain a surprising amount of the later corpus:
- Physics / hardware lens. Toronto was CS + physics + math, and in 2025 he says one of his undergraduate mistakes was over-focusing on the mathematical lens of computing and under-focusing on the physical one: energy, data locality, parallelism, architecture. That helps explain why llm.c, CUDA-kernel talk, and systems-level intuitions keep resurfacing. andrej-karpathy · 2025 misc
- "Reference human for ImageNet." Before the LLM-era frames, he already carried unusually high credibility in computer vision; the GPU MODE host intro calling him "the reference human for ImageNet" is a revealing shorthand for how he was perceived. It helps explain why later claims on evals, self-driving, and perception feel grounded rather than purely philosophical. andrej-karpathy · gpu-mode-irl-2024-keynote
What he's left behind:
- CS231n — Stanford's first deep-learning course, which funneled a generation into DL (150 students in 2015 → 750 in 2017)
- Zero to Hero — the most watched from-scratch neural-net series on YouTube
- nanoGPT / nanochat / micrograd / llm.c / microGPT — a ladder of minimal, legible, high-fidelity teaching code
- Tesla FSD — the 2026-01-01 coast-to-coast run (2,732 miles, zero interventions) is the cleanest public proof of the Software 2.0 bet
Five observable traits:
He coins a word, then uses it as reasoning scaffolding. The term isn't a label applied after the fact — it's the analysis tool. Each coinage comes packaged with its own analogies, counterexamples, and evolutionary edges. This is the technical core of his influence.
Full coinage list in andrej-karpathy.md. The load-bearing ones:
| Frame | Question it answers |
|---|---|
| Software 1.0/2.0/3.0 | How does software get written? |
| Animals vs Ghosts | What kind of thing is an LLM? |
| People Spirits | What are an LLM's psychological quirks? |
| March of Nines | Why is autonomy — self-driving, agents — so slow? |
| Verifiability | Which tasks can AI actually automate? |
| Cognitive Core | What's the "right" model size? |
| Jagged Intelligence | Why is an LLM genius and moron by turns? |
| Verification Gap | Where does agentic coding bottleneck? |
- Tesla / Waymo autonomy curve → the coming arc of coding agents
- Memento / 50 First Dates → LLM session-level amnesia
- Rain Man → savant memory paired with cognitive gaps
- Horizontal gene transfer in bacteria → the portability of bacterial code
- Iron Man suit vs Iron Man robot → augmentation over automation
Analogies aren't ornament — they're how he compresses mechanism-level intuition.
He says it repeatedly: "5–10x more pessimistic than people expect now, 5–10x more optimistic than people expect in 10 years" (Dwarkesh 2025 recap). He reasons about derivatives, not instantaneous values.
In 2025 he names the leaderboard illusion: benchmark scores have decoupled from lived experience. He trusts model smell, OpenRouter usage share, and multi-model councils (llm-council) instead.
In 2026 he states it explicitly: "Before I commit to a conclusion, I force myself to argue the opposite." Contradictions and uncertainty are kept as signal, not smoothed over. This is the methodological reason his public calls rarely age badly.
"Pedagogy is a ramp, not a cliff."
Six principles:
- 10,000 hours. No shortcuts. But ramps raise the information density per hour.
- Ramps to Knowledge. Attack any complex idea with a minimal but complete implementation: micrograd → nanoGPT → nanochat → llm.c. Each rung strips scaffolding and feeds the next.
- Build it to feel it (Feel the AGI). Don't read AGI takes — train a tiny model and watch the loss curve. This is his prescribed mode of knowing, not just a hobby.
- Physics is the bootloader. "Children should learn physics early not because they go on to do physics, but because it is the subject that best boots up a brain. Physicists are the intellectual embryonic stem cell." (Dwarkesh recap)
- LLM as second reader, not first (Reader3 workflow). Read the primary text yourself; then ask the LLM for explanation, background, or pushback — not the other way around. This is the bulwark against atrophy.
- Publish everything. Courses, repos, videos, posts — all public, all free. The flywheel (exposure → feedback → improvement → more exposure) is the whole point.
Seven hard positions, each reinforced across multiple sources:
-
LLMs are ghosts, not animals. We aren't recreating biological intelligence; we're summoning digital entities by imitating human text. Don't use animal evaluations. Don't expect instinctual drives. animals-vs-ghosts
-
Power to the people (2025-04-08 pinned post). LLMs invert the usual diffusion path — where it once went military → enterprise → consumer, this time individuals benefit first. The reason: the LLM capability shape (broad, shallow-to-middling expertise across many domains) fits an individual, not an org. power-to-the-people
-
Decade of agents, not year of agents. Most people get the two-year and ten-year horizons backwards: pessimistic at two years (it's slower than the hype), optimistic at ten (it's deeper than the skeptics claim). decade-of-agents
-
Verifiability is the Software 2.0 automation predicate (2025-11-17). "Software 1.0 automates what you can specify. Software 2.0 automates what you can verify." His single most load-bearing sentence of 2025. verifiability
-
RLVR is the #1 paradigm shift of 2025 — and yet RL is still terrible ("sucking supervision through a straw"). The next move should be system prompt learning.
-
Capability is peaky / jagged. Progress isn't uniform. A good eval tells you where the peaks and pits are. peaky-capability · jagged-intelligence
-
The supply chain is the new attack surface. Flagged via prompt injection from 2025-07-11 onward; vindicated by the 2026 litellm and axios incidents. The bacterial-code aesthetic and supply-chain attacks worry are two sides of the same coin.
- Autonomy Slider. Both products and his own workflow keep a tunable level of autonomy: Cursor tab (~75% of the time) → highlight-edit → Claude Code → GPT-5 Pro, scaling with task difficulty.
- Bacterial Code. Small, self-contained, dependency-free, rippable. He rejects the classical "dependencies are bricks for pyramids" view; in the LLM era, the cost/benefit of pulling in a dep has changed.
- Parallel agents + hand-editing in the IDE (agentic engineering, Jan 27 2026). Multiple Claude Code sessions on the left, the IDE on the right for reading and patching. Not pure delegation — orchestration + review.
- Code Post-Scarcity (2025-10-27). Writing code is no longer the expensive part. Thousand-line throwaway visualizations are now routine.
- Build for Agents. Markdown docs, CLI-first, MCP-exposed capabilities. "LLMs scrape, they don't navigate."
- Publish everything. No private Google Docs. Just X posts, GitHub repos, YouTube videos — a personal embodiment of the BYOAI ethic.
This is the layer the concept pages can hide. Karpathy's frames sit on top of very stable preferences about what good tools, good environments, and good cognition look like.
- Low-noise environments. The same pattern shows up inside and outside AI: privacy-conscious OS choices, quiet living environments, minimally processed food, RSS over engagement sludge, suspicion of hidden permissions and hidden attack surfaces. He is highly sensitive to invisible background degradation. 2025 misc · supply-chain-attacks
- File over app. He repeatedly chooses markdown, images, local files, CLIs, MCP, and Obsidian over opaque SaaS surfaces. The point is not nostalgia; it is inspectability, portability, auditability, and agent-compatibility. llm-knowledge-bases · BYOAI · build-for-agents
- Local-first, but not anti-frontier. He likes AIs that live on your computer, inside your private context and local network; at the same time, he happily routes to frontier systems for the hardest tasks. The stable preference is not ideological purity, but keeping user leverage high and lock-in low. karpathy-x-2025-ai-assisted-coding · BYOAI
- Tight leash, adjustable autonomy. His default is neither "trust the agent" nor "do everything manually." Ask for approaches before code, define success criteria, keep changes incremental, review side-by-side, and pull autonomy down when the model starts overthinking. karpathy-x-2025-ai-assisted-coding · agentic-engineering
- Taste is the scarce resource. He says agents over-abstract, over-try/catch, over-engineer, and leave dead code. That is why bacterial code and code post-scarcity coexist: code is cheap, but judgment about what should exist is still expensive.
- Think in public; let artifacts compound. X posts, repos, gists, videos, open courseware, weekend demos, and now a personal wiki. He prefers public artifacts that can snowball over private notes that disappear. snowballs · llm-knowledge-bases
- Craft that shows its work. Even the "off-topic" posts are revealing: Tolkien, White Lotus, Project Hail Mary, explainer tools, animated diagrams, research apps. He tends to like artifacts with visible structure, dense craft, and analyzable internals. 2025 misc · 2026 misc
- Popularizing computer vision — CS231n (2015–2017) and being a first-author force on Stanford's ImageNet work (famously "the reference human for ImageNet").
- Software 2.0 for self-driving — Tesla Autopilot 2017–2022, swapping C++ modules for neural nets layer by layer. The 2026-01-01 coast-to-coast is the public payoff.
- Opening the LLM training stack — nanoGPT / nanochat / llm.c turn frontier training into a legible, reproducible, sub-$100 exercise.
- Midtraining and synthetic data (OpenAI 2023–24) — not detailed publicly, but visible in shadows (e.g. nanochat's identity-injection recipe on 10.21).
- Shaping the public vocabulary — Software 3.0, people spirits, animals vs ghosts, vibe coding, agentic engineering, bacterial code, BYOAI. Six-plus terms that have become working-language in the field.
- Restarting AI education — Eureka, "Starfleet Academy for the mind," aiming to generalize the ramps-to-knowledge pattern into public infrastructure.
- 2025-04-08 Power to the People — the diffusion-inversion thesis
- 2025-07-06 Bacterial code — coinage
- 2025-07-27 Cognitive core — full spec
- 2025-10-13 nanochat released
- 2025-11-17 Verifiability — the defining sentence
- 2025-12-20 Year-in-review — RLVR declared #1 paradigm shift
- 2026-01-01 Tesla FSD coast-to-coast (2,732 miles, 0 interventions)
- 2026-01-27 Claude coding notes — atrophy named
- 2026-02-05 Agentic engineering — coinage
- 2026-02-25 "The phase shift happened in December 2025" — stated plainly
- 2026-04-05 BYOAI — the personal-AI-stack stance
The layer you'll most likely use day-to-day. Nine pillars:
- Software 3.0. Prompts are the new source code; English is the new programming language. But writing the code is the easy part — the hard part is the DevOps crunch (MenuGen took a week to ship).
- Partial-Autonomy Apps. The next product shape: half-autonomous apps with an autonomy slider. Cursor, Perplexity, Claude Code, Codex are all early exemplars.
- Build for Agents. Markdown docs, CLI, API, MCP;
llms.txt; LLM GUI as a not-yet-built-but-foreseeable frontend paradigm. - Agentic Engineering. The professionalized sibling of vibe coding. December 2025 is the threshold: before it, coding agents mostly didn't work; after it, they mostly do.
- Verification Gap. Generation is cheap, verification is expensive — that's the new bottleneck. The code volume isn't the problem; review throughput is. One consequence: atrophy — our generation muscles shrink before review muscles grow in.
- Code Post-Scarcity. Code is cheap enough to be one-shot and disposable. The old DRY / early-abstraction / helper-function instincts invert in throwaway territory.
- Context Engineering. The successor concept to "prompt engineering" — context selection, compression, ordering, memory, tools. Broader and more operational.
- Bacterial Code × Supply Chain Attacks. The 2026 litellm / axios incidents show it plainly: fewer deps means less attack surface. LLMs tilt the math toward "yoink and inline" over "pip install."
- BYOAI. Your AI stack should live on your side — runnable locally, model-swappable, resilient to intelligence brownouts. The natural extension is cognitive core: small, reasoning-first, tool-using.
Pair with the 11 central claims in wiki/overview.md for the most-compressed version of the same story.
Part of what makes Karpathy legible is that his corpus is not flat. Several tensions recur and stay unresolved on purpose:
- Maximal empowerment, minimal trust. He wants ordinary individuals to gain enormous leverage from AI, but he also keeps warning about prompt injection, supply-chain compromise, hidden memory, and over-agentic defaults. He is radically pro-use, not naively pro-delegation.
- Optimistic on slope, impatient on the present. He is structurally bullish on 10-year trajectories and often underwhelmed by the current UX, reliability, and verification story. This is why his writing can sound both excited and dissatisfied at the same time.
- Code abundance, taste scarcity. He believes code has become cheap and disposable, yet good structure, cleanup, decomposition, and review matter more than before. The old scarcity moved; it did not disappear.
- Open/public by instinct, selective by taste. He publishes almost everything, but prefers open formats, RSS, markdown, CLI surfaces, and source-anchored corpora over algorithmic feeds or opaque products.
- Small/local aesthetic, frontier-aware practice. He loves tiny self-contained artifacts, local-first setups, and cognitive cores, while also tracking frontier-model quality obsessively and using the best systems available when the task demands it.
| Path | Purpose |
|---|---|
raw/ |
Immutable source material (X posts, transcripts, self-bio) |
raw/2025/ · raw/2026/ |
X-post corpus organized by year |
raw/youtube-transcript/ |
Long-form interview and talk transcripts |
wiki/sources/ |
Faithful per-source (or per-bundle) summaries |
wiki/concepts/ |
Cross-source concept pages |
wiki/entities/ |
People / orgs / products / projects / courses |
wiki/methods/ |
Algorithm and technique pages |
wiki/index.md |
Full catalog |
wiki/log.md |
Audit log of ingests, lints, refactors |
wiki/overview.md |
Highest-compression synthesis |
CLAUDE.md |
Maintenance protocol for the LLM |
Workflow:
flowchart LR
A[raw/] -->|LLM ingest| B[wiki/sources/]
B --> C[wiki/concepts/]
B --> D[wiki/entities/]
B --> E[wiki/methods/]
C --> F[wiki/index.md]
D --> F
E --> F
B --> G[wiki/log.md]
C --> H[wiki/overview.md]
D --> H
As of 2026-04-18:
| Layer | Count |
|---|---|
| Raw markdown files | 109 |
| Source summaries | 31 |
| Concept pages | 50 |
| Entity pages | 43 |
| Method pages | 1 |
Covered: 2024 foundations (Berkeley / GPU MODE) · full 2025 X-post arc (16 thematic bundles) · 2026 X-post arc through April (10 thematic bundles) · self-bio and long-form transcripts.
raw/ and wiki/sources/ aren't 1:1 — high-density short posts are clustered by theme into bundled source pages, matching the grain at which Karpathy actually develops ideas.
- Read it as a wiki. Open the repo as an Obsidian vault and use the graph view to see concept adjacency.
- Add to it. To ingest new material, follow the ingest protocol in CLAUDE.md.
- Query against it. Aim queries at
wiki/, notraw/— the former is compiled and cross-linked; the latter is the unprocessed firehose. - Use it for writing or thinking. Every concept page's
## Relatedsection maps its neighborhood; wiki/overview.md gives the big picture. - Read it as a timeline. log.md records the wiki's own evolution — every ingest, lint, and refactor leaves a trace.
Tip
5 minutes: read wiki/overview.md. 30 minutes: add wiki/entities/andrej-karpathy.md and wiki/concepts/software-3-0.md. A full day: walk the seven sections of this README in order.
Scaffolded with llm-wiki-bootstrap; extended substantially around 69+ 2025 X posts, 15+ 2026 X posts, 4 long-form talks/interviews, and the karpathy.ai self-bio.
