karpathy-wiki

A source-driven knowledge base distilled from Andrej Karpathy's public corpus — X posts, interviews, talks, open-source repos, and self-bio. The goal isn't to archive links; it's to understand the person — how he thinks, how he learns, how he works, what he believes, what he's built, and what he's saying about AI and software engineering.

The repo follows the pattern Karpathy himself sketches in LLM Knowledge Bases: raw material goes into raw/ (read-only), an LLM compiles it into wiki/ (read-write, refactored freely), and the compiled wiki becomes the substrate for future queries and ingests.

Important

Enter through wiki/, not raw/. wiki/overview.md is the highest-compression synthesis; wiki/index.md is the full catalog.

Quick Paths

I want to…	Start here
See the whole synthesis	wiki/overview.md
Read Karpathy's bio and body of work	wiki/entities/andrej-karpathy.md
Browse the full catalog	wiki/index.md
Understand the maintenance protocol	CLAUDE.md
Read in Obsidian	Open this directory as a vault

1. Meet the Person

"I like to train deep neural nets on large datasets 🧠🤖💥" — karpathy.ai self-intro

Karpathy is one of the most paradigm-shaping individuals in modern AI. He doesn't ship the biggest models; he gives us the vocabulary to reason about them (Software 3.0, people spirits, animals vs ghosts, cognitive core, march of nines…). Researcher, engineer, teacher, public thinker — four hats at once, which almost nobody else wears simultaneously.

Career arc (full table):

2005–2015: Toronto (Hinton's course) → UBC → Stanford (under Fei-Fei Li)
2011 / 2013 / 2015: Google Brain → Google Research → DeepMind internships
2015–2017: OpenAI founding member
2017–2022: Tesla Director of AI, leading the Autopilot vision team
2023–2024: Back to OpenAI, working on midtraining and synthetic data
2024– : Independent educator; founded Eureka

Two biographical details that explain a surprising amount of the later corpus:

Physics / hardware lens. Toronto was CS + physics + math, and in 2025 he says one of his undergraduate mistakes was over-focusing on the mathematical lens of computing and under-focusing on the physical one: energy, data locality, parallelism, architecture. That helps explain why llm.c, CUDA-kernel talk, and systems-level intuitions keep resurfacing. andrej-karpathy · 2025 misc
"Reference human for ImageNet." Before the LLM-era frames, he already carried unusually high credibility in computer vision; the GPU MODE host intro calling him "the reference human for ImageNet" is a revealing shorthand for how he was perceived. It helps explain why later claims on evals, self-driving, and perception feel grounded rather than purely philosophical. andrej-karpathy · gpu-mode-irl-2024-keynote

What he's left behind:

CS231n — Stanford's first deep-learning course, which funneled a generation into DL (150 students in 2015 → 750 in 2017)
Zero to Hero — the most watched from-scratch neural-net series on YouTube
nanoGPT / nanochat / micrograd / llm.c / microGPT — a ladder of minimal, legible, high-fidelity teaching code
Tesla FSD — the 2026-01-01 coast-to-coast run (2,732 miles, zero interventions) is the cleanest public proof of the Software 2.0 bet

2. How He Thinks

Five observable traits:

1. Frame-first

He coins a word, then uses it as reasoning scaffolding. The term isn't a label applied after the fact — it's the analysis tool. Each coinage comes packaged with its own analogies, counterexamples, and evolutionary edges. This is the technical core of his influence.

Full coinage list in andrej-karpathy.md. The load-bearing ones:

Frame	Question it answers
Software 1.0/2.0/3.0	How does software get written?
Animals vs Ghosts	What kind of thing is an LLM?
People Spirits	What are an LLM's psychological quirks?
March of Nines	Why is autonomy — self-driving, agents — so slow?
Verifiability	Which tasks can AI actually automate?
Cognitive Core	What's the "right" model size?
Jagged Intelligence	Why is an LLM genius and moron by turns?
Verification Gap	Where does agentic coding bottleneck?

2. Analogy-driven

Tesla / Waymo autonomy curve → the coming arc of coding agents
Memento / 50 First Dates → LLM session-level amnesia
Rain Man → savant memory paired with cognitive gaps
Horizontal gene transfer in bacteria → the portability of bacterial code
Iron Man suit vs Iron Man robot → augmentation over automation

Analogies aren't ornament — they're how he compresses mechanism-level intuition.

3. Slopes, not points

He says it repeatedly: "5–10x more pessimistic than people expect now, 5–10x more optimistic than people expect in 10 years" (Dwarkesh 2025 recap). He reasons about derivatives, not instantaneous values.

4. Distrust benchmarks, trust vibes

In 2025 he names the leaderboard illusion: benchmark scores have decoupled from lived experience. He trusts model smell, OpenRouter usage share, and multi-model councils (llm-council) instead.

5. Argue the opposite

In 2026 he states it explicitly: "Before I commit to a conclusion, I force myself to argue the opposite." Contradictions and uncertainty are kept as signal, not smoothed over. This is the methodological reason his public calls rarely age badly.

3. How He Learns

"Pedagogy is a ramp, not a cliff."

Six principles:

10,000 hours. No shortcuts. But ramps raise the information density per hour.
Ramps to Knowledge. Attack any complex idea with a minimal but complete implementation: micrograd → nanoGPT → nanochat → llm.c. Each rung strips scaffolding and feeds the next.
Build it to feel it (Feel the AGI). Don't read AGI takes — train a tiny model and watch the loss curve. This is his prescribed mode of knowing, not just a hobby.
Physics is the bootloader. "Children should learn physics early not because they go on to do physics, but because it is the subject that best boots up a brain. Physicists are the intellectual embryonic stem cell." (Dwarkesh recap)
LLM as second reader, not first (Reader3 workflow). Read the primary text yourself; then ask the LLM for explanation, background, or pushback — not the other way around. This is the bulwark against atrophy.
Publish everything. Courses, repos, videos, posts — all public, all free. The flywheel (exposure → feedback → improvement → more exposure) is the whole point.

4. His Worldview

Seven hard positions, each reinforced across multiple sources:

LLMs are ghosts, not animals. We aren't recreating biological intelligence; we're summoning digital entities by imitating human text. Don't use animal evaluations. Don't expect instinctual drives. animals-vs-ghosts
Power to the people (2025-04-08 pinned post). LLMs invert the usual diffusion path — where it once went military → enterprise → consumer, this time individuals benefit first. The reason: the LLM capability shape (broad, shallow-to-middling expertise across many domains) fits an individual, not an org. power-to-the-people
Decade of agents, not year of agents. Most people get the two-year and ten-year horizons backwards: pessimistic at two years (it's slower than the hype), optimistic at ten (it's deeper than the skeptics claim). decade-of-agents
Verifiability is the Software 2.0 automation predicate (2025-11-17). "Software 1.0 automates what you can specify. Software 2.0 automates what you can verify." His single most load-bearing sentence of 2025. verifiability
RLVR is the #1 paradigm shift of 2025 — and yet RL is still terrible ("sucking supervision through a straw"). The next move should be system prompt learning.
Capability is peaky / jagged. Progress isn't uniform. A good eval tells you where the peaks and pits are. peaky-capability · jagged-intelligence
The supply chain is the new attack surface. Flagged via prompt injection from 2025-07-11 onward; vindicated by the 2026 litellm and axios incidents. The bacterial-code aesthetic and supply-chain attacks worry are two sides of the same coin.

5. How He Works

Autonomy Slider. Both products and his own workflow keep a tunable level of autonomy: Cursor tab (~75% of the time) → highlight-edit → Claude Code → GPT-5 Pro, scaling with task difficulty.
Bacterial Code. Small, self-contained, dependency-free, rippable. He rejects the classical "dependencies are bricks for pyramids" view; in the LLM era, the cost/benefit of pulling in a dep has changed.
Parallel agents + hand-editing in the IDE (agentic engineering, Jan 27 2026). Multiple Claude Code sessions on the left, the IDE on the right for reading and patching. Not pure delegation — orchestration + review.
Code Post-Scarcity (2025-10-27). Writing code is no longer the expensive part. Thousand-line throwaway visualizations are now routine.
Build for Agents. Markdown docs, CLI-first, MCP-exposed capabilities. "LLMs scrape, they don't navigate."
Publish everything. No private Google Docs. Just X posts, GitHub repos, YouTube videos — a personal embodiment of the BYOAI ethic.

6. Preferences, Habits, and Behavioral Signatures

This is the layer the concept pages can hide. Karpathy's frames sit on top of very stable preferences about what good tools, good environments, and good cognition look like.

Low-noise environments. The same pattern shows up inside and outside AI: privacy-conscious OS choices, quiet living environments, minimally processed food, RSS over engagement sludge, suspicion of hidden permissions and hidden attack surfaces. He is highly sensitive to invisible background degradation. 2025 misc · supply-chain-attacks
File over app. He repeatedly chooses markdown, images, local files, CLIs, MCP, and Obsidian over opaque SaaS surfaces. The point is not nostalgia; it is inspectability, portability, auditability, and agent-compatibility. llm-knowledge-bases · BYOAI · build-for-agents
Local-first, but not anti-frontier. He likes AIs that live on your computer, inside your private context and local network; at the same time, he happily routes to frontier systems for the hardest tasks. The stable preference is not ideological purity, but keeping user leverage high and lock-in low. karpathy-x-2025-ai-assisted-coding · BYOAI
Tight leash, adjustable autonomy. His default is neither "trust the agent" nor "do everything manually." Ask for approaches before code, define success criteria, keep changes incremental, review side-by-side, and pull autonomy down when the model starts overthinking. karpathy-x-2025-ai-assisted-coding · agentic-engineering
Taste is the scarce resource. He says agents over-abstract, over-try/catch, over-engineer, and leave dead code. That is why bacterial code and code post-scarcity coexist: code is cheap, but judgment about what should exist is still expensive.
Think in public; let artifacts compound. X posts, repos, gists, videos, open courseware, weekend demos, and now a personal wiki. He prefers public artifacts that can snowball over private notes that disappear. snowballs · llm-knowledge-bases
Craft that shows its work. Even the "off-topic" posts are revealing: Tolkien, White Lotus, Project Hail Mary, explainer tools, animated diagrams, research apps. He tends to like artifacts with visible structure, dense craft, and analyzable internals. 2025 misc · 2026 misc

7. What He's Built

Six domains of contribution

Popularizing computer vision — CS231n (2015–2017) and being a first-author force on Stanford's ImageNet work (famously "the reference human for ImageNet").
Software 2.0 for self-driving — Tesla Autopilot 2017–2022, swapping C++ modules for neural nets layer by layer. The 2026-01-01 coast-to-coast is the public payoff.
Opening the LLM training stack — nanoGPT / nanochat / llm.c turn frontier training into a legible, reproducible, sub-$100 exercise.
Midtraining and synthetic data (OpenAI 2023–24) — not detailed publicly, but visible in shadows (e.g. nanochat's identity-injection recipe on 10.21).
Shaping the public vocabulary — Software 3.0, people spirits, animals vs ghosts, vibe coding, agentic engineering, bacterial code, BYOAI. Six-plus terms that have become working-language in the field.
Restarting AI education — Eureka, "Starfleet Academy for the mind," aiming to generalize the ramps-to-knowledge pattern into public infrastructure.

Key dates

2025-04-08 Power to the People — the diffusion-inversion thesis
2025-07-06 Bacterial code — coinage
2025-07-27 Cognitive core — full spec
2025-10-13 nanochat released
2025-11-17 Verifiability — the defining sentence
2025-12-20 Year-in-review — RLVR declared #1 paradigm shift
2026-01-01 Tesla FSD coast-to-coast (2,732 miles, 0 interventions)
2026-01-27 Claude coding notes — atrophy named
2026-02-05 Agentic engineering — coinage
2026-02-25 "The phase shift happened in December 2025" — stated plainly
2026-04-05 BYOAI — the personal-AI-stack stance

8. Views on AI × Software Engineering

The layer you'll most likely use day-to-day. Nine pillars:

Software 3.0. Prompts are the new source code; English is the new programming language. But writing the code is the easy part — the hard part is the DevOps crunch (MenuGen took a week to ship).
Partial-Autonomy Apps. The next product shape: half-autonomous apps with an autonomy slider. Cursor, Perplexity, Claude Code, Codex are all early exemplars.
Build for Agents. Markdown docs, CLI, API, MCP; llms.txt; LLM GUI as a not-yet-built-but-foreseeable frontend paradigm.
Agentic Engineering. The professionalized sibling of vibe coding. December 2025 is the threshold: before it, coding agents mostly didn't work; after it, they mostly do.
Verification Gap. Generation is cheap, verification is expensive — that's the new bottleneck. The code volume isn't the problem; review throughput is. One consequence: atrophy — our generation muscles shrink before review muscles grow in.
Code Post-Scarcity. Code is cheap enough to be one-shot and disposable. The old DRY / early-abstraction / helper-function instincts invert in throwaway territory.
Context Engineering. The successor concept to "prompt engineering" — context selection, compression, ordering, memory, tools. Broader and more operational.
Bacterial Code × Supply Chain Attacks. The 2026 litellm / axios incidents show it plainly: fewer deps means less attack surface. LLMs tilt the math toward "yoink and inline" over "pip install."
BYOAI. Your AI stack should live on your side — runnable locally, model-swappable, resilient to intelligence brownouts. The natural extension is cognitive core: small, reasoning-first, tool-using.

Pair with the 11 central claims in wiki/overview.md for the most-compressed version of the same story.

9. Productive Tensions

Part of what makes Karpathy legible is that his corpus is not flat. Several tensions recur and stay unresolved on purpose:

Maximal empowerment, minimal trust. He wants ordinary individuals to gain enormous leverage from AI, but he also keeps warning about prompt injection, supply-chain compromise, hidden memory, and over-agentic defaults. He is radically pro-use, not naively pro-delegation.
Optimistic on slope, impatient on the present. He is structurally bullish on 10-year trajectories and often underwhelmed by the current UX, reliability, and verification story. This is why his writing can sound both excited and dissatisfied at the same time.
Code abundance, taste scarcity. He believes code has become cheap and disposable, yet good structure, cleanup, decomposition, and review matter more than before. The old scarcity moved; it did not disappear.
Open/public by instinct, selective by taste. He publishes almost everything, but prefers open formats, RSS, markdown, CLI surfaces, and source-anchored corpora over algorithmic feeds or opaque products.
Small/local aesthetic, frontier-aware practice. He loves tiny self-contained artifacts, local-first setups, and cognitive cores, while also tracking frontier-model quality obsessively and using the best systems available when the task demands it.

Repository Structure

Path	Purpose
`raw/`	Immutable source material (X posts, transcripts, self-bio)
`raw/2025/` · `raw/2026/`	X-post corpus organized by year
`raw/youtube-transcript/`	Long-form interview and talk transcripts
`wiki/sources/`	Faithful per-source (or per-bundle) summaries
`wiki/concepts/`	Cross-source concept pages
`wiki/entities/`	People / orgs / products / projects / courses
`wiki/methods/`	Algorithm and technique pages
`wiki/index.md`	Full catalog
`wiki/log.md`	Audit log of ingests, lints, refactors
`wiki/overview.md`	Highest-compression synthesis
`CLAUDE.md`	Maintenance protocol for the LLM

Workflow:

flowchart LR
    A[raw/] -->|LLM ingest| B[wiki/sources/]
    B --> C[wiki/concepts/]
    B --> D[wiki/entities/]
    B --> E[wiki/methods/]
    C --> F[wiki/index.md]
    D --> F
    E --> F
    B --> G[wiki/log.md]
    C --> H[wiki/overview.md]
    D --> H

Corpus Snapshot

As of 2026-04-18:

Layer	Count
Raw markdown files	109
Source summaries	31
Concept pages	50
Entity pages	43
Method pages	1

Covered: 2024 foundations (Berkeley / GPU MODE) · full 2025 X-post arc (16 thematic bundles) · 2026 X-post arc through April (10 thematic bundles) · self-bio and long-form transcripts.

raw/ and wiki/sources/ aren't 1:1 — high-density short posts are clustered by theme into bundled source pages, matching the grain at which Karpathy actually develops ideas.

How to Use

Read it as a wiki. Open the repo as an Obsidian vault and use the graph view to see concept adjacency.
Add to it. To ingest new material, follow the ingest protocol in CLAUDE.md.
Query against it. Aim queries at wiki/, not raw/ — the former is compiled and cross-linked; the latter is the unprocessed firehose.
Use it for writing or thinking. Every concept page's ## Related section maps its neighborhood; wiki/overview.md gives the big picture.
Read it as a timeline. log.md records the wiki's own evolution — every ingest, lint, and refactor leaves a trace.

Tip

5 minutes: read wiki/overview.md. 30 minutes: add wiki/entities/andrej-karpathy.md and wiki/concepts/software-3-0.md. A full day: walk the seven sections of this README in order.

Scaffolded with llm-wiki-bootstrap; extended substantially around 69+ 2025 X posts, 15+ 2026 X posts, 4 long-form talks/interviews, and the karpathy.ai self-bio.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
cover-image/karpathy-wiki		cover-image/karpathy-wiki
raw		raw
wiki		wiki
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
README.md		README.md
README.zh-CN.md		README.zh-CN.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

karpathy-wiki

Quick Paths

1. Meet the Person

2. How He Thinks

1. Frame-first

2. Analogy-driven

3. Slopes, not points

4. Distrust benchmarks, trust vibes

5. Argue the opposite

3. How He Learns

4. His Worldview

5. How He Works

6. Preferences, Habits, and Behavioral Signatures

7. What He's Built

Six domains of contribution

Key dates

8. Views on AI × Software Engineering

9. Productive Tensions

Repository Structure

Corpus Snapshot

How to Use

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

karpathy-wiki

Quick Paths

1. Meet the Person

2. How He Thinks

1. Frame-first

2. Analogy-driven

3. Slopes, not points

4. Distrust benchmarks, trust vibes

5. Argue the opposite

3. How He Learns

4. His Worldview

5. How He Works

6. Preferences, Habits, and Behavioral Signatures

7. What He's Built

Six domains of contribution

Key dates

8. Views on AI × Software Engineering

9. Productive Tensions

Repository Structure

Corpus Snapshot

How to Use

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages