Andrej Karpathy on Software 3.0: Programming LLMs in English and Building for Agents

Y Combinator

Summary:

Andrej Karpathy discusses the profound shifts in software, introducing "Software 3.0" where Large Language Models (LLMs) are programmed in natural language, fundamentally changing how software is developed and interacted with.

  • Software has evolved from traditional code [1] to neural network weights [2] and now to natural language prompts [3], making programming accessible to everyone ("vibe coding").
  • LLMs behave like utilities, fabs, and especially operating systems, centralizing compute but enabling widespread access, similar to computing in the 1960s.
  • LLMs possess emergent human-like "psychology," offering superhuman knowledge but also exhibiting cognitive deficits like hallucinations and amnesia.
  • The future of software involves building "partial autonomy" LLM applications that integrate AI for generation and humans for verification, emphasizing fast human-AI collaboration loops and configurable "autonomy sliders."
  • To support this shift, digital infrastructure must be redesigned for agents, including LLM-friendly documentation and APIs that allow agents to interact directly.
    Software evolution from 1.0 to 3.0, showing Software 3.0 eating through the stack
    Software evolution from 1.0 to 3.0, showing Software 3.0 eating through the stack [ 00:05:37 ]

Intro [00:00]

Software is undergoing a fundamental change, arguably the most significant in 70 years, with two rapid shifts occurring recently. This transformation presents immense opportunities for those entering the industry, requiring the ability to write and rewrite vast amounts of software.

Software Evolution: From 1.0 to 3.0 [01:25]

The talk outlines a progression in software paradigms:

LLMs as Utilities, Fabs, and Operating Systems [06:10]

Andrej Karpathy proposes analogies to understand LLMs:

Historical images of 1950s-1970s mainframe and time-sharing computers, illustrating centralized, expensive computing
Historical images of 1950s-1970s mainframe and time-sharing computers, illustrating centralized, expensive computing [ 00:11:08 ]
* Personal computing for LLMs hasn't fully happened yet, though early hints exist (e.g., Mac Minis for LLM inference).
Images of stacked Mac Minis used for personal LLM computing, suggesting early hints of a personal computing evolution
Images of stacked Mac Minis used for personal LLM computing, suggesting early hints of a personal computing evolution [ 00:11:48 ]
* Direct text chat with LLMs feels like interacting with an operating system through a terminal; a general GUI for LLMs is yet to be invented.
Meme comparing direct text chat with ChatGPT to interacting with an old computer terminal, suggesting the current lack of a general GUI for LLMs
Meme comparing direct text chat with ChatGPT to interacting with an old computer terminal, suggesting the current lack of a general GUI for LLMs [ 00:12:17 ]
* LLMs Flip Technology Diffusion [12:49] * Traditionally, transformative technologies diffuse from government and corporations to consumers (e.g., electricity, cryptography, computing). * LLMs have flipped this script: initial widespread use is by consumers (e.g., asking "how to boil an egg"), with governments and corporations lagging in adoption.
Meme illustrating how LLMs flip the script on technology diffusion, showing consumer adoption first, contrasting military ballistics with boiling an egg
Meme illustrating how LLMs flip the script on technology diffusion, showing consumer adoption first, contrasting military ballistics with boiling an egg [ 00:13:27 ]

LLM Psychology: People Spirits and Cognitive Quirks [14:39]

To effectively program LLMs, one must understand their "psychology."

Designing LLM Apps with Partial Autonomy [18:22]

There are significant opportunities in building LLM-powered applications.

Vibe Coding: Everyone is Now a Programmer [29:06]

The ability to program LLMs in English has made software highly accessible, leading to a new phenomenon called "vibe coding."

Building for Agents: Future-Ready Digital Infrastructure [33:39]

The rise of human-like AI agents necessitates a fundamental shift in how digital infrastructure is designed.

Summary: We’re in the 1960s of LLMs — Time to Build [38:14]