Andrej Karpathy

33 Followers

community-curated profile

Director of AI at Tesla. Previously a research scientist at OpenAI and CS PhD student at Stanford. I like to train deep neural nets on large datasets 🧠🤖💥

Overview Posts Content Recommendations

Andrej Karpathy @karpathy · Jun 30, 2023

From Twitter

I think this is mostly right.
- LLMs created a whole new layer of abstraction and profession.
- I've so far called this role "Prompt Engineer" but agree it is misleading. It's not just prompting alone, there's a lot of glue code/infra around it. Maybe "AI Engineer" is ~usable, though it takes something a bit too specific and makes it a bit too broad.
- ML people train algorithms/networks, usually from scratch, usually at lower capability.
- LLM training is becoming sufficently different from ML because of its systems-heavy workloads, and is also splitting off into a new kind of role, focused on very large scale training of transformers on supercomputers.
- In numbers, there's probably going to be significantly more AI Engineers than there are ML engineers / LLM engineers.
- One can be quite successful in this role without ever training anything.
- I don't fully follow the Software 1.0/2.0 framing. Software 3.0 (imo ~prompting LLMs) is amusing because prompts are human-designed "code", but in English, and interpreted by an LLM (itself now a Software 2.0 artifact). AI Engineers simultaneously program in all 3 paradigms. It's a bit 😵‍💫

Article Jun 30, 2023

The Rise of the AI Engineer

by swyx.ai

Recommended by 1 person

1 mention

Andrej Karpathy @karpathy · Oct 17, 2023

From Twitter

State of AI Report: very nice snapshot of the AI ecosystem across research, industry and (geo)politics (as usual each year :)).

Report 2023

State of AI Report 2023

by Nathan Benaich

Recommended by 1 person

1 mention

Andrej Karpathy @karpathy · Mar 12, 2024

From Twitter

+1 to the best AI newsletter atm that I enjoy skimming, great/ambitious work by @swyx & friends: [link] "Skimming" because they are very long. Not sure how it is built, sounds like there is a lot of LLM aid going on indexing ~356 Twitters, ~21 Discords, etc.

AI News

Recommended by 1 person

1 mention

Andrej Karpathy @karpathy · Sep 11, 2020

From Twitter

- The Black Cloud by Hoyle - His Master's Voice by Lem - Profiles of the Future by Clarke - The Molecular Biology of the Cell - The Other Side of History: Daily Life in the Ancient World

Book 1983

Molecular Biology of the Cell

by Bruce Alberts and 5 others

Recommended by 2 people

2 mentions

Andrej Karpathy @karpathy · Sep 11, 2020

From Twitter

- The Black Cloud by Hoyle - His Master's Voice by Lem - Profiles of the Future by Clarke - The Molecular Biology of the Cell - The Other Side of History: Daily Life in the Ancient World

Book Jan 1, 2010

The Other Side of History : Daily Life in the Ancient World

by Robert Garland

Recommended by 1 person

1 mention

Andrej Karpathy @karpathy · Jan 12, 2022

From Twitter

Interesting read and pointers; I've always wondered why the Roman Empire did not industrialize

Article Jan 11, 2022

🤔 What if the Industrial Revolution had started 2,000 years ago rather than 200? (And why didn't it?)

by James Pethokoukis and Jason Crawford

Recommended by 1 person

1 mention

Andrej Karpathy @karpathy · Mar 18, 2022

From Twitter

Re-read Ted Chiang’s “Understand”. It’s beautiful and the closest I’ve read to what it may think like to be a superintelligence.

Book Aug 15, 2006

Understand

by Ted Chiang

Recommended by 1 person

1 mention

Andrej Karpathy @karpathy · Mar 15, 2022

From Twitter

Excellent and unintuitive read on GPUs. The chip doing the compute has tiny amount of memory & is connected to the main memory literally through a straw. Most of the energy goes to data movement too. Many repercussions. E.g. latency better predicted by # activations than # flops

Tweet Mar 15, 2022

Everybody wants their models to run faster. However, researchers often cargo cult performance without a solid understanding on the underlying principles. To address that, I wrote a post called "Making Deep Learning Go Brrrr From First Principles". (

by Horace He

Recommended by 1 person

1 mention

Andrej Karpathy @karpathy · Mar 31, 2022

From Twitter

“Exploring Plain Vision Transformer Backbones for Object Detection” [link] Excellent read as usual from the FAIR team. Strong object detection results with only minor tweaks on the vanilla (ViT) Transformer backbone.

Paper

Exploring Plain Vision Transformer Backbones for Object Detection

by Yanghao Li and 3 others

Recommended by 1 person

1 mention

Andrej Karpathy @karpathy · Apr 1, 2022

From Twitter

Just making sure everyone read “The Bitter Lesson”, as it is one of the best compact pieces of insight into nature of progress in AI. Good habit to keep checking ideas on whether they pass the bitter lesson gut check

Article

The Bitter Lesson

by Rich Sutton

Recommended by 1 person

1 mention

Andrej Karpathy @karpathy · Sep 11, 2020

From Twitter

- The Black Cloud by Hoyle - His Master's Voice by Lem - Profiles of the Future by Clarke - The Molecular Biology of the Cell - The Other Side of History: Daily Life in the Ancient World

Book 1957

The Black Cloud