FlareStart
HomeNewsHow ToSources
FlareStart

Where developers start their day. All the tech news & tutorials that matter, in one place.

Quick Links

  • Home
  • News
  • Tutorials
  • Sources
  • Privacy Policy

Connect

© 2026 FlareStart. All rights reserved.

Back to articles
ArticleMachine Learning

Let's reproduce GPT-2 (124M)

via Andrej KarpathyAndrej Karpathy1y ago

We reproduce the GPT-2 (124M) from scratch. This video covers the whole process: First we build the GPT-2 network, then we optimize its training to be really fast, then we set up the training run following the GPT-2 and GPT-3 paper and their hyperparameters, then we hit run, and come back the next morning to see our results, and enjoy some amusing model generations. Keep in mind that in some places this video builds on the knowledge from earlier videos in the Zero to Hero Playlist (see my channel). You could also see this video as building my nanoGPT repo, which by the end is about 90% similar. Links: - build-nanogpt GitHub repo, with all the changes in this video as individual commits: https://github.com/karpathy/build-nanogpt - nanoGPT repo: https://github.com/karpathy/nanoGPT - llm.c repo: https://github.com/karpathy/llm.c - my website: https://karpathy.ai - my twitter: https://twitter.com/karpathy - our Discord channel: https://discord.gg/3zy8kqD9Cp Supplementary links: - Attention

Watch on Andrej Karpathy

Opens in a new tab

Watch on YouTube
20 views

Related Articles

Kids and young people: stay curious and be willing to engage with others
Article

Kids and young people: stay curious and be willing to engage with others

freeCodeCamp.org • 1d ago

I really miss coding.
Article

I really miss coding.

Theo • 1d ago

Here's a cool and easy way to work with colors in your Three.js projects
Article

Here's a cool and easy way to work with colors in your Three.js projects

freeCodeCamp.org • 2d ago

Learn a bit about camera position in Three.js
Article

Learn a bit about camera position in Three.js

freeCodeCamp.org • 5d ago

A great novel can be written in any language - just like great programs.
Article

A great novel can be written in any language - just like great programs.

freeCodeCamp.org • 6d ago

Discover More Articles