
AGI is not coming!
jack Morris's investigation into GPT-OSS training data https://x.com/jxmnop/status/1953899426075816164?t=3YRhVQDwQLk2gouTSACoqA&s=09

jack Morris's investigation into GPT-OSS training data https://x.com/jxmnop/status/1953899426075816164?t=3YRhVQDwQLk2gouTSACoqA&s=09

I got a bit concerningly obsessed with birds for a few months. Follow Sarah and The Mouth! https://linktr.ee/inkydragon Older bat vid: https://www.you...

Paper: https://research.trychroma.com/context-rot Abstract: Large Language Models (LLMs) are typically presumed to process context uniformly—that is,...

I was diagnosed with hearing loss. The problem is that I don't have hearing loss. Here's how to take care of your hearing in reality. Thanks @DrCliffA...

Take my hand while I gradually show you how to spy in ways that will make KGB agents look like noobs. Thanks @WonderboyMMA for being a great high spee...

Sign up now to access ChatLLM: https://bit.ly/42RlGDV Get a Free System Design PDF with 158 pages by subscribing to our weekly newsletter: https://bit...

An in-depth look at Anthropic's Transformer Circuit Blog Post Part 1 here: https://youtu.be/mU3g2YPKlsA Discord here: https;//ykilcher.com/discord htt...

This Video is Sponsored by Rocket Money. Try Rocket Money for free: https://rocketmoney.com/osman Send your inventions to me: https://opensauce.com/ex...

An in-depth look at Anthropic's Transformer Circuit Blog Post https://transformer-circuits.pub/2025/attribution-graphs/biology.html Abstract: We inves...

The example-driven, practical walkthrough of Large Language Models and their growing list of related features, as a new entry to my general audience s...

This is a general audience deep dive into the Large Language Model (LLM) AI technology that powers ChatGPT and related products. It is covers the full...
![[GRPO Explained] DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models](/_next/image?url=https%3A%2F%2Fi3.ytimg.com%2Fvi%2FbAWV_yrqx4w%2Fhqdefault.jpg&w=1200&q=75)
#deepseek #llm #grpo GRPO is one of the core advancements used in Deepseek-R1, but was introduced already last year in this paper that uses a combinat...

#tokenization #llm #meta This paper does away with tokenization and creates an LLM architecture that operates on dynamically sized "patches" instead o...


I like programming. Fonts tried in this video 1. Menlo 2. Comic Shanns 3. Fira Code 4. JetBrains Mono 5. MonoLisa #benawad ---- Follow me online: http...

We reproduce the GPT-2 (124M) from scratch. This video covers the whole process: First we build the GPT-2 network, then we optimize its training to be...

Go to http://crunchlabs.com/williamosman and claim your free kit today! Get your Open Sauce Tickets before they sell out!! https://opensauce.com/ticke...

Go to http://crunchlabs.com/backyardscientist and claim your free kit today! Also, bring your project to Open Sauce! http://opensauce.com Allens Video...

The Tokenizer is a necessary and pervasive component of Large Language Models (LLMs), where it translates between strings and tokens (text chunks). To...
![[1hr Talk] Intro to Large Language Models](/_next/image?url=https%3A%2F%2Fi3.ytimg.com%2Fvi%2FzjkBMFhNj_g%2Fhqdefault.jpg&w=1200&q=75)
This is a 1 hour general-audience introduction to Large Language Models: the core technical component behind systems like ChatGPT, Claude, and Bard. W...
Showing 11801 - 11820 of 11828 articles