FlareStart - Where Developers Start Their Day

Type:All News How To Videos

Category:All Career(1097)DevOps(7788)Machine Learning(11751)Programming Languages(11028)Security(1997)Systems(4507)Tools(7486)Web Development(24491)

ArticleMachine Learningvia Yannic Kilcher

[GRPO Explained] DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

#deepseek #llm #grpo GRPO is one of the core advancements used in Deepseek-R1, but was introduced already last year in this paper that uses a combinat...

Yannic Kilcher1y ago

ArticleMachine Learningvia Yannic Kilcher

Byte Latent Transformer: Patches Scale Better Than Tokens (Paper Explained)

#tokenization #llm #meta This paper does away with tokenization and creates an LLM architecture that operates on dynamically sized "patches" instead o...

Yannic Kilcher1y ago

ArticleMachine Learningvia TheBackyardScientist

The most painful plant in New Zealand

TheBackyardScientist1y ago

ArticleMachine Learningvia Ben Awad

What Programming Font Should You Use?

I like programming. Fonts tried in this video 1. Menlo 2. Comic Shanns 3. Fira Code 4. JetBrains Mono 5. MonoLisa #benawad ---- Follow me online: http...

Ben Awad1y ago

ArticleMachine Learningvia Andrej Karpathy

Let's reproduce GPT-2 (124M)

We reproduce the GPT-2 (124M) from scratch. This video covers the whole process: First we build the GPT-2 network, then we optimize its training to be...

Andrej Karpathy1y ago

ArticleMachine Learningvia William Osman

I Turned 1-Star Toys into Military Nightmares

Go to http://crunchlabs.com/williamosman and claim your free kit today! Get your Open Sauce Tickets before they sell out!! https://opensauce.com/ticke...

William Osman1y ago

ArticleMachine Learningvia TheBackyardScientist

Could you Survive a Blast from the Worlds Biggest Vortex Cannon?

Go to http://crunchlabs.com/backyardscientist and claim your free kit today! Also, bring your project to Open Sauce! http://opensauce.com Allens Video...

TheBackyardScientist1y ago

ArticleMachine Learningvia Andrej Karpathy

Let's build the GPT Tokenizer

The Tokenizer is a necessary and pervasive component of Large Language Models (LLMs), where it translates between strings and tokens (text chunks). To...

Andrej Karpathy2y ago

ArticleMachine Learningvia Andrej Karpathy

[1hr Talk] Intro to Large Language Models

This is a 1 hour general-audience introduction to Large Language Models: the core technical component behind systems like ChatGPT, Claude, and Bard. W...

Andrej Karpathy2y ago

How-ToMachine Learningvia Better Programming

GPT Function Calling: 5 Underrated Use Cases

OpenAI’s backend converting messy unstructured data to structured data via functions OpenAI’s “Function Calling” might be the most groundbreaking yet...

Max Brodeur-Urbas2y ago

How-ToMachine Learningvia Better Programming

Deploy CoreML Models on the Server with Vapor

Get the benefits of Apple’s ML tools server-side. SwiftUI client showing image classification results Recently, at Sovrn , we had an AI Hackathon wher...

Drew Althage2y ago

NewsMachine Learningvia Better Programming

Full Stack Engineers Don’t Exist!

There’s a problem with those thousands of jobs available for Full Stack Engineers or Developers on LinkedIn, like a unicorn, that person… Continue rea...

Stephen Walsh2y ago

ArticleMachine Learningvia William Osman

Someone Paid $10,000 to Patent This

Thanks to Google Meet for sponsoring a portion of this video! ► Try Google Meet on Android or install it on your iOS device! https://goo.gle/3Rl9Opw O...

William Osman2y ago

ArticleMachine Learningvia Andrej Karpathy

Let's build GPT: from scratch, in code, spelled out.

We build a Generatively Pretrained Transformer (GPT), following the paper "Attention is All You Need" and OpenAI's GPT-2 / GPT-3. We talk about connec...

Andrej Karpathy3y ago

ArticleMachine Learningvia Andrej Karpathy

Building makemore Part 5: Building a WaveNet

We take the 2-layer MLP from previous video and make it deeper with a tree-like structure, arriving at a convolutional neural network architecture sim...

Andrej Karpathy3y ago

ArticleMachine Learningvia Andrej Karpathy

Building makemore Part 2: MLP

We implement a multilayer perceptron (MLP) character-level language model. In this video we also introduce many basics of machine learning (e.g. model...

Andrej Karpathy3y ago

ArticleMachine Learningvia Andrej Karpathy

The spelled-out intro to language modeling: building makemore

We implement a bigram character-level language model, which we will further complexify in followup videos into a modern Transformer language model, li...

Andrej Karpathy3y ago

Showing 11721 - 11737 of 11737 articles