FlareStart
HomeNewsHow ToSources
FlareStart

Where developers start their day. All the tech news & tutorials that matter, in one place.

Quick Links

  • Home
  • News
  • Tutorials
  • Sources
  • Privacy Policy

Connect

© 2026 FlareStart. All rights reserved.

Back to articles
Show HN: Duplicate 3 layers in a 24B LLM, logical deduction .22→.76. No training
How-ToMachine Learning

Show HN: Duplicate 3 layers in a 24B LLM, logical deduction .22→.76. No training

via Hacker Newsxlayn7h ago

I replicated David Ng's RYS method ( https://dnhkng.github.io/posts/rys/ ) on consumer AMD GPUs (RX 7900 XT + RX 6950 XT) and found something I didn't expect. Transformers appear to have discrete "reasoning circuits" — contiguous blocks of 3-4 layers that act as indivisible cognitive units. Duplicate the right block and the model runs its reasoning pipeline twice. No weights change. No training. The model just thinks longer. The results on standard benchmarks (lm-evaluation-harness, n=50): Devstral-24B, layers 12-14 duplicated once: - BBH Logical Deduction: 0.22 → 0.76 - GSM8K (strict): 0.48 → 0.64 - MBPP (code gen): 0.72 → 0.78 - Nothing degraded Qwen2.5-Coder-32B, layers 7-9 duplicated once: - Reasoning probe: 76% → 94% The weird part: different duplication patterns create different cognitive "modes" from the same weights. Double-pass boosts math. Triple-pass boosts emotional reasoning. Interleaved doubling (13,13,14,14,15,15,16) creates a pure math specialist. Same model, same VRAM,

Continue reading on Hacker News

Opens in a new tab

Read Full Article
0 views

Related Articles

How to Structure Large Flutter Projects Like Senior Developers
How-To

How to Structure Large Flutter Projects Like Senior Developers

Medium Programming • 32m ago

Why the Monolith is a Dead End for the Weekend Indie Developer
How-To

Why the Monolith is a Dead End for the Weekend Indie Developer

Medium Programming • 33m ago

Understand OpenClaw by Building One —Part 3
How-To

Understand OpenClaw by Building One —Part 3

Medium Programming • 51m ago

DSL — Recursive Descent Parser
How-To

DSL — Recursive Descent Parser

Medium Programming • 1h ago

A simple web-based log viewer
How-To

A simple web-based log viewer

Medium Programming • 1h ago

Discover More Articles