Show HN: We fingerprinted 178 AI models' writing styles and similarity clusters

We have a dataset of 3,095 standardized AI responses across 43 prompts. From each response, we extract a 32-dimension stylometric fingerprint (lexical richness, sentence structure, punctuation habits, formatting patterns, discourse markers). Some findings: - 9 clone clusters (>90% cosine similarity on z-normalized feature vectors) - Mistral Large 2 and Large 3 2512 score 84.8% on a composite metric combining 5 independent signals - Gemini 2.5 Flash Lite writes 78% like Claude 3 Opus. Costs 185x less - Meta has the strongest provider "house style" (37.5x distinctiveness ratio) - "Satirical fake news" is the prompt that causes the most writing convergence across all models - "Count letters" causes the most divergence The composite clone score combines: prompt-controlled head-to-head similarity, per-feature Pearson correlation across challenges, response length correlation, cross-prompt consistency, and aggregate cosine similarity. Tech: stylometric extraction in Node.js, z-score normaliz

Show HN: We fingerprinted 178 AI models' writing styles and similarity clusters

Related Articles

The Future of Everything is Lies, I Guess

The tech behind words.zip (infinite mmo word search game)

Full Text Search with IndexedDB

ServiceMesh at Scale with Linkerd creator, William Morgan

Floating point from scratch: Hard Mode

Related Articles

News
The Future of Everything is Lies, I Guess
Lobsters • 3h ago

News
The tech behind words.zip (infinite mmo word search game)
Reddit Programming • 3h ago

News
Full Text Search with IndexedDB
Lobsters • 4h ago

News
ServiceMesh at Scale with Linkerd creator, William Morgan
Reddit Programming • 4h ago

News
Floating point from scratch: Hard Mode
Reddit Programming • 5h ago