Policy-Locked Triage for Messy Citizen Text: A Municipal-Style Routing PoC with SFT and Preference Alignment

How I stabilized noisy 311-style requests with supervised training and reviewer preferences in Python TL;DR This write-up is an experimental account of how I built a small routing proof of concept for synthetic municipal-style service requests. The goal was not to ship a city-wide system. From my perspective, the interesting part is the training story: start with labeled text, fit a transparent classifier, then inject reviewer-style preferences so the policy moves toward routes that match operational nuance. The repository is public, fully synthetic, and designed to run on a laptop without calling a hosted large language model. If you are looking for a polished civic product, this is not it. If you are looking for a clean, inspectable playground that mirrors how I think about aligning lightweight agents before any serious conversation about production, this article walks through the motivation, design, code, and limitations in depth. Introduction I have spent a fair amount of time thin

Policy-Locked Triage for Messy Citizen Text: A Municipal-Style Routing PoC with SFT and Preference Alignment

Related Articles

From Missed Birthdays to Automation: How I Built a Bot That Designs and Sends Birthday Cards

I Made a Keyboard Nobody Asked For: My Experience Making TapType

Anthropic is having a month

The Repressed Demand for Software

Amazon is offering up to 50 percent off chargers from Anker and others for its Big Spring Sale

Related Articles

News
From Missed Birthdays to Automation: How I Built a Bot That Designs and Sends Birthday Cards
Medium Programming • 3h ago

News
I Made a Keyboard Nobody Asked For: My Experience Making TapType
Lobsters • 5h ago

News
Anthropic is having a month
TechCrunch • 5h ago

News
The Repressed Demand for Software
Medium Programming • 6h ago

News
Amazon is offering up to 50 percent off chargers from Anker and others for its Big Spring Sale
The Verge • 6h ago