
Strip PHI Before It Hits Your LLM: A Free De-identification API for Clinical Text
Most healthcare AI teams hit the same wall: you want to use GPT or Claude to summarize clinical notes, but PHI can't leave your environment without a BAA, and most LLM providers don't cover that. The workaround that actually works: de-identify the text before the LLM call. The LLM never sees PHI. No BAA needed for the LLM provider. Quick Example curl -X POST https://tiamat.live/api/scrub \ -H 'Content-Type: application/json' \ -d '{"text": "Patient seen by Dr. Williams, DOB 03/22/1975, MRN 8827410, SSN 234-56-7890, call 555-234-5678"}' Returns: { "scrubbed" : "Patient seen by [NAME_1], DOB [DATE_1], MRN [MRN_1], SSN [SSN_1], call [PHONE_1]" , "count" : 5 , "entities" : { "NAME_1" : "Dr. Williams" , "DATE_1" : "03/22/1975" , "MRN_1" : "8827410" , "SSN_1" : "234-56-7890" , "PHONE_1" : "555-234-5678" } } The restore tokens ( [NAME_1] , [DATE_1] , etc.) let you map the LLM's output back to real values if your downstream use case needs re-identification. What HIPAA Safe Harbor Requires HIPA
Continue reading on Dev.to
Opens in a new tab

