I built an open-source real-time LLM hallucination guardrail — here are the benchmarks

What is Director-Class AI? An open-source Python library that guards LLM output in real time. It watches tokens as they stream and halts generation the moment it detects a hallucination. It uses NLI (Natural Language Inference via DeBERTa/FactCG) and optional RAG knowledge grounding to score each claim against source documents. pip install director-ai Two-line integration: from director_ai import guard client = guard ( openai . OpenAI ()) # wraps any OpenAI/Anthropic client Benchmarks (measured, not aspirational) Metric Value Conditions Balanced accuracy 75.8% FactCG on LLM-AggreFact (29,320 samples) GPU latency 14.6ms/pair GTX 1060, ONNX, batch=16 L40S latency 0.5ms/pair FP16, batch=32 E2E catch rate 90.7% Hybrid mode, 600 HaluEval traces Rust BM25 speedup 10.2x Over pure Python implementation Framework Integrations LangChain, LlamaIndex, LangGraph, CrewAI, Haystack, DSPy, Semantic Kernel, and SDK Guard (wraps OpenAI/Anthropic/Bedrock/Gemini/Cohere clients). Honest Limitations NLI-onl

I built an open-source real-time LLM hallucination guardrail — here are the benchmarks

Related Articles

Android Remote Compose：讓 Android UI 不用發版也能更新

Learn Something Old Every Day, Part XVIII: How Does FPU Detection Work?

“Learn to Code” Is Dead… Learn to Think Instead

How One File Makes Claude Code Actually Follow Your Instructions

LeetCode Solution: 121. Best Time to Buy and Sell Stock

Related Articles

How-To
Android Remote Compose：讓 Android UI 不用發版也能更新
Medium Programming • 4h ago

How-To
Learn Something Old Every Day, Part XVIII: How Does FPU Detection Work?
Lobsters • 10h ago

How-To
“Learn to Code” Is Dead… Learn to Think Instead
Medium Programming • 12h ago

How-To
How One File Makes Claude Code Actually Follow Your Instructions
Medium Programming • 12h ago

How-To
LeetCode Solution: 121. Best Time to Buy and Sell Stock
Dev.to Tutorial • 13h ago