How to Test AI Agent Tool Calls with Pytest

Your AI agent calls the right tool in development. Then it picks the wrong one in production, sends a Slack message instead of querying your database, and you have no idea why. The problem: LLM responses are non-deterministic. You can't write a traditional test that says "given this input, expect this exact output." So most developers skip testing their agents entirely. The fix: don't test the LLM. Mock it. Test everything around it — the tool routing, the argument extraction, the result handling — deterministically with pytest. Here's how in under 5 minutes. The Agent You're Testing Let's say you have a simple agent that takes a user message, sends it to an LLM with a list of tools, and executes whichever tool the LLM picks: # agent.py import json from openai import OpenAI TOOLS = { " search_docs " : lambda query : f " Results for: { query } " , " create_ticket " : lambda title , priority = " medium " : f " Ticket created: { title } [ { priority } ] " , " send_notification " : lambda

How to Test AI Agent Tool Calls with Pytest

Related Articles

Why Shipping Faster Can Create Slower Systems

How to Use Value Objects to Solve Primitive Obsession — Part 1: Understanding the Problem and…

Backbone’s versatile pro controller is nearly matching its best price to date

I recommend this bestselling DeWalt cordless power tool set to everyone - and it's nearly 50% off

Why Building Projects Is the Fastest Way to Learn Coding

Related Articles

How-To
Why Shipping Faster Can Create Slower Systems
Medium Programming • 8h ago

How-To
How to Use Value Objects to Solve Primitive Obsession — Part 1: Understanding the Problem and…
Medium Programming • 9h ago

How-To
Backbone’s versatile pro controller is nearly matching its best price to date
The Verge • 9h ago

How-To
I recommend this bestselling DeWalt cordless power tool set to everyone - and it's nearly 50% off
ZDNet • 9h ago

How-To
Why Building Projects Is the Fastest Way to Learn Coding
Medium Programming • 10h ago