Playwright for AI Agents - test what your agent DOES, not what it SAYS. YAML-first, 3200+ tests.
-
Updated
Apr 4, 2026 - TypeScript
Playwright for AI Agents - test what your agent DOES, not what it SAYS. YAML-first, 3200+ tests.
Behavioral testing for LLM applications. pytest plugin with semantic assertions, multi-turn conversation testing, and drift detection. No LLM judge needed.
Bots I broke and how I broke them to be a future conversational Red Teamer
Agent behavioral testing -- YAML specs for tool calls, sequences, constraints
Advanced Mockito usage featuring Spies, Mocks, and behavioral verification to test a shopping cart checkout flow.
Catch AI behavioral regressions before merge. Run eval suites for prompts, agents, and workflows in GitHub Actions.
AI persona-based behavioral testing for web apps. No test scripts. YAML-configured. Vision-powered.
LLM drift detector — know within 5 min when GPT-4o, Claude, or Gemini silently changes behaviour. Open source, self-hostable.
Add a description, image, and links to the behavioral-testing topic page so that developers can more easily learn about it.
To associate your repository with the behavioral-testing topic, visit your repo's landing page and select "manage topics."