Open-source LLM evaluation framework by the UK AI Safety Institute
Inspect is an open-source framework for building LLM evaluations, developed by the UK AI Safety Institute. It provides task composition, built-in datasets, scorers, and solvers for systematic benchmarking of LLM capabilities, safety, and alignment properties.
Tests, evals, and experiment tracking to measure and improve your AI output quality
AIchitect's Genome scanner detects Inspect in your project via these signals:
inspect-aiAdd to your GitHub README
[](https://aichitect.dev/tool/inspect-ai)Explore the full AI landscape
See how Inspect fits into the bigger picture — browse all 207 tools and their relationships.