These tools often paired with

PromptFoovsDeepEval

CLI/library for prompt testing and red-teaming versus LLM evaluation framework — 14+ metrics

Field

PromptFoo

DeepEval

Test and compare prompts across models. Built-in red-teaming, regression testing, and side-by-side model comparison.

Open-source evaluation framework with 14+ metrics including faithfulness, relevancy, and hallucination detection. Integrates with CI/CD.

Shared Connections3 tools both integrate with

VellumDeepEvalAgenta

RAGASPromptFooTruLensInspect

Explore the full AI landscape

See how PromptFoo and DeepEval fit into the bigger picture — 207 tools, 452 relationships, all mapped.