These tools integrates with

DeepEvalvsLangfuse

LLM evaluation framework — 14+ metrics versus OSS LLM engineering platform (acquired by ClickHouse, Jan 2026)

Compare interactively in Explore →

Choose DeepEval when…

•You want a pytest-style framework for LLM testing
•Unit-test-like evals for LLM outputs fit your workflow
•You need RAG-specific metrics like faithfulness and relevancy

Choose Langfuse when…

•You want open-source LLM observability
•Self-hosting your tracing stack is important
•You need cost tracking across models and users

Field

DeepEval

Langfuse

DeepEval

Open-source evaluation framework with 14+ metrics including faithfulness, relevancy, and hallucination detection. Integrates with CI/CD.

Website ↗GitHub ↗

Langfuse

Open-source platform for tracing, evaluations, and prompt management. Self-hostable alternative to LangSmith with broad framework support. Acquired by ClickHouse in January 2026; OSS core continues with deeper ClickHouse-backed analytics.

Website ↗GitHub ↗

Shared Connections3 tools both integrate with

RAGAS PromptFoo OpenAI API

Only DeepEval (4)

LangfuseTruLensInspectGalileo

Only Langfuse (28)

Claude CodeCrewAICursorLangGraphLangChainOpenHandsAutoGenLlamaIndexDifyMastra

Explore the full AI landscape

See how DeepEval and Langfuse fit into the bigger picture — 235 tools, 543 relationships, all mapped.

Open in Explore →

DeepEvalvsLangfuse

Choose DeepEval when…

Choose Langfuse when…

Side-by-side comparison

DeepEval

Langfuse

Shared Connections3 tools both integrate with

Only DeepEval (4)

Only Langfuse (28)