Cloudflare AI Gateway

LLM gateway with caching, analytics, and rate limiting

App Infrastructure

About

LLM gateway from Cloudflare that sits in front of any AI provider and adds request caching, cost analytics, rate limiting, and fallback routing. Zero-config for teams already on Cloudflare Workers. Free tier available on all Cloudflare plans.

Choose Cloudflare AI Gateway when…

•You're already deploying on Cloudflare Workers or Pages
•You want LLM caching and cost analytics with zero infrastructure overhead
•You need a lightweight gateway without running LiteLLM yourself

Builder Slot

Which models does your stack route through?Optional for most stacks

A gateway that normalizes calls across providers — one API for all models, with fallbacks

Dev Tools

Not applicable

App Infra

Optional

Hybrid

Optional

Other tools in this slot:

LiteLLM OpenRouter PortKey Unify Martian Not Diamond

Stack Genome Detection

AIchitect's Genome scanner detects Cloudflare AI Gateway in your project via these signals:

env vars

CLOUDFLARE_API_TOKEN

Integrates with (1)

OpenAI APILLM Infrastructure

Cloudflare AI Gateway proxies OpenAI traffic, adding caching, rate limits, fallbacks, and analytics without code changes — clients keep using the OpenAI SDK, just pointed at the gateway URL.

→ Add caching, fallbacks, and rate limits in front of OpenAI without touching application code.

Compare →

Alternatives to consider (2)

PortKeycompare →LiteLLMcompare →

Pricing

✦ Free tier available

FreeFree

Pro$5/mo

Pulse

● No incidents in the last 90 days

Badge

Add to your GitHub README

[![Cloudflare AI Gateway](https://www.aichitect.dev/badge/tool/cloudflare-ai-gateway)](https://www.aichitect.dev/tool/cloudflare-ai-gateway)

Explore the full AI landscape

See how Cloudflare AI Gateway fits into the bigger picture — browse all 207 tools and their relationships.

Explore graph →