Inference servingOpen sourceUpdated 2026
LiteLLM
Intermediate · LLM gateway/proxy
Proxy and SDK for routing requests across many LLM providers with OpenAI-compatible interfaces.
Best for
Teams managing multiple model providers, budgets, keys, and routing rules.
Why use it
Useful when you want one gateway across local, open, and commercial models.
Tradeoffs
A gateway adds another operational layer; keep routing policies simple.
Key features
- Provider routing
- OpenAI-compatible proxy
- Spend controls
Alternatives
OpenRouter, Portkey, LocalAI
Where it fits
LiteLLM belongs in the inference serving layer of an open AI stack. Evaluate it against your model runtime, privacy needs, deployment target, and the amount of operational complexity your team can support.
CategoryInference servingLicenseMITDeploymentLLM gateway/proxyModeSelf-hosted proxy or cloud
LiteLLM GitHub →Recommendation
Use LiteLLM when model routing and provider abstraction matter.