[Preview] v1.78.0-stable - MCP Gateway: Control Tool Access by Team/Key
Deploy this versionโ
- Docker
- Pip
pip install litellm==1.78.0
Key Highlightsโ
- MCP Gateway Enhancements - Fine-grained tool control at team/key level, OpenAPI to MCP server conversion, and per-tool parameter allowlists
- GPT-5 Pro & GPT-Image-1-Mini - Day 0 support for OpenAI's GPT-5 Pro (400K context) and gpt-image-1-mini image generation
- UI Performance Boost - Replaces bloated key list calls with lean key aliases endpoint, Turbopack for faster development, and major UI refactors
- EnkryptAI Guardrails - New guardrail integration for content moderation
- Tag-Based Budgets - Support for setting budgets based on request tags
- Azure AD & SSO - Enhanced Azure AD default credentials selection and EntraID app roles support
New Models / Updated Modelsโ
New Model Supportโ
Provider | Model | Context Window | Input ($/1M tokens) | Output ($/1M tokens) | Features |
---|---|---|---|---|---|
OpenAI | gpt-5-pro | 400K | $15.00 | $120.00 | Responses API, reasoning, vision, function calling, prompt caching, web search |
OpenAI | gpt-5-pro-2025-10-06 | 400K | $15.00 | $120.00 | Responses API, reasoning, vision, function calling, prompt caching, web search |
OpenAI | gpt-image-1-mini | - | $2.00/img | - | Image generation and editing |
OpenAI | gpt-realtime-mini | 128K | $0.60 | $2.40 | Realtime audio, function calling |
Azure AI | azure_ai/Phi-4-mini-reasoning | 131K | $0.08 | $0.32 | Function calling |
Azure AI | azure_ai/Phi-4-reasoning | 32K | $0.125 | $0.50 | Function calling, reasoning |
Azure AI | azure_ai/MAI-DS-R1 | 128K | $1.35 | $5.40 | Reasoning, function calling |
Bedrock | au.anthropic.claude-sonnet-4-5-20250929-v1:0 | 200K | $3.30 | $16.50 | Chat, reasoning, vision, function calling, prompt caching |
Bedrock | global.anthropic.claude-sonnet-4-5-20250929-v1:0 | 200K | $3.00 | $15.00 | Chat, reasoning, vision, function calling, prompt caching |
Bedrock | global.anthropic.claude-sonnet-4-20250514-v1:0 | 1M | $3.00 | $15.00 | Chat, reasoning, vision, function calling, prompt caching |
Bedrock | cohere.embed-v4:0 | 128K | $0.12 | - | Embeddings, image input support |
OCI | oci/cohere.command-latest | 128K | $1.56 | $1.56 | Function calling |
OCI | oci/cohere.command-a-03-2025 | 256K | $1.56 | $1.56 | Function calling |
OCI | oci/cohere.command-plus-latest | 128K | $1.56 | $1.56 | Function calling |
Together AI | together_ai/moonshotai/Kimi-K2-Instruct-0905 | 262K | $1.00 | $3.00 | Function calling |
Together AI | together_ai/Qwen/Qwen3-Next-80B-A3B-Instruct | 262K | $0.15 | $1.50 | Function calling |
Together AI | together_ai/Qwen/Qwen3-Next-80B-A3B-Thinking | 262K | $0.15 | $1.50 | Function calling |
Vertex AI | MedGemma models | Varies | Varies | Varies | Medical-focused Gemma models on custom endpoints |
Watson X | 27 new foundation models | Varies | Varies | Varies | Granite, Llama, Mistral families |
Featuresโ
-
- Add GPT-5 Pro model configuration and documentation - PR #15258
- Add stop parameter to non-supported params for GPT-5 - PR #15244
- Day 0 Support, Add gpt-image-1-mini - PR #15259
- Add gpt-realtime-mini support - PR #15283
- Add gpt-5-pro-2025-10-06 to model costs - PR #15344
- Minimal fix: gpt5 models should not go on cooldown when called with temperature!=1 - PR #15330
-
- Add function calling support for Snowflake Cortex REST API - PR #15221
-
- Fix header forwarding for Gemini/Vertex AI providers in proxy mode - PR #15231
-
- Add Global Cross-Region Inference - PR #15210
- Add Cohere Embed v4 support for AWS Bedrock - PR #15298
- Fix(bedrock): include cacheWriteInputTokens in prompt_tokens calculation - PR #15292
- Add Bedrock AU Cross-Region Inference for Claude Sonnet 4.5 - PR #15402
- Converse โ /v1/messages streaming doesn't handle parallel tool calls with Claude models - PR #15315
-
- Add OCI Cohere support with tool calling and streaming capabilities - PR #15365
-
- Add new together models - PR #15383
Bug Fixesโ
- General
LLM API Endpointsโ
Featuresโ
-
- Feat(files): add @client decorator to file operations - PR #15339
-
- Fix gemini cli by actually streaming the response - PR #15264
-
- Azure - passthrough support with router models - PR #15240
Bugsโ
- General
- Fix x-litellm-cache-key header not being returned on cache hit - PR #15348
Management Endpoints / UIโ
Featuresโ
-
Proxy CLI Auth
- Proxy CLI - dont store existing key in the URL, store it in the state param - PR #15290
-
Models + Endpoints
- Make PATCH
/model/{model_id}/update
handleteam_id
consistently with POST/model/new
- PR #15297 - Feature: adds Infinity as a provider in the UI - PR #15285
- Fix: model + endpoints page crash when config file contains router_settings.model_group_alias - PR #15308
- Models & Endpoints Initial Refactor - PR #15435
- Litellm UI API Reference page updates - PR #15438
- Make PATCH
-
Teams
-
UI Infrastructure
- Added prettier to autoformat frontend - PR #15215
- Adds turbopack to the npm run dev command in UI to build faster during development - PR #15250
- (perf) fix: Replaces bloated key list calls with lean key aliases endpoint - PR #15252
- Potentially fixes a UI spasm issue with an expired cookie - PR #15309
- LiteLLM UI Refactor Infrastructure - PR #15236
- Enforces removal of unused imports from UI - PR #15416
- Fix: usage page >> Model Activity >> spend per day graph: y-axis clipping on large spend values - PR #15389
- Updates guardrail provider logos - PR #15421
-
Admin Settings
-
SSO
- SSO - support EntraID app roles - PR #15351
Logging / Guardrail / Prompt Management Integrationsโ
Featuresโ
Guardrailsโ
Spend Tracking, Budgets and Rate Limitingโ
-
Tag Management
- Tag Management - Add support for setting tag based budgets - PR #15433
-
Dynamic Rate Limiter v3
-
Shared Health Check
- Implement Shared Health Check State Across Pods - PR #15380
MCP Gatewayโ
-
Tool Control
- MCP Gateway - UI - Select allowed tools for Key, Teams - PR #15241
- MCP Gateway - Backend - Allow storing allowed tools by team/key - PR #15243
- MCP Gateway - Fine-grained Database Object Storage Control - PR #15255
- MCP Gateway - Litellm mcp fixes team control - PR #15304
- MCP Gateway - QA/Fixes - Ensure Team/Key level enforcement works for MCPs - PR #15305
- Feature: Include server_name in /v1/mcp/server/health endpoint response - PR #15431
-
OpenAPI Integration
-
Configuration
Performance / Loadbalancing / Reliability improvementsโ
-
Router Optimizations
-
Session Management
-
SSL/TLS Performance
- Perf: optimize SSL/TLS handshake performance with prioritized cipher - PR #15398
-
Dependencies
- Upgrades tenacity version to 8.5.0 - PR #15303
-
Data Masking
- Fix - SensitiveDataMasker converts lists to string - PR #15420
General AI Gateway Improvementsโ
Securityโ
- General
- Fix: redact AWS credentials when redact_user_api_key_info enabled - PR #15321
Documentation Updatesโ
-
Provider Documentation
-
Deployment
- Deletion of docker-compose buggy comment that cause
config.yaml
based startup fail - PR #15425
- Deletion of docker-compose buggy comment that cause
New Contributorsโ
- @Gal-bloch made their first contribution in PR #15219
- @lcfyi made their first contribution in PR #15315
- @ashengstd made their first contribution in PR #15362
- @vkolehmainen made their first contribution in PR #15363
- @jlan-nl made their first contribution in PR #15330
- @BCook98 made their first contribution in PR #15402
- @PabloGmz96 made their first contribution in PR #15425