litellm

Описание

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

ai-gateway

anthropic

550.55 MiB

Политика безопасности

В избранном0

Форки0

Языки

Python69,5%
JavaScript22,4%
TypeScript7,6%
HTML0,4%
Остальные0,1%

Ishaan Jaff
fix ghcr
9 месяцев назад
b36dbb8

.circleci

[Security] - Add Trivy Security Scan for UI + Docs folder - remove all vulnerabilities (#11778)

10 месяцев назад

.devcontainer

LiteLLM Minor Fixes and Improvements (08/06/2024) (#5567)

2 года назад

.github

build(ghcr_deploy.yml): add rc to all docker images

10 месяцев назад

ci_cd

install prisma migration files - connects litellm proxy to litellm's prisma migration files (#9637)

год назад

cookbook

Add new model provider Novita AI (#7582) (#9527)

год назад

db_scripts

Litellm dev contributor prs 01 31 2025 (#8168)

год назад

deploy

Add deployment annotations (#11849)

10 месяцев назад

dist

Litellm dev 01 10 2025 p2 (#7679)

год назад

10 месяцев назад

9 месяцев назад

bump litellm-enterprise-0.1.8

9 месяцев назад

litellm-js

(UI) fix adding Vertex Models (#8129)

год назад

litellm-proxy-extras

build: update with new migration file

10 месяцев назад

litellm

ui new build

9 месяцев назад

tests

Management Fixes - don't apply default internal user settings to admins + preserve all model access for teams with empty model list, when team model added + /v2/model/info fixes (#11957)

9 месяцев назад

ui new build

9 месяцев назад

.dockerignore

Add back in non root image fixes (#7781) (#7795)

год назад

.env.example

Add new model provider Novita AI (#7582) (#9527)

год назад

.flake8

месяц назад

.git-blame-ignore-revs

месяц назад

.gitattributes

месяц назад

.gitignore

feat: add .cursor to .gitignore

10 месяцев назад

.pre-commit-config.yaml

docs(index.md): update release note with rc patch

10 месяцев назад

AGENTS.md

Add AGENTS.md (#11461)

10 месяцев назад

CONTRIBUTING.md

Update Makefile and add CONTRIBUTING.md to guide contributors on best practices and submission process (#11485)

10 месяцев назад

Dockerfile

adds tzdata (#10796) (#11052)

10 месяцев назад

LICENSE

месяц назад

Makefile

Update Makefile and add CONTRIBUTING.md to guide contributors on best practices and submission process (#11485)

10 месяцев назад

README.md

Update README.md (#11586)

10 месяцев назад

codecov.yaml

fix comment

год назад

docker-compose.yml

Fix #9295 docker-compose healthcheck test uses curl but curl is not in the image (#9737)

10 месяцев назад

index.yaml

add 0.2.3 helm

2 года назад

mcp_servers.json

add well known MCP servers (#11209)

10 месяцев назад

model_prices_and_context_window.json

Update Azure o3 pricing to match OpenAI pricing ($2/$8 per 1M tokens) (#11937)

9 месяцев назад

package-lock.json

fix(main.py): fix retries being multiplied when using openai sdk (#7221)

год назад

package.json

fix(main.py): fix retries being multiplied when using openai sdk (#7221)

год назад

poetry.lock

bump litellm-enterprise-0.1.8

9 месяцев назад

prometheus.yml

build(docker-compose.yml): add prometheus scraper to docker compose

2 года назад

proxy_server_config.yaml

build: update model in test (#10706)

год назад

pyproject.toml

bump: version 1.72.9 → 1.73.0

9 месяцев назад

pyrightconfig.json

Add pyright to ci/cd + Fix remaining type-checking errors (#6082)

год назад

render.yaml

месяц назад

requirements.txt

bump litellm-enterprise-0.1.8

9 месяцев назад

ruff.toml

(code quality) run ruff rule to ban unused imports (#7313)

год назад

schema.prisma

feat: add LiteLLM_HealthCheckTable model to schema for health monitoring (#11677)

10 месяцев назад

security.md

Discard duplicate sentence (#10231)

год назад

test_script.py

build(model_prices_and_context_window.json): mark all gemini-2.5 mode… (#11907)

10 месяцев назад

test_url_encoding.py

fix(internal_user_endpoints.py): support user with `+` in email on us… (#11601)

10 месяцев назад

Usage (Docs)Async (Docs)Streaming (Docs)Logging Observability (Docs)LiteLLM Proxy Server (LLM Gateway) - (Docs)📖 Proxy Endpoints - Swagger Docs Quick Start Proxy - CLI Proxy Key Management (Docs)Supported Providers (Docs)Contributing Enterprise Contributing Quick Start for Contributors Code Quality / Linting Support / talk with founders Why did we build this Contributors Run in Developer mode

README.md

🚅 LiteLLM

Call all LLM APIs using the OpenAI format [Bedrock, Huggingface, VertexAI, TogetherAI, Azure, OpenAI, Groq etc.]

LiteLLM Proxy Server (LLM Gateway) | Hosted Proxy (Preview) | Enterprise Tier

LiteLLM manages:

Translate inputs to provider's completion, embedding, and image_generation endpoints
Consistent output, text responses will always be available at ['choices'][0]['message']['content']
Retry/fallback logic across multiple deployments (e.g. Azure/OpenAI) - Router
Set Budgets & Rate limits per project, api key, model LiteLLM Proxy Server (LLM Gateway)

Jump to LiteLLM Proxy (LLM Gateway) Docs
Jump to Supported LLM Providers

🚨 Stable Release: Use docker images with the

-stable

tag. These have undergone 12 hour load tests, before being published. More information about the release cycle here

Support for more providers. Missing a provider or LLM Platform, raise a feature request.

Usage (Docs)

Important

LiteLLM v1.0.0 now requires
openai>=1.0.0
. Migration guide here
LiteLLM v1.40.14+ now requires pydantic>=2.0.0. No changes required.

Response (OpenAI Format)

Call any model supported by a provider, with

model=<provider_name>/<model_name>

. There might be provider-specific details here, so refer to provider docs for more information

Async (Docs)

Streaming (Docs)

liteLLM supports streaming the model response back, pass

stream=True

to get a streaming iterator in response.
Streaming is supported for all models (Bedrock, Huggingface, TogetherAI, Azure, OpenAI, etc.)

Response chunk (OpenAI Format)

Logging Observability (Docs)

LiteLLM exposes pre defined callbacks to send data to Lunary, MLflow, Langfuse, DynamoDB, s3 Buckets, Helicone, Promptlayer, Traceloop, Athina, Slack

LiteLLM Proxy Server (LLM Gateway) - (Docs)

Track spend + Load Balance across multiple projects

Hosted Proxy (Preview)

The proxy provides:

📖 Proxy Endpoints - Swagger Docs

Quick Start Proxy - CLI

Step 1: Start litellm proxy

Step 2: Make ChatCompletions Request to Proxy

Important

💡 Use LiteLLM Proxy with Langchain (Python, JS), OpenAI SDK (Python, JS) Anthropic SDK, Mistral SDK, LlamaIndex, Instructor, Curl

Proxy Key Management (Docs)

Connect the proxy with a Postgres DB to create proxy keys

UI on

/ui

on your proxy server ui_3

Set budgets and rate limits across multiple projects

POST /key/generate

Request

Expected Response

Supported Providers (Docs)

Provider	Completion	Streaming	Async Completion	Async Streaming	Async Embedding	Async Image Generation
openai	✅	✅	✅	✅	✅	✅
Meta - Llama API	✅	✅	✅	✅
azure	✅	✅	✅	✅	✅	✅
AI/ML API	✅	✅	✅	✅	✅	✅
aws - sagemaker	✅	✅	✅	✅	✅
aws - bedrock	✅	✅	✅	✅	✅
google - vertex_ai	✅	✅	✅	✅	✅	✅
google - palm	✅	✅	✅	✅
google AI Studio - gemini	✅	✅	✅	✅
mistral ai api	✅	✅	✅	✅	✅
cloudflare AI Workers	✅	✅	✅	✅
cohere	✅	✅	✅	✅	✅
anthropic	✅	✅	✅	✅
empower	✅	✅	✅	✅
huggingface	✅	✅	✅	✅	✅
replicate	✅	✅	✅	✅
together_ai	✅	✅	✅	✅
openrouter	✅	✅	✅	✅
ai21	✅	✅	✅	✅
baseten	✅	✅	✅	✅
vllm	✅	✅	✅	✅
nlp_cloud	✅	✅	✅	✅
aleph alpha	✅	✅	✅	✅
petals	✅	✅	✅	✅
ollama	✅	✅	✅	✅	✅
deepinfra	✅	✅	✅	✅
perplexity-ai	✅	✅	✅	✅
Groq AI	✅	✅	✅	✅
Deepseek	✅	✅	✅	✅
anyscale	✅	✅	✅	✅
IBM - watsonx.ai	✅	✅	✅	✅	✅
voyage ai					✅
xinference [Xorbits Inference]					✅
FriendliAI	✅	✅	✅	✅
Galadriel	✅	✅	✅	✅
Novita AI	✅	✅	✅	✅
Featherless AI	✅	✅	✅	✅
Nebius AI Studio	✅	✅	✅	✅	✅

Read the Docs

Contributing

Interested in contributing? Contributions to LiteLLM Python SDK, Proxy Server, and LLM integrations are both accepted and highly encouraged!

Quick start:

git clone

→

make install-dev

→

make format

→

make lint

→

make test-unit

See our comprehensive Contributing Guide (CONTRIBUTING.md) for detailed instructions.

Enterprise

For companies that need better security, user management and professional support

Talk to founders

This covers:

✅ Features under the LiteLLM Commercial License:
✅ Feature Prioritization
✅ Custom Integrations
✅ Professional Support - Dedicated discord + slack
✅ Custom SLAs
✅ Secure access with Single Sign-On

Contributing

We welcome contributions to LiteLLM! Whether you're fixing bugs, adding features, or improving documentation, we appreciate your help.

Quick Start for Contributors

For detailed contributing guidelines, see CONTRIBUTING.md.

Code Quality / Linting

LiteLLM follows the Google Python Style Guide.

Our automated checks include:

Black for code formatting
Ruff for linting and code quality
MyPy for type checking
Circular import detection
Import safety checks

Run all checks locally:

All these checks must pass before your PR can be merged.

Support / talk with founders

Schedule Demo 👋
Community Discord 💭
Our numbers 📞 +1 (770) 8783-106 / ‭+1 (412) 618-6238‬
Our emails ✉️ ishaan@berri.ai / krrish@berri.ai

Why did we build this

Need for simplicity: Our code started to get extremely complicated managing & translating calls between Azure, OpenAI and Cohere.

Contributors

Run in Developer mode

Services

Setup .env file in root
Run dependant services docker-compose up db prometheus

Backend

(In root) create virtual environment python -m venv .venv
Activate virtual environment source .venv/bin/activate
Install dependencies pip install -e ".[all]"
Start proxy backend uvicorn litellm.proxy.proxy_server:app --host localhost --port 4000 --reload

Frontend

Navigate to ui/litellm-dashboard
Install dependencies npm install
Run npm run dev to start the dashboard

litellm

Описание

Языки

Ishaan Jafffix ghcr 9 месяцев назадb36dbb8

🚅 LiteLLM

LiteLLM Proxy Server (LLM Gateway) | Hosted Proxy (Preview) | Enterprise Tier

Usage (Docs)

Response (OpenAI Format)

Async (Docs)

Streaming (Docs)

Response chunk (OpenAI Format)

Logging Observability (Docs)

LiteLLM Proxy Server (LLM Gateway) - (Docs)

📖 Proxy Endpoints - Swagger Docs

Quick Start Proxy - CLI

Step 1: Start litellm proxy

Step 2: Make ChatCompletions Request to Proxy

Proxy Key Management (Docs)

Request

Expected Response

Supported Providers (Docs)

Contributing

Enterprise

Contributing

Quick Start for Contributors

Code Quality / Linting

Support / talk with founders

Why did we build this

Contributors

Run in Developer mode

Services

Backend

Frontend

Ishaan Jaff
fix ghcr
9 месяцев назад
b36dbb8