Overview

Users

Errors

Cache

Feedback

Metadata Summary

Charts

Analytics

Comprehensive documentation for Portkey's AI Gateway, Guardrails, Observability, Prompts, and Governance features.

Portkey Docs

Portkey AI is a comprehensive platform designed to streamline and enhance AI integration for developers and organizations. It serves as a unified interface for interacting with over 250 AI models, offering advanced tools for control, visibility, and security in your Generative AI apps.

What is Portkey?

Integrate Portkey and analyze your first LLM call in 2 minutes!

Make Your First Request

Portkey Features

Integrations

Submit an Integration

Portkey connects with all major LLM providers and orchestration frameworks.

Learn to integrate OpenAI with Portkey, enabling seamless completions, prompt management, and advanced functionalities like streaming, function calling and fine-tuning.

OpenAI

Structured Outputs ensure that the model always follows your supplied [JSON schema](https://json-schema.org/overview/what-is-jsonschema). Portkey supports OpenAI's Structured Outputs feature out of the box with our SDKs & APIs. 

Structured Outputs

Prompt Caching

Files

Batches

Anthropic

Google Gemini

Google Vertex AI

Controlled Generations ensure that the model always follows your supplied [JSON schema](https://json-schema.org/overview/what-is-jsonschema). Portkey supports Vertex AI's Controlled Generations feature out of the box with our SDKs & APIs. 

Controlled Generations

Azure OpenAI is a great alternative to accessing the best models including GPT-4 and more in your private environments. Portkey provides complete support for Azure OpenAI.

Azure OpenAI

AWS Bedrock

Upload files to S3 for Bedrock batch inference

Route to your AWS Sagemaker models through Portkey

AWS SageMaker

Ollama

LocalAI

Integrate vLLM-hosted custom models with Portkey and take them to production

vLLM

Integrate Trtiton-hosted custom models with Portkey and take them to production

Triton

AI21

Integrate Anyscale endpoints with Portkey seamlessly and make your OSS models production-ready

Anyscale

Cerebras

Cohere

Fireworks

Integrate Dashscope with Portkey for seamless completions, prompt management, and advanced features like streaming, function calling, and fine-tuning.

Dashscope

Deepinfra

Portkey provides a robust and secure gateway to facilitate the integration of various Large Language Models (LLMs) into your applications, including [Deepbricks](https://deepbricks.ai/). 

Deepbricks

Portkey provides a robust and secure gateway to use and observe Deepgrm's Speech-to-Text API.

Deepgram

Portkey provides a robust and secure gateway to facilitate the integration of various Large Language Models (LLMs) into your applications, including DeepSeek models. 

DeepSeek

Github

Groq

Hugging Face

Portkey provides a robust and secure gateway to facilitate the integration of various Large Language Models (LLMs) into your applications, including the models hosted on [Inference.net](https://www.inference.net/). 

Inference.net

Jina AI

Integrate Lambda with Portkey AI for seamless completions, prompt management, and advanced features like streaming and function calling.

Lambda Labs

Integrate LemonFox with Portkey for seamless completions, prompt management, and advanced features like streaming, function calling, and fine-tuning.

Lemonfox-AI

Lingyi (01.ai)

Mistral AI

MonsterAPIs provides access to generative AI model APIs at 80% lower costs. Connect to MonsterAPI LLM APIs seamlessly through Portkey's AI gateway.

Monster API

Moonshot

Nomic

Novita AI

OpenRouter

Perplexity AI

Predibase

Reka AI

Portkey provides a robust and secure gateway to facilitate the integration of various Large Language Models (LLMs) into your applications, including [SambaNova AI](https://sambanova.ai/).

SambaNova

Segmind

Stability AI

SiliconFlow

Together AI

Integrate Upstage with Portkey AI for seamless completions, prompt management, and advanced features like streaming and embedding.

Upstage AI

Portkey provides a robust and secure gateway to facilitate the integration of various Large Language Models (LLMs) into your applications, including Voyage AI's embedding and Re-rank endpoints. 

Voyage AI

Workers AI

Portkey supports xAI's chat completions, completions, and embeddings APIs.

ZhipuAI / ChatGLM / BigModel

Replicate

Suggest a new integration!

Bring Your Own LLM

Milvus

Qdrant

Portkey helps bring your agents to production

Use Portkey with Autogen to take your AI Agents to production

Autogen

Use Portkey with Control Flow to take your AI Agents to production

Control Flow

Use Portkey with CrewAI to take your AI Agents to production

CrewAI

Langchain Agents

Use Portkey with LangGraph to take your AI Agents to production

LangGraph Agents

Use Portkey with Llama Agents to take your AI Agents to production

Llama Agents by Llamaindex

The Portkey x Swarm integration brings advanced AI gateway capabilities, full-stack observability, and reliability features to build production-ready AI agents.

OpenAI Swarm

Use Portkey with Phidata to take your AI Agents to production

Phidata

You can also use Portkey if you are doing custom agent orchestration!

Bring Your own Agents

AutoGen is a framework that enables the development of LLM applications using multiple agents that can converse with each other to solve tasks.

Integrate DSPy with Portkey for production-ready LLM pipelines

DSPy

With Portkey, you can confidently take your Instructor pipelines to production and get complete observability over all of your calls + make them reliable - all with a 2 LOC change!

Instructor

Portkey adds core production capabilities to any Langchain app.

Langchain (Python)

Langchain (JS/TS)

Cost tracking, observability, and more for LibreChat

LibreChat

Cost tracking, observability, and more for Open WebUI

Open WebUI

Add usage tracking, cost controls, and security guardrails to your Anything LLM deployment

Anything LLM

The **Portkey x LlamaIndex** integration brings advanced **AI gateway** capabilities, full-stack **observability**, and **prompt management** to apps built on LlamaIndex.

LlamaIndex (Python)

Portkey brings advanced **AI gateway** capabilities, full-stack **observability**, and **prompt management** + **versioning** to your **Promptfoo** projects. This document provides an overview of how to leverage the strengths of both the platforms to streamline your AI development workflow.

Promptfoo

Integrate Portkey with Vercel AI SDK for production-ready and reliable AI apps

Vercel

Integrate MindsDB with Portkey to build enterprise-grade AI use-cases

MindsDb

ToolJet is a low-code platform that lets you build apps by connecting APIs and data sources, with Portkey integration adding AI features like chat interfaces and automation.

ToolJet

MongoDB

Supabase

Learn how to integrate Portkey's enterprise features with Zed for enhanced observability, reliability and governance.

Learn how to integrate Portkey's enterprise features with any OpenAI Compliant project for enhanced observability, reliability and governance.

Portkey with Any OpenAI Compatible Project

Gain real-time insights, track key metrics, and streamline debugging with our comprehensive observability suite.

Observability (OpenTelemetry)

The Logs section presents a chronological list of all the requests processed through Portkey.

Logs

The **Tracing** capabilities in Portkey empowers you to monitor the lifecycle of your LLM requests in a unified, chronological view.

Tracing

Portkey's Feedback APIs provide a simple way to get weighted feedback from customers on any request you served, at any stage in your app.

Metadata

Filters

Easily access your Portkey logs data for further analysis and reporting

Logs Export

Budget Limits

The world's fastest AI Gateway with advanced routing & integrated Guardrails.

AI Gateway

Portkey's Universal API provides a consistent interface to integrate a wide range of modalities (text, vision, audio) and LLMs (hosted OR local) into your apps.

Universal API

This feature is available on all Portkey plans.

Configs

Conditional Routing

Cache (Simple & Semantic)

Fallbacks

LLM APIs often have inexplicable failures. With Portkey, you can rescue a substantial number of your requests with our in-built automatic retries feature. 

Automatic Retries

Use OpenAI's Realtime API with logs, cost tracking, and more!

Realtime API

Load Balancing feature efficiently distributes network traffic across multiple LLMs.

Load Balancing

You can use Portkey's AI gateway to also canary test new models or prompts in different environments. 

Canary Testing

Strict OpenAI Compliance

Manage unpredictable LLM latencies effectively with Portkey's **Request Timeouts**.

Request Timeouts

Upload files to Portkey and reuse the content in your requests

With Portkey's Prompt Library, you can seamlessly create and manage prompts along with all associated model parameters.

Prompt Library

With Prompt Templates, you can seamlessly create and manage your LLM prompts in one place, and deploy them with just an API call.

Prompt Templates

With Prompt Partials, you can save your commonly used templates (which could be your instruction set, data structure explanation, examples etc.) separately from your prompts and flexibly incorporate them wherever required.

Prompt Partials

See how to use Portkey's prompt templates with OpenAI (or any other provider) SDKs

Retrieve Prompts

Advanced Prompting with JSON Mode

Ship to production confidently with Portkey Guardrails on your requests & responses

Guardrails

List of Guardrail Checks

Replace any sensitive data in requests with standard identifiers

PII Redaction

Patronus excels in industry-specific guardrails for RAG workflows.

Patronus AI

Aporia

Pillar

Pangea AI Guard helps analyze and redact text to prevent model manipulation and malicious content.

Pangea

Bring Your Own Guardrails

With the raw Guardrails mode, we let you define your Guardrail checks & actions however you want, directly in code.

Creating Raw Guardrails (in JSON)

Automatically create, manage, and execute fine-tuning jobs for Large Language Models (LLMs) across multiple providers.

Introduction

Product

Support

Analytics

Charts

Overview

Users

Errors

Cache

Feedback

Metadata Summary

Introduction

Product

Support

​Charts

​Overview

​Users

​Errors

​Cache

​Feedback

​Metadata Summary

Charts

Overview

Users

Errors

Cache

Feedback

Metadata Summary