Easily switch between LLM providers. Call various LLMs such as Anthropic, Gemini, Mistral, Azure OpenAI, Google Vertex AI, AWS Bedrock and much more by simply changing the provider and API key in the LLM object.
Improve performance and reduce costs on your Agent’s LLM calls by storing past responses in the Portkey cache. Choose between Simple and Semantic cache modes in your Portkey’s gateway config.
Copy
Ask AI
{ "cache": { "mode": "semantic" // Choose between "simple" or "semantic" }}
Set up fallbacks between different LLMs or providers, load balance your requests across multiple instances or API keys, set automatic retries, and request timeouts. Ensure your agents’ resilience with advanced reliability features.
Portkey automatically logs key details about your agent runs, including cost, tokens used, response time, etc. For agent-specific observability, add Trace IDs to the request headers for each agent. This enables filtering analytics by Trace IDs, ensuring deeper monitoring and analysis.
Access a dedicated section to view records of action executions, including parameters, outcomes, and errors. Filter logs of your agent run based on multiple parameters such as trace ID, model, tokens used, metadata, etc.
Use Portkey as a centralized hub to store, version, and experiment with your agent’s prompts across multiple LLMs. Easily modify your prompts and run A/B tests without worrying about the breaking prod.
Improve your Agent runs by capturing qualitative & quantitative user feedback on your requests, and then using that feedback to make your prompts AND LLMs themselves better.