LLMs in Prod Blog ● Portkey

Sign in Subscribe

Latest

What is shadow AI, and why is it a real risk for LLM apps

Unapproved LLM usage, unmanaged APIs, and prompt sprawl are all signs of shadow AI. This blog breaks down the risks and how to detect it in your GenAI stack.

LLM proxy vs AI gateway: what’s the difference and which one do you need?

Why enterprises need to rethink how employees access LLMs

Managing and deploying prompts at scale without breaking your pipeline

How a model catalog accelerates LLM development

Make Cline enterprise-ready using an AI Gateway

How to balance AI model accuracy, performance, and costs with an AI gateway

Announcements

August at Portkey: 2 BILLION Requests, Guardrails, Tracing, and More

August at Portkey: 2 BILLION Requests, Guardrails, Tracing, and More

Last month at Portkey, we crossed 2 BILLION total requests processed through our platform. To think, Portkey just started a year ago when this number was at 0! We're truly humbled to be production partners for some of the world's leading AI companies. And this drives us to continue innovating

Open Sourcing Guardrails on the Gateway Framework

Conversations

Jamming on Event-Driven Architecture and MCP for Multi-Agentic Systems

Jamming on Event-Driven Architecture and MCP for Multi-Agentic Systems

Insights from Portkey's AI Practitioners Meetup on building scalable multi-agent systems. Learn about Event-Driven Architecture patterns, Model Context Protocol implementation, and real-world experiences building multi-agents system.

LLMs in Prod Comes to Bangalore

LLMs in Prod Comes to Bangalore

Transforming E-Commerce Search with Semantic Cache: Insights from Walmart's Journey

Transforming E-Commerce Search with Semantic Cache: Insights from Walmart's Journey

Unpacking Semantic Caching at Walmart

Unpacking Semantic Caching at Walmart

Dive into what is LLMOps

The Confidence Checklist for LLMs in Production

Production Guides

Beyond the Hype: The Enterprise AI Blueprint You Need Now (And Why Your AI Gateway is Non-Negotiable)

Beyond the Hype: The Enterprise AI Blueprint You Need Now (And Why Your AI Gateway is Non-Negotiable)

The Gen AI wave isn't just approaching—it's already crashed over every industry, leaving enterprises to navigate the aftermath. As a CTO or CIO, you've moved past the demos and proofs-of-concept. The questions keeping you up at night are now existential: How do we operationalize this technology at scale? How

Launching Prompt Engineering Studio

Launching Prompt Engineering Studio

OpenAI's New Agent Tools: Navigating Strategic Implications for Enterprise AI

OpenAI's New Agent Tools: Navigating Strategic Implications for Enterprise AI

The State of AI FinOps 2025: Key Insights from FinOps Foundation's Latest Report

The State of AI FinOps 2025: Key Insights from FinOps Foundation's Latest Report

Beyond Implementation: Why Audit Logs are Critical for Enterprise AI Governance

LLMs in Prod 2025: Insights from 2 Trillion+ Tokens

LLMs in Prod 2025: Insights from 2 Trillion+ Tokens

Prompt Injection Attacks in LLMs: What Are They and How to Prevent Them

Benchmarks

More

Benchmarking the new moderation model from OpenAI

GPT-4 is Getting Faster 🐇

paper summaries

More

Evaluating Long-Context LLMs

Multi-LLM Text Summarization

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference - Summary

Mixtral of Experts - Summary

LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models - Summary