Sign in Subscribe

Production Guides

Supercharging Open-source LLMs: Your Gateway to 250+ Models

The Rise of Open-source LLMs in Production

⭐️ Implementing FrugalGPT: Reducing LLM Costs & Improving Performance

⭐️ Implementing FrugalGPT: Reducing LLM Costs & Improving Performance

FrugalGPT is a framework proposed by Lingjiao Chen, Matei Zaharia, and James Zou from Stanford University in their 2023 paper "FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance". The paper outlines strategies for more cost-effective and performant usage of large language model (LLM) APIs. A

What It Means To Go To Prod

What It Means To Go To Prod

Lately, I am feeling more and more confident about Portkey solving *real* production challenges for our users. Of course, developer love and revenue increase are important parameters, but there's something else just as high signal: The platform modularity that customers demand from Portkey, and how fast we serve it without

Portkey Goes Multimodal

Portkey Goes Multimodal

2024 is the year where Gen AI innovation and productisation happens hand-in-hand. We are seeing companies and enterprises move their Gen AI prototypes to production at a breathtaking pace. At the same time, an exciting new shift is also taking place in how you can interact with LLMs: completely new

Understanding RAG: A Deeper Dive into the Fusion of Retrieval and Generation

Understanding RAG: A Deeper Dive into the Fusion of Retrieval and Generation

Retrieval-Augmented Generation (RAG) models represent a fascinating marriage of two distinct but complementary components: retrieval systems and generative models. By seamlessly integrating the retrieval of relevant information with the generation of contextually appropriate responses, RAG models achieve a level of sophistication that sets them apart in the realm of artificial

⭐️ OpenAI Model Deprecation Guide

⭐️ OpenAI Model Deprecation Guide

In two days (i.e. Jan 4), OpenAI will retire 33 models, including GPT-3 (text-davinci-003) and various others. This is OpenAI's biggest model deprecation so far. Here's what you need to know: GPT-3 Model Retirement The text-davinci-003 model (commonly known as GPT-3) will be unavailable from Jan 4. → You must

Anyscale's OSS Models + Portkey's Ops Stack

Anyscale's OSS Models + Portkey's Ops Stack

The landscape of AI development is rapidly evolving, and open-source Large Language Models (LLMs) have emerged as a key foundation for building AI applications. Anyscale has been a game-changer here with their fast and cheap APIs for Llama2, Mistral, and more OSS models. But to harness the full potential of