> ## Documentation Index
> Fetch the complete documentation index at: https://docs.portkey.ai/docs/llms.txt
> Use this file to discover all available pages before exploring further.

# AI Engineering Hours

> Discussion notes from the weekly AI engineering meetup

<CardGroup cols={2}>
  <Card title="Weekly Calendar" horizontal="True" href="https://lu.ma/portkey" icon="bell" />

  <Card title="Community Discord" horizontal="True" href="https://portkey.wiki/community" icon="discord" />
</CardGroup>

<Update label="29 Nov">
  <ResponseField name="Summary" />

  Teams from Springworks and Haptik shared hard-won insights from running LLMs in production: Gemini outperforms gpt-4o for Hinglish translation, and shifting to managed Gateways cuts latency in half. Plus practical tips on caching and RAG optimization at scale.

  <ResponseField name="Attendees" />

  <CardGroup cols={4}>
    <Card horizontal title="Karan Trehan" href="https://www.linkedin.com/in/karantrehan3/" img="https://media.licdn.com/dms/image/v2/D4D03AQElEQb4M5pJlQ/profile-displayphoto-shrink_800_800/profile-displayphoto-shrink_800_800/0/1669909061328?e=1738195200&v=beta&t=feWNcyW5m8NE-FIZa3Nzkv0UJGN2Kdf786KUJWE5hwQ">
      SDE-2, Springworks
    </Card>

    <Card horizontal title="Komal Singh" href="https://www.linkedin.com/in/komalssingh/" img="https://media.licdn.com/dms/image/v2/C5603AQE70AgYTTTAOQ/profile-displayphoto-shrink_800_800/profile-displayphoto-shrink_800_800/0/1651298543992?e=1738195200&v=beta&t=aTBlzx-QeEdJuIlVWdiQwRkXQvhS4OZ0XzWk9jryrks">
      DevOps Engineer, Jio Haptik
    </Card>

    <Card horizontal title="Pratham Naveen" href="https://www.linkedin.com/in/prathamnaveen/" img="https://media.licdn.com/dms/image/v2/D5603AQHvpXjRqux0Gw/profile-displayphoto-shrink_800_800/profile-displayphoto-shrink_800_800/0/1695470961960?e=1738195200&v=beta&t=fx26DQLSOCtYz9PCbd8gSeISOejWAW3dZxsI0f7SU9c">
      Gen AI, NetApp
    </Card>

    <Card horizontal title="Vinodraj V K" href="https://www.linkedin.com/in/vinodraj-v-k-426884280/" img="https://media.licdn.com/dms/image/v2/D5603AQESQMPCJlfXjQ/profile-displayphoto-shrink_800_800/profile-displayphoto-shrink_800_800/0/1687632716123?e=1738195200&v=beta&t=fiX3EBZdIRa0KoArzWt-TeO28ha7jxPRg25pWiWf-Ck">
      Gen AI, NetApp
    </Card>
  </CardGroup>

  <ResponseField name="Notes" />

  **On Production Patterns**

  * Haptik & Springworks map Portkey virtual keys to their model deployments, making it simple for engineers to prototype & build AI features
  * Monitor Portkey analytics to understand deployment behavior and pre-scale resources to avoid rate limits
  * For secure testing, use short-lived virtual keys instead of sharing long-term access

  **Some Learnings**

  * Infrastructure insight: Each additional middleware layer (auth, rate limiting) compounds latency at scale - consider using Gateway features directly instead of custom layers
  * Plan for caching early: Auxiliary services inevitably add latency at scale - implement caching in your initial development cycle
  * In RAG pipelines, Vector DB operations become bottlenecks before LLM calls - optimize these first
  * For Hinglish audio translations, especially with noise, Gemini proves more reliable than gpt-4o
</Update>
