Portkey Enterprise offers a secure hybrid deployment model that balances security, flexibility, and fast deployment timelines:

  • Data Plane runs within your VPC, keeping sensitive LLM data and AI traffic in your environment
  • Control Plane hosted by Portkey handles administration, configs, and analytics

Portkey Enterprise Hybrid Deployment Architecture

Schedule an Enterprise Architecture Demo

Want to learn more about our hybrid deployment model? Schedule a personalized demo with our solutions team to see how Portkey Enterprise can fit your security and compliance requirements.

Core Architecture Components

Data Plane (Your VPC)

The Data Plane is deployed in your cloud environment and processes all your AI traffic:

ComponentDescriptionSecurity Benefit
AI GatewayCore engine that routes traffic across LLM providers and implements metering, access control, and guardrailsAll LLM requests remain in your network perimeter
Cache StoreLocal cache storage for gateway consumptionEliminates runtime dependency on Control Plane
Data StoreStorage for LLM request/response logsKeep sensitive LLM data completely in your environment

The AI Gateway runs as containerized workloads in your infrastructure, deployable via your preferred orchestration method (Kubernetes, ECS, etc.).

Control Plane (Portkey VPC)

The Control Plane is fully managed by Portkey and provides the administrative layer for your deployment:

  • Hosts the web dashboard for managing configurations, tracking analytics, and viewing logs
  • Maintains routing configs, provider integrations
  • Stores non-sensitive metadata and aggregated metrics
  • Automatically updates with new features and provider integrations without requiring changes to your infrastructure

Data Flow Between Planes

Deployment Architecture

Portkey AI Gateway is deployed as containerized workloads using Helm charts for Kubernetes environments, with flexible deployment options for various cloud providers.

Infrastructure Components

ComponentDescriptionConfiguration Options
AI GatewayCore container running the routing logicDeployed as stateless containers that can scale horizontally
Cache SystemStores routing configs, virtual keys, and moreRedis (in-cluster, AWS ElastiCache, or custom endpoint)
Log StoragePersistence for request/response dataMultiple options (see below)

Storage Options

S3-compatible storage options including:

  • AWS S3 (standard credentials or assumed roles)
  • Google Cloud Storage (S3 compatible interoperability mode)
  • Azure Blob Storage (key, managed identity, or Entra ID)
  • Any S3-compatible Blob Storage

Authentication Methods

  • IAM roles for service accounts (IRSA) in Kubernetes
  • Instance Metadata Service (IMDS) for EC2/ECS
  • Managed identities in Azure environments

Infrastructure Requirements

  • Kubernetes Cluster: K8s 1.20+ with Helm 3.x
  • Outbound Network: HTTPS access to Control Plane endpoints
  • Container Registry Access: For pulling gateway container images
  • Recommended Resource Requirements:
    • CPU: 1-2 cores per gateway instance
    • Memory: 2-4GB per gateway instance
    • Storage: Dependent on logging configuration

Data Security & Encryption

Your Sensitive Data Stays in Your VPC

  • All prompt content and LLM responses remain within your network
  • Only anonymized metrics data cross network boundaries
  • Log storage location is configurable based on your requirements

Advantages of Hybrid Architecture

BenefitTechnical ImplementationBusiness Value
Security & Compliance- Sensitive data never leaves VPC
- Configurable encryption methods
- Flexible authentication options
- Meets data residency requirements
- Supports regulated industries
- Simplifies security reviews
Operational Efficiency- No database management overhead
- Automatic model config updates
- Horizontally scalable architecture
- Low operational burden
- Always up-to-date with LLM ecosystem
- Scales with your traffic patterns
Deployment Flexibility- Kubernetes-native deployment
- Support for major cloud providers
- Multiple storage backend options
- Fits into existing infrastructure
- Avoids vendor lock-in
- Customizable to specific needs
Developer Experience- OpenAI-compatible API
- Simple integration patterns
- Comprehensive observability
- Minimal code changes needed
- Smooth developer onboarding
- Full visibility into system behavior

Technical Rationale

Sample Files

These samples demonstrate the typical data patterns flowing between systems:

Resources & Next Steps

Have Questions?

Our solution architects are available to discuss your specific deployment requirements and security needs.

Schedule Architecture Discussion

Book a personalized consultation with our enterprise team to explore how Portkey’s architecture can be tailored to your organization’s specific requirements.