Enforce Budget Limits and Rate Limits for Your API Keys
Configure budget and rate limits on API keys to effectively manage AI spending and usage across your organization
Available on Enterprise plan and select Pro customers.
Overview
For enterprises deploying AI at scale, maintaining financial oversight and operational control is crucial. Portkey’s governance features for API keys provide finance teams, IT departments, and executives with the transparency and guardrails needed to confidently scale AI adoption across the organization.
By implementing budget and rate limits on API keys at both organization and workspace levels, you can:
- Prevent unexpected cost overruns through automated spending caps
- Maintain performance and availability through usage rate controls
- Receive timely notifications when thresholds are approached
- Enforce consistent governance policies across teams and departments
These capabilities ensure your organization can innovate with AI while maintaining predictable costs and usage patterns.
Budget Limits
Budget limits allow you to set maximum LLM spending or token usage thresholds on your API keys, automatically preventing further usage when limits are reached.
When creating or editing an API key, you can establish spending parameters that align with your financial planning:
Setting Up Budget Limits
When creating a new API key or editing an existing one:
- Toggle on Add Budget Limit
- Choose between two limit types:
- Cost: Set a maximum spend in USD (minimum $1)
- Tokens: Set a maximum token usage
Budget limit configuration
Alert Thresholds
You can configure alert thresholds to receive notifications before reaching your full budget:
- Enter a value in the Alert Threshold field
- When usage reaches this threshold, notifications will be sent to configured recipients
- The API key continues to function until the full budget limit is reached
Periodic Reset Options
Budget limits can be set to automatically reset at regular intervals:
Periodic reset options
- No Periodic Reset: The budget limit applies until exhausted
- Reset Weekly: Budget limits reset every Sunday at 12 AM UTC
- Reset Monthly: Budget limits reset on the 1st of each month at 12 AM UTC
Rate Limits
Rate limits control how frequently an API key can be used, helping you maintain application performance and prevent unexpected usage spikes.
Setting Up Rate Limits
When creating a new API key or editing an existing one:
- Toggle on Add Rate Limit
- Choose your limit type:
- Requests: Limit based on number of API calls
- Tokens: Limit based on token consumption
- Specify the limit value and time interval
Rate limit configuration
Time Intervals
Rate limits can be applied using three different time intervals:
- Per Minute: For granular control of high-frequency applications
- Per Hour: For balanced control of moderate usage
- Per Day: For broader usage management
When a rate limit is reached, subsequent requests are rejected until the time interval resets.
Email Notifications
Email notifications keep relevant stakeholders informed about API key usage and when limits are approached or reached.
Configuring Notifications
To set up email notifications for an API key with budget limits:
- Toggle on Email Notifications when creating/editing an API key
- Add recipient email addresses:
- Type an email address and click New or press Enter
- Add multiple recipients as needed
Email notification setup
Default Recipients
When limits are reached or thresholds are crossed, Portkey automatically sends notifications to:
- Organization administrators
- Organization owners
- The API key creator/owner
You can add additional recipients such as finance team members, department heads, or project managers who need visibility into AI usage.
Availability
These features are available to Portkey Enterprise customers and select Pro users. To enable these features for your account, please contact [email protected] or join the Portkey Discord community.
To learn more about the Portkey Enterprise plan, schedule a consultation.