Area | Key Updates |
---|---|
Benchmark | • Released LLMs in Prod Report 2025 analyzing 2T+ tokens • Key finding: Multi-LLM deployment is now standard • Average prompt size up 4x, with 40% cost savings from caching |
Security | • Advanced PII redaction with automatic standardized identifiers • JWT authentication support for enterprise deployments • Comprehensive audit logs for all critical actions • Enforced metadata schemas for better governance • Attach default configs & metadata to API keys • Granular workspace management controls |
Platform | • Unified API for files & batches across major providers • Support for private LLM deployments • Enhanced virtual keys with granular controls |
New Models | • Deepseek R1 available across 7+ providers • Added Gemini thinking model • Support for Perplexity Sonar models • o3-mini integration |
Integrations | • AWS Bedrock Guardrails support • Milvus DB & Replicate integrations • Expanded Open WebUI support • Guardrails for embedding requests |
Community | • We did a deep dive into MCP and event-driven architecture for agentic systems |
Inverse
option in the UI - which triggers a TRUE
verdict when the Guardrail verdict fails.