gpt-4o (excludes gpt-4o-2024-05-13)
gpt-4o-mini
o1-preview
o1-mini
tools
can be cached, contributing to the minimum 1024 token requirement.cached_tokens
field of the usage.prompt_tokens_details
chat completions object indicating how many of the prompt tokens were a cache hit.
For requests under 1024 tokens, cached_tokens
will be zero.