POST
/
audio
/
speech

Authorizations

x-portkey-api-key
string
headerrequired
x-portkey-virtual-key
string
headerrequired

Body

application/json
model
required

One of the available TTS models: tts-1 or tts-1-hd

input
string
required

The text to generate audio for. The maximum length is 4096 characters.

Maximum length: 4096
voice
enum<string>
required

The voice to use when generating the audio. Supported voices are alloy, echo, fable, onyx, nova, and shimmer. Previews of the voices are available in the Text to speech guide.

Available options:
alloy,
echo,
fable,
onyx,
nova,
shimmer
response_format
enum<string>
default: mp3

The format to audio in. Supported formats are mp3, opus, aac, flac, wav, and pcm.

Available options:
mp3,
opus,
aac,
flac,
wav,
pcm
speed
number
default: 1

The speed of the generated audio. Select a value from 0.25 to 4.0. 1.0 is the default.

Required range: 0.25 < x < 4

Response

200 - application/octet-stream

The response is of type file.