Deepfake Generation API
Generate realistic voice conversions and text-to-speech
Deepfake Generation API
The Deepfake Generation API allows you to create realistic voice conversions and text-to-speech outputs.
Text to Speech
Generate speech from text
Request Parameters
The text to convert to speech
Optional. Reference audio file to clone the voice from
Optional. Transcript of the reference audio
Example Request
Response
Returns the generated audio file as a streaming response with Content-Type: audio/wav
.
Voice Conversion Models
Get available voice conversion models
Example Response
Voice Conversion
Convert voice using the specified model
Request Parameters
The ID of the voice conversion model to use
The amount to shift the pitch (in semitones). Range: -12 to 12. Default: 0
The audio file to convert
Example Request
Response
Returns the converted audio file as a streaming response with Content-Type: audio/wav
.
Get Audio File
Get a previously generated audio file
Path Parameters
The name of the audio file to retrieve
Example Request
Response
Returns the audio file as a streaming response with Content-Type: audio/wav
.