Deepfake Generation API
The Deepfake Generation API allows you to create realistic voice conversions and text-to-speech outputs.Text to Speech
/playground/text-to-speech
Generate speech from text
Request Parameters
The text to convert to speech
Optional. Reference audio file to clone the voice from
Optional. Transcript of the reference audio
Example Request
Response
Returns the generated audio file as a streaming response withContent-Type: audio/wav.
Voice Conversion Models
/playground/voice-conversion/models
Get available voice conversion models
Example Response
Voice Conversion
/playground/voice-conversion
Convert voice using the specified model
Request Parameters
The ID of the voice conversion model to use
The amount to shift the pitch (in semitones). Range: -12 to 12. Default: 0
The audio file to convert
Example Request
Response
Returns the converted audio file as a streaming response withContent-Type: audio/wav.
Get Audio File
/playground/audio/{filename}
Get a previously generated audio file
Path Parameters
The name of the audio file to retrieve
Example Request
Response
Returns the audio file as a streaming response withContent-Type: audio/wav.

