Starts a background job that creates an AI generated script and produces an audio file in convo mode.
The response contains a job_id that can be used with the /podcast/{job_id} endpoint
to query for status.
Requires a valid X-API-TOKEN header.
Note that generating via Convo Mode yields more natural results but takes longer. Expect at least 1 minute of processing per minute duration of audio before investigating a potential failure.
Throws:
A request to start a job that generates an audio file for a podcast in convo mode with a script generated by AI using the provided prompt. An optional list of Voice IDs can be provided to control the audio generation output. If no voice_ids are provided, defaults of up to 2 convo-mode voices will be chosen.
The prompt to instruct the AI to generate the script. You can specify the duration, and include links. Sometimes, the AI hallucinates, so it helps to mention that this podcast has two hosts.
A list of voice_ids to use for the generated audio. Must provide exactly 2 voices for this endpoint. If the script has multiple speakers, we will use the voices from this list in the order they appear in. All ids in this list must be unique. If no IDs are provided, 2 defaults will be used.
An optional delivery instructions for the generated audio. This will be used to guide the generation of the audio.
An optional spec for adding background music to the generated script. Note that if the chosenmusic track is longer than the duration of the generated script, it will be cut off.
Successful Response
The job ID for the episode generation. The status of this job can be queried using the/podcast/{job_id} endpoint.