FAQs
Common questions and answers.
On this page you’ll find answers to common questions about our API and AI models.
Why am I getting a 'Could not load audio' error?
Why am I getting a 'Could not load audio' error?
This error typically occurs when the provided audio file URL is not a direct link or when there are issues with accessing the file. Make sure the URL is direct and publicly accessible, without requiring additional confirmation steps.
Please refer to the troubleshooting guide for more information on how to resolve this issue.
What is the maximum number of voiceprints supported in speaker identification?
What is the maximum number of voiceprints supported in speaker identification?
Identification jobs support up to 50 voiceprints.
What is the maximum file size supported by the API?
What is the maximum file size supported by the API?
The API supports files of up to 1GiB for diarization and identification jobs, and up to 100MiB for voiceprint jobs.
What is the maximum duration of an audio file for diarization and speaker identification?
What is the maximum duration of an audio file for diarization and speaker identification?
The maximum duration of an audio file for diarization and speaker identification is 24 hours. For voiceprints, the maximum is 30 seconds.
I haven't received a webhook request after submitting a job.
I haven't received a webhook request after submitting a job.
Please check the status of your job using the get job API endpoint. If the job is still in progress, please wait for it to complete. If the job is completed, but you haven’t received the webhook request, please check your webhook configuration and make sure that the webhook URL is correct. For more information, see webhooks.
How can I secure my webhook endpoint for pyannoteAI API?
How can I secure my webhook endpoint for pyannoteAI API?
pyannoteAI supports webhook signatures, which can be used to verify the authenticity of webhook requests. See verifying webhooks for more information.
Which audio file formats are supported by pyannoteAI?
Which audio file formats are supported by pyannoteAI?
Most major audio formats are supported, including mp3, wav, m4a, ogg, flac, among others. We recommend the use of compressed audio to make sure the file size is under the 1GiB limit.
Is the speaker diarization model on the API the same as the one in the pyannote open-source library?
Is the speaker diarization model on the API the same as the one in the pyannote open-source library?
No, the speaker diarization model used in the API is an improved version of the pyannote open-source model. It is faster and more accurate.
How can I contact support?
How can I contact support?
Please first check the documentation and the troubleshooting section for common issues. If you still have questions, please contact us at support@pyannote.ai.