Python – What audio formats does Azure Cognitive Services Speech Service (SST) support?

What audio formats does Azure Cognitive Services Speech Service (SST) support?… here is a solution to the problem.

What audio formats does Azure Cognitive Services Speech Service (SST) support?

Keep in mind that as far as I know, the Speech service of Microsoft/Azure Cognitive Services is currently undergoing rationalization efforts

https://learn.microsoft.com/en-us/azure/cognitive-services/speech-service/rest-apis#speech-to-text

https://learn.microsoft.com/en-us/azure/cognitive-services/speech/home

Only .wav binaries are acceptable, anything else gives a response:

{"Message":"Unsupported audio format"}

Is there any other way to discover acceptable audio formats/encodings etc or is that it?

[Bonus points for prompts for preprocessing arbitrary/.m4a audio formats in python pydub so that they meet the criteria – currently for .mp3 but not for .m4a].

Thanks!

Solution

Related Problems and Solutions