Skip to content

Audio

OpenGateway exposes the OpenAI Audio shapes, normalized across Hugging Face providers.

POST /<lane>/v1/audio/speech returns audio bytes.

Terminal window
curl "https://api.opengateway.one/oss/v1/audio/speech" \
-H "Authorization: Bearer $OPENGATEWAY_API_KEY" \
-H "Content-Type: application/json" \
-d '{ "model": "hexgrad/Kokoro-82M", "input": "Welcome to the edge.", "voice": "default" }' \
--output speech.mp3

POST /<lane>/v1/audio/transcriptions accepts multipart form data.

Terminal window
curl "https://api.opengateway.one/oss/v1/audio/transcriptions" \
-H "Authorization: Bearer $OPENGATEWAY_API_KEY" \
-F "model=openai/whisper-large-v3" \
-F "file=@sample.mp3"
{ "text": "Welcome to the edge." }

Translation is available at /v1/audio/translations. Requesting a non-audio model returns 422 model_task_mismatch.