Skip to content

Responses

POST /<lane>/v1/responses — the OpenAI Responses API. This is the only wire protocol Codex supports (wire_api = "responses"), and the OpenAI SDK can use it via client.responses.create.

Terminal window
curl "https://api.opengateway.one/oss/v1/responses" \
-H "Authorization: Bearer $OPENGATEWAY_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "claude-turbo-hub-gpt-oss-120b",
"input": "Say PONG only.",
"reasoning": { "effort": "low" }
}'
{
"id": "resp_...",
"object": "response",
"model": "claude-turbo-hub-gpt-oss-120b",
"output": [
{
"type": "message",
"role": "assistant",
"content": [{ "type": "output_text", "text": "PONG" }]
}
],
"usage": { "input_tokens": 5, "output_tokens": 1, "total_tokens": 6 }
}
  • Set stream: true for SSE response events.
  • reasoning.effort accepts none|minimal|low|medium|high|xhigh — Codex maps its model_reasoning_effort onto this. See Effort levels.
  • previous_response_id continuity is supported for multi-turn flows.

See Codex setup for the full ~/.codex/config.toml.