Responses
POST /<lane>/v1/responses — the OpenAI Responses API. This is the only wire
protocol Codex supports (wire_api = "responses"), and the OpenAI SDK can use it
via client.responses.create.
Request
Section titled “Request”curl "https://api.opengateway.one/oss/v1/responses" \ -H "Authorization: Bearer $OPENGATEWAY_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "model": "claude-turbo-hub-gpt-oss-120b", "input": "Say PONG only.", "reasoning": { "effort": "low" } }'Response
Section titled “Response”{ "id": "resp_...", "object": "response", "model": "claude-turbo-hub-gpt-oss-120b", "output": [ { "type": "message", "role": "assistant", "content": [{ "type": "output_text", "text": "PONG" }] } ], "usage": { "input_tokens": 5, "output_tokens": 1, "total_tokens": 6 }}Streaming & effort
Section titled “Streaming & effort”- Set
stream: truefor SSE response events. reasoning.effortacceptsnone|minimal|low|medium|high|xhigh— Codex maps itsmodel_reasoning_effortonto this. See Effort levels.previous_response_idcontinuity is supported for multi-turn flows.
See Codex setup for the full ~/.codex/config.toml.
OpenGateway Guide
Ask about configuring OpenGateway — lanes, base URLs, client setup, model choice, or an error you hit. Answers are grounded in the docs.