OpenAI
OpenAI-compatible /chat/completions format — drop-in replacement
Access Google Gemini through API in One's OpenAI-compatible chat API. Advanced text generation, reasoning, and conversation capabilities. 5 credits are pre-deducted, then adjusted by token usage.
These capabilities come from the current model config and backend route. Unreleased APIs are not listed.
OpenAI-compatible /chat/completions format — drop-in replacement
One of the cheapest LLM APIs available
128K token context window for long documents
No Google Cloud or Vertex AI setup required
Clients call same-origin API routes; the server BFF forwards to the matching Worker.
POST/api/v1/chat/completions5 creditsGoogle's most capable AI model for text generation, reasoning, and conversation.
messagesarrayArray of message objects with role (user/assistant/system/developer) and content
RequiredstreambooleanEnable SSE streaming responses
Optional · Default trueinclude_thoughtsbooleanInclude model reasoning/thinking steps
Optional · Default truereasoning_effortstringReasoning depth: "low" or "high"
Optional · Default hightoolsarrayTool definitions for function calling (OpenAI-compatible format)
Optionalresponse_formatobjectForce structured output (e.g. {type: "json_object"})
Optional{
"endpoint": "/api/v1/chat/completions",
"headers": {
"Authorization": "Bearer <API_KEY>",
"Content-Type": "application/json"
},
"body": {
"model": "gemini",
"messages": [
{
"role": "user",
"content": "Explain quantum computing in simple terms."
}
],
"stream": false,
"include_thoughts": true
}
}~5 credits pre-deduct, pay-per-token
These workflows are supported by the current model and backend node.
Build AI-powered chatbots and customer support agents
Generate and summarize text content at scale
Create AI coding assistants and code generation tools
Power reasoning and analysis features in your application
Yes. API in One uses the standard OpenAI chat/completions format. You can switch from OpenAI to Gemini by changing the base URL and model name — no other code changes needed.
5 credits are pre-deducted per request, then adjusted based on actual token usage (pay-per-token). One of the most affordable LLM APIs. Free credits included on sign up.
Yes. Streaming is enabled by default via SSE (Server-Sent Events). Set stream to false for synchronous responses.
Gemini 3 Flash supports up to 1M tokens of context, suitable for processing long documents and maintaining extended conversations.