Use this in the request body model field.
gemini API
Access Google Gemini through API in One's OpenAI-compatible chat API. Advanced text generation, reasoning, and conversation capabilities. 5 credits are pre-deducted, then adjusted by token usage.
The currently implemented BFF route.
Input types: text
Keys are managed by the shared dashboard.
This route does not require webhook handling.
Live capabilities
These capabilities come from the current model config and backend route. Unreleased APIs are not listed.
OpenAI
OpenAI-compatible /chat/completions format — drop-in replacement
One of the cheapest LLM APIs available
One of the cheapest LLM APIs available
128K token context window for long documents
128K token context window for long documents
No Google Cloud or Vertex AI setup required
No Google Cloud or Vertex AI setup required
Request format
Clients call same-origin API routes; the server BFF forwards to the matching Worker.
POST/api/v1/chat/completions5 creditsGoogle's most capable AI model for text generation, reasoning, and conversation.
Parameters
messagesarrayArray of message objects with role (user/assistant/system/developer) and content
RequiredstreambooleanEnable SSE streaming responses
Optional · Default trueinclude_thoughtsbooleanInclude model reasoning/thinking steps
Optional · Default truereasoning_effortstringReasoning depth: "low" or "high"
Optional · Default hightoolsarrayTool definitions for function calling (OpenAI-compatible format)
Optionalresponse_formatobjectForce structured output (e.g. {type: "json_object"})
OptionalExample request
curl -X POST https://apiin.one/api/v1/chat/completions \
-H "Authorization: Bearer aio_your_api_key" \
-H "Content-Type: application/json" \
--data-raw '{
"model": "gemini",
"messages": [
{
"role": "user",
"content": "Explain quantum computing in simple terms."
}
],
"stream": false,
"include_thoughts": true
}'Integration flow
These steps use the current live route and shared account system; login, payment, orders, credits, and profile remain shared.
Create a shared API key
Create a key in the existing dashboard. Top-up, orders, credit balance, and profile stay in the shared console.
Call the real endpoint
Send a request with the endpoint, model id, and parameters shown on this page. /api/v1/chat/completions
Poll task status
Chat models return directly without task polling.
Pricing
~5 credits pre-deduct, pay-per-token
Use cases
These workflows are supported by the current model and backend node.
Build AI-powered chatbots and customer support agents
Generate and summarize text content at scale
Create AI coding assistants and code generation tools
Power reasoning and analysis features in your application
FAQ
Is the Gemini API compatible with OpenAI's format?
Yes. API in One uses the standard OpenAI chat/completions format. You can switch from OpenAI to Gemini by changing the base URL and model name — no other code changes needed.
How much does the Gemini API cost?
5 credits are pre-deducted per request, then adjusted based on actual token usage (pay-per-token). One of the most affordable LLM APIs. Free credits included on sign up.
Does the Gemini API support streaming?
Yes. Streaming is enabled by default via SSE (Server-Sent Events). Set stream to false for synchronous responses.
What is the maximum context length?
Gemini 3 Flash supports up to 1M tokens of context, suitable for processing long documents and maintaining extended conversations.