稳定生产路由
OpenAI-compatible /chat/completions format — drop-in replacement
Gemini 已通过 API in One 接入真实生产路由 /api/v1/chat/completions,后端节点为 node62。一个 API Key 即可调用并按成功请求计费。
以下能力来自当前已接入的模型配置和后端路由,不展示未上线 API。
OpenAI-compatible /chat/completions format — drop-in replacement
One of the cheapest LLM APIs available
128K token context window for long documents
No Google Cloud or Vertex AI setup required
浏览器端只调用同源 API,服务端 BFF 再转发到对应 Worker。
messagesarrayArray of message objects with role (user/assistant/system/developer) and content
必填streambooleanEnable SSE streaming responses
可选 · 默认 trueinclude_thoughtsbooleanInclude model reasoning/thinking steps
可选 · 默认 truereasoning_effortstringReasoning depth: "low" or "high"
可选 · 默认 hightoolsarrayTool definitions for function calling (OpenAI-compatible format)
可选response_formatobjectForce structured output (e.g. {type: "json_object"})
可选{
"endpoint": "/api/v1/chat/completions",
"headers": {
"Authorization": "Bearer <API_KEY>",
"Content-Type": "application/json"
},
"body": {
"model": "gemini",
"messages": [
{
"role": "user",
"content": "Explain quantum computing in simple terms."
}
],
"stream": false,
"include_thoughts": true
}
}~5 credits pre-deduct, pay-per-token
这些场景可通过当前模型和对应后端节点实现。
Build AI-powered chatbots and customer support agents
Generate and summarize text content at scale
Create AI coding assistants and code generation tools
Power reasoning and analysis features in your application
Yes. API in One uses the standard OpenAI chat/completions format. You can switch from OpenAI to Gemini by changing the base URL and model name — no other code changes needed.
5 credits are pre-deducted per request, then adjusted based on actual token usage (pay-per-token). One of the most affordable LLM APIs. Free credits included on sign up.
Yes. Streaming is enabled by default via SSE (Server-Sent Events). Set stream to false for synchronous responses.
Gemini 3 Flash supports up to 1M tokens of context, suitable for processing long documents and maintaining extended conversations.