Get Started
Quick Start
Base URL
Authentication
Chat Completion (OpenAI-compatible)
curl -X POST https://apigateway-q66ryynraa-uc.a.run.app/v1/chat/completions \
-H "Authorization: Bearer aaas_xxx" \
-H "Content-Type: application/json" \
-d '{
"model": "google/gemini-2.5-flash",
"messages": [
{"role": "user", "content": "Hello!"}
]
}'22 Endpoints
Grouped by category. All authenticated endpoints require a Bearer token.
System
LLM
Foundry
Intelligence
Billing
Keys
Model Pricing
Per 1M tokens. Credits deducted based on actual usage. Gemini models available now; others activating.
| Model | Provider | Input/1M | Output/1M | Context | Status |
|---|---|---|---|---|---|
| google/gemini-2.5-flash | $0.15 | $0.60 | 1M | ||
| google/gemini-2.5-pro | $1.25 | $5.00 | 1M | ||
| anthropic/claude-4.5-sonnet | Anthropic | $3.00 | $15.00 | 200K | |
| anthropic/claude-4.5-haiku | Anthropic | $0.80 | $4.00 | 200K | |
| openai/gpt-5 | OpenAI | $2.50 | $10.00 | 128K | |
| openai/gpt-5-mini | OpenAI | $0.15 | $0.60 | 128K | |
| deepseek/deepseek-r1 | OpenRouter | $0.55 | $2.19 | 128K | |
| meta/llama-3.3-70b | OpenRouter | $0.40 | $0.40 | 128K | |
| perplexity/sonar-pro | Perplexity | $3.00 | $15.00 | 200K |
Live now Activating (needs provider key)
Not Just an LLM Router
Language Models
Claude, GPT, Gemini, DeepSeek, Llama, Mistral, Perplexity — all via one key. OpenAI-compatible format means zero migration cost.
Agent Chains
Multi-agent orchestration via API. DevOps incident response, revenue pipeline, customer success, HR, finance, legal — each chain composes autonomous workflows. No competitor offers this.
Composable Skills
A skill vault that agents draw from. Anomaly detection, contract redlining, bias mitigation, flight-risk modeling — typed capabilities that compose into chains.
Credit System
Pay-as-you-go. $5 minimum. Credits never expire. Negative balance is mathematically impossible — every request is credit-checked before execution.