Public Models
Public models are pre-deployed inference endpoints operated by QDivZero. You call them by name — no launch wizard, no GPU selection, no scheduler — and pay per token consumed. Under the hood, the platform routes the request across the providers that host each model.
Public models vs. Compute instances
Catalog
The current catalog. The platform adds models as providers expose them; the pricing is in EUR per million tokens (input and output priced separately).
| Model | Workload | European | Input / M | Output / M |
|---|---|---|---|---|
| deepseek-v3.2 | chat | No | €0.35 | €1.43 |
| deepseek-v3.2-european | chat | Yes | €0.81 | €2.18 |
| gpt-oss-120b-european | chat | Yes | €0.34 | €1.37 |
| gpt-oss-safeguard-120b-european | chat | Yes | €0.34 | €1.37 |
| gpt-oss-20b-european | chat | Yes | €0.09 | €0.39 |
| gpt-oss-safeguard-20b-european | chat | Yes | €0.09 | €0.39 |
Variants ending in -european
Where to go next
Calling public models →
curl and OpenAI SDK examples, plus which features are forwarded to the provider.
Pricing model →
How token usage is billed and the ledger shape.
Use in Smart Balancers →
Reference a public model as a route destination or as the intent classifier.
Inference →
Full request shape: backend selection, firewall, tools, and streaming.