Create a Smart Balancer
The create wizard is opinionated. The defaults cover the most common case (an intent classifier with three routes and a default). Most production balancers only need a few additional routes on top of that.
Steps
Pick a name and a serving name
The name is internal, the serving name is what clients will send as the model id.
Choose the workload kind
chat or embeddings. The balancer rejects mismatched calls.
Choose a routing mode
intent_classifier for many specialised routes, ordered for failover or weighted routing.
Pick a router model
Required for intent_classifier. A small chat instance is enough.
Add routes
Each route needs a name, an intent (a short declarative sentence), and a destination. Mark one route as default.
Save and test
The platform validates the configuration and shows a sample call you can paste into the playground.
Default route is required
Target groups (optional)
A target group is a named pool of instances. Use it when several balancers should share the same pool, or when you want to rotate capacity without editing each balancer.
1{
2 "name": "tech-support-pool",
3 "display_name": "Tech support pool",
4 "description": "Pool of tech-support instances behind a single name.",
5 "workload_kind": "chat",
6 "selection_policy": "weighted_round_robin",
7 "enabled": true,
8 "members": [
9 { "instance_id": "inst_a", "weight": 100, "enabled": true, "priority": 1 },
10 { "instance_id": "inst_b", "weight": 100, "enabled": true, "priority": 2 }
11 ]
12}Smoke testing
After the balancer is created, the detail page shows a sample call you can paste into any OpenAI-compatible client. The smoke test hits the default route by default; the playground exposes a dropdown to pick a specific route for testing.