Hydite Vtslx AO
The unified LLM API aggregation and orchestration platform
Hydite Vtslx AO is a unified API aggregation platform that brings together hundreds of large language models (LLMs) from dozens of providers into a single, consistent API interface. Whether you're building AI-powered applications, running model evaluations, or operating an enterprise-scale AI infrastructure, Vtslx AO eliminates the complexity of managing multiple provider SDKs, API keys, and billing relationships.
Supported models and providers#
Vtslx AO aggregates models across the entire LLM ecosystem:
- Global frontier models — OpenAI (GPT-5.5, GPT-5.5 Pro, GPT-5.2, GPT-5.5 Instant), Anthropic (Claude Opus 4.7, Claude Sonnet 4.6, Claude Haiku 4.5, Claude Mythos), Google (Gemini 3.1, Gemini 3.1 Flash), Meta (Llama 4 Behemoth, Llama 4 Maverick, Muse Spark)
- Chinese domestic models — DeepSeek (V4 Pro, V4 Flash, R1-0528), Qwen (Qwen3.6, Qwen3.6-Max, Qwen3-Coder-Next, QwQ-32B), Zhipu (GLM-5), Moonshot (Kimi K2.6), MiniMax (M2.7), ByteDance (Doubao Seed 2.0), StepFun (Step 3.5 Flash)
- Open-source and specialized models — Mistral (Large 3, Small 4), xAI (Grok Heavy, Grok 4), Cohere (Command R+), and more
All models are accessible through a single API endpoint with OpenAI-compatible request/response format, meaning you can switch between models by changing one parameter — no SDK changes required.
Deployment options#
Vtslx AO offers two deployment tiers to match your scale and security requirements:
Shared instance (Edge API)#
Shared instances are deployed across globally distributed edge nodes, providing low-latency access from any region. This tier is ideal for:
- Development and testing
- Small to medium-scale production applications
- Projects that don't require data residency guarantees
Shared instances share compute resources with other tenants, with fair-use rate limiting to ensure consistent performance.
Dedicated instance (Enterprise)#
Dedicated instances provision isolated compute nodes exclusively for your organization. This tier delivers:
- Higher concurrency — No shared throttling; burst to your licensed capacity
- Faster response times — Dedicated GPU/CPU resources eliminate noisy-neighbor latency
- Private model deployment — Deploy fine-tuned or proprietary models directly to your dedicated nodes
- Data residency — Choose deployment regions that meet your compliance requirements
- Custom domain — Serve API traffic through your own domain with full TLS control
- SLA guarantees — Enterprise-grade uptime commitments and priority support
Key features#
Unified API format#
All models speak the same language. Vtslx AO exposes an OpenAI-compatible /v1/chat/completions endpoint, so any existing OpenAI SDK or tooling works out of the box with every supported model.
API key groups#
Organize API keys into logical groups with shared configuration — rate limits, budget caps, allowed model lists, and system prompts. Groups make it easy to manage access for different teams, environments, or customer tiers.
Fine-grained rate limiting#
Control usage at both the group and individual key level:
- RPM (requests per minute)
- TPM (tokens per minute)
- Budget caps (daily, weekly, monthly, or yearly spend limits)
System prompts at the gateway#
Inject system prompts at the API gateway level, ensuring consistent behavior across all consumers of a key group without requiring client-side changes.
Usage analytics#
Track token consumption, request volumes, latency distributions, and cost breakdowns by model, key, and group — all from the Vtslx AO dashboard.
Getting started#
- Create an organization in the Vtslx AO Dashboard
- Create an API key group with your desired rate limits and model access
- Generate an API key and start making requests:
1curl https://api.hydite.com/v1/chat/completions \2 -H "Content-Type: application/json" \3 -H "Authorization: Bearer YOUR_API_KEY" \4 -d '{5 "model": "gpt-5.5",6 "messages": [{"role": "user", "content": "Hello!"}]7 }'To switch models, just change the model field — no other code changes needed.