Home

Hydite Vtslx AO

The unified LLM API aggregation and orchestration platform

Hydite Vtslx AO is a unified API aggregation platform that brings together hundreds of large language models (LLMs) from dozens of providers into a single, consistent API interface. Whether you're building AI-powered applications, running model evaluations, or operating an enterprise-scale AI infrastructure, Vtslx AO eliminates the complexity of managing multiple provider SDKs, API keys, and billing relationships.

Supported models and providers#

OpenAI

Anthropic

Google

Meta

DeepSeek

Qwen

Zhipu

Moonshot

MiniMax

ByteDance

StepFun

xAI

Mistral

Cohere

Vtslx AO aggregates models across the entire LLM ecosystem:

Global frontier models — OpenAI (GPT-5.5, GPT-5.5 Pro, GPT-5.2, GPT-5.5 Instant), Anthropic (Claude Opus 4.7, Claude Sonnet 4.6, Claude Haiku 4.5, Claude Mythos), Google (Gemini 3.1, Gemini 3.1 Flash), Meta (Llama 4 Behemoth, Llama 4 Maverick, Muse Spark)
Chinese domestic models — DeepSeek (V4 Pro, V4 Flash, R1-0528), Qwen (Qwen3.6, Qwen3.6-Max, Qwen3-Coder-Next, QwQ-32B), Zhipu (GLM-5), Moonshot (Kimi K2.6), MiniMax (M2.7), ByteDance (Doubao Seed 2.0), StepFun (Step 3.5 Flash)
Open-source and specialized models — Mistral (Large 3, Small 4), xAI (Grok Heavy, Grok 4), Cohere (Command R+), and more

All models are accessible through a single API endpoint with OpenAI-compatible request/response format, meaning you can switch between models by changing one parameter — no SDK changes required.

Deployment options#

Vtslx AO offers two deployment tiers to match your scale and security requirements:

Shared instance (Edge API)#

Shared instances are deployed across globally distributed edge nodes, providing low-latency access from any region. This tier is ideal for:

Development and testing
Small to medium-scale production applications
Projects that don't require data residency guarantees

Shared instances share compute resources with other tenants, with fair-use rate limiting to ensure consistent performance.

Dedicated instance (Enterprise)#

Dedicated instances provision isolated compute nodes exclusively for your organization. This tier delivers:

Higher concurrency — No shared throttling; burst to your licensed capacity
Faster response times — Dedicated GPU/CPU resources eliminate noisy-neighbor latency
Private model deployment — Deploy fine-tuned or proprietary models directly to your dedicated nodes
Data residency — Choose deployment regions that meet your compliance requirements
Custom domain — Serve API traffic through your own domain with full TLS control
SLA guarantees — Enterprise-grade uptime commitments and priority support

Key features#

Unified API format#

All models speak the same language. Vtslx AO exposes an OpenAI-compatible /v1/chat/completions endpoint, so any existing OpenAI SDK or tooling works out of the box with every supported model.

API key groups#

Organize API keys into logical groups with shared configuration — rate limits, budget caps, allowed model lists, and system prompts. Groups make it easy to manage access for different teams, environments, or customer tiers.

Fine-grained rate limiting#

Control usage at both the group and individual key level:

RPM (requests per minute)
TPM (tokens per minute)
Budget caps (daily, weekly, monthly, or yearly spend limits)

System prompts at the gateway#

Inject system prompts at the API gateway level, ensuring consistent behavior across all consumers of a key group without requiring client-side changes.

Usage analytics#

Track token consumption, request volumes, latency distributions, and cost breakdowns by model, key, and group — all from the Vtslx AO dashboard.

Getting started#

Create an organization in the Vtslx AO Dashboard
Create an API key group with your desired rate limits and model access
Generate an API key and start making requests:

1
curl https://api.hydite.com/v1/chat/completions \
2
  -H "Content-Type: application/json" \
3
  -H "Authorization: Bearer YOUR_API_KEY" \
4
  -d '{
5
    "model": "gpt-5.5",
6
    "messages": [{"role": "user", "content": "Hello!"}]
7
  }'

To switch models, just change the model field — no other code changes needed.

Language

Is this helpful?

AI Tools

Ask ChatGPT Ask Claude