Every model, one account. We curate, you choose.

Anthropic, OpenAI, Google, DeepSeek, Meta, Mistral, xAI — 38 models available through Vera Cloud. Pick any model you want for any task. One account, one bill, full transparency on every token.

Pick any model. Use it directly.

These aren't models we've wrapped or fine-tuned. They're the real thing — Claude, GPT, Gemini, DeepSeek, Llama, Mistral, Grok — available directly through one account. You choose which model to use for each task, each mode, each session.

No lock-in to a single provider. No guessing which model is behind an API. You see exactly what you're using and what it costs.

Get started See pricing

01.1

Model	Provider	Speed	Notes
Claude Opus 4.7	Anthropic	36 tps	Latest Opus. Same price, sharper brain.
Claude Opus 4.6	Anthropic	36 tps	The best coder alive. Slow and expensive, but worth it.
Claude Sonnet 4.6	Anthropic	38 tps	90% of Opus brains at half the price.
Claude Sonnet 4.5	Anthropic	52 tps	Previous-gen Sonnet. Still great, cheaper.
Claude Haiku 4.5	Anthropic	68 tps	Anthropic’s lightweight model. Fast and cheap.
Claude Haiku 3.5	Anthropic	27 tps	Legacy Haiku. Still available.
GPT-5.4	OpenAI	41 tps	OpenAI’s flagship. 2x cheaper than Opus.
GPT-5.4 Mini	OpenAI	72 tps	Budget Codex. Handles most coding tasks just fine.
GPT-5.4 Nano	OpenAI	88 tps	20x cheaper than Opus. 88 tps.
GPT-5.3 Codex	OpenAI	45 tps	Code-tuned 5.3. Strong on refactoring.
o3	OpenAI	12 tps	OpenAI’s reasoning model. Deep thinking, slow output.
o4-mini	OpenAI	58 tps	Lightweight reasoning. Good for structured analysis.
Gemini 3 Pro	Google	73 tps	Google’s heavyweight. Million-token context window.
Gemini 3 Flash	Google	88 tps	Fast and capable. Best bang-for-buck from Google.
Gemini 3 Flash Lite	Google	90 tps	Google’s speed demon. 17x cheaper than Opus.
Gemini 2.5 Pro	Google	64 tps	Previous-gen Pro. Still excellent for long context.
DeepSeek V3.2 Speciale	DeepSeek	27 tps	21x cheaper than Opus. Strong on reasoning.
DeepSeek R1	DeepSeek	18 tps	Open-weight reasoning model. MIT licensed.
Llama 4 Scout	Meta	82 tps	Open-weight. Fast and free to self-host.
Llama 4 Maverick	Meta	65 tps	Meta’s large open model. Strong on code.
Mistral Large	Mistral AI	44 tps	European flagship. Optimized for code generation.
Codestral	Mistral AI	71 tps	Mistral’s dedicated coding model.
Grok 3	xAI	56 tps	xAI’s reasoning model. Real-time knowledge.
Grok 3 Mini	xAI	78 tps	Lightweight Grok. Good speed-to-quality ratio.
MiniMax M2.7	MiniMax	37 tps	Punches way above its price.
Qwen 3 235B	Alibaba	79 tps	250x cheaper than Opus on Cerebras.
Qwen 3 32B	Alibaba	120 tps	Tiny Qwen. Blazing on Groq.
GLM 4.7	Zhipu AI	61 tps	Chinese all-rounder. Strong multilingual support.
Kimi K2.5	Moonshot	48 tps	Long-context specialist. 128k native.
Llama 4 Scout (Cerebras)	Cerebras	2,200 tps	Same model, 2,200 tps on Cerebras hardware.
Llama 4 Scout (Groq)	Groq	1,400 tps	Same model, 1,400 tps on Groq LPUs.
Qwen 3 32B (Cerebras)	Cerebras	1,800 tps	Same model, 1,800 tps on Cerebras.
Nano Banana Pro	Google	—	Best image generation. Text-to-image built in.
FAL Flux Pro	fal.ai	—	Fast image gen. Great for UI mockups and assets.
Kling v3	fal.ai	—	Most consistent video gen we’ve tested.
Veo 3	Google	—	Google’s video model. Strong on cinematic style.
ElevenLabs STT	ElevenLabs	—	Speech-to-text. Meeting transcription, voice notes.
ElevenLabs TTS	ElevenLabs	—	Text-to-speech. Natural voiceovers and audio.

38models and counting. New models added as they ship — no update required on your end.

We've curated the best defaults so you don't have to.

We test every model and pick the best option for each job. Vera's three dynamic models route to whatever is strongest right now — they're our recommended defaults, and they update automatically as models improve.

Want a specific model instead? Swap any default for anything in the garden, any time.

02.1

Vera Frontier

Maximum intelligence. Always current.

Proxies to whatever is the best model right now — currently GPT-5.4 → Gemini 3.1 Pro → Opus 4.6. Auto-failover if a provider goes down.

Always SOTAAuto-failover

Vera Deep

Thinking models for hard problems.

Routes to the best reasoning model available. Optimized for complex analysis and architectural decisions.

Deep reasoning5x cheaper than Opus

Vera Lightning

Fast and smart. Our default.

Routes across Cerebras and Groq for maximum speed with sub-second time-to-first-token.

~2,200 tok/s14x cheaper than Opus

Modes over models

Each mode has a default model — but you can swap it for any model in the garden. Want to plan with Opus and build with Flash? Go for it. Every combination works.

Defaults change as models improve. You always have full control.

03.1

Mode	What it does	Default model
Interview	Conversational discovery	Vera Lightning
Plan	Thorough deliberation	Vera Deep
Build	Productive execution	Vera Lightning
Chat	Fast Q&A	Vera Lightning
Prototype	Visual mockups	Vera Lightning
QA Review	Rapid-fire triage	Vera Lightning
Document	Structured writing	Vera Lightning
Yeet	Full send	Vera Lightning

Every default is swappable. Use any model from the garden for any mode, any time.

Read the essay that started it all

In AI, Trust Is All You Need

Silent nerfs, fake benchmarks, and the dirty secret the foundation model companies don't want you to know.