Every model, one account. We curate, you choose.

Anthropic, OpenAI, Google, DeepSeek, Meta, Mistral, xAI — 38 models available through Vera Cloud. Pick any model you want for any task. One account, one bill, full transparency on every token.

01

Pick any model. Use it directly.

These aren't models we've wrapped or fine-tuned. They're the real thing — Claude, GPT, Gemini, DeepSeek, Llama, Mistral, Grok — available directly through one account. You choose which model to use for each task, each mode, each session.

No lock-in to a single provider. No guessing which model is behind an API. You see exactly what you're using and what it costs.

Vera Studio — Model Garden
01.1
ModelProviderSpeedNotes
Claude Opus 4.7Anthropic36 tpsLatest Opus. Same price, sharper brain.
Claude Opus 4.6Anthropic36 tpsThe best coder alive. Slow and expensive, but worth it.
Claude Sonnet 4.6Anthropic38 tps90% of Opus brains at half the price.
Claude Sonnet 4.5Anthropic52 tpsPrevious-gen Sonnet. Still great, cheaper.
Claude Haiku 4.5Anthropic68 tpsAnthropic’s lightweight model. Fast and cheap.
Claude Haiku 3.5Anthropic27 tpsLegacy Haiku. Still available.
GPT-5.4OpenAI41 tpsOpenAI’s flagship. 2x cheaper than Opus.
GPT-5.4 MiniOpenAI72 tpsBudget Codex. Handles most coding tasks just fine.
GPT-5.4 NanoOpenAI88 tps20x cheaper than Opus. 88 tps.
GPT-5.3 CodexOpenAI45 tpsCode-tuned 5.3. Strong on refactoring.
o3OpenAI12 tpsOpenAI’s reasoning model. Deep thinking, slow output.
o4-miniOpenAI58 tpsLightweight reasoning. Good for structured analysis.
Gemini 3 ProGoogle73 tpsGoogle’s heavyweight. Million-token context window.
Gemini 3 FlashGoogle88 tpsFast and capable. Best bang-for-buck from Google.
Gemini 3 Flash LiteGoogle90 tpsGoogle’s speed demon. 17x cheaper than Opus.
Gemini 2.5 ProGoogle64 tpsPrevious-gen Pro. Still excellent for long context.
DeepSeek V3.2 SpecialeDeepSeek27 tps21x cheaper than Opus. Strong on reasoning.
DeepSeek R1DeepSeek18 tpsOpen-weight reasoning model. MIT licensed.
Llama 4 ScoutMeta82 tpsOpen-weight. Fast and free to self-host.
Llama 4 MaverickMeta65 tpsMeta’s large open model. Strong on code.
Mistral LargeMistral AI44 tpsEuropean flagship. Optimized for code generation.
CodestralMistral AI71 tpsMistral’s dedicated coding model.
Grok 3xAI56 tpsxAI’s reasoning model. Real-time knowledge.
Grok 3 MinixAI78 tpsLightweight Grok. Good speed-to-quality ratio.
MiniMax M2.7MiniMax37 tpsPunches way above its price.
Qwen 3 235BAlibaba79 tps250x cheaper than Opus on Cerebras.
Qwen 3 32BAlibaba120 tpsTiny Qwen. Blazing on Groq.
GLM 4.7Zhipu AI61 tpsChinese all-rounder. Strong multilingual support.
Kimi K2.5Moonshot48 tpsLong-context specialist. 128k native.
Llama 4 Scout (Cerebras)Cerebras2,200 tpsSame model, 2,200 tps on Cerebras hardware.
Llama 4 Scout (Groq)Groq1,400 tpsSame model, 1,400 tps on Groq LPUs.
Qwen 3 32B (Cerebras)Cerebras1,800 tpsSame model, 1,800 tps on Cerebras.
Nano Banana ProGoogleBest image generation. Text-to-image built in.
FAL Flux Profal.aiFast image gen. Great for UI mockups and assets.
Kling v3fal.aiMost consistent video gen we’ve tested.
Veo 3GoogleGoogle’s video model. Strong on cinematic style.
ElevenLabs STTElevenLabsSpeech-to-text. Meeting transcription, voice notes.
ElevenLabs TTSElevenLabsText-to-speech. Natural voiceovers and audio.

38models and counting. New models added as they ship — no update required on your end.

02

We've curated the best defaults so you don't have to.

We test every model and pick the best option for each job. Vera's three dynamic models route to whatever is strongest right now — they're our recommended defaults, and they update automatically as models improve.

Want a specific model instead? Swap any default for anything in the garden, any time.

02.1

Vera Frontier

Maximum intelligence. Always current.

Proxies to whatever is the best model right now — currently GPT-5.4 → Gemini 3.1 Pro → Opus 4.6. Auto-failover if a provider goes down.

Always SOTAAuto-failover

Vera Deep

Thinking models for hard problems.

Routes to the best reasoning model available. Optimized for complex analysis and architectural decisions.

Deep reasoning5x cheaper than Opus

Vera Lightning

Fast and smart. Our default.

Routes across Cerebras and Groq for maximum speed with sub-second time-to-first-token.

~2,200 tok/s14x cheaper than Opus
03

Modes over models

Each mode has a default model — but you can swap it for any model in the garden. Want to plan with Opus and build with Flash? Go for it. Every combination works.

Defaults change as models improve. You always have full control.

03.1
ModeWhat it doesDefault model
InterviewConversational discoveryVera Lightning
PlanThorough deliberationVera Deep
BuildProductive executionVera Lightning
ChatFast Q&AVera Lightning
PrototypeVisual mockupsVera Lightning
QA ReviewRapid-fire triageVera Lightning
DocumentStructured writingVera Lightning
YeetFull sendVera Lightning

Every default is swappable. Use any model from the garden for any mode, any time.