Model catalog

Browse every model. Filter to yours.

Frontier and open models with capabilities, pricing, latency, use-case fit, persona fit, and HIPAA / SOC2 / GDPR / FedRAMP — verified against provider docs. Unknown values stay blank.

Models tracked
19
Providers
8
HIPAA-eligible
14
Open weights
5
19 of 19 models
OpenAI

GPT Realtime mini

Ultra Fast

Smaller, cheaper Realtime voice model for high-volume voice agents.

Reasoning2/5
Coding2/5
Multilingual5/5
Modalities3/5
Context
32K
In / 1M
$0.600
Out / 1M
$2.40
Voice
Chat
HIPAASOC2GDPRFedRAMP
Verified 2026-05-08Compare →
Anthropic

Claude Opus 4.5

Standard

Anthropic's most capable model. Best-in-class for agentic coding and tool use.

Reasoning5/5
Coding5/5
Multilingual5/5
Modalities2/5
Context
200K
In / 1M
$5.00
Out / 1M
$25.00
Coding
Agents
Reasoning
Long Context
HIPAASOC2GDPRFedRAMP
Verified 2026-04-15Compare →
Anthropic

Claude Haiku 4.5

Fast

Fast, low-cost model with frontier-level coding for its tier.

Reasoning4/5
Coding4/5
Multilingual4/5
Modalities2/5
Context
200K
In / 1M
$1.00
Out / 1M
$5.00
Chat
Rag
Summarization
Structured Output
HIPAASOC2GDPRFedRAMP
Verified 2026-04-15Compare →
Anthropic

Claude Sonnet 4.5

Fast

Balanced flagship: strong coding and agent performance at much lower cost than Opus.

Reasoning5/5
Coding5/5
Multilingual5/5
Modalities2/5
Context
200K
In / 1M
$3.00
Out / 1M
$15.00
Coding
Agents
Rag
Structured Output
HIPAASOC2GDPRFedRAMP
Verified 2026-04-15Compare →
OpenAI

GPT Realtime

Ultra Fast

Speech-to-speech model for low-latency voice agents. Native audio understanding and generation over the Realtime API.

Reasoning3/5
Coding3/5
Multilingual5/5
Modalities3/5
Context
32K
In / 1M
$4.00
Out / 1M
$16.00
Voice
Chat
Agents
HIPAASOC2GDPRFedRAMP
Verified 2026-05-08Compare →
DeepSeek

DeepSeek-V3.1

Open weights
Standard

Open MoE with hybrid thinking/non-thinking modes. Excellent price-to-performance.

Reasoning4/5
Coding5/5
Multilingual4/5
Modalities1/5
Context
128K
In / 1M
$0.270
Out / 1M
$1.10
Coding
Reasoning
Agents
Structured Output
HIPAASOC2GDPRFedRAMP
Verified 2026-04-15Compare →
OpenAI

GPT-5

Standard

OpenAI's flagship multimodal reasoning model with adaptive thinking depth.

Reasoning5/5
Coding5/5
Multilingual5/5
Modalities2/5
Context
400K
In / 1M
$1.25
Out / 1M
$10.00
Reasoning
Coding
Agents
Long Context
HIPAASOC2GDPRFedRAMP
Verified 2026-04-15Compare →
OpenAI

GPT-5 mini

Fast

Smaller, cheaper GPT-5 with most of the reasoning quality.

Reasoning4/5
Coding4/5
Multilingual4/5
Modalities2/5
Context
400K
In / 1M
$0.250
Out / 1M
$2.00
Chat
Rag
Structured Output
Summarization
HIPAASOC2GDPRFedRAMP
Verified 2026-04-15Compare →
OpenAI

GPT-5 nano

Ultra Fast

Ultra-fast, ultra-cheap GPT-5 variant for high-volume tasks.

Reasoning2/5
Coding3/5
Multilingual3/5
Modalities1/5
Context
400K
In / 1M
$0.050
Out / 1M
$0.400
Chat
Summarization
Structured Output
Translation
HIPAASOC2GDPRFedRAMP
Verified 2026-04-15Compare →
Google

Gemini 2.5 Flash-Lite

Ultra Fast

Cheapest Gemini for high-volume classification, extraction, and translation.

Reasoning2/5
Coding3/5
Multilingual5/5
Modalities3/5
Context
1.0M
In / 1M
$0.100
Out / 1M
$0.400
Summarization
Translation
Structured Output
Chat
HIPAASOC2GDPRFedRAMP
Verified 2026-04-15Compare →
xAI

Grok 4

Standard

Reasoning-first model with native tool use and real-time X search integration.

Reasoning5/5
Coding4/5
Multilingual4/5
Modalities2/5
Context
256K
In / 1M
$3.00
Out / 1M
$15.00
Reasoning
Agents
Long Context
HIPAASOC2GDPRFedRAMP
Verified 2026-04-15Compare →
Google

Gemini 2.5 Pro

Standard

Top-tier multimodal model with very long context and native audio/video understanding.

Reasoning5/5
Coding5/5
Multilingual5/5
Modalities4/5
Context
1.0M
In / 1M
$1.25
Out / 1M
$10.00
Long Context
Vision
Reasoning
Rag
HIPAASOC2GDPRFedRAMP
Verified 2026-04-15Compare →
Google

Gemini 2.5 Flash

Fast

Fast, cheap multimodal workhorse with controllable thinking budget.

Reasoning4/5
Coding4/5
Multilingual5/5
Modalities4/5
Context
1.0M
In / 1M
$0.300
Out / 1M
$2.50
Chat
Rag
Summarization
Vision
HIPAASOC2GDPRFedRAMP
Verified 2026-04-15Compare →
Google

Gemini 2.5 Flash with Native Audio (Live API)

Fast

Real-time speech-in / speech-out conversational model powering Gemini Live. Native audio with affective dialog and tool use.

Reasoning4/5
Coding3/5
Multilingual5/5
Modalities5/5
Context
1.0M
In / 1M
Out / 1M
Voice
Chat
Agents
HIPAASOC2GDPRFedRAMP
Verified 2026-05-08Compare →
OpenAI

o3

Slow

Deliberate reasoning model that thinks before answering. Strong at math and science.

Reasoning5/5
Coding5/5
Multilingual4/5
Modalities2/5
Context
200K
In / 1M
$2.00
Out / 1M
$8.00
Reasoning
Coding
Agents
HIPAASOC2GDPRFedRAMP
Verified 2026-04-15Compare →
Meta

Llama 4 Maverick

Open weights
Fast

Open-weights mixture-of-experts multimodal model. 17B active / 400B total params.

Reasoning4/5
Coding4/5
Multilingual5/5
Modalities2/5
Context
1M
In / 1M
Out / 1M
Chat
Long Context
Vision
Rag
HIPAASOC2GDPRFedRAMP
Verified 2026-04-15Compare →
Meta

Llama 4 Scout

Open weights
Fast

Smaller open MoE (17B active / 109B total) with up to 10M-token context.

Reasoning3/5
Coding4/5
Multilingual5/5
Modalities2/5
Context
10M
In / 1M
Out / 1M
Long Context
Rag
Summarization
Chat
HIPAASOC2GDPRFedRAMP
Verified 2026-04-15Compare →
Cohere

Command A

Open weights
Fast

Enterprise-focused model optimized for RAG, tool use, and private deployment.

Reasoning4/5
Coding4/5
Multilingual5/5
Modalities1/5
Context
256K
In / 1M
$2.50
Out / 1M
$10.00
Rag
Agents
Structured Output
Translation
HIPAASOC2GDPRFedRAMP
Verified 2026-04-15Compare →
Mistral AI

Mistral Large 2

Open weights
Standard

Flagship dense 123B model. Strong multilingual and code performance.

Reasoning4/5
Coding4/5
Multilingual5/5
Modalities1/5
Context
128K
In / 1M
$2.00
Out / 1M
$6.00
Coding
Translation
Structured Output
Chat
HIPAASOC2GDPRFedRAMP
Verified 2026-04-15Compare →