Model catalog

Browse every model. Filter to yours.

Frontier and open models with capabilities, pricing, latency, use-case fit, persona fit, and HIPAA / SOC2 / GDPR / FedRAMP — verified against provider docs. Unknown values stay blank.

Models tracked

Providers

HIPAA-eligible

Open weights

19 of 19 models

OpenAI

GPT Realtime mini

Ultra Fast

Smaller, cheaper Realtime voice model for high-volume voice agents.

Reasoning2/5

Coding2/5

Multilingual5/5

Modalities3/5

Context

32K

In / 1M

$0.600

Out / 1M

$2.40

Voice

Chat

HIPAASOC2GDPRFedRAMP

Verified 2026-05-08Compare →

Anthropic

Claude Opus 4.5

Standard

Anthropic's most capable model. Best-in-class for agentic coding and tool use.

Reasoning5/5

Coding5/5

Multilingual5/5

Modalities2/5

Context

200K

In / 1M

$5.00

Out / 1M

$25.00

Coding

Agents

Reasoning

Long Context

HIPAASOC2GDPRFedRAMP

Verified 2026-04-15Compare →

Anthropic

Claude Haiku 4.5

Fast

Fast, low-cost model with frontier-level coding for its tier.

Reasoning4/5

Coding4/5

Multilingual4/5

Modalities2/5

Context

200K

In / 1M

$1.00

Out / 1M

$5.00

Chat

Rag

Summarization

Structured Output

HIPAASOC2GDPRFedRAMP

Verified 2026-04-15Compare →

Anthropic

Claude Sonnet 4.5

Fast

Balanced flagship: strong coding and agent performance at much lower cost than Opus.

Reasoning5/5

Coding5/5

Multilingual5/5

Modalities2/5

Context

200K

In / 1M

$3.00

Out / 1M

$15.00

Coding

Agents

Rag

Structured Output

HIPAASOC2GDPRFedRAMP

Verified 2026-04-15Compare →

OpenAI

GPT Realtime

Ultra Fast

Speech-to-speech model for low-latency voice agents. Native audio understanding and generation over the Realtime API.

Reasoning3/5

Coding3/5

Multilingual5/5

Modalities3/5

Context

32K

In / 1M

$4.00

Out / 1M

$16.00

Voice

Chat

Agents

HIPAASOC2GDPRFedRAMP

Verified 2026-05-08Compare →

DeepSeek

DeepSeek-V3.1

Open weights

Standard

Open MoE with hybrid thinking/non-thinking modes. Excellent price-to-performance.

Reasoning4/5

Coding5/5

Multilingual4/5

Modalities1/5

Context

128K

In / 1M

$0.270

Out / 1M

$1.10

Coding

Reasoning

Agents

Structured Output

HIPAASOC2GDPRFedRAMP

Verified 2026-04-15Compare →

OpenAI

GPT-5

Standard

OpenAI's flagship multimodal reasoning model with adaptive thinking depth.

Reasoning5/5

Coding5/5

Multilingual5/5

Modalities2/5

Context

400K

In / 1M

$1.25

Out / 1M

$10.00

Reasoning

Coding

Agents

Long Context

HIPAASOC2GDPRFedRAMP

Verified 2026-04-15Compare →

OpenAI

GPT-5 mini

Fast

Smaller, cheaper GPT-5 with most of the reasoning quality.

Reasoning4/5

Coding4/5

Multilingual4/5

Modalities2/5

Context

400K

In / 1M

$0.250

Out / 1M

$2.00

Chat

Rag

Structured Output

Summarization

HIPAASOC2GDPRFedRAMP

Verified 2026-04-15Compare →

OpenAI

GPT-5 nano

Ultra Fast

Ultra-fast, ultra-cheap GPT-5 variant for high-volume tasks.

Reasoning2/5

Coding3/5

Multilingual3/5

Modalities1/5

Context

400K

In / 1M

$0.050

Out / 1M

$0.400

Chat

Summarization

Structured Output

Translation

HIPAASOC2GDPRFedRAMP

Verified 2026-04-15Compare →

Google

Gemini 2.5 Flash-Lite

Ultra Fast

Cheapest Gemini for high-volume classification, extraction, and translation.

Reasoning2/5

Coding3/5

Multilingual5/5

Modalities3/5

Context

1.0M

In / 1M

$0.100

Out / 1M

$0.400

Summarization

Translation

Structured Output

Chat

HIPAASOC2GDPRFedRAMP

Verified 2026-04-15Compare →

xAI

Grok 4

Standard

Reasoning-first model with native tool use and real-time X search integration.

Reasoning5/5

Coding4/5

Multilingual4/5

Modalities2/5

Context

256K

In / 1M

$3.00

Out / 1M

$15.00

Reasoning

Agents

Long Context

HIPAASOC2GDPRFedRAMP

Verified 2026-04-15Compare →

Google

Gemini 2.5 Pro

Standard

Top-tier multimodal model with very long context and native audio/video understanding.

Reasoning5/5

Coding5/5

Multilingual5/5

Modalities4/5

Context

1.0M

In / 1M

$1.25

Out / 1M

$10.00

Long Context

Vision

Reasoning

Rag

HIPAASOC2GDPRFedRAMP

Verified 2026-04-15Compare →

Google

Gemini 2.5 Flash

Fast

Fast, cheap multimodal workhorse with controllable thinking budget.

Reasoning4/5

Coding4/5

Multilingual5/5

Modalities4/5

Context

1.0M

In / 1M

$0.300

Out / 1M

$2.50

Chat

Rag

Summarization

Vision

HIPAASOC2GDPRFedRAMP

Verified 2026-04-15Compare →

Google

Gemini 2.5 Flash with Native Audio (Live API)

Fast

Real-time speech-in / speech-out conversational model powering Gemini Live. Native audio with affective dialog and tool use.

Reasoning4/5

Coding3/5

Multilingual5/5

Modalities5/5

Context

1.0M

In / 1M

—

Out / 1M

—

Voice

Chat

Agents

HIPAASOC2GDPRFedRAMP

Verified 2026-05-08Compare →

OpenAI

o3

Slow

Deliberate reasoning model that thinks before answering. Strong at math and science.

Reasoning5/5

Coding5/5

Multilingual4/5

Modalities2/5

Context

200K

In / 1M

$2.00

Out / 1M

$8.00

Reasoning

Coding

Agents

HIPAASOC2GDPRFedRAMP

Verified 2026-04-15Compare →

Llama 4 Maverick

Open weights

Fast

Open-weights mixture-of-experts multimodal model. 17B active / 400B total params.

Reasoning4/5

Coding4/5

Multilingual5/5

Modalities2/5

Context

In / 1M

—

Out / 1M

—

Chat

Long Context

Vision

Rag

HIPAASOC2GDPRFedRAMP

Verified 2026-04-15Compare →

Llama 4 Scout

Open weights

Fast

Smaller open MoE (17B active / 109B total) with up to 10M-token context.

Reasoning3/5

Coding4/5

Multilingual5/5

Modalities2/5

Context

10M

In / 1M

—

Out / 1M

—

Long Context

Rag

Summarization

Chat

HIPAASOC2GDPRFedRAMP

Verified 2026-04-15Compare →

Cohere

Command A

Open weights

Fast

Enterprise-focused model optimized for RAG, tool use, and private deployment.

Reasoning4/5

Coding4/5

Multilingual5/5

Modalities1/5

Context

256K

In / 1M

$2.50

Out / 1M

$10.00

Rag

Agents

Structured Output

Translation

HIPAASOC2GDPRFedRAMP

Verified 2026-04-15Compare →

Mistral AI

Mistral Large 2

Open weights

Standard

Flagship dense 123B model. Strong multilingual and code performance.

Reasoning4/5

Coding4/5

Multilingual5/5

Modalities1/5

Context

128K

In / 1M

$2.00

Out / 1M

$6.00

Coding

Translation

Structured Output

Chat

HIPAASOC2GDPRFedRAMP

Verified 2026-04-15Compare →