300+ AI Models Directory - Browse GPT-5, Claude, Gemini & More

Sakana

Sakana: Fugu Ultra

Fugu Ultra is the higher-performance model in Sakana AI's Fugu family. Rather than a single monolithic model, Fugu is a learned multi-agent orchestration system: a language model trained to route...

1000K context premium

Cohere

Cohere: North Mini Code (free)

North Mini Code is Cohere's first agentic coding model and the debut of its North family. A sparse mixture-of-experts model with 30B total parameters and 3B active, it is optimized...

256K context 30B budget

Z-ai

GLM 5.2 is a large-scale reasoning model from Z.ai. It supports text input and output with a 1M-token context window, and is suited for long-horizon agent workflows, project-level software engineering

1049K context standard

Moonshotai

MoonshotAI: Kimi K2.7 Code

MoonshotAI: Kimi K2.7 Code is a coding-focused model in Moonshot AI's Kimi K2 family, built to complete end-to-end programming tasks reliably over long contexts. It uses a native multimodal mixture-of

262K context standard

~anthropic

Anthropic: Claude Fable Latest

This model always redirects to the latest model in the Claude Fable family.

1000K context premium

Anthropic

Anthropic: Claude Fable 5

Claude Fable 5 is a Mythos-class model from Anthropic, built for autonomous knowledge work and coding. It supports text, image, and file inputs with text output, with reasoning support and...

1000K context premium

Nex-agi

Nex AGI: Nex-N2-Pro

Nex-N2-Pro is an agentic mixture-of-experts model from Nex AGI, with 17B active parameters out of 397B total. Built on the Qwen3.5 architecture, it accepts text and image input and produces...

262K context 17B standard

NVIDIA

NVIDIA: Nemotron 3.5 Content Safety (free)

NVIDIA Nemotron 3.5 Content Safety is a compact 4B-parameter multimodal guardrail model from NVIDIA, fine-tuned from Google Gemma-3-4B. It moderates both inputs to and responses from LLMs and VLMs, ac

128K context 4B budget

NVIDIA

NVIDIA: Nemotron 3 Ultra (free)

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts ar

1000K context 55B budget

NVIDIA

NVIDIA: Nemotron 3 Ultra

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts ar

1000K context 55B standard

Qwen

Qwen: Qwen3.7 Plus

Qwen3.7-Plus is a cost-effective model in Alibaba's Qwen3.7 series. It supports text and image input with text output, building on the series' text capabilities with a comprehensive upgrade to its...

1000K context standard

Minimax

MiniMax: MiniMax M3

MiniMax-M3 is a multimodal foundation model from MiniMax. It supports text, image, and video inputs with text output, a 1M-token context window, and is suited for long-horizon agentic work, coding,...

1049K context standard

Stepfun

StepFun: Step 3.7 Flash

Step 3.7 Flash is StepFun's latest high-efficiency multimodal Mixture-of-Experts model. It pairs a 196B-parameter language backbone with a vision encoder for native image and video understanding, acti

256K context 196B standard

Anthropic

Anthropic: Claude Opus 4.8 (Fast)

Fast-mode variant of [Opus 4.8](/anthropic/claude-opus-4.8) - identical capabilities with higher output speed at 2x pricing relative to regular Opus 4.8. Learn more in Anthropic's docs: https://platf

1000K context premium

Anthropic

Anthropic: Claude Opus 4.8

Claude Opus 4.8 is Anthropic's most capable generally available model in the Opus family. It supports text, image, and file inputs with text output, with reasoning support and a 1M-token...

1000K context premium

Qwen

Qwen: Qwen3.7 Max

Qwen3.7-Max is the flagship model in Alibaba's Qwen3.7 series. It supports text input and output and is designed for agent-centric workloads, with particular strengths in coding, office and productivi

1000K context standard

xAI

xAI: Grok Build 0.1

Grok Build 0.1 is xAI’s fast coding model trained specifically for agentic software engineering workflows. It supports text and image inputs with text output, and is optimized for interactive coding..

256K context standard

Google

Google: Gemini 3.5 Flash

Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level coding and reasoning at Flash-tier cost and speed. It is highly optimized for coding proficiency and parallel age

1049K context standard

Anthropic

Anthropic: Claude Opus 4.7 (Fast)

Fast-mode variant of [Opus 4.7](/anthropic/claude-opus-4.7) - identical capabilities with higher output speed at premium 6x pricing. Learn more in Anthropic's docs: https://platform.claude.com/docs/e

1000K context premium

Perceptron

Perceptron: Perceptron Mk1

Perceptron Mk1 (Mark One) is Perceptron's highest-quality vision-language model for video and embodied reasoning.** It accepts image and video inputs paired with natural language queries, and produces

33K context standard

Inclusionai

inclusionAI: Ring-2.6-1T

Ring-2.6-1T is a 1T-parameter-scale thinking model with 63B active parameters, built for real-world agent workflows that require both strong capability and operational efficiency. It is optimized for

262K context 63B budget

Google

Google: Gemini 3.1 Flash Lite

Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for light

1049K context standard

OpenAI

OpenAI: GPT Chat Latest

GPT Chat Latest points to OpenAI's stable API alias `chat-latest` that always resolves to the latest Instant chat model used in ChatGPT. As OpenAI rolls out new Instant model updates...

400K context premium

xAI

xAI: Grok 4.3

Grok 4.3 is a reasoning model from xAI. It accepts text and image inputs with text output, and is suited for agentic workflows, instruction-following tasks, and applications requiring high factual...

1000K context standard

Ibm-granite

IBM: Granite 4.1 8B

Granite 4.1 8B is a dense, decoder-only 8-billion-parameter language model from IBM, part of the Granite 4.1 family. It supports a 131K-token context window and is designed for enterprise tasks...

131K context 8B budget

Mistral AI

Mistral: Mistral Medium 3.5

Mistral Medium 3.5 is a dense 128B instruction-following model from Mistral AI. It supports text and image inputs with text output, and is designed for agentic workflows, coding, and complex...

262K context 128B standard

NVIDIA

NVIDIA: Nemotron 3 Nano Omni (free)

NVIDIA Nemotron™ 3 Nano Omni is a 30B-A3B open multimodal model designed to function as a perception and context sub-agent in enterprise agent systems. It accepts text, image, video, and...

256K context 30B budget

Poolside

Poolside: Laguna XS.2 (free)

Laguna XS.2 is the second-generation model in the XS size class from [Poolside](https://poolside.ai/), their efficient coding agent series. It combines tool calling and reasoning capabilities with a c

262K context budget

Poolside

Poolside: Laguna XS.2

Laguna XS.2 is the second-generation model in the XS size class from [Poolside](https://poolside.ai/), their efficient coding agent series. It combines tool calling and reasoning capabilities with a c

262K context budget

Poolside

Poolside: Laguna M.1 (free)

Laguna M.1 is the flagship coding agent model from [Poolside](https://poolside.ai/), optimized for complex software engineering tasks. Designed for agentic coding workflows, it supports tool calling a

262K context budget

Poolside

Poolside: Laguna M.1

Laguna M.1 is the flagship coding agent model from [Poolside](https://poolside.ai/), optimized for complex software engineering tasks. Designed for agentic coding workflows, it supports tool calling a

262K context budget

~anthropic

Anthropic Claude Haiku Latest

This model always redirects to the latest model in the Anthropic Claude Haiku family.

200K context standard

~openai

OpenAI GPT Mini Latest

This model always redirects to the latest model in the OpenAI GPT Mini family.

400K context standard

~google

Google Gemini Pro Latest

This model always redirects to the latest model in the Google Gemini Pro family.

1049K context premium

~moonshotai

MoonshotAI Kimi Latest

This model always redirects to the latest model in the MoonshotAI Kimi family.

262K context standard

~google

Google Gemini Flash Latest

This model always redirects to the latest model in the Google Gemini Flash family.

1049K context standard

~anthropic

Anthropic Claude Sonnet Latest

This model always redirects to the latest model in the Anthropic Claude Sonnet family.

1000K context premium

~openai

OpenAI GPT Latest

This model always redirects to the latest model in the OpenAI GPT family.

1050K context premium

Qwen

Qwen: Qwen3.5 Plus 2026-04-20

Qwen3.5 Plus (April 2026) is a large-scale multimodal language model from Alibaba. It accepts text, image, and video input and produces text output, with a 1M token context window. This...

1000K context standard

Qwen

Qwen: Qwen3.6 Flash

Qwen3.6 Flash is a fast, efficient language model from Alibaba's Qwen 3.6 series. It supports text, image, and video input with a 1M token context window. Tiered pricing kicks in...

1000K context standard

Qwen

Qwen: Qwen3.6 35B A3B

Qwen3.6-35B-A3B is an open-weight multimodal model from Alibaba Cloud with 35 billion total parameters and 3 billion active parameters per token. It uses a hybrid sparse mixture-of-experts architectur

262K context 35B standard

Qwen

Qwen: Qwen3.6 Max Preview

Qwen3.6-Max-Preview is a proprietary frontier model from Alibaba Cloud built on a sparse mixture-of-experts architecture with approximately 1 trillion total parameters. It is optimized for agentic cod

262K context standard

Qwen

Qwen: Qwen3.6 27B

Qwen3.6 27B is a dense 27-billion-parameter language model from the Qwen Team at Alibaba, released in April 2026. It features hybrid multimodal capabilities — accepting text, image, and video inputs..

262K context 27B standard

OpenAI

OpenAI: GPT-5.5 Pro

GPT-5.5 Pro is OpenAI’s high-capability model optimized for deep reasoning and accuracy on complex, high-stakes workloads. It features a 1M+ token context window (922K input, 128K output) with support

1050K context premium

OpenAI

OpenAI: GPT-5.5

GPT-5.5 is OpenAI’s frontier model designed for complex professional workloads, building on GPT-5.4 with stronger reasoning, higher reliability, and improved token efficiency on hard tasks. It feature

1050K context premium

DeepSeek

DeepSeek: DeepSeek V4 Pro

DeepSeek V4 Pro is a large-scale Mixture-of-Experts model from DeepSeek with 1.6T total parameters and 49B activated parameters, supporting a 1M-token context window. It is designed for advanced reaso

1049K context 49B budget

DeepSeek

DeepSeek: DeepSeek V4 Flash

DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fa

1049K context 284B budget

Inclusionai

inclusionAI: Ling-2.6-1T

Ling-2.6-1T is an instant (instruct) model from inclusionAI and the company’s trillion-parameter flagship, designed for real-world agents that require fast execution and high efficiency at scale. It u

262K context budget

Browse 300+ AI Models

Sakana: Fugu Ultra

Cohere: North Mini Code (free)

Z.ai: GLM 5.2

MoonshotAI: Kimi K2.7 Code

Anthropic: Claude Fable Latest

Anthropic: Claude Fable 5

Nex AGI: Nex-N2-Pro

NVIDIA: Nemotron 3.5 Content Safety (free)

NVIDIA: Nemotron 3 Ultra (free)

NVIDIA: Nemotron 3 Ultra

Qwen: Qwen3.7 Plus

MiniMax: MiniMax M3

StepFun: Step 3.7 Flash

Anthropic: Claude Opus 4.8 (Fast)

Anthropic: Claude Opus 4.8

Qwen: Qwen3.7 Max

xAI: Grok Build 0.1

Google: Gemini 3.5 Flash

Anthropic: Claude Opus 4.7 (Fast)

Perceptron: Perceptron Mk1

inclusionAI: Ring-2.6-1T

Google: Gemini 3.1 Flash Lite

OpenAI: GPT Chat Latest

xAI: Grok 4.3

IBM: Granite 4.1 8B

Mistral: Mistral Medium 3.5

NVIDIA: Nemotron 3 Nano Omni (free)

Poolside: Laguna XS.2 (free)

Poolside: Laguna XS.2

Poolside: Laguna M.1 (free)

Poolside: Laguna M.1

Anthropic Claude Haiku Latest

OpenAI GPT Mini Latest

Google Gemini Pro Latest

MoonshotAI Kimi Latest

Google Gemini Flash Latest

Anthropic Claude Sonnet Latest

OpenAI GPT Latest

Qwen: Qwen3.5 Plus 2026-04-20

Qwen: Qwen3.6 Flash

Qwen: Qwen3.6 35B A3B

Qwen: Qwen3.6 Max Preview

Qwen: Qwen3.6 27B

OpenAI: GPT-5.5 Pro

OpenAI: GPT-5.5

DeepSeek: DeepSeek V4 Pro

DeepSeek: DeepSeek V4 Flash

inclusionAI: Ling-2.6-1T

Popular AI Model Comparisons

Try Any AI Model Instantly