realtime multimodal modelGPT-4o
GPT-4o is a Foundation models product from OpenAI, focused on realtime multimodal model with tags such as Foundation model, Multimodal.
realtime multimodal modelGPT-4o is a Foundation models product from OpenAI, focused on realtime multimodal model with tags such as Foundation model, Multimodal.
flagship multimodal modelGemini 2.5 Pro is a Foundation models product from Google DeepMind, focused on flagship multimodal model with tags such as Foundation model, Multimodal.
lightweight multimodal modelGemini 2.5 Flash is a Foundation models product from Google DeepMind, focused on lightweight multimodal model with tags such as Foundation model, Multimodal.
high-end reasoning modelClaude Opus 4 is a Foundation models product from Anthropic, focused on high-end reasoning model with tags such as Foundation model, Multimodal.
mainstream reasoning modelClaude Sonnet 4 is a Foundation models product from Anthropic, focused on mainstream reasoning model with tags such as Foundation model, Multimodal.
omni model familyQwen Omni is a Foundation models product from 阿里通义, focused on omni model family with tags such as Foundation model, Multimodal, Open source.
multimodal model familyQwen VL is a Foundation models product from 阿里通义, focused on multimodal model family with tags such as Foundation model, Multimodal, Open source.
audio understanding modelQwen Audio is a Foundation models product from 阿里通义, focused on audio understanding model with tags such as Foundation model, Multimodal, Open source.
open multimodal modelsDeepSeek VL is a Foundation models product from DeepSeek, focused on open multimodal models with tags such as Foundation model, Multimodal, Open source.
multimodal reasoning modelSeed 1.5 is a Foundation models product from 字节 Seed, focused on multimodal reasoning model with tags such as Foundation model, Multimodal, China ecosystem.
multimodal foundation modelsHunyuan Models is a Foundation models product from 腾讯混元, focused on multimodal foundation models with tags such as Foundation model, Multimodal, China ecosystem.
ERNIE model familyERNIE Models is a Foundation models product from 百度文心, focused on ERNIE model family with tags such as Foundation model, China ecosystem.
multimodal model familyMiniMax Models is useful for seeing MiniMax across text, voice, video, and globally-oriented product layers.
multimodal model familyStep Models is a Foundation models product from 阶跃星辰, focused on multimodal model family with tags such as Foundation model, Multimodal, China ecosystem.
video generation modelWan 2.1 is a Foundation models product from 阿里通义, focused on video generation model with tags such as Foundation model, Multimodal, Open source.
video generation modelCogVideoX is a Foundation models product from 智谱 AI, focused on video generation model with tags such as Foundation model, Multimodal, Open source.
video generation modelSora 2 is a Foundation models product from OpenAI, focused on video generation model with tags such as Foundation model, Multimodal.
image generation modelGPT Image 1.5 is a Foundation models product from OpenAI, focused on image generation model with tags such as Foundation model, Multimodal.
vision language modelInternVL is a Foundation models product from OpenGVLab, focused on vision language model with tags such as Foundation model, Multimodal, Open source.
lightweight multimodalMiniCPM-V is a Foundation models product from OpenBMB, focused on lightweight multimodal with tags such as Foundation model, Multimodal, Open source.
vision language modelPixtral is a Foundation models product from Mistral AI, focused on vision language model with tags such as Foundation model, Multimodal.
multimodal base modelPaliGemma is a Foundation models product from Google, focused on multimodal base model with tags such as Foundation model, Multimodal, Open source.
open multimodal modelMolmo is a Foundation models product from Allen AI, focused on open multimodal model with tags such as Foundation model, Multimodal, Open source.
vision foundation modelFlorence-2 is a Foundation models product from Microsoft, focused on vision foundation model with tags such as Foundation model, Multimodal, Open source.
PopularChatGPT is OpenAI's mainstream AI assistant entry point, combining general Q&A, writing, search, file analysis, and multimodal interaction in one product.
Gemini is one of Google's consumer AI entry points, with real strength coming from its linkage to search, Google Workspace, Android, and multimodal capabilities.
Long-formClaude is Anthropic's main end-user assistant, best known for long-form handling, stable writing, document understanding, and enterprise-oriented safety.
Grok is a text model product from xAI, focused on realtime workflows and official access.
Kimi is Moonshot AI's most representative consumer product page, best known for long context, Chinese experience, and information synthesis.
Mass-market AIDoubao is ByteDance's mainstream AI entry, with value in broad consumer reach, low barrier for Chinese users, and linkage to ByteDance's content ecosystem.
AlibabaTongyi Qianwen is Alibaba's major end-user AI entry, but its real importance lies in how it connects to the Qwen family, Alibaba Cloud, and enterprise ecosystem.
ReasoningDeepSeek is one of the most watched reasoning-oriented AI assistants in China, with its main appeal in reasoning quality and cost efficiency rather than flashy features.
Tencent腾讯元宝 is a China AI model product from Tencent, focused on tencent workflows and official access.
Zhipu智谱清言是基于 GLM-5 的全能 AI 助手,支持精通对话、写作与编程。为你答疑解惑,激发创意,更能理解图片与文档,提升学习与工作效率。
official modelAurora Image is a Foundation models product from xAI, focused on official model with tags such as Foundation model, API, Multimodal.
official modelAya Vision 32B is a Foundation models product from Cohere, focused on official model with tags such as Foundation model, API, Open source.
open modelChameleon 7B is a Foundation models product from Meta, focused on open model with tags such as Foundation model, Open source, Multimodal.
official modelCogView 4 is a Foundation models product from 智谱 AI, focused on official model with tags such as Foundation model, API, China ecosystem.
official modelCogVLM2 is a Foundation models product from 智谱 AI, focused on official model with tags such as Foundation model, API, China ecosystem.
official modelCommand A Vision is a Foundation models product from Cohere, focused on official model with tags such as Foundation model, API, Multimodal.
official modelCosmos Predict1 is a Foundation models product from NVIDIA, focused on official model with tags such as Foundation model, API, Multimodal.
official modelCosmos Reason1 is a Foundation models product from NVIDIA, focused on official model with tags such as Foundation model, API, Multimodal.
official modelCosmos Transfer1 is a Foundation models product from NVIDIA, focused on official model with tags such as Foundation model, API, Multimodal.
official modelDeepSeek Janus Pro is a Foundation models product from DeepSeek, focused on official model with tags such as Foundation model, API, China ecosystem.
reasoning model familyDeepSeek Models is useful for viewing DeepSeek's overall layout across general reasoning, deep thinking, multimodal capability, and API cost efficiency.
official modelDeepSeek VL2 is a Foundation models product from DeepSeek, focused on official model with tags such as Foundation model, API, China ecosystem.
official modelGemini 1.5 Flash is a Foundation models product from Google DeepMind, focused on official model with tags such as Foundation model, API, Multimodal.
official modelGemini 1.5 Pro is a Foundation models product from Google DeepMind, focused on official model with tags such as Foundation model, API, Multimodal.
official modelGemini 2.0 Flash is a Foundation models product from Google DeepMind, focused on official model with tags such as Foundation model, API, Multimodal.
official modelGemini 2.0 Flash Lite is a Foundation models product from Google DeepMind, focused on official model with tags such as Foundation model, API, Multimodal.
official modelGemini 2.0 Flash Live is a Foundation models product from Google DeepMind, focused on official model with tags such as Foundation model, API, Multimodal.
multimodal model familyGemini Models is a Foundation models product from Google DeepMind, focused on multimodal model family with tags such as Foundation model, Multimodal.
GLM model familyGLM Models summarizes Zhipu AI's Chinese model family, reasoning capabilities, and platform entry points.
official modelGLM-4V 9B is a Foundation models product from 智谱 AI, focused on official model with tags such as Foundation model, API, China ecosystem.
official modelGPT Image 1 is a Foundation models product from OpenAI, focused on official model with tags such as Foundation model, API, Multimodal.
general multimodal modelGPT-4.1 is a Foundation models product from OpenAI, focused on general multimodal model with tags such as Foundation model, Multimodal.
official modelGPT-4o mini is a Foundation models product from OpenAI, focused on official model with tags such as Foundation model, API, Multimodal.
official modelGPT-4o mini Realtime is a Foundation models product from OpenAI, focused on official model with tags such as Foundation model, API, Multimodal.
official modelGPT-4o Realtime is a Foundation models product from OpenAI, focused on official model with tags such as Foundation model, API, Multimodal.
open modelGranite Vision 3.2 is a Foundation models product from IBM, focused on open model with tags such as Foundation model, Open source, Multimodal.
official modelGrok 2 Vision is a Foundation models product from xAI, focused on official model with tags such as Foundation model, API, Multimodal.
official modelGrok Live is a Foundation models product from xAI, focused on official model with tags such as Foundation model, API, Multimodal.
reasoning model familyGrok Models is a Foundation models product from xAI, focused on reasoning model family with tags such as Foundation model, Multimodal.
official modelHailuo 02 is a Foundation models product from MiniMax, focused on official model with tags such as Foundation model, API, China ecosystem.
official modelHailuo Video 01 is a Foundation models product from MiniMax, focused on official model with tags such as Foundation model, API, China ecosystem.
official modelHunyuan 3D 2.0 is a Foundation models product from 腾讯混元, focused on official model with tags such as Foundation model, API, China ecosystem.
official modelHunyuan Video is a Foundation models product from 腾讯混元, focused on official model with tags such as Foundation model, API, China ecosystem.
official modelHunyuan Vision is a Foundation models product from 腾讯混元, focused on official model with tags such as Foundation model, API, China ecosystem.
image generationImagen 3 is a Image generation product from Google DeepMind, focused on image generation with tags such as Image generation, Multimodal.
official modelImagen 4 is a Foundation models product from Google DeepMind, focused on official model with tags such as Foundation model, API, Multimodal.
open model familyLlama is a Foundation models product from Meta, focused on open model family with tags such as Foundation model, Multimodal, Open source.
open modelLlama 3.2 11B Vision is a Foundation models product from Meta, focused on open model with tags such as Foundation model, Open source, Multimodal.
open modelLlama 3.2 90B Vision is a Foundation models product from Meta, focused on open model with tags such as Foundation model, Open source, Multimodal.
open modelLlama 4 Maverick is a Foundation models product from Meta, focused on open model with tags such as Foundation model, Open source, Multimodal.
open modelLlama 4 Scout is a Foundation models product from Meta, focused on open model with tags such as Foundation model, Open source, Multimodal.
official modelMedGemma is a Foundation models product from Google DeepMind, focused on official model with tags such as Foundation model, API, Open source.
MiniMax是全球领先的通用人工智能科技公司,致力于"与所有人共创智能",自主研发了一系列多模态通用大模型,并面向全球推出一系列AI原生产品,已服务逾 2亿名用户
reasoning model familyNemotron is a Foundation models product from NVIDIA, focused on reasoning model family with tags such as Foundation model, Multimodal.
closed model familyOpenAI Models aggregates OpenAI's flagship foundation, reasoning, realtime, and embedding model entry points.
official modelPaddleOCR-VL is a Foundation models product from 百度文心, focused on official model with tags such as Foundation model, API, China ecosystem.
open modelPhi-3.5 Vision is a Foundation models product from Microsoft, focused on open model with tags such as Foundation model, Open source, Multimodal.
open modelPhi-4 Multimodal is a Foundation models product from Microsoft, focused on open model with tags such as Foundation model, Open source, Multimodal.
official modelPixtral 12B is a Foundation models product from Mistral AI, focused on official model with tags such as Foundation model, API, Multimodal.
open modelQVQ-72B Preview is a Foundation models product from 阿里通义, focused on open model with tags such as Foundation model, China ecosystem, Open source.
open model familyQwen is one of the most complete China-based open model families, spanning text, vision, audio, coding, and omni directions.
Qwen2.5 Audio 7B is a Foundation models product from 阿里通义, focused on open model with tags such as Foundation model, China ecosystem, Open source.
Qwen2.5 Omni 7B is a Foundation models product from 阿里通义, focused on open model with tags such as Foundation model, China ecosystem, Open source.
Qwen2.5 VL 72B is a Foundation models product from 阿里通义, focused on open model with tags such as Foundation model, China ecosystem, Open source.
Qwen2.5 VL 7B is a Foundation models product from 阿里通义, focused on open model with tags such as Foundation model, China ecosystem, Open source.
foundation model familySeed Models aggregates ByteDance foundation-model and multimodal entry points with an emphasis on productization and content workflows.
multimodal model platformSenseNova is a Foundation models product from 商汤科技, focused on multimodal model platform with tags such as Foundation model, Multimodal, China ecosystem.
official modelSenseNova 3D is a Foundation models product from 商汤日日新, focused on official model with tags such as Foundation model, API, China ecosystem.
official modelSenseNova Vision is a Foundation models product from 商汤日日新, focused on official model with tags such as Foundation model, API, China ecosystem.
open modelSmolVLM 500M is a Foundation models product from Hugging Face, focused on open model with tags such as Foundation model, Open source, Multimodal.
open modelSmolVLM2 2.2B is a Foundation models product from Hugging Face, focused on open model with tags such as Foundation model, Open source, Multimodal.
text to video modelSora is a Video generation product from OpenAI, focused on text to video model with tags such as Multimodal, Video editing.
official modelStep 1V is a Foundation models product from 阶跃星辰, focused on official model with tags such as Foundation model, API, China ecosystem.
official modelStep Audio is a Foundation models product from 阶跃星辰, focused on official model with tags such as Foundation model, API, China ecosystem.
official modelStep R1 V Mini is a Foundation models product from 阶跃星辰, focused on official model with tags such as Foundation model, API, China ecosystem.
official modelStep Video I2V is a Foundation models product from 阶跃星辰, focused on official model with tags such as Foundation model, API, China ecosystem.
official modelStep Video T2V is a Foundation models product from 阶跃星辰, focused on official model with tags such as Foundation model, API, China ecosystem.
video modelVeo is a Video generation product from Google DeepMind, focused on video model with tags such as Multimodal, Video editing.
video generationVeo 2 is a Video generation product from Google DeepMind, focused on video generation with tags such as Video editing, Multimodal.
official modelVeo 3 is a Foundation models product from Google DeepMind, focused on official model with tags such as Foundation model, API, Multimodal.
official modelVoxtral Mini is a Foundation models product from Mistral AI, focused on official model with tags such as Foundation model, API, Multimodal.
official modelVoxtral Small is a Foundation models product from Mistral AI, focused on official model with tags such as Foundation model, API, Multimodal.
official modelYi Vision is a Foundation models product from 零一万物, focused on official model with tags such as Foundation model, API, China ecosystem.
Start with official access, pricing model, API support, open/closed status, and common use cases.