Recommended Models by Use Case
| Use Case | Recommended Model | Model String | Alternatives | Learn More |
|---|---|---|---|---|
| Chat | Kimi K2.5 (instant mode) | moonshotai/Kimi-K2.5 | deepseek-ai/DeepSeek-V3.1, openai/gpt-oss-120b | Chat |
| Reasoning | Kimi K2.5 (reasoning mode) | moonshotai/Kimi-K2.5 | deepseek-ai/DeepSeek-R1, Qwen/Qwen3-235B-A22B-Thinking-2507 | Reasoning Guide, DeepSeek R1 |
| Coding Agents | Kimi K2.5 (reasoning mode) | moonshotai/Kimi-K2.5 | Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8, deepseek-ai/DeepSeek-V3.1 | Building Agents |
| Small & Fast | GPT-OSS 20B | openai/gpt-oss-20b | Qwen/Qwen2.5-7B-Instruct-Turbo, meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo | - |
| Medium General Purpose | GPT-OSS 120B | openai/gpt-oss-120b | zai-org/GLM-4.5-Air-FP8, Qwen/Qwen3-Next-80B-A3B-Instruct | - |
| Function Calling | GLM 4.7 | zai-org/GLM-4.7 | moonshotai/Kimi-K2.5, moonshotai/Kimi-K2-Instruct-0905 | Function Calling |
| Vision | Kimi K2.5 | moonshotai/Kimi-K2.5 | meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8, Qwen/Qwen3-VL-32B-Instruct | Vision, OCR |
| Image Generation | Flash Image 2.5 (Nano Banana) | google/flash-image-2.5 | black-forest-labs/FLUX.2-pro, ByteDance-Seed/Seedream-4.0 | Images |
| Image-to-Image | Flash Image 2.5 (Nano Banana) | google/flash-image-2.5 | black-forest-labs/FLUX.1-kontext-max, google/gemini-3-pro-image | Flux Kontext |
| Text-to-Video | Sora 2 | openai/sora-2-pro | google/veo-3.0, ByteDance/Seedance-1.0-pro | Video Generation |
| Image-to-Video | Veo 3.0 | google/veo-3.0 | ByteDance/Seedance-1.0-pro, kwaivgI/kling-2.1-master | Video Generation |
| Text-to-Speech | Cartesia Sonic 3 | cartesia/sonic-3 | canopylabs/orpheus-3b-0.1-ft, hexgrad/Kokoro-82M | Text-to-Speech |
| Speech-to-Text | Whisper Large v3 | openai/whisper-large-v3 | mistralai/Voxtral-Mini-3B-2507 | Speech-to-Text |
| Embeddings | GTE-Modernbert-base | Alibaba-NLP/gte-modernbert-base | intfloat/multilingual-e5-large-instruct | Embeddings |
| Rerank | MixedBread Rerank Large | mixedbread-ai/Mxbai-Rerank-Large-V2 | - | Rerank, Guide |
| Moderation | Virtue Guard | VirtueAI/VirtueGuard-Text-Lite | meta-llama/Llama-Guard-4-12B | - |
Need Help Choosing?
- Check our Serverless Models page for complete specifications
- See our WhichLLM page which provides categorical benchmarks for the above usecases
- Review Rate Limits for your tier
- See Pricing for cost information
- Visit Inference FAQs for common questions