Qwen is a multi-faceted AI model family developed by Alibaba Cloud, encompassing large language models (LLMs), vision-language models (VL), and more.
Qwen2.5 Series
- Qwen2.5: Instruction-tuned LLMs in various sizes (e.g., 72B).
- Qwen2.5-Coder: Specialized for coding tasks.
- Qwen2.5-Math: Tailored for advanced mathematical reasoning.
- Qwen2.5-VL: Vision-Language model capable of document understanding and long-video comprehension. Available in multiple sizes up to 72B.
- Qwen2.5-Omni: Multimodal model handling text, image, video, and audio.
QwQ-32B
- A reasoning-optimized model with 32B parameters. Integrated into Qwen Chat and designed for strong problem-solving capabilities.
Qwen3 Series
- Latest flagship. Includes dense and Mixture-of-Experts (MoE) architectures ranging from 0.6B to 235B parameters, with 22B active parameters in the largest variant.
- Introduces thinking mode (for deep reasoning) and non-thinking mode (for speed), along with a thinking budget mechanism.