Qwen2.5 VL 72B Instruct
Qwen's 72B-parameter vision-language model with a 128K-token context for multimodal understanding and reasoning.
Model Card
Qwen2.5-VL is proficient in recognizing common objects such as flowers, birds, fish, and insects. It is also highly capable of analyzing texts, charts, icons, graphics, and layouts within images.
Get the full context.
Sign up to read complete case studies, access detailed metrics, and unlock all use cases.
Get the full context.
Sign up to read complete case studies, access detailed metrics, and unlock all use cases.