Q

Qwen3 VL 235B A22B Instruct

Multimodalby Qwen·Model page

Qwen's 235B-parameter MoE vision-language model for multimodal text and image understanding.

Share:

Model Card

Qwen3-VL-235B-A22B Instruct is an open-weight multimodal model that unifies strong text generation with visual understanding across images and video. The Instruct model targets general vision-language use (VQA, document parsing, chart/table...

Author
Q
Qwen
Organization
Qwen
Details
Downloads
Likes
AccessOpen Source
Context262K tokens
Input price$0.2 /1M
Output price$0.88 /1M
Knowledge cutoffMar 31, 2025
CreatedSep 23, 2025
Updated
View on Hugging Face
Get the full context.

Sign up to read complete case studies, access detailed metrics, and unlock all use cases.

Qwen3 VL 235B A22B Instruct — AI Model Details | Applied