Z

GLM 4.6V

Multimodalby Z.ai·Model page

Z.ai's multimodal GLM model with image and video understanding and a 131K-token context.

Share:

Model Card

GLM-4.6V is a large multimodal model designed for high-fidelity visual understanding and long-context reasoning across images, documents, and mixed media. It supports up to 128K tokens, processes complex page layouts...

Author
Z
Z.ai
Organization
z-ai
Details
Downloads
Likes
AccessOpen Source
Context131K tokens
Input price$0.3 /1M
Output price$0.9 /1M
CreatedDec 8, 2025
Updated
View on Hugging Face
Get the full context.

Sign up to read complete case studies, access detailed metrics, and unlock all use cases.

GLM 4.6V — AI Model Details | Applied