ML

Llama-3.2-11B-Vision-Instruct

Multimodalby Meta Llama·Model page

Meta's 10.7B-parameter instruction-tuned Llama 3.2 vision model for multilingual image-text understanding and generation.

Share:
Author
ML
Meta Llama
Organization · ✓
meta-llama
Details
Downloads126.2K
Likes1.6K
AccessOpen Source
Taskimage-text-to-text
Parameters10.7B
Licensellama3.2
Librarytransformers
CreatedSep 18, 2024
UpdatedDec 4, 2024
View on Hugging Face
Languages
endefritpthiesth
Get the full context.

Sign up to read complete case studies, access detailed metrics, and unlock all use cases.

Llama-3.2-11B-Vision-Instruct — AI Model Details | Applied