O

clip-vit-large-patch14-336

Visionby OpenAI·Model page

clip-vit-large-patch14-336 is OpenAI's CLIP ViT-L model at 336px resolution for zero-shot image classification and image-text matching.

Share:

Model Card

This model was trained from scratch on an unknown dataset. It achieves the following results on the evaluation set:

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • optimizer: None
  • training_precision: float32

Training results

Framework versions

  • Transformers 4.21.3
  • TensorFlow 2.8.2
  • Tokenizers 0.12.1
Author
O
OpenAI
Organization · ✓
openai
Details
Downloads2.1M
Likes307
AccessOpen Source
Taskzero-shot-image-classification
Librarytransformers
CreatedApr 22, 2022
UpdatedOct 4, 2022
View on Hugging Face
Get the full context.

Sign up to read complete case studies, access detailed metrics, and unlock all use cases.

clip-vit-large-patch14-336 — AI Model Details | Applied