HF

sft-llava-1.5-7b-hf

Otherby Hugging Face H4·Model page

HuggingFaceH4's PEFT LoRA adapter for LLaVA 1.5 7B, trained with TRL's supervised fine-tuning pipeline.

Share:

Base model

llava-hf/llava-1.5-7b-hf

Model Card

sft-llava-1.5-7b-hf

This model is a fine-tuned version of llava-hf/llava-1.5-7b-hf on an unknown dataset.

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • distributed_type: multi-GPU
  • num_devices: 8
  • gradient_accumulation_steps: 8
  • total_train_batch_size: 512
  • total_eval_batch_size: 64
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 3.0

Training results

Framework versions

  • PEFT 0.11.1
  • Transformers 4.44.0.dev0
  • Pytorch 2.3.0+cu121
  • Datasets 2.16.1
  • Tokenizers 0.19.1
Author
HF
Hugging Face H4
Organization
HuggingFaceH4
Details
Downloads3
Likes0
AccessOpen Source
Licensellama2
Librarypeft
CreatedJul 26, 2024
UpdatedJul 26, 2024
View on Hugging Face
Get the full context.

Sign up to read complete case studies, access detailed metrics, and unlock all use cases.

sft-llava-1.5-7b-hf — AI Model Details | Applied