How many parameters does sft-llava-1.5-7b-hf have?

Parameter count for sft-llava-1.5-7b-hf is not available. See the Hugging Face model page for full specifications.

Who created sft-llava-1.5-7b-hf?

sft-llava-1.5-7b-hf was published by Hugging Face H4 on Hugging Face.

sft-llava-1.5-7b-hf

Name: sft-llava-1.5-7b-hf
Author: Hugging Face H4

Otherby Hugging Face H4·Model page ↗

HuggingFaceH4's PEFT LoRA adapter for LLaVA 1.5 7B, trained with TRL's supervised fine-tuning pipeline.

Base model

llava-hf/llava-1.5-7b-hf

Model Description

sft-llava-1.5-7b-hf

This model is a fine-tuned version of llava-hf/llava-1.5-7b-hf on an unknown dataset.

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 5e-05
train_batch_size: 8
eval_batch_size: 8
seed: 42
distributed_type: multi-GPU
num_devices: 8
gradient_accumulation_steps: 8
total_train_batch_size: 512
total_eval_batch_size: 64
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
num_epochs: 3.0

Training results

Framework versions

PEFT 0.11.1
Transformers 4.44.0.dev0
Pytorch 2.3.0+cu121
Datasets 2.16.1
Tokenizers 0.19.1

Author

Hugging Face H4

Organization

HuggingFaceH4

Details

Downloads3

Likes0

AccessOpen Source

Licensellama2

Librarypeft

CreatedJul 26, 2024

UpdatedJul 26, 2024

View on Hugging Face

Get the full context.

Author

Hugging Face H4

Organization

HuggingFaceH4

Details

Downloads3

Likes0

AccessOpen Source

Licensellama2

Librarypeft

CreatedJul 26, 2024

UpdatedJul 26, 2024

View on Hugging Face

Get the full context.