HF

mistral-7b-grok

LLMby Hugging Face H4·Model page

HuggingFaceH4's 7.2B Mistral fine-tune trained on Grok-style harmless conversations and UltraChat 200k data.

Share:

Base model

mistralai/Mistral-7B-v0.1

Model Card

This model is a fine-tuned version of mistralai/Mistral-7B-v0.1 that has been aligned via Constitutional AI to mimic the style of xAI's Grok assistant.

It achieves the following results on the evaluation set:

  • Loss: 0.9348

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • distributed_type: multi-GPU
  • num_devices: 8
  • gradient_accumulation_steps: 4
  • total_train_batch_size: 256
  • total_eval_batch_size: 64
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: cosine
  • lr_scheduler_warmup_ratio: 0.1
  • num_epochs: 1

Training results

Training Loss Epoch Step Validation Loss
0.9326 1.0 545 0.9348

Framework versions

  • Transformers 4.36.2
  • Pytorch 2.1.2+cu121
  • Datasets 2.16.1
  • Tokenizers 0.15.0
Author
HF
Hugging Face H4
Organization
HuggingFaceH4
Details
Downloads49
Likes54
AccessOpen Source
Tasktext-generation
Parameters7.2B
Licenseapache-2.0
Librarytransformers
CreatedJan 29, 2024
UpdatedFeb 1, 2024
View on Hugging Face
Get the full context.

Sign up to read complete case studies, access detailed metrics, and unlock all use cases.

mistral-7b-grok — AI Model Details | Applied