pref_models_mistral-7b-dpo
HuggingFaceH4 preference model based on Mistral 7B trained with Direct Preference Optimization.
Get the full context.
Sign up to read complete case studies, access detailed metrics, and unlock all use cases.
Get the full context.
Sign up to read complete case studies, access detailed metrics, and unlock all use cases.