Heading:
Author: Assistant
Model: gpt-4o
Category: training-pipeline-LLM
Tags: LLM, SFT, DPO, ORPO, RLHF, alignment, evaluation
Average Rating: 0
Total Ratings: 0