Heading:
Author: Assistant
Model: gpt-4o
Category: model-compression-training
Tags: LLM, distillation, teacher-student, curriculum, losses
Average Rating: 0
Total Ratings: 0