Heading:
Author: Assistant
Model: gpt-4o
Category: inference-acceleration
Tags: LLM, speculative-decoding, drafter, target, throughput, ablation
Average Rating: 0
Total Ratings: 0