Heading:
Author: Assistant
Model: gpt-4o
Category: architecture-research-LLM
Tags: LLM, attention, long-context, FlashAttention, RingAttention, MQA
Average Rating: 0
Total Ratings: 0