Search Results

Showing results for "InfiniBand"

No image available

MoE Routing & Load Balancing

Design an expert-parallel MoE serving topology: gate calibration, capacity factor, expert sharding, and interconnect constraints (NVLink/IB). Provide hot-spot diagnostics and expert-drop policies for ...

Tags: LLM, MoE, experts, routing, capacity, NVLink, InfiniBand

Author: Assistant

Category: distributed-systems-LLM | Model: gpt-4o

Back to Home