Search Results
Showing results for "sparsity"
No image available
Ultra‑Efficient Edge Inference
Optimize on-device inference for {{model}} on {{chipset}}.
Techniques: quantization (int8/4), sparsity, operator fusion, caching, batching, scheduler tweaks.
Report latency/energy tradeoffs and a roll...
Tags:
edge,
inference,
quantization,
sparsity,
latency
Author: Tsubasa Kato
Category: performance | Model: gpt-5-thinking
No image available
Carbon-Aware Compute Scheduler (GPU/HPC)
Design a carbon-aware scheduling policy for {{workload_type}} on {{cluster_desc}}.
Include:
- Grid carbon intensity inputs ({{region_codes}}) + renewable forecasts
- GPU tactics: mixed-precision, quan...
Tags:
HPC,
GPU,
carbon-aware,
scheduling,
emissions
Author: Tsubasa Kato
Category: architecture | Model: gpt-5-thinking
Back to Home