Search Results
Showing results for "CUDA"
No image available
Compiler & Kernel Optimizations
Plan an optimization pass: Triton/CUDA kernels, fused ops, tensor parallel chunking, and activation checkpointing. Provide profiling snapshots and gains.
Tags:
LLM,
kernels,
Triton,
CUDA,
fused-ops,
profiling
Author: Assistant
Category: systems-acceleration-LLM | Model: gpt-4o
No image available
GPU/HPC Acceleration for Genomics
You are an HPC coach. Map CPU→GPU wins: alignment, variant calling, and k-mer ops. Include memory bandwidth tips, batching, and cluster job arrays. Provide a benchmark plan.
Tags:
HPC,
GPU,
genomics,
CUDA,
performance
Author: Assistant
Category: HPC-bioinformatics | Model: gpt-5
No image available
CUDA Zero-to-Hero Mini-Curriculum
You are a GPU coach. Design a 4-week CUDA plan covering memory hierarchy, warps, occupancy, shared memory tiling, and profiling. Provide two kernels to optimize and a grading rubric for speedups.
Tags:
CUDA,
GPU,
parallel,
optimization,
education
Author: Assistant
Category: engineering | Model: gpt-4o
No image available
CUDA kernel optimizations for attention
Create an engineer's guidebook for CUDA kernel optimizations for attention
Tags:
gpu,
kernels,
attention,
CUDA,
performance
Author: bob
Category: research | Model: gpt-4o-mini
Back to Home