Search Results

Showing results for "NVIDIA-GPU"

No image available

LoRA/QLoRA Strategy

Recommend when to use LoRA/QLoRA vs full finetune. Define rank search, target layers, and quantization-aware adapters. Include memory/perf tables per GPU class.

Tags: LLM, LoRA, QLoRA, finetuning, adapters, GPU

Author: Assistant

Category: parameter-efficient-tuning-LLM | Model: gpt-4o

No image available

GPU/HPC Acceleration for Genomics

You are an HPC coach. Map CPU→GPU wins: alignment, variant calling, and k-mer ops. Include memory bandwidth tips, batching, and cluster job arrays. Provide a benchmark plan.

Tags: HPC, GPU, genomics, CUDA, performance

Author: Assistant

Category: HPC-bioinformatics | Model: gpt-5

No image available

CUDA Zero-to-Hero Mini-Curriculum

You are a GPU coach. Design a 4-week CUDA plan covering memory hierarchy, warps, occupancy, shared memory tiling, and profiling. Provide two kernels to optimize and a grading rubric for speedups.

Tags: CUDA, GPU, parallel, optimization, education

Author: Assistant

Category: engineering | Model: gpt-4o

No image available

Hybrid Workload Split (CPU/GPU/QPU)

Propose how to split compute between CPU, GPU, and QPU for a variational algorithm. Provide a pipeline diagram description, and give heuristics for batching, latency hiding, and caching parameter shif...

Tags: quantum|hybrid|cpu|gpu|qpu|variational

Author: Curioforce Corp. Corp.

Category: Quantum Tech | Model: gpt-5-thinking

No image available

Optimizing GenAI Workflows for Edge GPU Boards

Write a comprehensive GenAI prompt that utilizes the latest edge GPU boards to accelerate tasks such as data annotation, object detection, and predictive modeling for a team of robotics engineers. Pr...

Tags: robotics, AI, prompts, edgeGPU, GenAI

Author: Prompt Generator

Category: AI Development | Model: llama3.1:latest

No image available

Capacity and Cloud Cost Planner

For workload <service>, model 12-month capacity and cloud cost. Include CPU/GPU, storage, egress, and LLM inference. Compare reserved vs spot vs savings plans. Map to SLOs and traffic seasonality. Out...

Tags: "CTO;FinOps;capacity;SLO;cost"

Author: ChatGPT

Category: CTO | Model: GPT-5 Thinking

No image available

NVIDIA roadmap: Blackwell Ultra → Rubin CPX

Compute roadmap prioritizes long-context video/software-gen AI. Action: align capex with GB200/GB300 ramps; capacity-plan power/cooling; shift some inference to video-native accelerators.

Tags: Semiconductors, AI, HPC, Capex

Author: ChatGPT

Category: Tech Strategy | Model: GPT-5 Thinking

Back to Home