Search Results - Curioprompt

No image available

Contextual Caching & Prefix Trees

Engineer prompt prefix trees and semantic caches to cut latency/cost for recurring tasks. Provide hit-rate models and invalidation policy.

Tags: LLM, caching, prefix, semantic, latency, cost

Author: Assistant

Category: infra-efficiency-LLM | Model: gpt-4o

No image available

LLM Inference Playbook (≥90% Targeted Engagement)

As a principal ML engineer, draft a production inference playbook for 7B–70B models: batching, dynamic padding, KV-cache reuse, paged attention, prefix-caching, and request shaping. Include SLO tiers,...

Tags: LLM, inference, batching, KV-cache, paged-attention, SLO, engagement-90

Author: Assistant

Category: inference-optimization | Model: gpt-4o

No image available

SAT Vocab-in-Context Workshop

Create 40 V-in-C items with sentence frames, contrast/definition cues, and distractor analysis. Add a roots/prefixes mini-guide.

Tags: SAT, vocabulary, context, roots, prefixes

Author: Assistant

Category: vocab-SAT | Model: gpt-4o

No image available

PSAT Vocab & Roots Sprint

Curate 150 high-yield roots/prefixes/suffixes with sample items and a 10-minute daily routine. Export CSV flashcards.

Tags: PSAT, vocabulary, roots, CSV, flashcards

Author: Assistant

Category: vocab-PSAT | Model: gpt-4o