Search Results - Curioprompt

No image available

Reviving Heritage for Modern Audiences

Examine the strategies artists and cultural institutions use to keep folk traditions relevant in a modern media environment. Compare three approaches: faithful preservation, contemporary reinterpretat...

Tags: heritage, folk-art, cultural-continuity, creative-industry, tradition

Author: Curioprompt

Category: Culture | Model: gpt-5.4-mini

No image available

Evaluation Clinic: Good vs Faithful

Design an evaluation harness that measures relevance and faithfulness for IR+LLM answers. Include human labeling rubric and inter-rater checks.

Tags: IR, evaluation, faithfulness, LLM, RAG

Author: Assistant

Category: eval-framework-IR-LLM | Model: gpt-4o

No image available

Retrieval Eval Harness

Build an eval harness: recall@k, calibrated precision, answer faithfulness, and human-time-to-verify. Include topic-aware test buckets and data drift alarms.

Tags: LLM, retrieval, eval, faithfulness, drift, metrics

Author: Assistant

Category: evaluation-frameworks-LLM | Model: gpt-4o

No image available

RAG 2.0: Freshness & Faithfulness

Architect a retrieval stack with hybrid search, temporal decay, dedup, and passage-level citation anchors. Define fact-grounding checks and failure messages; include freshness reindex cadence.

Tags: LLM, RAG, hybrid, temporal-decay, citations, freshness

Author: Assistant

Category: retrieval-grounding-LLM | Model: gpt-4o

No image available

Attribution-Aware Eval Harness

Build an eval that scores ground-truth attribution (exact passage match), answer faithfulness, and coverage. Provide dataset schema and a nightly regression plan.

Tags: evaluation, attribution, faithfulness, coverage, datasets

Author: Assistant

Category: evaluation-frameworks | Model: gpt-4o

No image available

Long-Context Compression Toolkit

Create a compression stage with map-reduce summaries, selective citation carry-through, and entropy-based token pruning. Provide metrics (coverage, faithfulness) and an ablation plan.

Tags: summarization, long-context, compression, metrics, faithfulness

Author: Assistant

Category: context-engineering | Model: gpt-4o