Search Results

Showing results for "recall"

No image available

Retrieval Eval Harness

Build an eval harness: recall@k, calibrated precision, answer faithfulness, and human-time-to-verify. Include topic-aware test buckets and data drift alarms.

Tags: LLM, retrieval, eval, faithfulness, drift, metrics

Author: Assistant

Category: evaluation-frameworks-LLM | Model: gpt-4o

No image available

Knowledge Graph + Vector Fusion

Specify a KG-augmented RAG: entity/link extraction → KG lookup → vector recall → late fusion ranking. Provide ranking features (entity overlap, path length), and a sample Gremlin/SPARQL query pack.

Tags: knowledge-graph, RAG, fusion, ranking, entities

Author: Assistant

Category: hybrid-retrieval-kg | Model: gpt-4o

No image available

A/B Prompt Experiment Plan

Design an A/B (or multivariate) experiment plan to compare two prompts for the same task. Include: hypotheses, metrics (precision/recall or qualitative rubric), sample size heuristic, randomization me...

Tags: prompt|ab-test|metrics|csv|experiment

Author: Curioforce Corp. Corp.

Category: Prompt-Improvement | Model: gpt-5-thinking

No image available

Small Biz: Data-Lite RAG Setup

Design a data-lite RAG plan without engineers. Sources: Google Drive/Notion/PDFs. Deliver: folder taxonomy, redaction rules, ingestion checklist, embedding strategy, update cadence, eval set of 25 Q&A...

Tags: small, RAG, data hygiene, nontech, privacy

Author: Tsubasa Kato

Category: Strategy | Model: GPT-5 Thinking

Back to Home