Search Results

Showing results for "offline-evals"

No image available

Retrieval Eval Harness

Build an eval harness: recall@k, calibrated precision, answer faithfulness, and human-time-to-verify. Include topic-aware test buckets and data drift alarms.

Tags: LLM, retrieval, eval, faithfulness, drift, metrics

Author: Assistant

Category: evaluation-frameworks-LLM | Model: gpt-4o

No image available

Mobile & Offline Research Mode

Design a mobile UX with offline packs, low-bandwidth retrieval, and later sync. Provide caching TTL, conflict resolution, and share sheet flows.

Tags: mobile, offline, caching, UX, sync

Author: Assistant

Category: mobile-experience | Model: gpt-4o

No image available

Navigation Stack: Map+Compass+GPS

Create a layered nav plan: map/compass refreshers, GPX file prep, offline app setup, backup wayfinding cues. Include a 10-question nav quiz and a pocket crib sheet.

Tags: camping, navigation, GPS, map, compass

Author: Assistant

Category: navigation-skills | Model: gpt-4o

No image available

Battery & Data Budget Guardrails

Act as a mobile ops coach. Provide battery/data-saving rules for long commutes: offline-first, background refresh policy, low-data media, and emergency reserve. Output a checklist.

Tags: mobile, battery, data, offline, workflow

Author: Assistant

Category: operations | Model: gpt-5

No image available

Backup & DAM 3-2-1 + IPTC

You are a DAM architect. Create a 3-2-1 backup plan, folder schema, file naming convention, and IPTC keyword strategy. Include cloud and offline options plus a restore drill.

Tags: DAM, backup, IPTC, workflow, 3-2-1

Author: Assistant

Category: photography | Model: gpt-4o

No image available

Prompt Lint & Style Guide

Create a lint checklist for prompts: brevity, unambiguous instructions, schema-first outputs, guardrails, and eval hooks. Return a checklist table with yes/no and examples. Add 5 before→after rewrites...

Tags: prompt|lint|style|guide|checklist

Author: Curioforce Corp. Corp.

Category: Prompt-Improvement | Model: gpt-5-thinking

No image available

Enterprise: Incident & Change Mgmt

Write an ITIL-aligned process for agent incidents and changes: severity matrix, rollback, shadow traffic, canary, approvals, comms templates, and post-mortems with eval deltas. Include regulator-ready...

Tags: enterprise, ITIL, incident, change, compliance

Author: Tsubasa Kato

Category: Strategy | Model: GPT-5 Thinking

No image available

Small Biz: Data-Lite RAG Setup

Design a data-lite RAG plan without engineers. Sources: Google Drive/Notion/PDFs. Deliver: folder taxonomy, redaction rules, ingestion checklist, embedding strategy, update cadence, eval set of 25 Q&A...

Tags: small, RAG, data hygiene, nontech, privacy

Author: Tsubasa Kato

Category: Strategy | Model: GPT-5 Thinking

No image available

Mid-Market: GTM Acceleration Agents

Ship marketing and sales agents: content brief generator, SEO outline, ad variants, webinar follow-up, lead enrichment, meeting note sync. Define prompts, tool chain (CRM, MAP, docs), eval rubric (bra...

Tags: medium, marketing, sales, GTM, experimentation

Author: Tsubasa Kato

Category: Strategy | Model: GPT-5 Thinking

No image available

Enterprise: Secure RAG over Data Lakes

Architect secure RAG across lakehouse/DWH: metadata-driven retrieval, policy-aware chunks, per-record ACL, caching, eval sets by domain, hallucination controls, and can’t-answer routing. Deliver refer...

Tags: enterprise, RAG, security, ACL, lakehouse

Author: Tsubasa Kato

Category: Strategy | Model: GPT-5 Thinking

Back to Home