Search Results
Showing results for "statistics"
No image available
Canary Release Strategy for Model/Prompt Updates
Create a canary strategy: cohort selection, metrics, guardrails, and automatic rollback conditions. Include steps to prevent canary contamination and to interpret results statistically.
Tags:
canary,
experimentation,
rollout,
metrics,
rollback
Author: Assistant
Category: recursive-ai-safety | Model: GPT-5.2
No image available
Numbers Literacy: Common Statistical Traps
List common statistical traps in media (base rates, selection bias, p-hacking, survivorship). Provide a quick “sanity check” checklist for numbers.
Tags:
statistics,
critical-thinking,
bias,
media-literacy
Author: Assistant
Category: information-reliability | Model: GPT-5.2
No image available
Research Methods: A/B Testing Basics
Explain A/B testing for product/MBA students: power, sample size, and p-hacking pitfalls. Provide a template analysis.
Tags:
MBA,
experimentation,
AB-testing,
statistics,
template
Author: Assistant
Category: methods-and-metrics-MBA | Model: gpt-4o
No image available
Multi-Task Multi-Domain Evals
Create a senior-grade eval battery: reasoning (math/code), instruction-following, safety, multilingual QA, and tool-use. Include uncertainty intervals and power analysis for A/Bs.
Tags:
LLM,
evaluation,
multidomain,
statistics,
AB-testing
Author: Assistant
Category: evaluation-design-LLM | Model: gpt-4o
No image available
Franca QC Sampling and SPC
Implement acceptance sampling and SPC on a Franca outsole line. Define AQL, control charts (p/X̄-R), gage R&R, and a response plan. Output operator training cards.
Tags:
QC,
SPC,
AQL,
gage-RR,
Franca,
footwear
Author: Assistant
Category: statistical-quality-control | Model: gpt-4o
No image available
CONSORT Trial Protocol Scaffold
Act as a clinical PM. Draft a randomized trial protocol scaffold aligned to CONSORT: arms, allocation, blinding, endpoints, statistical plan, DSMB, and stopping rules. Include a timeline Gantt skeleto...
Tags:
clinical-trial,
CONSORT,
protocol,
DSM,
statistics
Author: Assistant
Category: clinical-research | Model: gpt-5
No image available
Real-World Evidence Registry Plan
You are a RWE architect. Design a disease registry: core data elements, linkage strategy, follow-up cadence, missingness handling, and governance. Include a statistical analysis plan.
Tags:
RWE,
registry,
observational,
governance,
analysis
Author: Assistant
Category: health-services-research | Model: gpt-5
No image available
Evaluate a Person of Advanced Knowledge of Physics and Science
To evaluate whether a person has advanced knowledge of physics and science, consider the following ten questions covering a range of topics, including theoretical and experimental physics, as well as ...
Tags:
evaluation
Author: [email protected]
Category: Evaluation,NLP | Model: GPT-4o,o1,o1-mini
Back to Home