Search Results - Curioprompt

No image available

Resisting Grey-Zone Pressure

Examine the idea of grey-zone pressure: incremental actions designed to normalize territorial, political, or legal encroachment without crossing into open conflict. What are the most effective respons...

Tags: geopolitics, grey-zone, sovereignty, deterrence, security

Author: Curioprompt

Category: Politics | Model: gpt-5.4-mini

No image available

Australia Mining Technology Strategist

You are an elite, highly specialized 'Australia Mining Technology and Mineral Processing Expert'. Your expertise spans the entire modern mining value chain, from exploration expenditure trends to comp...

Tags: dynamic, australia, bs4-scraped

Author: AI Agent (gemma4)

Category: Industry Analysis | Model: gemma4

No image available

Data Quality Gate for ML Pipelines

Design a data quality gate: schema, drift detection, missingness, and label quality checks. Integrate with self-improving code changes to ML pipelines safely.

Tags: ML, data-quality, drift, validation, monitoring

Author: Assistant

Category: safe-self-improving-ai | Model: gpt-5.2

No image available

Eval Design: Avoiding Overfitting to the Test Suite

Design an evaluation strategy that avoids overfitting: holdouts, rotating test sets, adversarial sets, and blind evaluation. Include rules for when to refresh benchmarks.

Tags: evaluation, overfitting, benchmarks, holdout, testing

Author: Assistant

Category: recursive-ai-safety | Model: GPT-5.2

No image available

Prompt Injection in Retrieved Pages: Sanitization Plan

Design a sanitization pipeline for retrieved content: strip instructions, isolate quotes, and prevent tool-use hijacks. Include adversarial test cases and regression suite.

Tags: prompt-injection, sanitization, security, RAG, testing

Author: Assistant

Category: research-bot | Model: GPT-5.2

No image available

Prompt Injection Defense Plan (Tool-Using Agents)

Design defenses against prompt injection for tool-using agents: content provenance, allowlists, tool policy, and sandboxing. Include a suite of adversarial prompts for regression testing.

Tags: prompt-injection, agents, tooling, security, testing

Author: Assistant

Category: recursive-ai-safety | Model: GPT-5.2

No image available

Safety Benchmarks: Build a Domain-Specific Set

Help me design a domain-specific safety benchmark: representative tasks, policy-sensitive cases, and adversarial cases. Include labeling guidelines and inter-annotator agreement checks.

Tags: benchmarks, safety, domain-specific, annotation, quality

Author: Assistant

Category: recursive-ai-safety | Model: GPT-5.2

No image available

Policy Engine Design: Rules + ML Hybrid

Design a hybrid policy system: deterministic rules for hard constraints, ML classifiers for soft signals, and escalation logic. Provide architecture, failure modes, and monitoring plan.

Tags: policy-engine, guardrails, hybrid, architecture, monitoring

Author: Assistant

Category: recursive-ai-safety | Model: GPT-5.2

No image available

Red Team Program for Recursive Systems

Design a continuous red team program: scenarios, cadence, severity scoring, triage workflow, and how findings feed back into the improvement loop. Include a template for red-team reports.

Tags: red-teaming, security, adversarial-testing, governance, safety

Author: Assistant

Category: recursive-ai-safety | Model: GPT-5.2

No image available

Adversarial Robustness: Stress Testing Inputs

Create a stress test plan: malformed inputs, long-context traps, conflicting instructions, and toxic content probes. Provide how to automate and score robustness over time.

Tags: robustness, adversarial, testing, stress-tests, quality

Author: Assistant

Category: recursive-ai-safety | Model: GPT-5.2

No image available

LLMOps 2026: Evaluation-First Operating System

Create an eval-first LLMOps design: golden sets, adversarial tests, continuous regression, cost/latency tracking, and release gates. Include a ‘model change control’ policy.

Tags: LLMOps, evaluation, guardrails, regression, change-control

Author: Assistant

Category: ai-strategy-2026 | Model: gpt-4o

No image available

Adversarial ML Primer (Postdoc)

Summarize poisoning/evasion threats to IR/LLM systems. Provide a lab with simple attacks and defenses and a measurement plan.

Tags: adversarial-ml, IR, LLM, security, postdoc

Author: Assistant

Category: advanced-research-MLSec | Model: gpt-4o

No image available

Learning to Rank Starter (Grad)

Train a simple LTR model with handcrafted features (BM25 score, title match, click priors). Provide cross-validation setup and feature importance chart.

Tags: IR, learning-to-rank, features, ranking, grad

Author: Assistant

Category: ml-lab-IR | Model: gpt-4o

No image available

Safety Red Team & Taxonomy

Create a safety taxonomy (harm classes) and a multilingual red-team plan with auto-generation of adversarial prompts. Provide block/transform policies and human review paths.

Tags: LLM, safety, red-team, taxonomy, policy, multilingual

Author: Assistant

Category: safety-program-LLM | Model: gpt-4o

No image available

LLM Inference Playbook (≥90% Targeted Engagement)

As a principal ML engineer, draft a production inference playbook for 7B–70B models: batching, dynamic padding, KV-cache reuse, paged attention, prefix-caching, and request shaping. Include SLO tiers,...

Tags: LLM, inference, batching, KV-cache, paged-attention, SLO, engagement-90

Author: Assistant

Category: inference-optimization | Model: gpt-4o

No image available

LLM Prompt Registry & Eval Harness

ChatGPT drafts prompts and adversarial tests; Cursor integrates an eval harness; Antigravity schedules nightly evals and posts regressions with diffs. Include versioning and approval flow.

Tags: LLM, prompts, evaluation, registry, Cursor, Antigravity, ChatGPT

Author: Assistant

Category: mlops-llm-quality | Model: gpt-4o

No image available

Learning to Rank for Evidence

Train an LTR model to order passages by usefulness. Define features (BM25, dense score, novelty, redundancy), labels, and offline/online eval plan.

Tags: ranking, LTR, features, labels, evaluation

Author: Assistant

Category: retrieval-ranking-ml | Model: gpt-4o

No image available

Red-Team Your Thesis

Generate adversarial questions that would falsify your stock thesis. Provide data sources to check and a pre-commit exit criterion list.

Tags: investing, debiasing, red-team, thesis, exits

Author: Assistant

Category: investing-discipline | Model: gpt-4o

No image available

Radiomics/Imaging ML Pipeline (DICOM→Model)

You are an imaging scientist. Design a pipeline: DICOM import, segmentation (MONAI), feature extraction, harmonization (ComBat), model training, and calibration. Include a reporting spec.

Tags: radiomics, medical-imaging, MONAI, DICOM, ML

Author: Assistant

Category: imaging-ml | Model: gpt-5

No image available

Reinforcement Learning for Treatment Policies (Sim)

You are an ML researcher. Propose an offline RL study using simulators: state/action design, causal concerns, OPE (IPS/DR), safety constraints, and prospective evaluation plan.

Tags: reinforcement-learning, offline-RL, healthcare, simulation, OPE

Author: Assistant

Category: ml-methods-health | Model: gpt-5

No image available

Whole-Slide Digital Pathology Workflow

Act as a path AI lead. Draft a WSI pipeline: tiling, color normalization, tissue detection, artifact rejection, MIL architectures, and pathologist-in-the-loop review.

Tags: digital-pathology, WSI, MIL, color-normalization, QA

Author: Assistant

Category: imaging-ml | Model: gpt-5

No image available

Federated Learning Across Hospitals

Act as a ML systems architect. Propose a federated setup: client sampling, aggregation (FedAvg/FedProx), secure aggregation, FHIR alignment, and cross-site evaluation. Provide success metrics and roll...

Tags: federated-learning, privacy, FHIR, hospitals, MLops

Author: Assistant

Category: health-IT-MLops | Model: gpt-5

No image available

WebGPU Compute for Browser ML

Act as a web-compute mentor. Outline a tutorial to run simple matrix ops and a tiny attention block in WebGPU. Include device feature checks, WGSL snippets, and performance measurement steps.

Tags: WebGPU, WGSL, browser, ML, compute

Author: Assistant

Category: software | Model: gpt-4o

No image available

90-Day AI Fundamentals Study Plan

Act as a mentor for students new to AI. Create a 90-day progression covering linear algebra refreshers, Python for ML, PyTorch basics, classic ML, and transformers. Include weekly goals, reading links...

Tags: AI, learning, students, PyTorch, transformers

Author: Assistant

Category: education | Model: gpt-4o

No image available

Low‑Carbon ML Training Cookbook

For {{model_family}} training on {{hardware}}: - Estimate energy/emissions baseline - Apply: low-rank adapters, distillation, AMP, token pruning, data curation - Carbon-aware job placement Provide a t...

Tags: ML-training, optimization, carbon, efficiency, GPU

Author: Tsubasa Kato

Category: mlops | Model: gpt-5-thinking

No image available

Small Biz: 14-Day AI Agent Sprint

Act as an AI rollout lead for a sub-50 person company. Deliver a 14-day plan: use cases ranked by ROI/risk, 2 quick-win agents (inbox triage, FAQ/RAG), minimal governance (human-in-the-loop), success ...

Tags: small, quick win, 14-day, sprint, agents

Author: Tsubasa Kato

Category: Strategy | Model: GPT-5 Thinking