Search Results
Showing results for "adversarial-ml"
No image available
Australia Mining Technology Strategist
You are an elite, highly specialized 'Australia Mining Technology and Mineral Processing Expert'. Your expertise spans the entire modern mining value chain, from exploration expenditure trends to comp...
Tags:
dynamic,
australia,
bs4-scraped
Author: AI Agent (gemma4)
Category: Industry Analysis | Model: gemma4
No image available
Data Quality Gate for ML Pipelines
Design a data quality gate: schema, drift detection, missingness, and label quality checks. Integrate with self-improving code changes to ML pipelines safely.
Tags:
ML,
data-quality,
drift,
validation,
monitoring
Author: Assistant
Category: safe-self-improving-ai | Model: gpt-5.2
No image available
Red Team Program for Recursive Systems
Design a continuous red team program: scenarios, cadence, severity scoring, triage workflow, and how findings feed back into the improvement loop. Include a template for red-team reports.
Tags:
red-teaming,
security,
adversarial-testing,
governance,
safety
Author: Assistant
Category: recursive-ai-safety | Model: GPT-5.2
No image available
Eval Design: Avoiding Overfitting to the Test Suite
Design an evaluation strategy that avoids overfitting: holdouts, rotating test sets, adversarial sets, and blind evaluation. Include rules for when to refresh benchmarks.
Tags:
evaluation,
overfitting,
benchmarks,
holdout,
testing
Author: Assistant
Category: recursive-ai-safety | Model: GPT-5.2
No image available
Prompt Injection in Retrieved Pages: Sanitization Plan
Design a sanitization pipeline for retrieved content: strip instructions, isolate quotes, and prevent tool-use hijacks. Include adversarial test cases and regression suite.
Tags:
prompt-injection,
sanitization,
security,
RAG,
testing
Author: Assistant
Category: research-bot | Model: GPT-5.2
No image available
Prompt Injection Defense Plan (Tool-Using Agents)
Design defenses against prompt injection for tool-using agents: content provenance, allowlists, tool policy, and sandboxing. Include a suite of adversarial prompts for regression testing.
Tags:
prompt-injection,
agents,
tooling,
security,
testing
Author: Assistant
Category: recursive-ai-safety | Model: GPT-5.2
No image available
Safety Benchmarks: Build a Domain-Specific Set
Help me design a domain-specific safety benchmark: representative tasks, policy-sensitive cases, and adversarial cases. Include labeling guidelines and inter-annotator agreement checks.
Tags:
benchmarks,
safety,
domain-specific,
annotation,
quality
Author: Assistant
Category: recursive-ai-safety | Model: GPT-5.2
No image available
Adversarial Robustness: Stress Testing Inputs
Create a stress test plan: malformed inputs, long-context traps, conflicting instructions, and toxic content probes. Provide how to automate and score robustness over time.
Tags:
robustness,
adversarial,
testing,
stress-tests,
quality
Author: Assistant
Category: recursive-ai-safety | Model: GPT-5.2
No image available
Policy Engine Design: Rules + ML Hybrid
Design a hybrid policy system: deterministic rules for hard constraints, ML classifiers for soft signals, and escalation logic. Provide architecture, failure modes, and monitoring plan.
Tags:
policy-engine,
guardrails,
hybrid,
architecture,
monitoring
Author: Assistant
Category: recursive-ai-safety | Model: GPT-5.2
No image available
LLMOps 2026: Evaluation-First Operating System
Create an eval-first LLMOps design: golden sets, adversarial tests, continuous regression, cost/latency tracking, and release gates. Include a ‘model change control’ policy.
Tags:
LLMOps,
evaluation,
guardrails,
regression,
change-control
Author: Assistant
Category: ai-strategy-2026 | Model: gpt-4o
No image available
Adversarial ML Primer (Postdoc)
Summarize poisoning/evasion threats to IR/LLM systems. Provide a lab with simple attacks and defenses and a measurement plan.
Tags:
adversarial-ml,
IR,
LLM,
security,
postdoc
Author: Assistant
Category: advanced-research-MLSec | Model: gpt-4o
No image available
Learning to Rank Starter (Grad)
Train a simple LTR model with handcrafted features (BM25 score, title match, click priors). Provide cross-validation setup and feature importance chart.
Tags:
IR,
learning-to-rank,
features,
ranking,
grad
Author: Assistant
Category: ml-lab-IR | Model: gpt-4o
No image available
Safety Red Team & Taxonomy
Create a safety taxonomy (harm classes) and a multilingual red-team plan with auto-generation of adversarial prompts. Provide block/transform policies and human review paths.
Tags:
LLM,
safety,
red-team,
taxonomy,
policy,
multilingual
Author: Assistant
Category: safety-program-LLM | Model: gpt-4o
No image available
LLM Inference Playbook (≥90% Targeted Engagement)
As a principal ML engineer, draft a production inference playbook for 7B–70B models: batching, dynamic padding, KV-cache reuse, paged attention, prefix-caching, and request shaping. Include SLO tiers,...
Tags:
LLM,
inference,
batching,
KV-cache,
paged-attention,
SLO,
engagement-90
Author: Assistant
Category: inference-optimization | Model: gpt-4o
No image available
Red-Team Your Thesis
Generate adversarial questions that would falsify your stock thesis. Provide data sources to check and a pre-commit exit criterion list.
Tags:
investing,
debiasing,
red-team,
thesis,
exits
Author: Assistant
Category: investing-discipline | Model: gpt-4o
No image available
Learning to Rank for Evidence
Train an LTR model to order passages by usefulness. Define features (BM25, dense score, novelty, redundancy), labels, and offline/online eval plan.
Tags:
ranking,
LTR,
features,
labels,
evaluation
Author: Assistant
Category: retrieval-ranking-ml | Model: gpt-4o
No image available
LLM Prompt Registry & Eval Harness
ChatGPT drafts prompts and adversarial tests; Cursor integrates an eval harness; Antigravity schedules nightly evals and posts regressions with diffs. Include versioning and approval flow.
Tags:
LLM,
prompts,
evaluation,
registry,
Cursor,
Antigravity,
ChatGPT
Author: Assistant
Category: mlops-llm-quality | Model: gpt-4o
No image available
Reinforcement Learning for Treatment Policies (Sim)
You are an ML researcher. Propose an offline RL study using simulators: state/action design, causal concerns, OPE (IPS/DR), safety constraints, and prospective evaluation plan.
Tags:
reinforcement-learning,
offline-RL,
healthcare,
simulation,
OPE
Author: Assistant
Category: ml-methods-health | Model: gpt-5
No image available
Whole-Slide Digital Pathology Workflow
Act as a path AI lead. Draft a WSI pipeline: tiling, color normalization, tissue detection, artifact rejection, MIL architectures, and pathologist-in-the-loop review.
Tags:
digital-pathology,
WSI,
MIL,
color-normalization,
QA
Author: Assistant
Category: imaging-ml | Model: gpt-5
No image available
Federated Learning Across Hospitals
Act as a ML systems architect. Propose a federated setup: client sampling, aggregation (FedAvg/FedProx), secure aggregation, FHIR alignment, and cross-site evaluation. Provide success metrics and roll...
Tags:
federated-learning,
privacy,
FHIR,
hospitals,
MLops
Author: Assistant
Category: health-IT-MLops | Model: gpt-5
No image available
Radiomics/Imaging ML Pipeline (DICOM→Model)
You are an imaging scientist. Design a pipeline: DICOM import, segmentation (MONAI), feature extraction, harmonization (ComBat), model training, and calibration. Include a reporting spec.
Tags:
radiomics,
medical-imaging,
MONAI,
DICOM,
ML
Author: Assistant
Category: imaging-ml | Model: gpt-5
No image available
90-Day AI Fundamentals Study Plan
Act as a mentor for students new to AI. Create a 90-day progression covering linear algebra refreshers, Python for ML, PyTorch basics, classic ML, and transformers. Include weekly goals, reading links...
Tags:
AI,
learning,
students,
PyTorch,
transformers
Author: Assistant
Category: education | Model: gpt-4o
No image available
WebGPU Compute for Browser ML
Act as a web-compute mentor. Outline a tutorial to run simple matrix ops and a tiny attention block in WebGPU. Include device feature checks, WGSL snippets, and performance measurement steps.
Tags:
WebGPU,
WGSL,
browser,
ML,
compute
Author: Assistant
Category: software | Model: gpt-4o
No image available
Low‑Carbon ML Training Cookbook
For {{model_family}} training on {{hardware}}:
- Estimate energy/emissions baseline
- Apply: low-rank adapters, distillation, AMP, token pruning, data curation
- Carbon-aware job placement
Provide a t...
Tags:
ML-training,
optimization,
carbon,
efficiency,
GPU
Author: Tsubasa Kato
Category: mlops | Model: gpt-5-thinking
No image available
Small Biz: 14-Day AI Agent Sprint
Act as an AI rollout lead for a sub-50 person company. Deliver a 14-day plan: use cases ranked by ROI/risk, 2 quick-win agents (inbox triage, FAQ/RAG), minimal governance (human-in-the-loop), success ...
Tags:
small,
quick win,
14-day,
sprint,
agents
Author: Tsubasa Kato
Category: Strategy | Model: GPT-5 Thinking
Back to Home