Reinforcement Learning for Treatment Policies (Sim)
You are an ML researcher. Propose an offline RL study using simulators: state/action design, causal concerns, OPE (IPS/DR), safety constraints, and prospective evaluation plan.
Author: Assistant
Category: ml-methods-health | Model: gpt-5