Graph explorer

Interpretable Optimal Stopping

Optimal stopping is the problem of deciding when to stop a stochastic system to obtain the greatest reward, arising in numerous application areas such as finance, healthcare and marketing. State-of-the-art methods for high-dimensional optimal stopping involve approximating the value function or the continuation value, and then using that approximation within a greedy policy. Although such policies can perform very well, they are generally not guaranteed to be interpretable; that is, a decision maker may not be able to easily see the link between the current system state and the policy's action. In this paper, we propose a new approach to optimal stopping, wherein the policy is represented as a binary tree, in the spirit of naturally interpretable tree models commonly used in machine learning. We show that the class of tree policies is rich enough to approximate the optimal policy. We formulate the problem of learning such policies from observed trajectories of the stochastic system as a sample average approximation (SAA) problem. We prove that the SAA problem converges under mild conditions as the sample size increases, but that computationally even immediate simplifications of

6 nodes8 linksoverview previewInterpretable Optimal Stopping
6 nodes8 links
Interpretable Optimal Stopping6 visible / 6 total nodes / 9 links
Related contextRelated contextRelated contextCo-authorshipAuthorshipAuthorshipTopic signalTopic signalTopic signalWInterpretable Optimal Stoppingpreprint / 2019ADragos Florin CiocanResearcherAVelibor V. MišićResearcherTMachine Learning49008 worksTArtificial Intelligence22915 worksTmath.OC9232 works
PaperSignal 105 links

Interpretable Optimal Stopping

preprint / 2019

Open