mpo maxMPO has an independent prognostic value overall and most notably in patients tested negative with a hher sensitive cardiac troponin I assay.We introduce a new algorithm for reinforcement learning called Maximum a-posteriori Policy Optimisation (MPO) based on coordinate ascent on a relative-entropy