Penalized Q-learning for dynamic treatment regimens

From MaRDI portal
Publication:2950196