Studying conditional insurance policies for crystal design the use of offline reinforcement studying
Digital Discovery, 2024, 3,769-785DOI: 10.1039/D4DD00024B, Paper Open Access   This article is licensed under a Creative Commons Attribution 3.0 Unported Licence.Prashant Govindarajan, Santiago Miret, Jarrid Rector-Brooks, Mariano Phielipp, Janarthanan Rajendran, Sarath ChandarConservative Q-learning for band-gap conditioned crystal design with DFT evaluations – the model is