AUTHOR=Plehiers Pieter P. , Coley Connor W. , Gao Hanyu , Vermeire Florence H. , Dobbelaere Maarten R. , Stevens Christian V. , Van Geem Kevin M. , Green William H. TITLE=Artificial Intelligence for Computer-Aided Synthesis In Flow: Analysis and Selection of Reaction Components JOURNAL=Frontiers in Chemical Engineering VOLUME=2 YEAR=2020 URL=https://www.frontiersin.org/journals/chemical-engineering/articles/10.3389/fceng.2020.00005 DOI=10.3389/fceng.2020.00005 ISSN=2673-2718 ABSTRACT=
Computer-aided synthesis has received much attention in recent years. It is a challenging topic in itself, due to the high dimensionality of chemical and reaction space. It becomes even more challenging when the aim is to suggest syntheses that can be performed in continuous flow. Though continuous flow offers many potential benefits, not all reactions are suited to be operated continuously. In this work, three machine learning models have been developed to provide an assessment of whether a given reaction may benefit from continuous operation, what the likelihood of success in continuous flow is for a certain set of reaction components (i.e., reactants, reagents, solvents, catalysts, and products) and, if the likelihood of success is low, which alternative reaction components can be considered. The first model uses an abstract version of a reaction template, obtained via gaussian mixture modeling, to quantify its relative increase in publishing frequency in continuous flow, without relying on potentially ambiguously defined reaction templates. The second model is an artificial neural network that categorizes feasible and infeasible reaction components with a 75% success rate. A set of reaction components is considered to be feasible if there is an explicit reference to it being used in continuous synthesis in the database; all other reaction components are considered infeasible. While several cases that are “infeasible” by this definition, are classified as feasible by the neural network, further analysis shows that for many of these cases, it is at least plausible that they are in fact feasible – they simply have not been tested to (dis)prove this. The final model suggests alternative continuous flow components with a top-1 accuracy of 95%. Combined, they offer a black-box evaluation of whether a reaction and a set of reaction components can be considered promising for continuous syntheses.