Poster
in
Workshop: AI for Accelerated Materials Design (AI4Mat-2023)
Automatic Generation of Mechanistic Pathways of Organic Reactions with Dual Templates
SHU-AN CHEN · Ramil Babazade · Yousung Jung
Keywords: [ Reaction Dataset ] [ Mechanistic Dataset ] [ Mechanistic Annotation ] [ Organic Reaction Mechanisms ]
Understanding organic reaction mechanisms is crucial for interpreting the formation of products at the atomic and electronic level, but still remains as a domain of knowledgeable experts. The lack of a large-scale dataset with chemically reasonable mechanistic sequences also hinders the development of reliable machine learning models to predict organic reactions based on mechanisms as human chemists do. Here, we propose a method that automatically generates reaction mechanisms of a large dataset of organic reactions using autonomously extracted reaction templates and expert-coded mechanistic templates. By applying this method, we labeled 94.8\% of 33k USPTO reactions into chemically reasonable arrow-pushing diagrams, validated by expert chemists. Our method is simple, flexible, and can be expanded to cover a wider range of reactions, regardless of type or complexity. We envision it becoming an invaluable tool to propose reaction mechanisms, and to develop future reaction outcome prediction models and discover new reactions.