Computer-aided design of multi-enzyme networks
Computer-aided design of multi-enzyme networks
Disciplines
Biology (50%); Computer Sciences (30%); Mathematics (20%)
Keywords
-
Graph Theory,
Machine learning,
Biocatalysis,
Multi-Enzyme Networks,
Cheminformatics,
Retrosynthesis
The use of enzymes to manufacture chemicals has enjoyed increasing popularity during the last years, providing an eco-friendly alternative to conventional synthesis. The project targeted the increased use of enzymatic reactions through the computer-aided prediction of the reactivity and substrate range of various enzymes, as well as enzymatic synthesis pathways to a given chemical. An extensive dataset of enzymatic reactions was curated and validated from literature, which is now building the foundation of many heuristic and machine learning models of enzymatic reactivity. This work allowed, for the first time, an accurate prediction of enzymatic reactions across many different enzyme and reaction types, demonstrating that enzymatic reactions, and thus enzymatic synthesis pathways, can be predicted on the computer. The models underlying these predictions were mainly based on machine learning, where new reaction representations and model architectures were developed to facilitate the prediction of enzymatic reactivity. To this aim, the existing concept of graph-based machine learning for molecules, where each molecule is represented as a graph (a collection of nodes representing atoms, and edges connecting the nodes representing bonds between atoms) was extended to reactions. In this way, the concept of graph-convolutional neural networks, a type of machine learning model where the property of a molecule is learned from the local information in the molecular graph, became applicable to enzymatic reactions for the prediction of the reactivity and selectivity of enzymes. Also, existing frameworks for computer-aided retrosynthesis, i.e. the task of teaching a computer how to identify possibly synthesis routes to a specified chemical, were adapted to enzymatic reactions. Together with the new database described above, this yielded accurate models for enzymatic synthesis planning for the first time. Further work incorporated the quantification of errors and uncertainties in such machine learning models, so that a user can be informed about uncertain, and thus possibly wrong predictions, by the model. The project thus enabled the data-driven, accurate prediction of enzymatic reactions on the computer, and advanced this important field of research, namely to harness machine learning and deep learning to enable a more eco-friendly, sustainable chemical synthesis.
- Andrew Griffiths, ESPCI ParisTech - France
Research Output
- 285 Citations
- 30 Publications
- 9 Datasets & models
- 3 Scientific Awards
-
2021
Title Machine learning of reaction properties via learned representations of the condensed graph of reaction DOI 10.33774/chemrxiv-2021-frfhz Type Preprint Author Heid E Link Publication -
2021
Title Regio-selectivity prediction with a machine-learned reaction representation and on-the-fly quantum mechanical descriptors DOI 10.1039/d0sc04823b Type Journal Article Author Guan Y Journal Chemical Science Pages 2198-2208 Link Publication -
2022
Title On the Value of Using 3D Shape and Electrostatic Similarities in Deep Generative Methods DOI 10.1021/acs.jcim.1c01535 Type Journal Article Author Bolcato G Journal Journal of Chemical Information and Modeling Pages 1388-1398 Link Publication -
2022
Title On the value of using 3D-shape and electrostatic similarities in deep generative methods DOI 10.26434/chemrxiv-2021-sqvv9-v3 Type Preprint Author Bolcato G Link Publication -
2021
Title Influence of Template Size, Canonicalization, and Exclusivity for Retrosynthesis and Reaction Prediction Applications DOI 10.1021/acs.jcim.1c01192 Type Journal Article Author Heid E Journal Journal of Chemical Information and Modeling Pages 16-26 Link Publication -
2021
Title Machine Learning of Reaction Properties via Learned Representations of the Condensed Graph of Reaction DOI 10.1021/acs.jcim.1c00975 Type Journal Article Author Heid E Journal Journal of Chemical Information and Modeling Pages 2101-2110 Link Publication -
2021
Title Machine Learning of Reaction Properties via Learned Representations of the Condensed Graph of Reaction DOI 10.33774/chemrxiv-2021-frfhz-v2 Type Preprint Author Heid E Link Publication -
2022
Title Machine-Learning-Guided Discovery of Electrochemical Reactions DOI 10.1021/jacs.2c08997 Type Journal Article Author Zahrt A Journal Journal of the American Chemical Society Pages 22599-22610 Link Publication -
2021
Title EHreact: Extended Hasse Diagrams for the Extraction and Scoring of Enzymatic Reaction Templates DOI 10.26434/chemrxiv.14714748.v1 Type Preprint Author Heid E Link Publication -
2021
Title EHreact: Extended Hasse Diagrams for the Extraction and Scoring of Enzymatic Reaction Templates DOI 10.33774/chemrxiv-2021-jxxbh-v2 Type Preprint Author Heid E Link Publication -
2021
Title EHreact: Extended Hasse Diagrams for the Extraction and Scoring of Enzymatic Reaction Templates DOI 10.1021/acs.jcim.1c00921 Type Journal Article Author Heid E Journal Journal of Chemical Information and Modeling Pages 4949-4961 Link Publication -
2021
Title On the influence of template size, canonicalization and exclusivity for retrosynthesis and reaction prediction applications DOI 10.33774/chemrxiv-2021-9s7gj Type Preprint Author Heid E Link Publication -
2023
Title Deep Ensembles vs. Committees for Uncertainty Estimation in Neural-Network Force Fields: Comparison and Application to Active Learning DOI 10.48550/arxiv.2302.08805 Type Other Author Carrete J Link Publication -
2024
Title Chemprop: A Machine Learning Package for Chemical Property Prediction. DOI 10.1021/acs.jcim.3c01250 Type Journal Article Author Greenman Kp Journal Journal of chemical information and modeling Pages 9-17 -
2023
Title Characterizing Uncertainty in Machine Learning for Chemistry. DOI 10.1021/acs.jcim.3c00373 Type Journal Article Author Heid E Journal Journal of chemical information and modeling Pages 4012-4029 -
2023
Title Deep ensembles vs committees for uncertainty estimation in neural-network force fields: Comparison and application to active learning. DOI 10.1063/5.0146905 Type Journal Article Author Carrete J Journal The Journal of chemical physics -
2023
Title Characterizing Uncertainty in Machine Learning for Chemistry DOI 10.26434/chemrxiv-2023-00vcg Type Preprint Author Heid E -
2023
Title Characterizing Uncertainty in Machine Learning for Chemistry DOI 10.26434/chemrxiv-2023-00vcg-v2 Type Preprint Author Heid E -
2023
Title Characterizing Uncertainty in Machine Learning for Chemistry DOI 10.26434/chemrxiv-2023-00vcg-v3 Type Preprint Author Heid E -
2023
Title Chemprop: Machine Learning Package for Chemical Property Prediction DOI 10.26434/chemrxiv-2023-3zcfl Type Preprint Author Greenman K -
2023
Title EnzymeMap: Curation, validation and data-driven prediction of enzymatic reactions DOI 10.26434/chemrxiv-2023-jzw9w Type Preprint Author Heid E -
2023
Title EnzymeMap: Curation, validation and data-driven prediction of enzymatic reactions DOI 10.26434/chemrxiv-2023-jzw9w-v2 Type Preprint Author Heid E -
2023
Title Chemprop: A Machine Learning Package for Chemical Property Prediction DOI 10.26434/chemrxiv-2023-3zcfl-v3 Type Preprint Author Greenman K -
2023
Title Chemprop: A Machine Learning Package for Chemical Property Prediction DOI 10.26434/chemrxiv-2023-3zcfl-v2 Type Preprint Author Greenman K -
2023
Title EnzymeMap: curation, validation and data-driven prediction of enzymatic reactions. DOI 10.1039/d3sc02048g Type Journal Article Author Heid E Journal Chemical science Pages 14229-14242 -
2022
Title Similarity based enzymatic retrosynthesis DOI 10.1039/d2sc01588a Type Journal Article Author Sankaranarayanan K Journal Chemical Science Pages 6039-6053 Link Publication -
2021
Title On the influence of template size, canonicalization and exclusivity for retrosynthesis and reaction prediction applications DOI 10.26434/chemrxiv-2021-9s7gj Type Preprint Author Heid E Link Publication -
2021
Title Machine learning of reaction properties via learned representations of the condensed graph of reaction DOI 10.26434/chemrxiv-2021-frfhz Type Preprint Author Heid E Link Publication -
2021
Title Machine Learning of Reaction Properties via Learned Representations of the Condensed Graph of Reaction DOI 10.26434/chemrxiv-2021-frfhz-v2 Type Preprint Author Heid E Link Publication -
2021
Title EHreact: Extended Hasse Diagrams for the Extraction and Scoring of Enzymatic Reaction Templates DOI 10.26434/chemrxiv-2021-jxxbh-v2 Type Preprint Author Heid E Link Publication
-
2023
Link
Title EnzymeMap DOI 10.5281/zenodo.7841780 Type Database/Collection of data Public Access Link Link -
2023
Link
Title Benchmark Data for Chemprop DOI 10.5281/zenodo.8174267 Type Database/Collection of data Public Access Link Link -
2023
Link
Title EnzymeMap Python package Type Computer model/algorithm Public Access Link Link -
2022
Link
Title ESPsim Type Computer model/algorithm Public Access Link Link -
2021
Link
Title Chemprop Type Computer model/algorithm Public Access Link Link -
2021
Link
Title EHreact Type Computer model/algorithm Public Access Link Link -
2021
Link
Title Templatecorr Type Computer model/algorithm Public Access Link Link -
2021
Link
Title Enzymatic assay data Type Database/Collection of data Public Access Link Link -
2021
Link
Title Reaction Database Type Database/Collection of data Public Access Link Link
-
2023
Title AI4ChemMat Hands-On Series: Deep learning of reaction properties via graph-convolutional neural nets Type Personally asked as a key note speaker to a conference Level of Recognition Continental/International -
2022
Title Austrian Marshall Plan Foundation Poster Award (3rd place) Type Poster/abstract prize Level of Recognition National (any country) -
2022
Title RSC CICAG Open Source Tools for Chemistry: Scoring of shape and ESP similarity Type Personally asked as a key note speaker to a conference Level of Recognition Continental/International