Reaction classification and yield prediction using the differential reaction fingerprint DRFP.

Probst, Daniel; Schwaller, Philippe; Reymond, Jean-Louis (2022). Reaction classification and yield prediction using the differential reaction fingerprint DRFP. Digital discovery, 1(2), pp. 91-97. Royal Society of Chemistry 10.1039/d1dd00006c

[img]
Preview
Text
d1dd00006c.pdf - Published Version
Available under License Creative Commons: Attribution (CC-BY).

Download (1MB) | Preview

Predicting the nature and outcome of reactions using computational methods is a crucial tool to accelerate chemical research. The recent application of deep learning-based learned fingerprints to reaction classification and reaction yield prediction has shown an impressive increase in performance compared to previous methods such as DFT- and structure-based fingerprints. However, learned fingerprints require large training data sets, are inherently biased, and are based on complex deep learning architectures. Here we present the differential reaction fingerprint DRFP. The DRFP algorithm takes a reaction SMILES as an input and creates a binary fingerprint based on the symmetric difference of two sets containing the circular molecular n-grams generated from the molecules listed left and right from the reaction arrow, respectively, without the need for distinguishing between reactants and reagents. We show that DRFP performs better than DFT-based fingerprints in reaction yield prediction and other structure-based fingerprints in reaction classification, reaching the performance of state-of-the-art learned fingerprints in both tasks while being data-independent.

Item Type:

Journal Article (Original Article)

Division/Institute:

08 Faculty of Science > Department of Chemistry, Biochemistry and Pharmaceutical Sciences (DCBP)

UniBE Contributor:

Probst, Daniel, Reymond, Jean-Louis

Subjects:

500 Science > 570 Life sciences; biology
500 Science > 540 Chemistry

ISSN:

2635-098X

Publisher:

Royal Society of Chemistry

Language:

English

Submitter:

Pubmed Import

Date Deposited:

09 May 2022 09:55

Last Modified:

05 Dec 2022 16:19

Publisher DOI:

10.1039/d1dd00006c

PubMed ID:

35515081

BORIS DOI:

10.48350/169844

URI:

https://boris.unibe.ch/id/eprint/169844

Actions (login required)

Edit item Edit item
Provide Feedback