Kreutter, David; Reymond, Jean-Louis (2023). Multistep retrosynthesis combining a disconnection aware triple transformer loop with a route penalty score guided tree search. Chemical Science, 14(36), pp. 9959-9969. The Royal Society of Chemistry 10.1039/d3sc01604h
|
Text
d3sc01604h.pdf - Published Version Available under License Publisher holds Copyright. Download (1MB) | Preview |
Computer-aided synthesis planning (CASP) aims to automatically learn organic reactivity from literature and perform retrosynthesis of unseen molecules. CASP systems must learn reactions sufficiently precisely to propose realistic disconnections, while avoiding overfitting to leave room for diverse options, and explore possible routes such as to allow short synthetic sequences to emerge. Herein we report an open-source CASP tool proposing original solutions to both challenges. First, we use a triple transformer loop (TTL) predicting starting materials (T1), reagents (T2), and products (T3) to explore various disconnection sites defined by combining systematic, template-based, and transformer-based tagging procedures. Second, we integrate TTL into a multistep tree search algorithm (TTLA) prioritizing sequences using a route penalty score (RPScore) considering the number of steps, their confidence score, and the simplicity of all intermediates along the route. Our approach favours short synthetic routes to commercial starting materials, as exemplified by retrosynthetic analyses of recently approved drugs.
Item Type: |
Journal Article (Original Article) |
---|---|
Division/Institute: |
08 Faculty of Science > Department of Chemistry, Biochemistry and Pharmaceutical Sciences (DCBP) |
UniBE Contributor: |
Kreutter, David Patrick Joseph, Reymond, Jean-Louis |
Subjects: |
500 Science > 570 Life sciences; biology 500 Science > 540 Chemistry |
ISSN: |
2041-6520 |
Publisher: |
The Royal Society of Chemistry |
Language: |
English |
Submitter: |
Pubmed Import |
Date Deposited: |
25 Sep 2023 14:48 |
Last Modified: |
25 Sep 2023 14:57 |
Publisher DOI: |
10.1039/d3sc01604h |
PubMed ID: |
37736648 |
BORIS DOI: |
10.48350/186515 |
URI: |
https://boris.unibe.ch/id/eprint/186515 |