PubChem and ChEMBL beyond Lipinski

Capecchi, Alice; Awale, Mahendra; Probst, Daniel; Reymond, Jean-Louis (2019). PubChem and ChEMBL beyond Lipinski. Molecular informatics, 38(5), p. 1900016. Wiley 10.1002/minf.201900016

[img] Text
Capecchi_et_al-2019-Molecular_Informatics.pdf - Published Version
Restricted to registered users only
Available under License Publisher holds Copyright.

Download (8MB) | Request a copy

Seven million of the currently 94 million entries in the PubChem database break at least one of the four Lipinski constraints for oral bioavailability, 183,185 of which are also found in the ChEMBL database. These non‐Lipinski PubChem (NLP) and ChEMBL (NLC) subsets are interesting because they contain new modalities that can display biological properties not accessible to small molecule drugs. Unfortunately, the current search tools in PubChem and ChEMBL are designed for small molecules and are not well suited to explore these subsets, which therefore remain poorly appreciated. Herein we report MXFP (macromolecule extended atom‐pair fingerprint), a 217‐D fingerprint tailored to analyze large molecules in terms of molecular shape and pharmacophores. We implement MXFP in two web‐based applications, the first one to visualize NLP and NLC interactively using Faerun (http://faerun.gdb.tools/), the second one to perform MXFP nearest neighbor searches in NLP and NLC (http://similaritysearch.gdb.tools/). We show that these tools provide a meaningful insight into the diversity of large molecules in NLP and NLC. The interactive tools presented here are publicly available at http://gdb.unibe.ch and can be used freely to explore and better understand the diversity of non‐Lipinski molecules in PubChem and ChEMBL.

Item Type:

Journal Article (Original Article)

Division/Institute:

08 Faculty of Science > Department of Chemistry, Biochemistry and Pharmaceutical Sciences (DCBP)

UniBE Contributor:

Capecchi, Alice, Awale, Mahendra, Probst, Daniel, Reymond, Jean-Louis

Subjects:

500 Science > 570 Life sciences; biology
500 Science > 540 Chemistry

ISSN:

1868-1743

Publisher:

Wiley

Language:

English

Submitter:

Sandra Tanja Zbinden Di Biase

Date Deposited:

20 Jan 2020 10:48

Last Modified:

05 Dec 2022 15:35

Publisher DOI:

10.1002/minf.201900016

BORIS DOI:

10.7892/boris.138396

URI:

https://boris.unibe.ch/id/eprint/138396

Actions (login required)

Edit item Edit item
Provide Feedback