1
|
Mihalovits L, Szalai TV, Bajusz D, Keserű GM. Exploring Chemical Spaces in the Billion Range: Is Docking a Computational Alternative to DNA-Encoded Libraries? J Chem Inf Model 2024; 64:8963-8979. [PMID: 39305268 PMCID: PMC11632764 DOI: 10.1021/acs.jcim.4c00803] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2024] [Revised: 09/10/2024] [Accepted: 09/11/2024] [Indexed: 12/10/2024]
Abstract
The concept of DNA-encoded libraries (DELs) enables the experimental screening of billions of compounds simultaneously, offering an unprecedented boost in the coverage of chemical space. In parallel, however, dramatically increased access to supercomputers and a number of ultrahigh throughput virtual screening (uHTVS) tools have made screening of billion-membered virtual libraries available. Here, we investigate whether current, brute-force, or AI-enabled uHTVS approaches might constitute a computational alternative to DEL screening. While it is tempting to look at uHTVS as a computational analogue of DEL screening, we found specific advantages and limitations of both methodologies that suggest them being complementary rather than competitive.
Collapse
Affiliation(s)
- Levente
M. Mihalovits
- Medicinal
Chemistry Research Group and Drug Innovation Centre, HUN-REN Research Centre for Natural Sciences, Magyar tudósok krt. 2, 1117 Budapest, Hungary
| | - Tibor V. Szalai
- Medicinal
Chemistry Research Group and Drug Innovation Centre, HUN-REN Research Centre for Natural Sciences, Magyar tudósok krt. 2, 1117 Budapest, Hungary
- Department
of Inorganic and Analytical Chemistry, Faculty of Chemical Technology
and Biotechnology, Budapest University of
Technology and Economics, Műegyetem rkp. 3., H-1111 Budapest, Hungary
| | - Dávid Bajusz
- Medicinal
Chemistry Research Group and Drug Innovation Centre, HUN-REN Research Centre for Natural Sciences, Magyar tudósok krt. 2, 1117 Budapest, Hungary
| | - György M. Keserű
- Medicinal
Chemistry Research Group and Drug Innovation Centre, HUN-REN Research Centre for Natural Sciences, Magyar tudósok krt. 2, 1117 Budapest, Hungary
- Department
of Organic Chemistry and Technology, Faculty of Chemical Technology
and Biotechnology Budapest University of
Technology and Economics, Műegyetem rkp. 3., H-1111 Budapest, Hungary
| |
Collapse
|
2
|
Wichert M, Guasch L, Franzini RM. Challenges and Prospects of DNA-Encoded Library Data Interpretation. Chem Rev 2024; 124:12551-12572. [PMID: 39508428 DOI: 10.1021/acs.chemrev.4c00284] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2024]
Abstract
DNA-encoded library (DEL) technology is a powerful platform for the efficient identification of novel chemical matter in the early drug discovery process enabled by parallel screening of vast libraries of encoded small molecules through affinity selection and deep sequencing. While DEL selections provide rich data sets for computational drug discovery, the underlying technical factors influencing DEL data remain incompletely understood. This review systematically examines the key parameters affecting the chemical information in DEL data and their impact on hit triaging and machine learning integration. The need for rigorous data handling and interpretation is emphasized, with standardized methods being critical for the success of DEL-based approaches. Major challenges include the relationship between sequence counts and binding affinities, frequent hitters, and the influence of factors such as inhomogeneous library composition, DNA damage, and linkers on binding modes. Experimental artifacts, such as those caused by protein immobilization and screening matrix effects, further complicate data interpretation. Recent advancements in using machine learning to denoise DEL data and predict drug candidates are highlighted. This review offers practical guidance on adopting best practices for integrating robust methodologies, comprehensive data analysis, and computational tools to improve the accuracy and efficacy of DEL-driven hit discovery.
Collapse
Affiliation(s)
- Moreno Wichert
- Roche Pharma Research and Early Development, Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd, Grenzacherstrasse 124, 4070 Basel, Switzerland
| | - Laura Guasch
- Roche Pharma Research and Early Development, Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd, Grenzacherstrasse 124, 4070 Basel, Switzerland
| | - Raphael M Franzini
- Department of Medicinal Chemistry, University of Utah, Salt Lake City, Utah 84112, United States
- Huntsman Cancer Institute, Salt Lake City, Utah 84112, United States
| |
Collapse
|
3
|
Fitzgerald P, Dixit A, Zhang C, Mobley DL, Paegel BM. Building Block-Centric Approach to DNA-Encoded Library Design. J Chem Inf Model 2024; 64:4661-4672. [PMID: 38860710 PMCID: PMC11200258 DOI: 10.1021/acs.jcim.4c00232] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2024] [Revised: 06/02/2024] [Accepted: 06/03/2024] [Indexed: 06/12/2024]
Abstract
DNA-encoded library technology grants access to nearly infinite opportunities to explore the chemical structure space for drug discovery. Successful navigation depends on the design and synthesis of libraries with appropriate physicochemical properties (PCPs) and structural diversity while aligning with practical considerations. To this end, we analyze combinatorial library design constraints including the number of chemistry cycles, bond construction strategies, and building block (BB) class selection in pursuit of ideal library designs. We compare two-cycle library designs (amino acid + carboxylic acid, primary amine + carboxylic acid) in the context of PCPs and chemical space coverage, given different BB selection strategies and constraints. We find that broad availability of amines and acids is essential for enabling the widest exploration of chemical space. Surprisingly, cost is not a driving factor, and virtually, the same chemical space can be explored with "budget" BBs.
Collapse
Affiliation(s)
- Patrick
R. Fitzgerald
- Skaggs
Doctoral Program in the Chemical and Biological Sciences, Scripps Research, La Jolla, California 92037, United States
| | - Anjali Dixit
- Department
of Pharmaceutical Sciences, University of
California, Irvine, California 92697, United States
| | - Chris Zhang
- Department
of Chemistry, University of California, Irvine, California 92697, United States
| | - David L. Mobley
- Department
of Pharmaceutical Sciences, University of
California, Irvine, California 92697, United States
- Department
of Chemistry, University of California, Irvine, California 92697, United States
| | - Brian M. Paegel
- Department
of Pharmaceutical Sciences, University of
California, Irvine, California 92697, United States
- Department
of Chemistry, University of California, Irvine, California 92697, United States
| |
Collapse
|
4
|
Jin H, Cui D, Fan Y, Li G, Zhong Z, Wang Y. Recent advances in bioaffinity strategies for preclinical and clinical drug discovery: Screening natural products, small molecules and antibodies. Drug Discov Today 2024; 29:103885. [PMID: 38278476 DOI: 10.1016/j.drudis.2024.103885] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2023] [Revised: 12/26/2023] [Accepted: 01/11/2024] [Indexed: 01/28/2024]
Abstract
Bioaffinity drug screening strategies have gained popularity in preclinical and clinical drug discovery for natural products, small molecules and antibodies owing to their superior selectivity, the large number of compounds to be screened and their ability to minimize the time and expenses of the drug discovery process. This paper provides a systematic summary of the principles of commonly used bioaffinity-based screening methods, elaborates on the success of bioaffinity in clinical drug development and summarizes the active compounds, preclinical drugs and marketed drugs obtained through affinity screening methods. Owing to the high demand for new drugs, bioaffinity-guided screening techniques will play a greater part in clinical drug development.
Collapse
Affiliation(s)
- Haochun Jin
- Macao Centre for Research and Development in Chinese Medicine, State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences, University of Macau, Macao SAR 999078, China
| | - Dianxin Cui
- Macao Centre for Research and Development in Chinese Medicine, State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences, University of Macau, Macao SAR 999078, China
| | - Yu Fan
- Macao Centre for Research and Development in Chinese Medicine, State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences, University of Macau, Macao SAR 999078, China; Zhuhai UM Science and Technology Research Institute, Zhuhai 519031, China
| | - Guodong Li
- Macao Centre for Research and Development in Chinese Medicine, State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences, University of Macau, Macao SAR 999078, China; Zhuhai UM Science and Technology Research Institute, Zhuhai 519031, China.
| | - Zhangfeng Zhong
- Macao Centre for Research and Development in Chinese Medicine, State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences, University of Macau, Macao SAR 999078, China.
| | - Yitao Wang
- Macao Centre for Research and Development in Chinese Medicine, State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences, University of Macau, Macao SAR 999078, China.
| |
Collapse
|
5
|
Chen B, Sultan MM, Karaletsos T. Compositional Deep Probabilistic Models of DNA-Encoded Libraries. J Chem Inf Model 2024; 64:1123-1133. [PMID: 38335055 DOI: 10.1021/acs.jcim.3c01699] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/12/2024]
Abstract
DNA-encoded library (DEL) has proven to be a powerful tool that utilizes combinatorially constructed small molecules to facilitate highly efficient screening experiments. These selection experiments, involving multiple stages of washing, elution, and identification of potent binders via unique DNA barcodes, often generate complex data. This complexity can potentially mask the underlying signals, necessitating the application of computational tools, such as machine learning, to uncover valuable insights. We introduce a compositional deep probabilistic model of DEL data, DEL-Compose, which decomposes molecular representations into their monosynthon, disynthon, and trisynthon building blocks and capitalizes on the inherent hierarchical structure of these molecules by modeling latent reactions between embedded synthons. Additionally, we investigate methods to improve the observation models for DEL count data, such as integrating covariate factors to more effectively account for data noise. Across two popular public benchmark data sets (CA-IX and HRP), our model demonstrates strong performance compared to count baselines, enriches the correct pharmacophores, and offers valuable insights via its intrinsic interpretable structure, thereby providing a robust tool for the analysis of DEL data.
Collapse
Affiliation(s)
- Benson Chen
- Insitro, South San Francisco, California 94080, United States
| | | | | |
Collapse
|