1
|
Basnet BB, Zhou ZY, Wei B, Wang H. Advances in AI-based strategies and tools to facilitate natural product and drug development. Crit Rev Biotechnol 2025:1-32. [PMID: 40159111 DOI: 10.1080/07388551.2025.2478094] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2024] [Revised: 02/11/2025] [Accepted: 02/16/2025] [Indexed: 04/02/2025]
Abstract
Natural products and their derivatives have been important for treating diseases in humans, animals, and plants. However, discovering new structures from natural sources is still challenging. In recent years, artificial intelligence (AI) has greatly aided the discovery and development of natural products and drugs. AI facilitates to: connect genetic data to chemical structures or vice-versa, repurpose known natural products, predict metabolic pathways, and design and optimize metabolites biosynthesis. More recently, the emergence and improvement in neural networks such as deep learning and ensemble automated web based bioinformatics platforms have sped up the discovery process. Meanwhile, AI also improves the identification and structure elucidation of unknown compounds from raw data like mass spectrometry and nuclear magnetic resonance. This article reviews these AI-driven methods and tools, highlighting their practical applications and guide for efficient natural product discovery and drug development.
Collapse
Affiliation(s)
- Buddha Bahadur Basnet
- College of Pharmaceutical Sciences, Zhejiang University of Technology, Hangzhou, China
- Central Department of Biotechnology, Tribhuvan University, Kathmandu, Nepal
| | - Zhen-Yi Zhou
- College of Pharmaceutical Sciences, Zhejiang University of Technology, Hangzhou, China
| | - Bin Wei
- College of Pharmaceutical Sciences, Zhejiang University of Technology, Hangzhou, China
| | - Hong Wang
- College of Pharmaceutical Sciences, Zhejiang University of Technology, Hangzhou, China
- Key Laboratory of Marine Fishery Resources Exploitment, Utilization of Zhejiang Province, Zhejiang University of Technology, Hangzhou, China
| |
Collapse
|
2
|
Pillozzi S, Bernini A, Palchetti I, Crociani O, Antonuzzo L, Campanacci D, Scoccianti G. Soft Tissue Sarcoma: An Insight on Biomarkers at Molecular, Metabolic and Cellular Level. Cancers (Basel) 2021; 13:cancers13123044. [PMID: 34207243 PMCID: PMC8233868 DOI: 10.3390/cancers13123044] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2021] [Revised: 06/13/2021] [Accepted: 06/14/2021] [Indexed: 12/18/2022] Open
Abstract
Simple Summary Soft tissue sarcoma is a rare mesenchymal malignancy. Despite the advancements in the fields of radiology, pathology and surgery, these tumors often recur locally and/or with metastatic disease. STS is considered to be a diagnostic challenge due to the large variety of histological subtypes with clinical and histopathological characteristics which are not always distinct. One of the important clinical problems is a lack of useful biomarkers. Therefore, the discovery of biomarkers that can be used to detect tumors or predict tumor response to chemotherapy or radiotherapy could help clinicians provide more effective clinical management. Abstract Soft tissue sarcomas (STSs) are a heterogeneous group of rare tumors. Although constituting only 1% of all human malignancies, STSs represent the second most common type of solid tumors in children and adolescents and comprise an important group of secondary malignancies. Over 100 histologic subtypes have been characterized to date (occurring predominantly in the trunk, extremity, and retroperitoneum), and many more are being discovered due to molecular profiling. STS mortality remains high, despite adjuvant chemotherapy. New prognostic stratification markers are needed to help identify patients at risk of recurrence and possibly apply more intensive or novel treatments. Recent scientific advancements have enabled a more precise molecular characterization of sarcoma subtypes and revealed novel therapeutic targets and prognostic/predictive biomarkers. This review aims at providing a comprehensive overview of the most relevant cellular, molecular and metabolic biomarkers for STS, and highlight advances in STS-related biomarker research.
Collapse
Affiliation(s)
- Serena Pillozzi
- Medical Oncology Unit, Careggi University Hospital, Largo Brambilla 3, 50134 Florence, Italy;
- Correspondence:
| | - Andrea Bernini
- Department of Biotechnology, Chemistry and Pharmacy, University of Siena, Via Aldo Moro 2, 53100 Siena, Italy;
| | - Ilaria Palchetti
- Department of Chemistry Ugo Schiff, University of Florence, Via della Lastruccia 3, 50019 Sesto Fiorentino, Italy;
| | - Olivia Crociani
- Department of Experimental and Clinical Medicine, University of Florence, Largo Brambilla 3, 50134 Florence, Italy;
| | - Lorenzo Antonuzzo
- Medical Oncology Unit, Careggi University Hospital, Largo Brambilla 3, 50134 Florence, Italy;
- Department of Experimental and Clinical Medicine, University of Florence, Largo Brambilla 3, 50134 Florence, Italy;
| | - Domenico Campanacci
- Department of Health Science, University of Florence, Largo Brambilla 3, 50134 Florence, Italy;
| | - Guido Scoccianti
- Department of Orthopaedic Oncology and Reconstructive Surgery, University of Florence, Careggi University Hospital, Largo Brambilla 3, 50134 Florence, Italy;
| |
Collapse
|
3
|
Pesek M, Juvan A, Jakoš J, Košmrlj J, Marolt M, Gazvoda M. Database Independent Automated Structure Elucidation of Organic Molecules Based on IR, 1H NMR, 13C NMR, and MS Data. J Chem Inf Model 2021; 61:756-763. [PMID: 33378192 PMCID: PMC7903418 DOI: 10.1021/acs.jcim.0c01332] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2020] [Indexed: 01/21/2023]
Abstract
Herein, we report a computational algorithm that follows a spectroscopist-driven elucidation process of the structure of an organic molecule based on IR, 1H and 13C NMR, and MS tabular data. The algorithm is independent from database searching and is based on a bottom-up approach, building the molecular structure from small structural fragments visible in spectra. It employs an analytical combinatorial approach with a graph search technique to determine the connectivity of structural fragments that is based on the analysis of the NMR spectra, to connect the identified structural fragments into a molecular structure. After the process is completed, the interface lists the compound candidates, which are visualized by the WolframAlpha computational knowledge engine within the interface. The candidates are ranked according to the predefined rules for analyzing the spectral data. The developed elucidator has a user-friendly web interface and is publicly available (http://schmarnica.si).
Collapse
Affiliation(s)
- Matevž Pesek
- Faculty
of Computer and Information Science, University
of Ljubljana, Večna Pot 113, SI-1000 Ljubljana, Slovenia
| | - Andraž Juvan
- Faculty
of Computer and Information Science, University
of Ljubljana, Večna Pot 113, SI-1000 Ljubljana, Slovenia
| | - Jure Jakoš
- Faculty
of Chemistry and Chemical Technology, University
of Ljubljana, Večna
Pot 113, SI-1000 Ljubljana, Slovenia
| | - Janez Košmrlj
- Faculty
of Chemistry and Chemical Technology, University
of Ljubljana, Večna
Pot 113, SI-1000 Ljubljana, Slovenia
| | - Matija Marolt
- Faculty
of Computer and Information Science, University
of Ljubljana, Večna Pot 113, SI-1000 Ljubljana, Slovenia
| | - Martin Gazvoda
- Faculty
of Chemistry and Chemical Technology, University
of Ljubljana, Večna
Pot 113, SI-1000 Ljubljana, Slovenia
| |
Collapse
|
4
|
Jia W, Yang Z, Yang M, Cheng L, Lei Z, Wang X. Machine Learning Enhanced Spectrum Recognition Based on Computer Vision (SRCV) for Intelligent NMR Data Extraction. J Chem Inf Model 2020; 61:21-25. [PMID: 33170690 DOI: 10.1021/acs.jcim.0c01046] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
A machine learning enhanced spectrum recognition system called spectrum recognition based on computer vision (SRCV) for data extraction from previously analyzed 13C and 1H NMR spectra has been developed. The intelligent system was designed with four function modules to extract data from three areas of NMR images, including 13C and 1H chemical shifts, the integral, and the range of the shift values. During this study, three machine learning models were pretrained for number recognition, which is the key procedure for NMR data extraction. The k nearest neighbor (kNN) method was selected with optimized k (k = 4), which displayed a 100% recognition rate. Subsequently, the performance of SRCV was tested and validated to have high accuracy with a short processing time (11-21 s) for each NMR spectral image. Our spectrum recognizer enables high-throughput 13C and 1H NMR data extraction from abundant spectra in the literature and has the potential to be used for spectral database construction. In addition, the system may be applicable to be developed for data import to computer-assisted structure elucidation systems, which would automate this procedure significantly. SRCV can be accessed in GitHub (https://github.com/WJmodels/SRCV).
Collapse
Affiliation(s)
- Wenqiang Jia
- State Key Laboratory of Bioactive Substances and Functions of Natural Medicines, Department of Medicinal Chemistry, Beijing Key Laboratory of Active Substances Discovery and Druggability Evaluation, Institute of Materia Medica, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100050, China
| | - Zhuo Yang
- State Key Laboratory of Bioactive Substances and Functions of Natural Medicines, Department of Medicinal Chemistry, Beijing Key Laboratory of Active Substances Discovery and Druggability Evaluation, Institute of Materia Medica, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100050, China
| | - Minjian Yang
- State Key Laboratory of Bioactive Substances and Functions of Natural Medicines, Department of Medicinal Chemistry, Beijing Key Laboratory of Active Substances Discovery and Druggability Evaluation, Institute of Materia Medica, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100050, China
| | - Liang Cheng
- Golden Intelligence Cloud Co., Ltd., Guangzhou 510000, China
| | - Zengrong Lei
- Guangzhou Fermion Technology Co., Ltd., Guangzhou 510000, China
| | - Xiaojian Wang
- State Key Laboratory of Bioactive Substances and Functions of Natural Medicines, Department of Medicinal Chemistry, Beijing Key Laboratory of Active Substances Discovery and Druggability Evaluation, Institute of Materia Medica, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100050, China
| |
Collapse
|