1
|
Applying the digital data and the bioinformatics tools in SARS-CoV-2 research. Comput Struct Biotechnol J 2023; 21:4697-4705. [PMID: 37841328 PMCID: PMC10568291 DOI: 10.1016/j.csbj.2023.09.044] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Revised: 09/29/2023] [Accepted: 09/29/2023] [Indexed: 10/17/2023] Open
Abstract
Bioinformatics has been playing a crucial role in the scientific progress to fight against the pandemic of the coronavirus disease 2019 (COVID-19) caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). The advances in novel algorithms, mega data technology, artificial intelligence and deep learning assisted the development of novel bioinformatics tools to analyze daily increasing SARS-CoV-2 data in the past years. These tools were applied in genomic analyses, evolutionary tracking, epidemiological analyses, protein structure interpretation, studies in virus-host interaction and clinical performance. To promote the in-silico analysis in the future, we conducted a review which summarized the databases, web services and software applied in SARS-CoV-2 research. Those digital resources applied in SARS-CoV-2 research may also potentially contribute to the research in other coronavirus and non-coronavirus viruses.
Collapse
|
2
|
The role of machine learning in health policies during the COVID-19 pandemic and in long COVID management. Front Public Health 2023; 11:1140353. [PMID: 37113165 PMCID: PMC10126380 DOI: 10.3389/fpubh.2023.1140353] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2023] [Accepted: 03/20/2023] [Indexed: 04/29/2023] Open
Abstract
The ongoing COVID-19 pandemic is arguably one of the most challenging health crises in modern times. The development of effective strategies to control the spread of SARS-CoV-2 were major goals for governments and policy makers. Mathematical modeling and machine learning emerged as potent tools to guide and optimize the different control measures. This review briefly summarizes the SARS-CoV-2 pandemic evolution during the first 3 years. It details the main public health challenges focusing on the contribution of mathematical modeling to design and guide government action plans and spread mitigation interventions of SARS-CoV-2. Next describes the application of machine learning methods in a series of study cases, including COVID-19 clinical diagnosis, the analysis of epidemiological variables, and drug discovery by protein engineering techniques. Lastly, it explores the use of machine learning tools for investigating long COVID, by identifying patterns and relationships of symptoms, predicting risk indicators, and enabling early evaluation of COVID-19 sequelae.
Collapse
|
3
|
Bioinformatics-based investigation on the genetic influence between SARS-CoV-2 infections and idiopathic pulmonary fibrosis (IPF) diseases, and drug repurposing. Sci Rep 2023; 13:4685. [PMID: 36949176 PMCID: PMC10031699 DOI: 10.1038/s41598-023-31276-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2022] [Accepted: 03/09/2023] [Indexed: 03/24/2023] Open
Abstract
Some recent studies showed that severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infections and idiopathic pulmonary fibrosis (IPF) disease might stimulate each other through the shared genes. Therefore, in this study, an attempt was made to explore common genomic biomarkers for SARS-CoV-2 infections and IPF disease highlighting their functions, pathways, regulators and associated drug molecules. At first, we identified 32 statistically significant common differentially expressed genes (cDEGs) between disease (SARS-CoV-2 and IPF) and control samples of RNA-Seq profiles by using a statistical r-package (edgeR). Then we detected 10 cDEGs (CXCR4, TNFAIP3, VCAM1, NLRP3, TNFAIP6, SELE, MX2, IRF4, UBD and CH25H) out of 32 as the common hub genes (cHubGs) by the protein-protein interaction (PPI) network analysis. The cHubGs regulatory network analysis detected few key TFs-proteins and miRNAs as the transcriptional and post-transcriptional regulators of cHubGs. The cDEGs-set enrichment analysis identified some crucial SARS-CoV-2 and IPF causing common molecular mechanisms including biological processes, molecular functions, cellular components and signaling pathways. Then, we suggested the cHubGs-guided top-ranked 10 candidate drug molecules (Tegobuvir, Nilotinib, Digoxin, Proscillaridin, Simeprevir, Sorafenib, Torin 2, Rapamycin, Vancomycin and Hesperidin) for the treatment against SARS-CoV-2 infections with IFP diseases as comorbidity. Finally, we investigated the resistance performance of our proposed drug molecules compare to the already published molecules, against the state-of-the-art alternatives publicly available top-ranked independent receptors by molecular docking analysis. Molecular docking results suggested that our proposed drug molecules would be more effective compare to the already published drug molecules. Thus, the findings of this study might be played a vital role for diagnosis and therapies of SARS-CoV-2 infections with IPF disease as comorbidity risk.
Collapse
|
4
|
Investigation of the dual role of acyl phloroglucinols as a new hope for antibacterial and anti-SARS-CoV-2 agents employing integrated in vitro and multi-phase in silico approaches. J Biomol Struct Dyn 2023; 41:15067-15084. [PMID: 36927239 DOI: 10.1080/07391102.2023.2186712] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Accepted: 02/23/2023] [Indexed: 03/18/2023]
Abstract
With the rapid spread of multi-drug-resistant bacteria and more infectious or aggressive variants of SARS-CoV-2, it is critical to develop drugs that can quickly adapt to evolving bacterial and viral mutations. In this study, encouraged by nature, we synthesized a series of phloroglucinol (PG) derivatives, acyl phloroglucinols (ACPLs) 2a-4c by mimicking the structure of the natural antifungal 2,4-diacetylphloroglucinol 3a (2,4-DAPG). According to the quantum chemical calculation, these compounds were expected to be exceptionally favourable for intermolecular interaction with protein receptors. Intriguingly, the in vitro study of Staphylococcus aureus (S. aureus) ATCC 25923 and Escherichia coli (E. coli) ATCC 25922, showed that the four ACPLs 3a-4a had good antibacterial activity and selectivity against gram-positive bacteria, S. aureus. These results were then supported by in silico molecular docking and molecular dynamics simulations (MDs) between these potent compounds and the S. aureus FtsA protein (PDB ID: 3WQU). Furthermore, with the aid of the knowledge base Virus-CKB along with the molecular docking study, it was found that the three ACPLs 4a-4c showed potential inhibitors against SARS-CoV-2 PLpro (PDB ID: 7CMD). Additionally, 100 ns of MDs was carried out in order to assess the stability behaviour of ACPLs at the docked site. Moreover, in silico ADME/T and drug likeness of all studied ACPLs were also predicted. Finally, shape-based screening of FDA-approved drugs was performed using the most prominent synthesized ACPLs as a template, enabling us to include several medications that could be utilized as antibacterial and antiviral drug candidates.Communicated by Ramaswamy H. Sarma.
Collapse
|
5
|
Identification of host genomic biomarkers from multiple transcriptomics datasets for diagnosis and therapies of SARS-CoV-2 infections. PLoS One 2023; 18:e0281981. [PMID: 36913345 PMCID: PMC10010564 DOI: 10.1371/journal.pone.0281981] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Accepted: 02/05/2023] [Indexed: 03/14/2023] Open
Abstract
The pandemic of COVID-19 is a severe threat to human life and the global economy. Despite the success of vaccination efforts in reducing the spread of the virus, the situation remains largely uncontrolled due to the random mutation in the RNA sequence of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), which demands different variants of effective drugs. Disease-causing gene-mediated proteins are usually used as receptors to explore effective drug molecules. In this study, we analyzed two different RNA-Seq and one microarray gene expression profile datasets by integrating EdgeR, LIMMA, weighted gene co-expression network and robust rank aggregation approaches, which revealed SARS-CoV-2 infection causing eight hub-genes (HubGs) including HubGs; REL, AURKA, AURKB, FBXL3, OAS1, STAT4, MMP2 and IL6 as the host genomic biomarkers. Gene Ontology and pathway enrichment analyses of HubGs significantly enriched some crucial biological processes, molecular functions, cellular components and signaling pathways that are associated with the mechanisms of SARS-CoV-2 infections. Regulatory network analysis identified top-ranked 5 TFs (SRF, PBX1, MEIS1, ESR1 and MYC) and 5 miRNAs (hsa-miR-106b-5p, hsa-miR-20b-5p, hsa-miR-93-5p, hsa-miR-106a-5p and hsa-miR-20a-5p) as the key transcriptional and post-transcriptional regulators of HubGs. Then, we conducted a molecular docking analysis to determine potential drug candidates that could interact with HubGs-mediated receptors. This analysis resulted in the identification of top-ranked ten drug agents, including Nilotinib, Tegobuvir, Digoxin, Proscillaridin, Olysio, Simeprevir, Hesperidin, Oleanolic Acid, Naltrindole and Danoprevir. Finally, we investigated the binding stability of the top-ranked three drug molecules Nilotinib, Tegobuvir and Proscillaridin with the three top-ranked proposed receptors (AURKA, AURKB, OAS1) by using 100 ns MD-based MM-PBSA simulations and observed their stable performance. Therefore, the findings of this study might be useful resources for diagnosis and therapies of SARS-CoV-2 infections.
Collapse
|
6
|
The Spike Mutants Website: A Worldwide Used Resource against SARS-CoV-2. Int J Mol Sci 2022; 23:13082. [PMID: 36361870 PMCID: PMC9654198 DOI: 10.3390/ijms232113082] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Revised: 10/24/2022] [Accepted: 10/24/2022] [Indexed: 07/22/2023] Open
Abstract
A large number of SARS-CoV-2 mutations in a short period of time has driven scientific research related to vaccines, new drugs, and antibodies to combat the new variants of the virus. Herein, we present a web portal containing the structural information, the tridimensional coordinates, and the molecular dynamics trajectories of the SARS-CoV-2 spike protein and its main variants. The Spike Mutants website can serve as a rapid online tool for investigating the impact of novel mutations on virus fitness. Taking into account the high variability of SARS-CoV-2, this application can help the scientific community when prioritizing molecules for experimental assays, thus, accelerating the identification of promising drug candidates for COVID-19 treatment. Below we describe the main features of the platform and illustrate the possible applications for speeding up the drug discovery process and hypothesize new effective strategies to overcome the recurrent mutations in SARS-CoV-2 genome.
Collapse
|
7
|
Virus-CKB 2.0: Viral-Associated Disease-Specific Chemogenomics Knowledgebase. ACS OMEGA 2022; 7:37476-37484. [PMID: 36312370 PMCID: PMC9609052 DOI: 10.1021/acsomega.2c04258] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Accepted: 09/26/2022] [Indexed: 06/16/2023]
Abstract
Transmissible and infectious viruses can cause large-scale epidemics around the world. This is because the virus can constantly mutate and produce different variants and subvariants to counter existing treatments. Therefore, a variety of treatments are urgently needed to keep up with the mutation of the viruses. To facilitate the research of such treatment, we updated our Virus-CKB 1.0 to Virus-CKB 2.0, which contains 10 kinds of viruses, including enterovirus, dengue virus, hepatitis C virus, Zika virus, herpes simplex virus, Andes orthohantavirus, human immunodeficiency virus, Ebola virus, Lassa virus, influenza virus, coronavirus, and norovirus. To date, Virus-CKB 2.0 archived at least 65 antiviral drugs (such as remdesivir, telaprevir, acyclovir, boceprevir, and nelfinavir) in the market, 178 viral-related targets with 292 available 3D crystal or cryo-EM structures, and 3766 chemical agents reported for these target proteins. Virus-CKB 2.0 is integrated with established tools for target prediction and result visualization; these include HTDocking, TargetHunter, blood-brain barrier (BBB) predictor, Spider Plot, etc. The Virus-CKB 2.0 server is accessible at https://www.cbligand.org/g/virus-ckb. By using the established chemogenomic tools and algorithms and newly developed tools, we can screen FDA-approved drugs and chemical compounds that may bind to these proteins involved in viral-associated disease regulation. If the virus strain mutates and the vaccine loses its effect, we can still screen drugs that can be used to treat the mutated virus in a fleeting time. In some cases, we can even repurpose FDA-approved drugs through Virus-CKB 2.0.
Collapse
|
8
|
Elucidation of Binding Features and Dissociation Pathways of Inhibitors and Modulators in SARS-CoV-2 Main Protease by Multiple Molecular Dynamics Simulations. Molecules 2022; 27:molecules27206823. [PMID: 36296416 PMCID: PMC9609290 DOI: 10.3390/molecules27206823] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Revised: 10/03/2022] [Accepted: 10/10/2022] [Indexed: 01/18/2023] Open
Abstract
COVID-19 can cause different neurological symptoms in some people, including smell, inability to taste, dizziness, confusion, delirium, seizures, stroke, etc. Owing to the issue of vaccine effectiveness, update and coverage, we still need one or more diversified strategies as the backstop to manage illness. Characterizing the structural basis of ligand recognition in the main protease (Mpro) of SARS-CoV-2 will facilitate its rational design and development of potential drug candidates with high affinity and selectivity against COVID-19. Up to date, covalent-, non-covalent inhibitors and allosteric modulators have been reported to bind to different active sites of Mpro. In the present work, we applied the molecular dynamics (MD) simulations to systematically characterize the potential binding features of catalytic active site and allosteric binding sites in Mpro using a dataset of 163 3D structures of Mpro-inhibitor complexes, in which our results are consistent with the current studies. In addition, umbrella sampling (US) simulations were used to explore the dissociation processes of substrate pathway and allosteric pathway. All the information provided new insights into the protein features of Mpro and will facilitate its rational drug design for COVID-19.
Collapse
|
9
|
Abstract
Drug repurposing has become an effective approach to drug discovery, as it offers a new way to explore drugs. Based on the Science Citation Index Expanded (SCI-E) and Social Sciences Citation Index (SSCI) databases of the Web of Science core collection, this study presents a bibliometric analysis of drug repurposing publications from 2010 to 2020. Data were cleaned, mined, and visualized using Derwent Data Analyzer (DDA) software. An overview of the history and development trend of the number of publications, major journals, major countries, major institutions, author keywords, major contributors, and major research fields is provided. There were 2,978 publications included in the study. The findings show that the United States leads in this area of research, followed by China, the United Kingdom, and India. The Chinese Academy of Science published the most research studies, and NIH ranked first on the h-index. The Icahn School of Medicine at Mt Sinai leads in the average number of citations per study. Sci Rep, Drug Discov. Today, and Brief. Bioinform. are the three most productive journals evaluated from three separate perspectives, and pharmacology and pharmacy are unquestionably the most commonly used subject categories. Cheng, FX; Mucke, HAM; and Butte, AJ are the top 20 most prolific and influential authors. Keyword analysis shows that in recent years, most research has focused on drug discovery/drug development, COVID-19/SARS-CoV-2/coronavirus, molecular docking, virtual screening, cancer, and other research areas. The hotspots have changed in recent years, with COVID-19/SARS-CoV-2/coronavirus being the most popular topic for current drug repurposing research.
Collapse
|
10
|
Structures of the SARS-CoV-2 spike glycoprotein and applications for novel drug development. Front Pharmacol 2022; 13:955648. [PMID: 36016554 PMCID: PMC9395726 DOI: 10.3389/fphar.2022.955648] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2022] [Accepted: 07/13/2022] [Indexed: 12/14/2022] Open
Abstract
COVID-19 caused by SARS-CoV-2 has raised a health crisis worldwide. The high morbidity and mortality associated with COVID-19 and the lack of effective drugs or vaccines for SARS-CoV-2 emphasize the urgent need for standard treatment and prophylaxis of COVID-19. The receptor-binding domain (RBD) of the glycosylated spike protein (S protein) is capable of binding to human angiotensin-converting enzyme 2 (hACE2) and initiating membrane fusion and virus entry. Hence, it is rational to inhibit the RBD activity of the S protein by blocking the RBD interaction with hACE2, which makes the glycosylated S protein a potential target for designing and developing antiviral agents. In this study, the molecular features of the S protein of SARS-CoV-2 are highlighted, such as the structures, functions, and interactions of the S protein and ACE2. Additionally, computational tools developed for the treatment of COVID-19 are provided, for example, algorithms, databases, and relevant programs. Finally, recent advances in the novel development of antivirals against the S protein are summarized, including screening of natural products, drug repurposing and rational design. This study is expected to provide novel insights for the efficient discovery of promising drug candidates against the S protein and contribute to the development of broad-spectrum anti-coronavirus drugs to fight against SARS-CoV-2.
Collapse
|
11
|
D3AI-CoV: a deep learning platform for predicting drug targets and for virtual screening against COVID-19. Brief Bioinform 2022; 23:6571526. [PMID: 35443040 PMCID: PMC9310271 DOI: 10.1093/bib/bbac147] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Revised: 03/13/2022] [Accepted: 03/31/2022] [Indexed: 11/25/2022] Open
Abstract
Target prediction and virtual screening are two powerful tools of computer-aided drug design. Target identification is of great significance for hit discovery, lead optimization, drug repurposing and elucidation of the mechanism. Virtual screening can improve the hit rate of drug screening to shorten the cycle of drug discovery and development. Therefore, target prediction and virtual screening are of great importance for developing highly effective drugs against COVID-19. Here we present D3AI-CoV, a platform for target prediction and virtual screening for the discovery of anti-COVID-19 drugs. The platform is composed of three newly developed deep learning-based models i.e., MultiDTI, MPNNs-CNN and MPNNs-CNN-R models. To compare the predictive performance of D3AI-CoV with other methods, an external test set, named Test-78, was prepared, which consists of 39 newly published independent active compounds and 39 inactive compounds from DrugBank. For target prediction, the areas under the receiver operating characteristic curves (AUCs) of MultiDTI and MPNNs-CNN models are 0.93 and 0.91, respectively, whereas the AUCs of the other reported approaches range from 0.51 to 0.74. For virtual screening, the hit rate of D3AI-CoV is also better than other methods. D3AI-CoV is available for free as a web application at http://www.d3pharma.com/D3Targets-2019-nCoV/D3AI-CoV/index.php, which can serve as a rapid online tool for predicting potential targets for active compounds and for identifying active molecules against a specific target protein for COVID-19 treatment.
Collapse
|
12
|
Databases, Knowledgebases, and Software Tools for Virus Informatics. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2022; 1368:1-19. [DOI: 10.1007/978-981-16-8969-7_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
13
|
Advances in the computational landscape for repurposed drugs against COVID-19. Drug Discov Today 2021; 26:2800-2815. [PMID: 34339864 PMCID: PMC8323501 DOI: 10.1016/j.drudis.2021.07.026] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2021] [Revised: 04/30/2021] [Accepted: 07/26/2021] [Indexed: 02/07/2023]
Abstract
The COVID-19 pandemic has caused millions of deaths and massive societal distress worldwide. Therapeutic solutions are urgently needed, but de novo drug development remains a lengthy process. One promising alternative is computational drug repurposing, which enables the prioritization of existing compounds through fast in silico analyses. Recent efforts based on molecular docking, machine learning, and network analysis have produced actionable predictions. Some predicted drugs, targeting viral proteins and pathological host pathways are undergoing clinical trials. Here, we review this work, highlight drugs with high predicted efficacy and classify their mechanisms of action. We discuss the strengths and limitations of the published methodologies and outline possible future directions. Finally, we curate a list of COVID-19 data portals and other repositories that could be used to accelerate future research.
Collapse
|
14
|
Structure-based drug repurposing against COVID-19 and emerging infectious diseases: methods, resources and discoveries. Brief Bioinform 2021; 22:bbab113. [PMID: 33993214 PMCID: PMC8194848 DOI: 10.1093/bib/bbab113] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2020] [Revised: 02/15/2021] [Accepted: 03/13/2021] [Indexed: 01/09/2023] Open
Abstract
To attain promising pharmacotherapies, researchers have applied drug repurposing (DR) techniques to discover the candidate medicines to combat the coronavirus disease 2019 (COVID-19) outbreak. Although many DR approaches have been introduced for treating different diseases, only structure-based DR (SBDR) methods can be employed as the first therapeutic option against the COVID-19 pandemic because they rely on the rudimentary information about the diseases such as the sequence of the severe acute respiratory syndrome coronavirus 2 genome. Hence, to try out new treatments for the disease, the first attempts have been made based on the SBDR methods which seem to be among the proper choices for discovering the potential medications against the emerging and re-emerging infectious diseases. Given the importance of SBDR approaches, in the present review, well-known SBDR methods are summarized, and their merits are investigated. Then, the databases and software applications, utilized for repurposing the drugs against COVID-19, are introduced. Besides, the identified drugs are categorized based on their targets. Finally, a comparison is made between the SBDR approaches and other DR methods, and some possible future directions are proposed.
Collapse
|
15
|
COVID19db: a comprehensive database platform to discover potential drugs and targets of COVID-19 at whole transcriptomic scale. Nucleic Acids Res 2021; 50:D747-D757. [PMID: 34554255 PMCID: PMC8728200 DOI: 10.1093/nar/gkab850] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2021] [Revised: 09/08/2021] [Accepted: 09/11/2021] [Indexed: 12/26/2022] Open
Abstract
Many open access transcriptomic data of coronavirus disease 2019 (COVID-19) were generated, they have great heterogeneity and are difficult to analyze. To utilize these invaluable data for better understanding of COVID-19, additional software should be developed. Especially for researchers without bioinformatic skills, a user-friendly platform is mandatory. We developed the COVID19db platform (http://hpcc.siat.ac.cn/covid19db & http://www.biomedical-web.com/covid19db) that provides 39 930 drug–target–pathway interactions and 95 COVID-19 related datasets, which include transcriptomes of 4127 human samples across 13 body sites associated with the exposure of 33 microbes and 33 drugs/agents. To facilitate data application, each dataset was standardized and annotated with rich clinical information. The platform further provides 14 different analytical applications to analyze various mechanisms underlying COVID-19. Moreover, the 14 applications enable researchers to customize grouping and setting for different analyses and allow them to perform analyses using their own data. Furthermore, a Drug Discovery tool is designed to identify potential drugs and targets at whole transcriptomic scale. For proof of concept, we used COVID19db and identified multiple potential drugs and targets for COVID-19. In summary, COVID19db provides user-friendly web interfaces to freely analyze, download data, and submit new data for further integration, it can accelerate the identification of effective strategies against COVID-19.
Collapse
|
16
|
WADDAICA: A webserver for aiding protein drug design by artificial intelligence and classical algorithm. Comput Struct Biotechnol J 2021; 19:3573-3579. [PMID: 34194678 PMCID: PMC8234348 DOI: 10.1016/j.csbj.2021.06.017] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Revised: 06/05/2021] [Accepted: 06/12/2021] [Indexed: 10/25/2022] Open
Abstract
Artificial intelligence can train the related known drug data into deep learning models for drug design, while classical algorithms can design drugs through established and predefined procedures. Both deep learning and classical algorithms have their merits for drug design. Here, the webserver WADDAICA is built to employ the advantage of deep learning model and classical algorithms for drug design. The WADDAICA mainly contains two modules. In the first module, WADDAICA provides deep learning models for scaffold hopping of compounds to modify or design new novel drugs. The deep learning model which is used in WADDAICA shows a good scoring power based on the PDBbind database. In the second module, WADDAICA supplies functions for modifying or designing new novel drugs by classical algorithms. WADDAICA shows better Pearson and Spearman correlations of binding affinity than Autodock Vina that is considered to have the best scoring power. Besides, WADDAICA supplies a friendly and convenient web interface for users to submit drug design jobs. We believe that WADDAICA is a useful and effective tool to help researchers to modify or design novel drugs by deep learning models and classical algorithms. WADDAICA is free and accessible at https://bqflab.github.io or https://heisenberg.ucam.edu:5000.
Collapse
|
17
|
IsAb: a computational protocol for antibody design. Brief Bioinform 2021; 22:6238584. [PMID: 33876197 DOI: 10.1093/bib/bbab143] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2021] [Revised: 02/24/2021] [Accepted: 03/23/2021] [Indexed: 12/15/2022] Open
Abstract
The design of therapeutic antibodies has attracted a large amount of attention over the years. Antibodies are widely used to treat many diseases due to their high efficiency and low risk of adverse events. However, the experimental methods of antibody design are time-consuming and expensive. Although computational antibody design techniques have had significant advances in the past years, there are still some challenges that need to be solved, such as the flexibility of antigen structure, the lack of antibody structural data and the absence of standard antibody design protocol. In the present work, we elaborated on an in silico antibody design protocol for users to easily perform computer-aided antibody design. First, the Rosetta web server will be applied to generate the 3D structure of query antibodies if there is no structural information available. Then, two-step docking will be used to identify the binding pose of an antibody-antigen complex when the binding information is unknown. ClusPro is the first method to be used to conduct the global docking, and SnugDock is applied for the local docking. Sequentially, based on the predicted binding poses, in silico alanine scanning will be used to predict the potential hotspots (or key residues). Finally, computational affinity maturation protocol will be used to modify the structure of antibodies to theoretically increase their affinity and stability, which will be further validated by the bioassays in the future. As a proof of concept, we redesigned antibody D44.1 and compared it with previously reported data in order to validate IsAb protocol. To further illustrate our proposed protocol, we used cemiplimab antibody, a PD-1 checkpoint inhibitor, as an example to showcase a step-by-step tutorial.
Collapse
|
18
|
Ligand-based approach for predicting drug targets and for virtual screening against COVID-19. Brief Bioinform 2021; 22:1053-1064. [PMID: 33461215 PMCID: PMC7929377 DOI: 10.1093/bib/bbaa422] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2020] [Revised: 11/18/2020] [Accepted: 12/19/2020] [Indexed: 01/18/2023] Open
Abstract
Discovering efficient drugs and identifying target proteins are still an unmet but urgent need for curing coronavirus disease 2019 (COVID-19). Protein structure-based docking is a widely applied approach for discovering active compounds against drug targets and for predicting potential targets of active compounds. However, this approach has its inherent deficiency caused by e.g. various different conformations with largely varied binding pockets adopted by proteins, or the lack of true target proteins in the database. This deficiency may result in false negative results. As a complementary approach to the protein structure-based platform for COVID-19, termed as D3Docking in our previous work, we developed in this study a ligand-based method, named D3Similarity, which is based on the molecular similarity evaluation between the submitted molecule(s) and those in an active compound database. The database is constituted by all the reported bioactive molecules against the coronaviruses, viz., severe acute respiratory syndrome coronavirus (SARS), Middle East respiratory syndrome coronavirus (MERS), severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), human betacoronavirus 2c EMC/2012 (HCoV-EMC), human CoV 229E (HCoV-229E) and feline infectious peritonitis virus (FIPV), some of which have target or mechanism information but some do not. Based on the two-dimensional (2D) and three-dimensional (3D) similarity evaluation of molecular structures, virtual screening and target prediction could be performed according to similarity ranking results. With two examples, we demonstrated the reliability and efficiency of D3Similarity by using 2D × 3D value as score for drug discovery and target prediction against COVID-19. The database, which will be updated regularly, is available free of charge at https://www.d3pharma.com/D3Targets-2019-nCoV/D3Similarity/index.php.
Collapse
|
19
|
Bioinformatics and machine learning approach identifies potential drug targets and pathways in COVID-19. Brief Bioinform 2021; 22:6220170. [PMID: 33839760 PMCID: PMC8083354 DOI: 10.1093/bib/bbab120] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2020] [Revised: 02/15/2021] [Accepted: 03/13/2021] [Indexed: 12/12/2022] Open
Abstract
Current coronavirus disease-2019 (COVID-19) pandemic has caused massive loss of lives. Clinical trials of vaccines and drugs are currently being conducted around the world; however, till now no effective drug is available for COVID-19. Identification of key genes and perturbed pathways in COVID-19 may uncover potential drug targets and biomarkers. We aimed to identify key gene modules and hub targets involved in COVID-19. We have analyzed SARS-CoV-2 infected peripheral blood mononuclear cell (PBMC) transcriptomic data through gene coexpression analysis. We identified 1520 and 1733 differentially expressed genes (DEGs) from the GSE152418 and CRA002390 PBMC datasets, respectively (FDR < 0.05). We found four key gene modules and hub gene signature based on module membership (MMhub) statistics and protein-protein interaction (PPI) networks (PPIhub). Functional annotation by enrichment analysis of the genes of these modules demonstrated immune and inflammatory response biological processes enriched by the DEGs. The pathway analysis revealed the hub genes were enriched with the IL-17 signaling pathway, cytokine-cytokine receptor interaction pathways. Then, we demonstrated the classification performance of hub genes (PLK1, AURKB, AURKA, CDK1, CDC20, KIF11, CCNB1, KIF2C, DTL and CDC6) with accuracy >0.90 suggesting the biomarker potential of the hub genes. The regulatory network analysis showed transcription factors and microRNAs that target these hub genes. Finally, drug-gene interactions analysis suggests amsacrine, BRD-K68548958, naproxol, palbociclib and teniposide as the top-scored repurposed drugs. The identified biomarkers and pathways might be therapeutic targets to the COVID-19.
Collapse
|
20
|
Bioinformatics resources facilitate understanding and harnessing clinical research of SARS-CoV-2. Brief Bioinform 2021; 22:714-725. [PMID: 33432321 PMCID: PMC7929412 DOI: 10.1093/bib/bbaa416] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2020] [Revised: 11/18/2020] [Accepted: 12/19/2020] [Indexed: 12/17/2022] Open
Abstract
The coronavirus disease 2019 (COVID-19) pandemic, caused by the coronavirus severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), has created an unprecedented threat to public health. The pandemic has been sweeping the globe, impacting more than 200 countries, with more outbreaks still lurking on the horizon. At the time of the writing, no approved drugs or vaccines are available to treat COVID-19 patients, prompting an urgent need to decipher mechanisms underlying the pathogenesis and develop curative treatments. To fight COVID-19, researchers around the world have provided specific tools and molecular information for SARS-CoV-2. These pieces of information can be integrated to aid computational investigations and facilitate clinical research. This paper reviews current knowledge, the current status of drug development and various resources for key steps toward effective treatment of COVID-19, including the phylogenetic characteristics, genomic conservation and interaction data. The final goal of this paper is to provide information that may be utilized in bioinformatics approaches and aid target prioritization and drug repurposing. Several SARS-CoV-2-related tools/databases were reviewed, and a web-portal named OverCOVID (http://bis.zju.edu.cn/overcovid/) is constructed to provide a detailed interpretation of SARS-CoV-2 basics and share a collection of resources that may contribute to therapeutic advances. These information could improve researchers' understanding of SARS-CoV-2 and help to accelerate the development of new antiviral treatments.
Collapse
|
21
|
Generative chemistry: drug discovery with deep learning generative models. J Mol Model 2021; 27:71. [PMID: 33543405 PMCID: PMC10984615 DOI: 10.1007/s00894-021-04674-8] [Citation(s) in RCA: 38] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2020] [Accepted: 01/13/2021] [Indexed: 12/15/2022]
Abstract
The de novo design of molecular structures using deep learning generative models introduces an encouraging solution to drug discovery in the face of the continuously increased cost of new drug development. From the generation of original texts, images, and videos, to the scratching of novel molecular structures the creativity of deep learning generative models exhibits the height machine intelligence can achieve. The purpose of this paper is to review the latest advances in generative chemistry which relies on generative modeling to expedite the drug discovery process. This review starts with a brief history of artificial intelligence in drug discovery to outline this emerging paradigm. Commonly used chemical databases, molecular representations, and tools in cheminformatics and machine learning are covered as the infrastructure for generative chemistry. The detailed discussions on utilizing cutting-edge generative architectures, including recurrent neural network, variational autoencoder, adversarial autoencoder, and generative adversarial network for compound generation are focused. Challenges and future perspectives follow.
Collapse
|
22
|
COVID19 Drug Repository: text-mining the literature in search of putative COVID19 therapeutics. Nucleic Acids Res 2021; 49:D1113-D1121. [PMID: 33166390 PMCID: PMC7778969 DOI: 10.1093/nar/gkaa969] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2020] [Revised: 10/07/2020] [Accepted: 11/04/2020] [Indexed: 12/12/2022] Open
Abstract
The recent outbreak of COVID-19 has generated an enormous amount of Big Data. To date, the COVID-19 Open Research Dataset (CORD-19), lists ∼130,000 articles from the WHO COVID-19 database, PubMed Central, medRxiv, and bioRxiv, as collected by Semantic Scholar. According to LitCovid (11 August 2020), ∼40,300 COVID19-related articles are currently listed in PubMed. It has been shown in clinical settings that the analysis of past research results and the mining of available data can provide novel opportunities for the successful application of currently approved therapeutics and their combinations for the treatment of conditions caused by a novel SARS-CoV-2 infection. As such, effective responses to the pandemic require the development of efficient applications, methods and algorithms for data navigation, text-mining, clustering, classification, analysis, and reasoning. Thus, our COVID19 Drug Repository represents a modular platform for drug data navigation and analysis, with an emphasis on COVID-19-related information currently being reported. The COVID19 Drug Repository enables users to focus on different levels of complexity, starting from general information about (FDA-) approved drugs, PubMed references, clinical trials, recipes as well as the descriptions of molecular mechanisms of drugs' action. Our COVID19 drug repository provide a most updated world-wide collection of drugs that has been repurposed for COVID19 treatments around the world.
Collapse
|
23
|
Pain-CKB, A Pain-Domain-Specific Chemogenomics Knowledgebase for Target Identification and Systems Pharmacology Research. J Chem Inf Model 2020; 60:4429-4435. [PMID: 32786694 DOI: 10.1021/acs.jcim.0c00633] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
A traditional single-target analgesic, though it may be highly selective and potent, may not be sufficient to mitigate pain. An alternative strategy for alleviation of pain is to seek simultaneous modulation at multiple nodes in the network of pain-signaling pathways through a multitarget analgesic or drug combinations. Here we present a comprehensive pain-domain-specific chemogenomics knowledgebase (Pain-CKB) with integrated computing tools for target identification and systems pharmacology research. Pain-CKB is constructed on the basis of our established chemogenomics technology with new features, including multiple compound support, multicavity protein support, and customizable symbol display. The determination of bioactivity is also revised to avoid the use of complex machine learning models. Our one-stop computing platform describes the chemical molecules, genes, and proteins involved in pain regulation. To date, Pain-CKB has archived 272 analgesics in the market, 84 pain-related targets with 207 available 3D crystal or cryo-EM structures, and 234 662 chemical agents reported for these target proteins. Moreover, Pain-CKB implements user-friendly web-interfaced computing tools and applications for the prediction and analysis of the relevant protein targets and visualization of the outputs, including HTDocking, TargetHunter, BBB permeation predictor, NGL viewer, Spider Plot, etc. The Pain-CKB server is accessible at https://www.cbligand.org/g/pain-ckb.
Collapse
|