1
|
Perovic V, Glisic S, Veljkovic M, Paessler S, Veljkovic V. Novel Entropy-Based Phylogenetic Algorithm: A New Approach for Classifying SARS-CoV-2 Variants. ENTROPY (BASEL, SWITZERLAND) 2023; 25:1463. [PMID: 37895584 PMCID: PMC10606860 DOI: 10.3390/e25101463] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 09/15/2023] [Accepted: 10/17/2023] [Indexed: 10/29/2023]
Abstract
The SARS-CoV-2 virus, the causative agent of COVID-19, is known for its genetic diversity. Virus variants of concern (VOCs) as well as variants of interest (VOIs) are classified by the World Health Organization (WHO) according to their potential risk to global health. This study seeks to enhance the identification and classification of such variants by developing a novel bioinformatics criterion centered on the virus's spike protein (SP1), a key player in host cell entry, immune response, and a mutational hotspot. To achieve this, we pioneered a unique phylogenetic algorithm which calculates EIIP-entropy as a distance measure based on the distribution of the electron-ion interaction potential (EIIP) of amino acids in SP1. This method offers a comprehensive, scalable, and rapid approach to analyze large genomic data sets and predict the impact of specific mutations. This innovative approach provides a robust tool for classifying emergent SARS-CoV-2 variants into potential VOCs or VOIs. It could significantly augment surveillance efforts and understanding of variant characteristics, while also offering potential applicability to the analysis and classification of other emerging viral pathogens and enhancing global readiness against emerging and re-emerging viral pathogens.
Collapse
Affiliation(s)
| | - Sanja Glisic
- Biomed Protection, Galveston, TX 77550, USA (M.V.)
| | | | - Slobodan Paessler
- Galveston National Laboratory, Department of Pathology, University of Texas Medical Branch, Galveston, TX 77555, USA;
| | | |
Collapse
|
2
|
Wang W, Zhang Y, Liu D, Zhang H, Wang X, Zhou Y. Prediction of DNA-Binding Protein–Drug-Binding Sites Using Residue Interaction Networks and Sequence Feature. Front Bioeng Biotechnol 2022; 10:822392. [PMID: 35519609 PMCID: PMC9065339 DOI: 10.3389/fbioe.2022.822392] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Accepted: 03/14/2022] [Indexed: 11/13/2022] Open
Abstract
Identification of protein–ligand binding sites plays a critical role in drug discovery. However, there is still a lack of targeted drug prediction for DNA-binding proteins. This study aims at the binding sites of DNA-binding proteins and drugs, by mining the residue interaction network features, which can describe the local and global structure of amino acids, combined with sequence feature. The predictor of DNA-binding protein–drug-binding sites is built by employing the Extreme Gradient Boosting (XGBoost) model with random under-sampling. We found that the residue interaction network features can better characterize DNA-binding proteins, and the binding sites with high betweenness value and high closeness value are more likely to interact with drugs. The model shows that the residue interaction network features can be used as an important quantitative indicator of drug-binding sites, and this method achieves high predictive performance for the binding sites of DNA-binding protein–drug. This study will help in drug discovery research for DNA-binding proteins.
Collapse
Affiliation(s)
- Wei Wang
- College of Computer and Information Engineering, Henan Normal University, Xinxiang, China
- Key Laboratory of Artificial Intelligence and Personalized Learning in Education of Henan Province, College of Computer and Information Engineering, Henan Normal University, Xinxiang, China
- *Correspondence: Wei Wang, ; Dong Liu, ; Yun Zhou,
| | - Yu Zhang
- College of Computer and Information Engineering, Henan Normal University, Xinxiang, China
| | - Dong Liu
- College of Computer and Information Engineering, Henan Normal University, Xinxiang, China
- Key Laboratory of Artificial Intelligence and Personalized Learning in Education of Henan Province, College of Computer and Information Engineering, Henan Normal University, Xinxiang, China
- *Correspondence: Wei Wang, ; Dong Liu, ; Yun Zhou,
| | - HongJun Zhang
- Computer Science and Technology, Anyang University, Anyang, China
| | - XianFang Wang
- Computer Science and Technology, Henan Institute of Technology, Xinxiang, China
| | - Yun Zhou
- College of Computer and Information Engineering, Henan Normal University, Xinxiang, China
- *Correspondence: Wei Wang, ; Dong Liu, ; Yun Zhou,
| |
Collapse
|
3
|
|
4
|
Huang CH, Zaenudin E, Tsai JJP, Kurubanjerdjit N, Dessie EY, Ng KL. Dissecting molecular network structures using a network subgraph approach. PeerJ 2020; 8:e9556. [PMID: 33005483 PMCID: PMC7512139 DOI: 10.7717/peerj.9556] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2020] [Accepted: 06/25/2020] [Indexed: 11/20/2022] Open
Abstract
Biological processes are based on molecular networks, which exhibit biological functions through interactions of genetic elements or proteins. This study presents a graph-based method to characterize molecular networks by decomposing the networks into directed multigraphs: network subgraphs. Spectral graph theory, reciprocity and complexity measures were used to quantify the network subgraphs. Graph energy, reciprocity and cyclomatic complexity can optimally specify network subgraphs with some degree of degeneracy. Seventy-one molecular networks were analyzed from three network types: cancer networks, signal transduction networks, and cellular processes. Molecular networks are built from a finite number of subgraph patterns and subgraphs with large graph energies are not present, which implies a graph energy cutoff. In addition, certain subgraph patterns are absent from the three network types. Thus, the Shannon entropy of the subgraph frequency distribution is not maximal. Furthermore, frequently-observed subgraphs are irreducible graphs. These novel findings warrant further investigation and may lead to important applications. Finally, we observed that cancer-related cellular processes are enriched with subgraph-associated driver genes. Our study provides a systematic approach for dissecting biological networks and supports the conclusion that there are organizational principles underlying molecular networks.
Collapse
Affiliation(s)
- Chien-Hung Huang
- Department of Computer Science and Information Engineering, National Formosa University, Yunlin, Taiwan
| | - Efendi Zaenudin
- Department of Bioinformatics and Medical Engineering, Asia University, Taichung, Taiwan.,Research Center for Informatics, Indonesian Institute of Sciences, Bandung, Indonesia
| | - Jeffrey J P Tsai
- Department of Bioinformatics and Medical Engineering, Asia University, Taichung, Taiwan
| | | | - Eskezeia Y Dessie
- Department of Bioinformatics and Medical Engineering, Asia University, Taichung, Taiwan
| | - Ka-Lok Ng
- Department of Bioinformatics and Medical Engineering, Asia University, Taichung, Taiwan.,Department of Medical Research, China Medical University Hospital, China Medical University, Taichung, Taiwan
| |
Collapse
|
5
|
Dey A, Sen S, Uversky VN, Maulik U. Structural facets of POU2F1 in light of the functional annotations and sequence-structure patterns. J Biomol Struct Dyn 2020; 39:1093-1105. [PMID: 32081083 DOI: 10.1080/07391102.2020.1733092] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
POU domain class 2 homebox 1 or POU2F1 is broadly known as an important transcription factor. Due to its association with different types of malignancies, POU2F1 became one of the key factors in pancancer analysis. However, in spite of considering this protein as a potential drug target, none of the drug targeting POU2F1 has been designed as of yet due to the extreme structural flexibility of this protein. In this article, we have proposed a three-level comprehensive framework for understanding the structural conservation and co-variation of POU2F1. First, a gene regulatory network based on the normal and pathological functions of POU2F1 has been created for better understanding the strong association between POU2F1 deregulation and cancers. After that, based on the evolutionary sequence space analysis, the comparative sequence dynamics of the protein members of POU domain family has been studied mostly between non-human and human species. Subsequently, the reciprocity effect of the residual co-variation has been identified through direct coupling analysis. Along with that, the structure of POU2F1 has been analyzed depending on quality assessment and normal mode-based structure network. Comparing the sequence and structure space information, the most significant set of residues viz., 3, 9, 13, 17, 20, 21, 28, 35, and 36 have been identified as structural facet for function. This study demonstrates that the structural malleability of POU2F1 serves as one of the prime reason behind its functional multiplicity in terms of protein moonlighting. Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Ashmita Dey
- Computer Science and Engineering, Jadavpur University, Kolkata, West Bengal, India
| | - Sagnik Sen
- Computer Science and Engineering, Jadavpur University, Kolkata, West Bengal, India
| | - Vladimir N Uversky
- Federal Research Center "Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences", Institute for Biological Instrumentation of the Russian Academy of Sciences, Pushchino, Moscow Region, Russia.,Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, USA
| | - Ujjwal Maulik
- Computer Science and Engineering, Jadavpur University, Kolkata, West Bengal, India
| |
Collapse
|
6
|
Ahmad S, Gromiha MM, Raghava GPS, Schönbach C, Ranganathan S. APBioNet's annual International Conference on Bioinformatics (InCoB) returns to India in 2018. BMC Genomics 2019; 19:266. [PMID: 30999857 PMCID: PMC7402400 DOI: 10.1186/s12864-019-5582-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
InCoB, one of the largest annual bioinformatics conferences in the Asia-Pacific region since its launch in 2002, returned to New Delhi, India after 12 years, with a conference attendance of 314 delegates. The 2018 conference had sessions on Big Data and Algorithms, Next Generation Sequencing and Omics Science, Structure, Function and Interactions, Disease and Drug Discovery and Plant and Agricultural Bioinformatics. The conference also featured an industry track as well as panel discussions on Women in Bioinformatics and Democratization vs. Quality control in academic publishing. Asia Pacific Bioinformatics Interaction & Networking Society (APbians) was launched as an APBionet Special Interest Group. Of the 52 oral presentations made, 22 were accepted in supplemental issues of BMC Bioinformatics, BMC Genomics or BMC Medical Genomics and are briefly reviewed here. Next year’s InCoB will be held in Jakarta, Indonesia from September 10–12, 2019.
Collapse
Affiliation(s)
- Shandar Ahmad
- School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi, 110 067, India
| | - Michael M Gromiha
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, Tamilnadu, 600 036, India
| | - Gajendra P S Raghava
- Centre for Computational Biology, Indraprastha Institute of Information Technology, Okhla Industrial Estate, Phase III, New Delhi, 110020, India
| | - Christian Schönbach
- Department of Biology, School of Science and Technology, Nazarbayev University, Astana, Kazakhstan.,International Research Center for Medical Sciences, Graduate School of Medical Sciences, Kumamoto University, Kumamoto, 860-0811, Japan
| | - Shoba Ranganathan
- Department of Molecular Sciences, Macquarie University, Sydney, NSW, 2109, Australia. .,Transformational Bioinformatics, Health and Biosecurity, Commonwealth Scientific and Industrial Research Organisation, Sydney, Australia.
| |
Collapse
|