1
|
Yan W, Yu F, Tan L, Mengshan L, Xiaojun X, Weihong Z, Sheng S, Jun W, Fu-An W. A hybrid machine learning model with attention mechanism and multidimensional multivariate feature coding for essential gene prediction. BMC Biol 2025; 23:108. [PMID: 40275343 PMCID: PMC12023577 DOI: 10.1186/s12915-025-02209-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Accepted: 04/07/2025] [Indexed: 04/26/2025] Open
Abstract
BACKGROUND Essential genes are crucial for the development, inheritance, and survival of species. The exploration of these genes can unravel the complex mechanisms and fundamental life processes and identify potential therapeutic targets for various diseases. Therefore, the identification of essential genes is significant. Machine learning has become the mainstream approach for essential gene prediction. However, some key challenges in machine learning need to be addressed, such as the extraction of genetic features, the impact of imbalanced data, and the cross-species generalization ability. RESULTS Here, we proposed a hybrid machine learning model based on graph convolutional neural networks (GCN) and bi-directional long short-term memory (Bi-LSTM) with attention mechanism and multidimensional multivariate feature coding for essential gene prediction, called EGP Hybrid-ML. In the model, GCN was used to extract feature encoding information from the visualized graphics of gene sequences and the attention mechanism was combined with Bi-LSTM to assess the importance of each feature in gene sequences and analyze the influences of different feature encoding methods and data imbalance. Additionally, the cross-species predictive performance of the model was evaluated through cross-validation. The results indicated that the sensitivity of the EGP Hybrid-ML model reached 0.9122. CONCLUSIONS This model demonstrated the superior predictive performance and strong generalization capabilities compared to other models. The EGP Hybrid-ML model proposed in this paper has broad application prospects in bioinformatics, chemical information, and pharmaceutical information. The codes, architectures, parameters, and datasets of the proposed model are available free of charge at GitHub ( https://github.com/gnnumsli/EGP-Hybrid-ML ).
Collapse
Affiliation(s)
- Wu Yan
- Gannan Normal University, Ganzhou, Jiangxi, 341000, China.
- Jiangsu University of Science and Technology, Zhenjiang, Jiangsu, 212018, China.
- Sericultural Research Institute, Chinese Academy of Agricultural Sciences, Zhenjiang, Jiangsu, 212018, China.
| | - Fu Yu
- Ganzhou Power Supply Branch of State Grid, Jiangxi Electric Power Co., Ltd, Ganzhou, Jiangxi, 341000, China
| | - Li Tan
- Gannan Normal University, Ganzhou, Jiangxi, 341000, China
| | - Li Mengshan
- Gannan Normal University, Ganzhou, Jiangxi, 341000, China.
- Ganzhou Power Supply Branch of State Grid, Jiangxi Electric Power Co., Ltd, Ganzhou, Jiangxi, 341000, China.
| | - Xie Xiaojun
- Gannan Normal University, Ganzhou, Jiangxi, 341000, China
| | - Zhou Weihong
- Jiangsu University of Science and Technology, Zhenjiang, Jiangsu, 212018, China
- Sericultural Research Institute, Chinese Academy of Agricultural Sciences, Zhenjiang, Jiangsu, 212018, China
| | - Sheng Sheng
- Jiangsu University of Science and Technology, Zhenjiang, Jiangsu, 212018, China
- Sericultural Research Institute, Chinese Academy of Agricultural Sciences, Zhenjiang, Jiangsu, 212018, China
| | - Wang Jun
- Jiangsu University of Science and Technology, Zhenjiang, Jiangsu, 212018, China
- Sericultural Research Institute, Chinese Academy of Agricultural Sciences, Zhenjiang, Jiangsu, 212018, China
| | - Wu Fu-An
- Jiangsu University of Science and Technology, Zhenjiang, Jiangsu, 212018, China.
- Sericultural Research Institute, Chinese Academy of Agricultural Sciences, Zhenjiang, Jiangsu, 212018, China.
| |
Collapse
|
2
|
Li Y, Zhu J, Zhai F, Kong L, Li H, Jin X. Advances in the understanding of nuclear pore complexes in human diseases. J Cancer Res Clin Oncol 2024; 150:374. [PMID: 39080077 PMCID: PMC11289042 DOI: 10.1007/s00432-024-05881-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2024] [Accepted: 07/03/2024] [Indexed: 08/02/2024]
Abstract
BACKGROUND Nuclear pore complexes (NPCs) are sophisticated and dynamic protein structures that straddle the nuclear envelope and act as gatekeepers for transporting molecules between the nucleus and the cytoplasm. NPCs comprise up to 30 different proteins known as nucleoporins (NUPs). However, a growing body of research has suggested that NPCs play important roles in gene regulation, viral infections, cancer, mitosis, genetic diseases, kidney diseases, immune system diseases, and degenerative neurological and muscular pathologies. PURPOSE In this review, we introduce the structure and function of NPCs. Then We described the physiological and pathological effects of each component of NPCs which provide a direction for future clinical applications. METHODS The literatures from PubMed have been reviewed for this article. CONCLUSION This review summarizes current studies on the implications of NPCs in human physiology and pathology, highlighting the mechanistic underpinnings of NPC-associated diseases.
Collapse
Affiliation(s)
- Yuxuan Li
- The Affiliated Lihuili Hospital of Ningbo University, Ningbo, 315040, Zhejiang, China
- Department of Biochemistry and Molecular Biology, and Zhejiang Key Laboratory of Pathophysiology, Health Science Center, Nngbo University, Ningbo, 315211, Zhejiang, China
| | - Jie Zhu
- The Affiliated Lihuili Hospital of Ningbo University, Ningbo, 315040, Zhejiang, China
| | - Fengguang Zhai
- Department of Biochemistry and Molecular Biology, and Zhejiang Key Laboratory of Pathophysiology, Health Science Center, Nngbo University, Ningbo, 315211, Zhejiang, China
| | - Lili Kong
- Department of Biochemistry and Molecular Biology, and Zhejiang Key Laboratory of Pathophysiology, Health Science Center, Nngbo University, Ningbo, 315211, Zhejiang, China
| | - Hong Li
- The Affiliated Lihuili Hospital of Ningbo University, Ningbo, 315040, Zhejiang, China.
- Department of Biochemistry and Molecular Biology, and Zhejiang Key Laboratory of Pathophysiology, Health Science Center, Nngbo University, Ningbo, 315211, Zhejiang, China.
| | - Xiaofeng Jin
- The Affiliated Lihuili Hospital of Ningbo University, Ningbo, 315040, Zhejiang, China.
- Department of Biochemistry and Molecular Biology, and Zhejiang Key Laboratory of Pathophysiology, Health Science Center, Nngbo University, Ningbo, 315211, Zhejiang, China.
| |
Collapse
|
3
|
Halliday C, de Liz LV, Vaughan S, Sunter JD. Disruption of Leishmania flagellum attachment zone architecture causes flagellum loss. Mol Microbiol 2024; 121:53-68. [PMID: 38010644 PMCID: PMC10953051 DOI: 10.1111/mmi.15199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2023] [Revised: 11/10/2023] [Accepted: 11/13/2023] [Indexed: 11/29/2023]
Abstract
Leishmania are flagellated eukaryotic parasites that cause leishmaniasis and are closely related to the other kinetoplastid parasites such as Trypanosoma brucei. In all these parasites there is a cell membrane invagination at the base of the flagellum called the flagellar pocket, which is tightly associated with and sculpted by cytoskeletal structures including the flagellum attachment zone (FAZ). The FAZ is a complex interconnected structure linking the flagellum to the cell body and has critical roles in cell morphogenesis, function and pathogenicity. However, this structure varies dramatically in size and organisation between these different parasites, suggesting changes in protein localisation and function. Here, we screened the localisation and function of the Leishmania orthologues of T. brucei FAZ proteins identified in the genome-wide protein tagging project TrypTag. We identified 27 FAZ proteins and our deletion analysis showed that deletion of two FAZ proteins in the flagellum, FAZ27 and FAZ34 resulted in a reduction in cell body size, and flagellum loss in some cells. Furthermore, after null mutant generation, we observed distinct and reproducible changes to cell shape, demonstrating the ability of the parasite to adapt to morphological perturbations resulting from gene deletion. This process of adaptation has important implications for the study of Leishmania mutants.
Collapse
Affiliation(s)
- Clare Halliday
- Department of Biological and Medical SciencesOxford Brookes UniversityOxfordUK
| | - Laryssa Vanessa de Liz
- Department of Biological and Medical SciencesOxford Brookes UniversityOxfordUK
- Departamento de Microbiologia, Imunologia e ParasitologiaUniversidade Federal de Santa CatarinaFlorianópolisSCBrazil
| | - Sue Vaughan
- Department of Biological and Medical SciencesOxford Brookes UniversityOxfordUK
| | - Jack D. Sunter
- Department of Biological and Medical SciencesOxford Brookes UniversityOxfordUK
| |
Collapse
|
4
|
Targa A, Larrimore KE, Wong CK, Chong YL, Fung R, Lee J, Choi H, Rancati G. Non-genetic and genetic rewiring underlie adaptation to hypomorphic alleles of an essential gene. EMBO J 2021; 40:e107839. [PMID: 34528284 PMCID: PMC8561638 DOI: 10.15252/embj.2021107839] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2021] [Revised: 08/05/2021] [Accepted: 08/23/2021] [Indexed: 11/17/2022] Open
Abstract
Adaptive evolution to cellular stress is a process implicated in a wide range of biological and clinical phenomena. Two major routes of adaptation have been identified: non-genetic changes, which allow expression of different phenotypes in novel environments, and genetic variation achieved by selection of fitter phenotypes. While these processes are broadly accepted, their temporal and epistatic features in the context of cellular evolution and emerging drug resistance are contentious. In this manuscript, we generated hypomorphic alleles of the essential nuclear pore complex (NPC) gene NUP58. By dissecting early and long-term mechanisms of adaptation in independent clones, we observed that early physiological adaptation correlated with transcriptome rewiring and upregulation of genes known to interact with the NPC; long-term adaptation and fitness recovery instead occurred via focal amplification of NUP58 and restoration of mutant protein expression. These data support the concept that early phenotypic plasticity allows later acquisition of genetic adaptations to a specific impairment. We propose this approach as a genetic model to mimic targeted drug therapy in human cells and to dissect mechanisms of adaptation.
Collapse
Affiliation(s)
- Altea Targa
- Institute of Medical Biology (IMB)Agency for Science, Technology and Research (A*STAR)SingaporeSingapore
- Skin Research Institute of Singapore (SRIS)Agency for Science, Technology and Research (A*STAR)SingaporeSingapore
- School of Biological SciencesNanyang Technological UniversitySingaporeSingapore
| | - Katherine E Larrimore
- Institute of Medical Biology (IMB)Agency for Science, Technology and Research (A*STAR)SingaporeSingapore
- Skin Research Institute of Singapore (SRIS)Agency for Science, Technology and Research (A*STAR)SingaporeSingapore
| | - Cheng Kit Wong
- Institute of Medical Biology (IMB)Agency for Science, Technology and Research (A*STAR)SingaporeSingapore
| | - Yu Lin Chong
- Institute of Medical Biology (IMB)Agency for Science, Technology and Research (A*STAR)SingaporeSingapore
- Skin Research Institute of Singapore (SRIS)Agency for Science, Technology and Research (A*STAR)SingaporeSingapore
| | - Ronald Fung
- Institute of Medical Biology (IMB)Agency for Science, Technology and Research (A*STAR)SingaporeSingapore
| | - Joseph Lee
- Department of MedicineYong Loo Lin School of MedicineNUS and National University Health SystemSingaporeSingapore
| | - Hyungwon Choi
- Department of MedicineYong Loo Lin School of MedicineNUS and National University Health SystemSingaporeSingapore
| | - Giulia Rancati
- Institute of Medical Biology (IMB)Agency for Science, Technology and Research (A*STAR)SingaporeSingapore
- Skin Research Institute of Singapore (SRIS)Agency for Science, Technology and Research (A*STAR)SingaporeSingapore
- School of Biological SciencesNanyang Technological UniversitySingaporeSingapore
| |
Collapse
|