1
|
Peng FZ, Wang C, Chen T, Schussheim B, Vincoff S, Chatterjee P. PTM-Mamba: a PTM-aware protein language model with bidirectional gated Mamba blocks. Nat Methods 2025; 22:945-949. [PMID: 40211004 PMCID: PMC12074982 DOI: 10.1038/s41592-025-02656-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Accepted: 03/05/2025] [Indexed: 04/12/2025]
Abstract
Current protein language models (LMs) accurately encode protein properties but have yet to represent post-translational modifications (PTMs), which are crucial for proteomic diversity and influence protein structure, function and interactions. To address this gap, we develop PTM-Mamba, a PTM-aware protein LM that integrates PTM tokens using bidirectional Mamba blocks fused with ESM-2 protein LM embeddings via a newly developed gating mechanism. PTM-Mamba uniquely models both wild-type and PTM sequences, enabling downstream tasks such as disease association and druggability prediction, PTM effect prediction on protein-protein interactions and zero-shot PTM discovery. In total, our work establishes PTM-Mamba as a foundational tool for PTM-aware protein modeling and design.
Collapse
Affiliation(s)
| | - Chentong Wang
- School of Life Sciences, Westlake University, Hangzhou, China
| | - Tong Chen
- Department of Biomedical Engineering, Duke University, Durham, NC, USA
| | | | - Sophia Vincoff
- Department of Biomedical Engineering, Duke University, Durham, NC, USA
| | - Pranam Chatterjee
- Department of Biomedical Engineering, Duke University, Durham, NC, USA.
- Department of Computer Science, Duke University, Durham, NC, USA.
- Department of Biostatistics and Bioinformatics, Duke University, Durham, NC, USA.
| |
Collapse
|
2
|
Liu L, Li X, Yang H, Xu F, Dong X. Bioinformatic Analysis of Apoptosis-Related Genes in Preeclampsia Using Public Transcriptomic and Single-Cell RNA Sequencing Datasets. J Inflamm Res 2025; 18:4785-4812. [PMID: 40224388 PMCID: PMC11992479 DOI: 10.2147/jir.s507660] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2024] [Accepted: 03/25/2025] [Indexed: 04/15/2025] Open
Abstract
Purpose Apoptosis, which is crucial in preeclampsia (PE), affects trophoblast survival and placental function. We used transcriptomics and single-cell RNA sequencing (scRNA-seq) to explore apoptosis-related genes (ARGs) and their cellular mechanisms as potential PE biomarkers. Patients and Methods All the data included in this study were sourced from public databases. We used scRNA-seq and differential expression analysis, combined with five algorithms from the CytoHubba plugin, to identify ARGs as PE biomarkers. These were integrated into diagnostic nomograms. Mechanistic studies involved enrichment analysis and immune profiling. Biomarker expression was examined at the single-cell level, and verified in clinical samples by RT-qPCR. Results Cluster of Differentiation 44 (CD44), Macrophage migration inhibitory factor (MIF), PIK3R1, and Toll-like receptor 4 (TLR4) were identified as PE biomarkers. CD44 and TLR4 were down-regulated, while MIF and PIK3R1 were up-regulated. When integrated into the diagnostic nomogram, they showed clinical utility and affected cell functions. In the immune profile of PE, monocytes decreased, resting NK cells increased, and the activities of APC, checkpoint, T-cell co-stimulation, and MHC class I pathways reduced. ScRNA-seq identified 11 cell types, 10 of which were significantly different. Endothelial cell communication with other cell types decreased, while the interaction between common myeloid progenitors (CMP) and villous cytotrophoblasts (VCT) enhanced. The expression levels of CD44, MIF, and PIK3R1 in VCT were significantly different and key to PE. Their decrease in early PE and increase in late PE reflected the placenta's adaptation to adverse pregnancy conditions. Conclusion Four ARGs, CD44, MIF, PIK3R1, and TLR4, identified through comprehensive analyses, served as significant biomarkers for PE and offered insights into PE's cellular mechanisms of PE, providing valuable references for further research.
Collapse
Affiliation(s)
- Lingyan Liu
- Faculty of Life Science and Technology, Kunming University of Science and Technology, Kunming, 650500, People’s Republic of China
- Department of Obstetrics, The First People’s Hospital of Yunnan Province, The Affiliated Hospital of Kunming University of Science and Technology, Kunming, 650500, People’s Republic of China
- Medical School, Kunming University of Science and Technology, Kunming, 650500, People’s Republic of China
| | - Xiuling Li
- Department of Obstetrics, The First People’s Hospital of Yunnan Province, The Affiliated Hospital of Kunming University of Science and Technology, Kunming, 650500, People’s Republic of China
- Medical School, Kunming University of Science and Technology, Kunming, 650500, People’s Republic of China
| | - Hongfen Yang
- Department of Obstetrics, The First People’s Hospital of Yunnan Province, The Affiliated Hospital of Kunming University of Science and Technology, Kunming, 650500, People’s Republic of China
- Medical School, Kunming University of Science and Technology, Kunming, 650500, People’s Republic of China
| | - Fei Xu
- Medical School, Kunming University of Science and Technology, Kunming, 650500, People’s Republic of China
- Department of Pain Management, The First People’s Hospital of Yunnan Province, The Affiliated Hospital of Kunming University of Science and Technology, Kunming, 650500, People’s Republic of China
| | - Xudong Dong
- Department of Obstetrics, The First People’s Hospital of Yunnan Province, The Affiliated Hospital of Kunming University of Science and Technology, Kunming, 650500, People’s Republic of China
- Medical School, Kunming University of Science and Technology, Kunming, 650500, People’s Republic of China
| |
Collapse
|
3
|
Chhichholiya Y, Kumar V, Hlawndo L, Sangave K, Singh S, Munshi A. BreVamiR3': A comprehensive database for breast cancer-associated genetic variations in miRNA and 3' UTR of their target genes. Comput Biol Med 2025; 186:109639. [PMID: 39740512 DOI: 10.1016/j.compbiomed.2024.109639] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2024] [Revised: 12/25/2024] [Accepted: 12/26/2024] [Indexed: 01/02/2025]
Abstract
Breast cancer (BC) is a multifactorial disease where microRNA (miRNA)-mediated dysregulated gene expression plays a pivotal role in tumorigenesis, progression, and clinical outcomes. Genetic variation, particularly SNPs in miRNA sequences and the 3' untranslated regions (3'UTRs) of their target genes, can disrupt miRNA-mRNA interactions, leading to altered gene expression. Despite several existing databases providing insights into various aspects of miRNAs and their target genes in association with the development of the disease. Still, there remains a critical gap in a unified, BC-specific repository that integrates SNPs in miRNAs, their target genes, and associated molecular mechanisms. To address this, we developed BreVamiR3' (Breast cancer-associated genetic Variations in miRNA and 3'UTR of their target genes), a novel, freely accessible database (https://brevamir3.web.app/index.html). BreVamiR3' features comprehensive and curated data on 500 miRNAs and 828 target genes associated with BC, including experimentally validated SNPs, chromosomal loci, expression profiles, BC subtypes, signaling pathways, and classifications of genes i.e. oncogene, tumor suppressor, or both. Unique features of BreVamiR3' include detailed annotations of SNPs disrupting miRNA-target interactions, subtype-specific miRNA expression, visualization of miRNA-chromosomal distributions and their experimental validation. Chromosome 19 harbours the highest number of BC-associated miRNAs, while target genes such as CCND1, KRAS, and ERBB2 demonstrate extensive miRNA regulation. By integrating genetic variations with functional and clinical relevance, BreVamiR3' provides a single platform to explore miRNA-mediated regulatory networks, their genetic variation, and downstream signaling pathways in BC. This database serves as a powerful resource for researchers, enabling exploration of miRNA-gene interactions, genetic variation, and their role in pathogenesis of BC and diverse clinical outcomes.
Collapse
Affiliation(s)
- Yogita Chhichholiya
- Department of Human Genetics and Molecular Medicine, Central University of Punjab, Bathinda, Punjab, India
| | - Vicky Kumar
- Department of Computer Science Engineering, Guru Nanak Dev DSEU Rohini Campus, Delhi Skill and Entrepreneurship University, Dwarka, Delhi, India
| | - Lalnunmawia Hlawndo
- Department of Human Genetics and Molecular Medicine, Central University of Punjab, Bathinda, Punjab, India
| | - Kshitij Sangave
- Department of Human Genetics and Molecular Medicine, Central University of Punjab, Bathinda, Punjab, India
| | - Sandeep Singh
- Department of Human Genetics and Molecular Medicine, Central University of Punjab, Bathinda, Punjab, India
| | - Anjana Munshi
- Department of Human Genetics and Molecular Medicine, Central University of Punjab, Bathinda, Punjab, India.
| |
Collapse
|
4
|
Huang X, Feng Z, Liu D, Gou Y, Chen M, Tang D, Han C, Peng J, Peng D, Xue Y. PTMD 2.0: an updated database of disease-associated post-translational modifications. Nucleic Acids Res 2025; 53:D554-D563. [PMID: 39329270 PMCID: PMC11701619 DOI: 10.1093/nar/gkae850] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2024] [Revised: 09/10/2024] [Accepted: 09/17/2024] [Indexed: 09/28/2024] Open
Abstract
Various post-translational modifications (PTMs) participate in nearly all aspects of biological processes by regulating protein functions, and aberrant states of PTMs are frequently associated with human diseases. Here, we present a comprehensive database of PTMs associated with diseases (PTMD 2.0), including 342 624 PTM-disease associations (PDAs) in 15 105 proteins for 93 types of PTMs and 2083 diseases. Based on the distinct PTM states in diseases, we classified all PDAs into six categories: upregulation (U) or downregulation (D) of PTM levels, absence (A) or presence (P) of PTMs, and creation (C) or disruption (N) of PTM sites. We provided detailed annotations for each PDA and carefully annotated disease-associated proteins by integrating the knowledge from 101 additional resources that covered 13 aspects, including disease-associated information, variation and mutation, protein-protein interaction, protein functional annotation, DNA and RNA element, protein structure, chemical-target relationship, mRNA expression, protein expression/proteomics, subcellular localization, biological pathway annotation, functional domain annotation and physicochemical property. With a data volume of ∼8 GB, we anticipate that PTMD 2.0 will serve as a fundamental resource for further analysing the relationships between PTMs and diseases. The online service of PTMD 2.0 is freely available at https://ptmd.biocuckoo.cn/.
Collapse
Affiliation(s)
- Xinhe Huang
- Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Luoyu Road 1037, Wuhan 430074, China
| | - Zihao Feng
- Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Luoyu Road 1037, Wuhan 430074, China
| | - Dan Liu
- Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Luoyu Road 1037, Wuhan 430074, China
| | - Yujie Gou
- Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Luoyu Road 1037, Wuhan 430074, China
| | - Miaomiao Chen
- Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Luoyu Road 1037, Wuhan 430074, China
| | - Dachao Tang
- Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Luoyu Road 1037, Wuhan 430074, China
| | - Cheng Han
- Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Luoyu Road 1037, Wuhan 430074, China
| | - Jianzhen Peng
- Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Luoyu Road 1037, Wuhan 430074, China
| | - Di Peng
- Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Luoyu Road 1037, Wuhan 430074, China
- Key Laboratory of Molecular Biophysics of Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Luoyu Road 1037, Wuhan 430074, China
| | - Yu Xue
- Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Luoyu Road 1037, Wuhan 430074, China
- Key Laboratory of Molecular Biophysics of Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Luoyu Road 1037, Wuhan 430074, China
- Nanjing University Institute of Artificial Intelligence Biomedicine, Xianlin Avenue 163, Nanjing 210031, China
| |
Collapse
|
5
|
Chung CR, Tang Y, Chiu YP, Li S, Hsieh WK, Yao L, Chiang YC, Pang Y, Chen GT, Chou KC, Paik YS, Tran P, Lin CP, Kao YM, Chen YJ, Chang WC, Hsu JK, Horng JT, Lee TY. dbPTM 2025 update: comprehensive integration of PTMs and proteomic data for advanced insights into cancer research. Nucleic Acids Res 2025; 53:D377-D386. [PMID: 39526378 PMCID: PMC11701562 DOI: 10.1093/nar/gkae1005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2024] [Revised: 10/08/2024] [Accepted: 10/18/2024] [Indexed: 11/16/2024] Open
Abstract
Post-translational modifications (PTMs) are essential for modulating protein function and influencing stability, activity and signaling processes. The dbPTM 2025 update significantly expands the database to include over 2.79 million PTM sites, of which 2.243 million are experimentally validated from 48 databases and over 80 000 research articles. This version integrates proteomic data from 13 cancer types, with a particular focus on phosphoproteomic data and kinase activity profiles, allowing the exploration of personalized phosphorylation patterns in tumor samples. Integrating kinase-substrate phosphorylations with E3 ligase-substrate interactions, dbPTM 2025 provides a detailed map of PTM regulatory networks, offering insights into cancer-specific post-translational regulations. This update also includes advanced search capabilities, enabling users to efficiently query PTM data across species, PTM types and modified residues. The platform's new features-interactive visualization tools and streamlined data downloads-allow researchers to access and analyze PTM data easily. dbPTM 2025 also enhances functional annotations, regulatory networks and disease associations, broadening its application for cancer research and the study of disease-associated PTMs. Through these enhancements, dbPTM 2025 is a comprehensive, user-friendly resource, facilitating the study of PTMs and their roles in cancer research. The database is now freely accessible at https://biomics.lab.nycu.edu.tw/dbPTM/.
Collapse
Affiliation(s)
- Chia-Ru Chung
- Department of Computer Science and Information Engineering, National Central University, No. 300, Zhongda Rd., Zhongli Dist., Taoyuan City 320317, Taiwan
| | - Yun Tang
- Institute of Bioinformatics and Systems Biology, College of Engineering Bioscience, National Yang Ming Chiao Tung University, No. 1001, Daxue Rd. East Dist., Hsinchu City 300093, Taiwan
| | - Yen-Peng Chiu
- Institute of Data Science and Engineering, College of Computer Science, National Yang Ming Chiao Tung University, No. 1001, Daxue Rd. East Dist., Hsinchu City 300093, Taiwan
| | - Shangfu Li
- Warshel Institute for Computational Biology, School of Medicine, The Chinese University of Hong Kong, Shenzhen, No. 2001, Longxiang Boulevard, Longgang Dist., Shenzhen, Guangdong 518172, China
| | - Wen-Kai Hsieh
- Department of Computer Science and Information Engineering, National Central University, No. 300, Zhongda Rd., Zhongli Dist., Taoyuan City 320317, Taiwan
| | - Lantian Yao
- Kobilka Institute of Innovative Drug Discovery, School of Medicine, The Chinese University of Hong Kong, Shenzhen, No. 2001, Longxiang Boulevard, Longgang Dist., Shenzhen, Guangdong 518172, China
| | - Ying-Chih Chiang
- Kobilka Institute of Innovative Drug Discovery, School of Medicine, The Chinese University of Hong Kong, Shenzhen, No. 2001, Longxiang Boulevard, Longgang Dist., Shenzhen, Guangdong 518172, China
| | - Yuxuan Pang
- Institute of Medical Science, The University of Tokyo, 4-6-1, Shirokanedai, Minato-ku, Tokyo 108-8639, Japan
| | - Guan-Ting Chen
- Institute of Bioinformatics and Systems Biology, College of Engineering Bioscience, National Yang Ming Chiao Tung University, No. 1001, Daxue Rd. East Dist., Hsinchu City 300093, Taiwan
| | - Kai-Chen Chou
- Institute of Bioinformatics and Systems Biology, College of Engineering Bioscience, National Yang Ming Chiao Tung University, No. 1001, Daxue Rd. East Dist., Hsinchu City 300093, Taiwan
| | - You Sheng Paik
- Institute of Bioinformatics and Systems Biology, College of Engineering Bioscience, National Yang Ming Chiao Tung University, No. 1001, Daxue Rd. East Dist., Hsinchu City 300093, Taiwan
| | - Phuong Lam Tran
- Institute of Bioinformatics and Systems Biology, College of Engineering Bioscience, National Yang Ming Chiao Tung University, No. 1001, Daxue Rd. East Dist., Hsinchu City 300093, Taiwan
| | - Cheng-Pei Lin
- Institute of Bioinformatics and Systems Biology, College of Engineering Bioscience, National Yang Ming Chiao Tung University, No. 1001, Daxue Rd. East Dist., Hsinchu City 300093, Taiwan
| | - Yu-Min Kao
- Institute of Bioinformatics and Systems Biology, College of Engineering Bioscience, National Yang Ming Chiao Tung University, No. 1001, Daxue Rd. East Dist., Hsinchu City 300093, Taiwan
| | - Yi-Jie Chen
- Institute of Bioinformatics and Systems Biology, College of Engineering Bioscience, National Yang Ming Chiao Tung University, No. 1001, Daxue Rd. East Dist., Hsinchu City 300093, Taiwan
| | - Wen-Chi Chang
- Institute of Tropical Plant Sciences and Microbiology, National Cheng Kung University, No.1, University Rd., Tainan City 70101, Taiwan
| | - Justin Bo-Kai Hsu
- Department of Computer Science and Engineering, Yuan Ze University, No. 135, Yuandong Rd., Zhongli Dist., Taoyuan City 320315, Taiwan
| | - Jorng-Tzong Horng
- Department of Computer Science and Information Engineering, National Central University, No. 300, Zhongda Rd., Zhongli Dist., Taoyuan City 320317, Taiwan
| | - Tzong-Yi Lee
- Institute of Bioinformatics and Systems Biology, College of Engineering Bioscience, National Yang Ming Chiao Tung University, No. 1001, Daxue Rd. East Dist., Hsinchu City 300093, Taiwan
- Center for Intelligent Drug Systems and Smart Bio-devices (IDS2B), National Yang Ming Chiao Tung University, No. 1001, Daxue Rd. East Dist., Hsinchu City 300093, Taiwan
| |
Collapse
|
6
|
Qin Z, Ren H, Zhao P, Wang K, Liu H, Miao C, Du Y, Li J, Wu L, Chen Z. Current computational tools for protein lysine acylation site prediction. Brief Bioinform 2024; 25:bbae469. [PMID: 39316944 PMCID: PMC11421846 DOI: 10.1093/bib/bbae469] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2024] [Revised: 08/20/2024] [Accepted: 09/07/2024] [Indexed: 09/26/2024] Open
Abstract
As a main subtype of post-translational modification (PTM), protein lysine acylations (PLAs) play crucial roles in regulating diverse functions of proteins. With recent advancements in proteomics technology, the identification of PTM is becoming a data-rich field. A large amount of experimentally verified data is urgently required to be translated into valuable biological insights. With computational approaches, PLA can be accurately detected across the whole proteome, even for organisms with small-scale datasets. Herein, a comprehensive summary of 166 in silico PLA prediction methods is presented, including a single type of PLA site and multiple types of PLA sites. This recapitulation covers important aspects that are critical for the development of a robust predictor, including data collection and preparation, sample selection, feature representation, classification algorithm design, model evaluation, and method availability. Notably, we discuss the application of protein language models and transfer learning to solve the small-sample learning issue. We also highlight the prediction methods developed for functionally relevant PLA sites and species/substrate/cell-type-specific PLA sites. In conclusion, this systematic review could potentially facilitate the development of novel PLA predictors and offer useful insights to researchers from various disciplines.
Collapse
Affiliation(s)
- Zhaohui Qin
- Collaborative Innovation Center of Henan Grain Crops, Henan Key Laboratory of Rice Molecular Breeding and High Efficiency Production, College of Agronomy, Henan Agricultural University, Zhengzhou 450046, China
| | - Haoran Ren
- Collaborative Innovation Center of Henan Grain Crops, Henan Key Laboratory of Rice Molecular Breeding and High Efficiency Production, College of Agronomy, Henan Agricultural University, Zhengzhou 450046, China
| | - Pei Zhao
- State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Sciences (CAAS), Anyang 455000, China
| | - Kaiyuan Wang
- Collaborative Innovation Center of Henan Grain Crops, Henan Key Laboratory of Rice Molecular Breeding and High Efficiency Production, College of Agronomy, Henan Agricultural University, Zhengzhou 450046, China
| | - Huixia Liu
- Collaborative Innovation Center of Henan Grain Crops, Henan Key Laboratory of Rice Molecular Breeding and High Efficiency Production, College of Agronomy, Henan Agricultural University, Zhengzhou 450046, China
| | - Chunbo Miao
- Collaborative Innovation Center of Henan Grain Crops, Henan Key Laboratory of Rice Molecular Breeding and High Efficiency Production, College of Agronomy, Henan Agricultural University, Zhengzhou 450046, China
| | - Yanxiu Du
- Collaborative Innovation Center of Henan Grain Crops, Henan Key Laboratory of Rice Molecular Breeding and High Efficiency Production, College of Agronomy, Henan Agricultural University, Zhengzhou 450046, China
| | - Junzhou Li
- Collaborative Innovation Center of Henan Grain Crops, Henan Key Laboratory of Rice Molecular Breeding and High Efficiency Production, College of Agronomy, Henan Agricultural University, Zhengzhou 450046, China
| | - Liuji Wu
- National Key Laboratory of Wheat and Maize Crop Science, College of Agronomy, Henan Agricultural University, Zhengzhou 450046, China
| | - Zhen Chen
- Collaborative Innovation Center of Henan Grain Crops, Henan Key Laboratory of Rice Molecular Breeding and High Efficiency Production, College of Agronomy, Henan Agricultural University, Zhengzhou 450046, China
| |
Collapse
|
7
|
Gou Y, Liu D, Chen M, Wei Y, Huang X, Han C, Feng Z, Zhang C, Lu T, Peng D, Xue Y. GPS-SUMO 2.0: an updated online service for the prediction of SUMOylation sites and SUMO-interacting motifs. Nucleic Acids Res 2024; 52:W238-W247. [PMID: 38709873 PMCID: PMC11223847 DOI: 10.1093/nar/gkae346] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Revised: 04/08/2024] [Accepted: 04/18/2024] [Indexed: 05/08/2024] Open
Abstract
Small ubiquitin-like modifiers (SUMOs) are tiny but important protein regulators involved in orchestrating a broad spectrum of biological processes, either by covalently modifying protein substrates or by noncovalently interacting with other proteins. Here, we report an updated server, GPS-SUMO 2.0, for the prediction of SUMOylation sites and SUMO-interacting motifs (SIMs). For predictor training, we adopted three machine learning algorithms, penalized logistic regression (PLR), a deep neural network (DNN), and a transformer, and used 52 404 nonredundant SUMOylation sites in 8262 proteins and 163 SIMs in 102 proteins. To further increase the accuracy of predicting SUMOylation sites, a pretraining model was first constructed using 145 545 protein lysine modification sites, followed by transfer learning to fine-tune the model. GPS-SUMO 2.0 exhibited greater accuracy in predicting SUMOylation sites than did other existing tools. For users, one or multiple protein sequences or identifiers can be input, and the prediction results are shown in a tabular list. In addition to the basic statistics, we integrated knowledge from 35 public resources to annotate SUMOylation sites or SIMs. The GPS-SUMO 2.0 server is freely available at https://sumo.biocuckoo.cn/. We believe that GPS-SUMO 2.0 can serve as a useful tool for further analysis of SUMOylation and SUMO interactions.
Collapse
Affiliation(s)
- Yujie Gou
- Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan430074, China
- Key Laboratory of Molecular Biophysics of Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan430074, China
| | - Dan Liu
- Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan430074, China
- Key Laboratory of Molecular Biophysics of Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan430074, China
| | - Miaomiao Chen
- Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan430074, China
- Key Laboratory of Molecular Biophysics of Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan430074, China
| | - Yuxiang Wei
- Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan430074, China
- Key Laboratory of Molecular Biophysics of Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan430074, China
| | - Xinhe Huang
- Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan430074, China
- Key Laboratory of Molecular Biophysics of Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan430074, China
| | - Cheng Han
- Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan430074, China
- Key Laboratory of Molecular Biophysics of Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan430074, China
| | - Zihao Feng
- Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan430074, China
- Key Laboratory of Molecular Biophysics of Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan430074, China
| | - Chi Zhang
- Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan430074, China
- Key Laboratory of Molecular Biophysics of Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan430074, China
| | - Teng Lu
- Computer Network Information Center, Chinese Academy of Sciences, Beijing100190, China
| | - Di Peng
- Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan430074, China
- Key Laboratory of Molecular Biophysics of Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan430074, China
| | - Yu Xue
- Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan430074, China
- Key Laboratory of Molecular Biophysics of Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan430074, China
- Nanjing University Institute of Artificial Intelligence Biomedicine, Nanjing210031, China
| |
Collapse
|
8
|
Esmaili F, Pourmirzaei M, Ramazi S, Shojaeilangari S, Yavari E. A Review of Machine Learning and Algorithmic Methods for Protein Phosphorylation Site Prediction. GENOMICS, PROTEOMICS & BIOINFORMATICS 2023; 21:1266-1285. [PMID: 37863385 PMCID: PMC11082408 DOI: 10.1016/j.gpb.2023.03.007] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Revised: 01/16/2023] [Accepted: 03/23/2023] [Indexed: 10/22/2023]
Abstract
Post-translational modifications (PTMs) have key roles in extending the functional diversity of proteins and, as a result, regulating diverse cellular processes in prokaryotic and eukaryotic organisms. Phosphorylation modification is a vital PTM that occurs in most proteins and plays a significant role in many biological processes. Disorders in the phosphorylation process lead to multiple diseases, including neurological disorders and cancers. The purpose of this review is to organize this body of knowledge associated with phosphorylation site (p-site) prediction to facilitate future research in this field. At first, we comprehensively review all related databases and introduce all steps regarding dataset creation, data preprocessing, and method evaluation in p-site prediction. Next, we investigate p-site prediction methods, which are divided into two computational groups: algorithmic and machine learning (ML). Additionally, it is shown that there are basically two main approaches for p-site prediction by ML: conventional and end-to-end deep learning methods, both of which are given an overview. Moreover, this review introduces the most important feature extraction techniques, which have mostly been used in p-site prediction. Finally, we create three test sets from new proteins related to the released version of the database of protein post-translational modifications (dbPTM) in 2022 based on general and human species. Evaluating online p-site prediction tools on newly added proteins introduced in the dbPTM 2022 release, distinct from those in the dbPTM 2019 release, reveals their limitations. In other words, the actual performance of these online p-site prediction tools on unseen proteins is notably lower than the results reported in their respective research papers.
Collapse
Affiliation(s)
- Farzaneh Esmaili
- Department of Information Technology, Tarbiat Modares University, Tehran 14115-111, Iran
| | - Mahdi Pourmirzaei
- Department of Information Technology, Tarbiat Modares University, Tehran 14115-111, Iran
| | - Shahin Ramazi
- Department of Biophysics, Faculty of Biological Sciences, Tarbiat Modares University, Tehran 14115-111, Iran.
| | - Seyedehsamaneh Shojaeilangari
- Biomedical Engineering Group, Department of Electrical Engineering and Information Technology, Iranian Research Organization for Science and Technology (IROST), Tehran 33535-111, Iran
| | - Elham Yavari
- Department of Information Technology, Tarbiat Modares University, Tehran 14115-111, Iran
| |
Collapse
|
9
|
Goldtzvik Y, Sen N, Lam SD, Orengo C. Protein diversification through post-translational modifications, alternative splicing, and gene duplication. Curr Opin Struct Biol 2023; 81:102640. [PMID: 37354790 DOI: 10.1016/j.sbi.2023.102640] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Revised: 05/05/2023] [Accepted: 05/24/2023] [Indexed: 06/26/2023]
Abstract
Proteins provide the basis for cellular function. Having multiple versions of the same protein within a single organism provides a way of regulating its activity or developing novel functions. Post-translational modifications of proteins, by means of adding/removing chemical groups to amino acids, allow for a well-regulated and controlled way of generating functionally distinct protein species. Alternative splicing is another method with which organisms possibly generate new isoforms. Additionally, gene duplication events throughout evolution generate multiple paralogs of the same genes, resulting in multiple versions of the same protein within an organism. In this review, we discuss recent advancements in the study of these three methods of protein diversification and provide illustrative examples of how they affect protein structure and function.
Collapse
Affiliation(s)
- Yonathan Goldtzvik
- Department of Structural and Molecular Biology, University College London, London, United Kingdom
| | - Neeladri Sen
- Department of Structural and Molecular Biology, University College London, London, United Kingdom. https://twitter.com/@NeeladriSen
| | - Su Datt Lam
- Department of Structural and Molecular Biology, University College London, London, United Kingdom; Department of Applied Physics, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, Bangi, Malaysia
| | - Christine Orengo
- Department of Structural and Molecular Biology, University College London, London, United Kingdom.
| |
Collapse
|
10
|
Mohammadi-Shemirani P, Sood T, Paré G. From 'Omics to Multi-omics Technologies: the Discovery of Novel Causal Mediators. Curr Atheroscler Rep 2023; 25:55-65. [PMID: 36595202 PMCID: PMC9807989 DOI: 10.1007/s11883-022-01078-8] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/31/2022] [Indexed: 01/04/2023]
Abstract
PURPOSE OF REVIEW 'Omics studies provide a comprehensive characterisation of a biological entity, such as the genome, epigenome, transcriptome, proteome, metabolome, or microbiome. This review covers the unique properties of these types of 'omics and their roles as causal mediators in cardiovascular disease. Moreover, applications and challenges of integrating multiple types of 'omics data to increase predictive power, improve causal inference, and elucidate biological mechanisms are discussed. RECENT FINDINGS Multi-omics approaches are growing in adoption as they provide orthogonal evidence and overcome the limitations of individual types of 'omics data. Studies with multiple types of 'omics data have improved the diagnosis and prediction of disease states and afforded a deeper understanding of underlying pathophysiological mechanisms, beyond any single type of 'omics data. For instance, disease-associated loci in the genome can be supplemented with other 'omics to prioritise causal genes and understand the function of non-coding variants. Alternatively, techniques, such as Mendelian randomisation, can leverage genetics to provide evidence supporting a causal role for disease-associated molecules, and elucidate their role in disease pathogenesis. As technologies improve, costs for 'omics studies will continue to fall and datasets will become increasingly accessible to researchers. The intrinsically unbiased nature of 'omics data is well-suited to exploratory analyses that discover causal mediators of disease, and multi-omics is an emerging discipline that leverages the strengths of each type of 'omics data to provide insights greater than the sum of its parts.
Collapse
Affiliation(s)
- Pedrum Mohammadi-Shemirani
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton, ON Canada
- Thrombosis and Atherosclerosis Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton, ON Canada
| | - Tushar Sood
- Temerty Faculty of Medicine, University of Toronto, Toronto, Canada
| | - Guillaume Paré
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton, ON Canada
- Thrombosis and Atherosclerosis Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton, ON Canada
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, ON Canada
- Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, ON Canada
- Department of Pathology and Molecular Medicine, Michael G. DeGroote School of Medicine, McMaster University, Hamilton, ON Canada
| |
Collapse
|
11
|
Deichaite I, Hopper A, Krockenberger L, Sears TJ, Sutton L, Ray X, Sharabi A, Navon A, Sanghvi P, Carter H, Moiseenko V. Germline genetic biomarkers to stratify patients for personalized radiation treatment. J Transl Med 2022; 20:360. [PMID: 35962345 PMCID: PMC9373374 DOI: 10.1186/s12967-022-03561-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2022] [Accepted: 07/29/2022] [Indexed: 11/10/2022] Open
Abstract
Background Precision medicine incorporating genetic profiling is becoming a standard of care in medical oncology. However, in the field of radiation oncology there is limited use of genetic profiling and the impact of germline genetic biomarkers on radiosensitivity, radioresistance, or patient outcomes after radiation therapy is poorly understood. In HNSCC, the toxicity associated with treatment can cause delays or early cessation which has been associated with worse outcomes. Identifying potential biomarkers which can help predict toxicity, as well as response to treatment, is of significant interest. Methods Patients with HNSCC who received RT and underwent next generation sequencing of somatic tumor samples, transcriptome RNA-seq with matched normal tissue samples were included. Patients were then grouped by propensity towards increased late vs. early toxicity (Group A) and those without (Group B), assessed by CTCAE v5.0. The groups were then analyzed for association of specific germline variants with toxicity and clinical outcomes. Results In this study we analyzed 37 patients for correlation between germline variants and toxicity. We observed that TSC2, HLA-A, TET2, GEN1, NCOR2 and other germline variants were significantly associated with long term toxicities. 34 HNSCC patients treated with curative intent were evaluated for clinical outcomes. Group A had significantly improved overall survival as well as improved rates of locoregional recurrence and metastatic disease. Specific variants associated with improved clinical outcomes included TSC2, FANCD2, and PPP1R15A, while the HLA-A and GEN1 variants were not correlated with survival or recurrence. A group of five HLA-DMA/HLA-DMB variants was only found in Group B and was associated with a higher risk of locoregional recurrence. Conclusions This study indicates that germline genetic biomarkers may have utility in predicting toxicity and outcomes after radiation therapy and deserve further investigation in precision radiation medicine approaches.
Collapse
Affiliation(s)
- Ida Deichaite
- Department of Radiation Medicine and Applied Sciences, University of California San Diego, La Jolla, CA, USA. .,Moores Cancer Center, University of California San Diego, La Jolla, CA, USA.
| | - Austin Hopper
- Department of Radiation Medicine and Applied Sciences, University of California San Diego, La Jolla, CA, USA
| | - Lena Krockenberger
- Moores Cancer Center, University of California San Diego, La Jolla, CA, USA
| | - Timothy J Sears
- Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, CA, USA
| | - Leisa Sutton
- Moores Cancer Center, University of California San Diego, La Jolla, CA, USA
| | - Xenia Ray
- Department of Radiation Medicine and Applied Sciences, University of California San Diego, La Jolla, CA, USA
| | - Andrew Sharabi
- Department of Radiation Medicine and Applied Sciences, University of California San Diego, La Jolla, CA, USA.,Moores Cancer Center, University of California San Diego, La Jolla, CA, USA
| | - Ami Navon
- Department of Biological Regulation, Weizmann Institute of Science, Rehovot, Israel
| | - Parag Sanghvi
- Department of Radiation Medicine and Applied Sciences, University of California San Diego, La Jolla, CA, USA
| | - Hannah Carter
- Moores Cancer Center, University of California San Diego, La Jolla, CA, USA.,Division of Medical Genetics, Department of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Vitali Moiseenko
- Department of Radiation Medicine and Applied Sciences, University of California San Diego, La Jolla, CA, USA
| |
Collapse
|
12
|
Pellegrina D, Bahcheli AT, Krassowski M, Reimand J. Human phospho-signaling networks of SARS-CoV-2 infection are rewired by population genetic variants. Mol Syst Biol 2022; 18:e10823. [PMID: 35579274 PMCID: PMC9112486 DOI: 10.15252/msb.202110823] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2021] [Revised: 04/26/2022] [Accepted: 04/27/2022] [Indexed: 11/23/2022] Open
Abstract
SARS-CoV-2 infection hijacks signaling pathways and induces protein-protein interactions between human and viral proteins. Human genetic variation may impact SARS-CoV-2 infection and COVID-19 pathology; however, the genetic variation in these signaling networks remains uncharacterized. Here, we studied human missense single nucleotide variants (SNVs) altering phosphorylation sites modulated by SARS-CoV-2 infection, using machine learning to identify amino acid substitutions altering kinase-bound sequence motifs. We found 2,033 infrequent phosphorylation-associated SNVs (pSNVs) that are enriched in sequence motif alterations, potentially reflecting the evolution of signaling networks regulating host defenses. Proteins with pSNVs are involved in viral life cycle and host responses, including RNA splicing, interferon response (TRIM28), and glucose homeostasis (TBC1D4) with potential associations with COVID-19 comorbidities. pSNVs disrupt CDK and MAPK substrate motifs and replace these with motifs of Tank Binding Kinase 1 (TBK1) involved in innate immune responses, indicating consistent rewiring of signaling networks. Several pSNVs associate with severe COVID-19 and hospitalization (STARD13, ARFGEF2). Our analysis highlights potential genetic factors contributing to inter-individual variation of SARS-CoV-2 infection and COVID-19 and suggests leads for mechanistic and translational studies.
Collapse
Affiliation(s)
- Diogo Pellegrina
- Computational Biology ProgramOntario Institute for Cancer ResearchTorontoONCanada
| | - Alexander T Bahcheli
- Computational Biology ProgramOntario Institute for Cancer ResearchTorontoONCanada
- Department of Molecular GeneticsUniversity of TorontoTorontoONCanada
| | - Michal Krassowski
- Medical Sciences DivisionNuffield Department of Women's and Reproductive HealthUniversity of OxfordOxfordUK
| | - Jüri Reimand
- Computational Biology ProgramOntario Institute for Cancer ResearchTorontoONCanada
- Department of Molecular GeneticsUniversity of TorontoTorontoONCanada
- Department of Medical BiophysicsUniversity of TorontoTorontoONCanada
| |
Collapse
|
13
|
Xiao D, Kim HJ, Pang I, Yang P. Functional analysis of the stable phosphoproteome reveals cancer vulnerabilities. Bioinformatics 2022; 38:1956-1963. [PMID: 35015814 PMCID: PMC9113330 DOI: 10.1093/bioinformatics/btac015] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Revised: 12/21/2021] [Accepted: 01/06/2022] [Indexed: 11/29/2022] Open
Abstract
MOTIVATION The advance of mass spectrometry-based technologies enabled the profiling of the phosphoproteomes of a multitude of cell and tissue types. However, current research primarily focused on investigating the phosphorylation dynamics in specific cell types and experimental conditions, whereas the phosphorylation events that are common across cell/tissue types and stable regardless of experimental conditions are, so far, mostly ignored. RESULTS Here, we developed a statistical framework to identify the stable phosphoproteome across 53 human phosphoproteomics datasets, covering 40 cell/tissue types and 194 conditions/treatments. We demonstrate that the stably phosphorylated sites (SPSs) identified from our statistical framework are evolutionarily conserved, functionally important and enriched in a range of core signaling and gene pathways. Particularly, we show that SPSs are highly enriched in the RNA splicing pathway, an essential cellular process in mammalian cells, and frequently disrupted by cancer mutations, suggesting a link between the dysregulation of RNA splicing and cancer development through mutations on SPSs. AVAILABILITY AND IMPLEMENTATION The source code for data analysis in this study is available from Github repository https://github.com/PYangLab/SPSs under the open-source license of GPL-3. The data used in this study are publicly available (see Section 2.8). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Di Xiao
- Computational Systems Biology Group, Children’s Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Sydney 2145, Australia
| | - Hani Jieun Kim
- Computational Systems Biology Group, Children’s Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Sydney 2145, Australia
- Charles Perkins Centre, School of Mathematics and Statistics, The University of Sydney, Sydney 2006, Australia
| | - Ignatius Pang
- Bioinformatics Group, Children’s Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Sydney 2145, Australia
| | - Pengyi Yang
- Computational Systems Biology Group, Children’s Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Sydney 2145, Australia
- Charles Perkins Centre, School of Mathematics and Statistics, The University of Sydney, Sydney 2006, Australia
| |
Collapse
|
14
|
Li Z, Li S, Luo M, Jhong JH, Li W, Yao L, Pang Y, Wang Z, Wang R, Ma R, Yu J, Huang Y, Zhu X, Cheng Q, Feng H, Zhang J, Wang C, Hsu JBK, Chang WC, Wei FX, Huang HD, Lee TY. dbPTM in 2022: an updated database for exploring regulatory networks and functional associations of protein post-translational modifications. Nucleic Acids Res 2021; 50:D471-D479. [PMID: 34788852 PMCID: PMC8728263 DOI: 10.1093/nar/gkab1017] [Citation(s) in RCA: 96] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Revised: 10/08/2021] [Accepted: 10/13/2021] [Indexed: 01/02/2023] Open
Abstract
Protein post-translational modifications (PTMs) play an important role in different cellular processes. In view of the importance of PTMs in cellular functions and the massive data accumulated by the rapid development of mass spectrometry (MS)-based proteomics, this paper presents an update of dbPTM with over 2 777 000 PTM substrate sites obtained from existing databases and manual curation of literature, of which more than 2 235 000 entries are experimentally verified. This update has manually curated over 42 new modification types that were not included in the previous version. Due to the increasing number of studies on the mechanism of PTMs in the past few years, a great deal of upstream regulatory proteins of PTM substrate sites have been revealed. The updated dbPTM thus collates regulatory information from databases and literature, and merges them into a protein-protein interaction network. To enhance the understanding of the association between PTMs and molecular functions/cellular processes, the functional annotations of PTMs are curated and integrated into the database. In addition, the existing PTM-related resources, including annotation databases and prediction tools are also renewed. Overall, in this update, we would like to provide users with the most abundant data and comprehensive annotations on PTMs of proteins. The updated dbPTM is now freely accessible at https://awi.cuhk.edu.cn/dbPTM/.
Collapse
Affiliation(s)
- Zhongyan Li
- The Genetics Laboratory, Longgang District Maternity & Child Healthcare Hospital of Shenzhen City, Shenzhen 518172, China.,School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen 518172, China.,Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Shangfu Li
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Mengqi Luo
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Jhih-Hua Jhong
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Wenshuo Li
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China.,School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Lantian Yao
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China.,School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Yuxuan Pang
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China.,School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Zhuo Wang
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Rulan Wang
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China.,School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Renfei Ma
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Jinhan Yu
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Yuqi Huang
- School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen 518172, China.,Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Xiaoning Zhu
- School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen 518172, China.,Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Qifan Cheng
- School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen 518172, China.,Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Hexiang Feng
- School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen 518172, China.,Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Jiahong Zhang
- School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen 518172, China.,Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Chunxuan Wang
- School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen 518172, China.,Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Justin Bo-Kai Hsu
- Department of Medical Research, Taipei Medical University Hospital, Taipei 110, Taiwan
| | - Wen-Chi Chang
- Institute of Tropical Plant Sciences and Microbiology, National Cheng Kung University, Tainan 701, Taiwan
| | - Feng-Xiang Wei
- The Genetics Laboratory, Longgang District Maternity & Child Healthcare Hospital of Shenzhen City, Shenzhen 518172, China.,Department of Cell Biology, Jiamusi University, Jiamusi 154007, China.,Shenzhen Children's Hospital of China Medical University, Shenzhen 518172, China
| | - Hsien-Da Huang
- The Genetics Laboratory, Longgang District Maternity & Child Healthcare Hospital of Shenzhen City, Shenzhen 518172, China.,School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen 518172, China.,Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Tzong-Yi Lee
- School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen 518172, China.,Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| |
Collapse
|