1
|
Zheng X, Jin N, Wu Q, Zhang N, Wu H, Wang Y, Luo R, Liu T, Ding W, Geng Q, Cheng L. Less is more: relative rank is more informative than absolute abundance for compositional NGS data. Brief Funct Genomics 2025; 24:elae045. [PMID: 39568388 PMCID: PMC11735744 DOI: 10.1093/bfgp/elae045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2024] [Revised: 10/24/2024] [Accepted: 11/08/2024] [Indexed: 11/22/2024] Open
Abstract
High-throughput gene expression data have been extensively generated and utilized in biological mechanism investigations, biomarker detection, disease diagnosis and prognosis. These applications encompass not only bulk transcriptome, but also single cell RNA-seq data. However, extracting reliable biological information from transcriptome data remains challenging due to the constrains of Compositional Data Analysis. Current data preprocessing methods, including dataset normalization and batch effect correction, are insufficient to address these issues and improve data quality for downstream analysis. Alternatively, qualification methods focusing on the relative order of gene expression (ROGER) are more informative than the quantification methods that rely on gene expression abundance. The Pairwise Analysis of Gene expression method is an enhancement of ROGER, designed for data integration in either sample space or feature space. In this review, we summarize the methods applied to transcriptome data analysis and discuss their potentials in predicting clinical outcomes.
Collapse
Affiliation(s)
- Xubin Zheng
- Guangdong Provincial Clinical Research Center for Geriatrics, Shenzhen Clinical Research Center for Geriatrics, Shenzhen People’s Hospital, Luohu District, Shenzhen 518020, China
- Health Data Science Center, Shenzhen People's Hospital (First Affiliated Hospital of Southern University of Science and Technology), Luohu District, Shenzhen 518020, China
- School of Computing and Information Technology, Great Bay University, Dongguan 523000, Guangdong, China
| | - Nana Jin
- Guangdong Provincial Clinical Research Center for Geriatrics, Shenzhen Clinical Research Center for Geriatrics, Shenzhen People’s Hospital, Luohu District, Shenzhen 518020, China
- Health Data Science Center, Shenzhen People's Hospital (First Affiliated Hospital of Southern University of Science and Technology), Luohu District, Shenzhen 518020, China
| | - Qiong Wu
- School of Basic Medicine, North Sichuan Medical College, Nanchong 637000, Sichuan, China
| | - Ning Zhang
- Guangdong Provincial Clinical Research Center for Geriatrics, Shenzhen Clinical Research Center for Geriatrics, Shenzhen People’s Hospital, Luohu District, Shenzhen 518020, China
- Health Data Science Center, Shenzhen People's Hospital (First Affiliated Hospital of Southern University of Science and Technology), Luohu District, Shenzhen 518020, China
| | - Haonan Wu
- Guangdong Provincial Clinical Research Center for Geriatrics, Shenzhen Clinical Research Center for Geriatrics, Shenzhen People’s Hospital, Luohu District, Shenzhen 518020, China
- Health Data Science Center, Shenzhen People's Hospital (First Affiliated Hospital of Southern University of Science and Technology), Luohu District, Shenzhen 518020, China
| | - Yuanhao Wang
- Guangdong Provincial Clinical Research Center for Geriatrics, Shenzhen Clinical Research Center for Geriatrics, Shenzhen People’s Hospital, Luohu District, Shenzhen 518020, China
- Health Data Science Center, Shenzhen People's Hospital (First Affiliated Hospital of Southern University of Science and Technology), Luohu District, Shenzhen 518020, China
| | - Rui Luo
- Department of Systems Engineering, City University of Hong Kong, Kowloon, Hong Kong SAR
| | - Tao Liu
- International Digital Economy Academy (IDEA), Futian District, Shenzhen 518020, China
| | - Wanfu Ding
- Guangdong Provincial Clinical Research Center for Geriatrics, Shenzhen Clinical Research Center for Geriatrics, Shenzhen People’s Hospital, Luohu District, Shenzhen 518020, China
| | - Qingshan Geng
- Guangdong Provincial Clinical Research Center for Geriatrics, Shenzhen Clinical Research Center for Geriatrics, Shenzhen People’s Hospital, Luohu District, Shenzhen 518020, China
| | - Lixin Cheng
- Guangdong Provincial Clinical Research Center for Geriatrics, Shenzhen Clinical Research Center for Geriatrics, Shenzhen People’s Hospital, Luohu District, Shenzhen 518020, China
- Health Data Science Center, Shenzhen People's Hospital (First Affiliated Hospital of Southern University of Science and Technology), Luohu District, Shenzhen 518020, China
| |
Collapse
|
2
|
Li N, Hu Z, Zhang N, Liang Y, Feng Y, Ding W, Cheng L, Zheng Y. Pairwise analysis of gene expression for oral squamous cell carcinoma via a large-scale transcriptome integration. J Cell Mol Med 2024; 28:e70153. [PMID: 39470584 PMCID: PMC11520439 DOI: 10.1111/jcmm.70153] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2024] [Revised: 09/09/2024] [Accepted: 10/01/2024] [Indexed: 10/30/2024] Open
Abstract
Among all cancers occurring in the head and neck region, oral squamous cell carcinoma (OSCC) is the most common oral malignant tumours characterized by its aggressiveness and metastasis. The development of transcriptomics technology has greatly facilitated the diagnosis of various cancers. However, identifying genetic biomarkers is limited by data from a single batch of OSCC samples, and integrating analysis across different platforms remains a great challenge. In this study, we integrated five OSCC transcriptome datasets using an innovative strategy capable of mitigating batch effect, and extracting information from different datasets based on changes in the relative expression of gene pairs. By leveraging a machine learning method, we developed a prediction model including 27 differential gene pairs (DGPs) to discriminate OSCC from control samples, achieving an area under the receiver operating characteristic curve (AUC) of 0.8987 for the training set. Moreover, the model demonstrated commendable performance in four external validation sets, with AUCs of 0.9926, 0.9688, 0.8052 and 0.8565, respectively. Subsequently, a prognostic model was constructed based on six key gene pairs through univariate and multivariate Cox regression analysis. The AUCs of the model at 1-year and 3-year overall survival time prediction were 0.717 and 0.779 in an independent dataset. Our result demonstrates the effectiveness of this new method of integrating data and identifying DGPs. Using DGPs can significantly improve the performance of both diagnostic and prognostic models.
Collapse
Affiliation(s)
- Nan Li
- Department of StomatologyShenzhen People's Hospital (Second Clinical Medical School of Jinan University; First Affiliated Hospital of Southern University of Science and Technology)ShenzhenGuangdongChina
| | - Zunkai Hu
- Department of Critical Care MedicineShenzhen People's Hospital (Second Clinical Medical School of Jinan University; First Affiliated Hospital of Southern University of Science and Technology)ShenzhenGuangdongChina
| | - Ning Zhang
- Department of Critical Care MedicineShenzhen People's Hospital (Second Clinical Medical School of Jinan University; First Affiliated Hospital of Southern University of Science and Technology)ShenzhenGuangdongChina
| | - Yining Liang
- School of MedicineSouthern University of Science and TechnologyShenzhenGuangdongChina
| | - Yating Feng
- School of MedicineSouthern University of Science and TechnologyShenzhenGuangdongChina
| | - Wanfu Ding
- Department of Information and TechnologyShenzhen People's HospitalShenzhenGuangdongChina
| | - Lixin Cheng
- Department of Critical Care MedicineShenzhen People's Hospital (Second Clinical Medical School of Jinan University; First Affiliated Hospital of Southern University of Science and Technology)ShenzhenGuangdongChina
| | - Yuyan Zheng
- Department of StomatologyShenzhen People's Hospital (Second Clinical Medical School of Jinan University; First Affiliated Hospital of Southern University of Science and Technology)ShenzhenGuangdongChina
| |
Collapse
|
3
|
Xie J, Zheng X, Yan J, Li Q, Jin N, Wang S, Zhao P, Li S, Ding W, Cheng L, Geng Q. Deep learning model to discriminate diverse infection types based on pairwise analysis of host gene expression. iScience 2024; 27:109908. [PMID: 38827397 PMCID: PMC11141160 DOI: 10.1016/j.isci.2024.109908] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Revised: 03/01/2024] [Accepted: 05/03/2024] [Indexed: 06/04/2024] Open
Abstract
Accurate detection of pathogens, particularly distinguishing between Gram-positive and Gram-negative bacteria, could improve disease treatment. Host gene expression can capture the immune system's response to infections caused by various pathogens. Here, we present a deep neural network model, bvnGPS2, which incorporates the attention mechanism based on a large-scale integrated host transcriptome dataset to precisely identify Gram-positive and Gram-negative bacterial infections as well as viral infections. We performed analysis of 4,949 blood samples across 40 cohorts from 10 countries using our previously designed omics data integration method, iPAGE, to select discriminant gene pairs and train the bvnGPS2. The performance of the model was evaluated on six independent cohorts comprising 374 samples. Overall, our deep neural network model shows robust capability to accurately identify specific infections, paving the way for precise medicine strategies in infection treatment and potentially also for identifying subtypes of other diseases.
Collapse
Affiliation(s)
- Jize Xie
- Guangdong Provincial Clinical Research Center for Geriatrics, Shenzhen People’s Hospital (First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University), Shenzhen 518020, China
- John Hopcroft Center for Computer Science, Shanghai Jiao Tong University, Shanghai, China
| | - Xubin Zheng
- Guangdong Provincial Clinical Research Center for Geriatrics, Shenzhen People’s Hospital (First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University), Shenzhen 518020, China
- Great Bay University, Dongguan, China
| | - Jianlong Yan
- Guangdong Provincial Clinical Research Center for Geriatrics, Shenzhen People’s Hospital (First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University), Shenzhen 518020, China
| | - Qizhi Li
- John Hopcroft Center for Computer Science, Shanghai Jiao Tong University, Shanghai, China
| | - Nana Jin
- Guangdong Provincial Clinical Research Center for Geriatrics, Shenzhen People’s Hospital (First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University), Shenzhen 518020, China
- Health Data Science Center, Shenzhen People’s Hospital, Shenzhen 518020, China
| | - Shuojia Wang
- Guangdong Provincial Clinical Research Center for Geriatrics, Shenzhen People’s Hospital (First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University), Shenzhen 518020, China
| | - Pengfei Zhao
- Guangdong Provincial Clinical Research Center for Geriatrics, Shenzhen People’s Hospital (First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University), Shenzhen 518020, China
| | - Shuai Li
- John Hopcroft Center for Computer Science, Shanghai Jiao Tong University, Shanghai, China
| | - Wanfu Ding
- Guangdong Provincial Clinical Research Center for Geriatrics, Shenzhen People’s Hospital (First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University), Shenzhen 518020, China
| | - Lixin Cheng
- Guangdong Provincial Clinical Research Center for Geriatrics, Shenzhen People’s Hospital (First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University), Shenzhen 518020, China
- Health Data Science Center, Shenzhen People’s Hospital, Shenzhen 518020, China
| | - Qingshan Geng
- Guangdong Provincial Clinical Research Center for Geriatrics, Shenzhen People’s Hospital (First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University), Shenzhen 518020, China
| |
Collapse
|
4
|
Zhang N, Yang F, Zhao P, Jin N, Wu H, Liu T, Geng Q, Yang X, Cheng L. MrGPS: an m6A-related gene pair signature to predict the prognosis and immunological impact of glioma patients. Brief Bioinform 2023; 25:bbad498. [PMID: 38171932 PMCID: PMC10782913 DOI: 10.1093/bib/bbad498] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Revised: 11/17/2023] [Accepted: 12/03/2023] [Indexed: 01/05/2024] Open
Abstract
N6-methyladenosine (m6A) RNA methylation is the predominant epigenetic modification for mRNAs that regulates various cancer-related pathways. However, the prognostic significance of m6A modification regulators remains unclear in glioma. By integrating the TCGA lower-grade glioma (LGG) and glioblastoma multiforme (GBM) gene expression data, we demonstrated that both the m6A regulators and m6A-target genes were associated with glioma prognosis and activated various cancer-related pathways. Then, we paired m6A regulators and their target genes as m6A-related gene pairs (MGPs) using the iPAGE algorithm, among which 122 MGPs were significantly reversed in expression between LGG and GBM. Subsequently, we employed LASSO Cox regression analysis to construct an MGP signature (MrGPS) to evaluate glioma prognosis. MrGPS was independently validated in CGGA and GEO glioma cohorts with high accuracy in predicting overall survival. The average area under the receiver operating characteristic curve (AUC) at 1-, 3- and 5-year intervals were 0.752, 0.853 and 0.831, respectively. Combining clinical factors of age and radiotherapy, the AUC of MrGPS was much improved to around 0.90. Furthermore, CIBERSORT and TIDE algorithms revealed that MrGPS is indicative for the immune infiltration level and the response to immune checkpoint inhibitor therapy in glioma patients. In conclusion, our study demonstrated that m6A methylation is a prognostic factor for glioma and the developed prognostic model MrGPS holds potential as a valuable tool for enhancing patient management and facilitating accurate prognosis assessment in cases of glioma.
Collapse
Affiliation(s)
- Ning Zhang
- Guangdong Provincial Clinical Research Center for Geriatrics, Shenzhen Clinical Research Center for Geriatrics, Shenzhen People's Hospital, Shenzhen, China
- The First Affiliated Hospital of Southern University of Science and Technology, The Second Clinical Medical College of Jinan University
- Neuroscience Center, Shantou University Medical College, Shantou, China
| | - Fengxia Yang
- Guangdong Provincial Clinical Research Center for Geriatrics, Shenzhen Clinical Research Center for Geriatrics, Shenzhen People's Hospital, Shenzhen, China
- The First Affiliated Hospital of Southern University of Science and Technology, The Second Clinical Medical College of Jinan University
- Neuroscience Center, Shantou University Medical College, Shantou, China
| | - Pengfei Zhao
- Guangdong Provincial Clinical Research Center for Geriatrics, Shenzhen Clinical Research Center for Geriatrics, Shenzhen People's Hospital, Shenzhen, China
- The First Affiliated Hospital of Southern University of Science and Technology, The Second Clinical Medical College of Jinan University
| | - Nana Jin
- Guangdong Provincial Clinical Research Center for Geriatrics, Shenzhen Clinical Research Center for Geriatrics, Shenzhen People's Hospital, Shenzhen, China
- The First Affiliated Hospital of Southern University of Science and Technology, The Second Clinical Medical College of Jinan University
| | - Haonan Wu
- Guangdong Provincial Clinical Research Center for Geriatrics, Shenzhen Clinical Research Center for Geriatrics, Shenzhen People's Hospital, Shenzhen, China
- The First Affiliated Hospital of Southern University of Science and Technology, The Second Clinical Medical College of Jinan University
| | - Tao Liu
- International Digital Economy Academy, Shenzhen, China
| | - Qingshan Geng
- Guangdong Provincial Clinical Research Center for Geriatrics, Shenzhen Clinical Research Center for Geriatrics, Shenzhen People's Hospital, Shenzhen, China
- The First Affiliated Hospital of Southern University of Science and Technology, The Second Clinical Medical College of Jinan University
| | - Xiaojun Yang
- Neuroscience Center, Shantou University Medical College, Shantou, China
| | - Lixin Cheng
- Guangdong Provincial Clinical Research Center for Geriatrics, Shenzhen Clinical Research Center for Geriatrics, Shenzhen People's Hospital, Shenzhen, China
- The First Affiliated Hospital of Southern University of Science and Technology, The Second Clinical Medical College of Jinan University
| |
Collapse
|
5
|
Liu X, Hong C, Jiang Y, Li W, Chen Y, Ma Y, Zhao P, Li T, Chen H, Liu X, Cheng L. Co-expression module analysis reveals high expression homogeneity for both coding and non-coding genes in sepsis. BMC Genomics 2023; 24:418. [PMID: 37488493 PMCID: PMC10364430 DOI: 10.1186/s12864-023-09460-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Accepted: 06/16/2023] [Indexed: 07/26/2023] Open
Abstract
Sepsis is a life-threatening condition characterized by a harmful host response to infection with organ dysfunction. Annually about 20 million people are dead owing to sepsis and its mortality rates is as high as 20%. However, no studies have been carried out to investigate sepsis from the system biology point of view, as previous research predominantly focused on individual genes without considering their interactions and associations. Here, we conducted a comprehensive exploration of genome-wide expression alterations in both mRNAs and long non-coding RNAs (lncRNAs) in sepsis, using six microarray datasets. Co-expression networks were conducted to identify mRNA and lncRNA modules, respectively. Comparing these sepsis modules with normal modules, we observed a homogeneous expression pattern within the mRNA/lncRNA members, with the majority of them displaying consistent expression direction. Moreover, we identified consistent modules across diverse datasets, consisting of 20 common mRNA members and two lncRNAs, namely CHRM3-AS2 and PRKCQ-AS1, which are potential regulators of sepsis. Our results reveal that the up-regulated common mRNAs are mainly involved in the processes of neutrophil mediated immunity, while the down-regulated mRNAs and lncRNAs are significantly overrepresented in T-cell mediated immunity functions. This study sheds light on the co-expression patterns of mRNAs and lncRNAs in sepsis, providing a novel perspective and insight into the sepsis transcriptome, which may facilitate the exploration of candidate therapeutic targets and molecular biomarkers for sepsis.
Collapse
Affiliation(s)
- Xiaojun Liu
- Department of Critical Care, Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, 518020, China
| | - Chengying Hong
- Department of Critical Care, Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, 518020, China
| | - Yichun Jiang
- Department of Critical Care, Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, 518020, China
| | - Wei Li
- Department of Critical Care, Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, 518020, China
| | - Youlian Chen
- Department of Critical Care, Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, 518020, China
| | - Yonghui Ma
- Department of Critical Care, Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, 518020, China
| | - Pengfei Zhao
- Department of Critical Care, Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, 518020, China
| | - Tiyuan Li
- Department of Critical Care, Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, 518020, China
| | - Huaisheng Chen
- Department of Critical Care, Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, 518020, China.
| | - Xueyan Liu
- Department of Critical Care, Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, 518020, China.
| | - Lixin Cheng
- Department of Critical Care, Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, 518020, China.
| |
Collapse
|
6
|
Cheng L, Wu H, Zheng X, Zhang N, Zhao P, Wang R, Wu Q, Liu T, Yang X, Geng Q. GPGPS: a robust prognostic gene pair signature of glioma ensembling IDH mutation and 1p/19q co-deletion. Bioinformatics 2023; 39:6986965. [PMID: 36637205 PMCID: PMC9843586 DOI: 10.1093/bioinformatics/btac850] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Revised: 12/14/2022] [Indexed: 01/14/2023] Open
Abstract
MOTIVATION Many studies have shown that IDH mutation and 1p/19q co-deletion can serve as prognostic signatures of glioma. Although these genetic variations affect the expression of one or more genes, the prognostic value of gene expression related to IDH and 1p/19q status is still unclear. RESULTS We constructed an ensemble gene pair signature for the risk evaluation and survival prediction of glioma based on the prior knowledge of the IDH and 1p/19q status. First, we separately built two gene pair signatures IDH-GPS and 1p/19q-GPS and elucidated that they were useful transcriptome markers projecting from corresponding genome variations. Then, the gene pairs in these two models were assembled to develop an integrated model named Glioma Prognostic Gene Pair Signature (GPGPS), which demonstrated high area under the curves (AUCs) to predict 1-, 3- and 5-year overall survival (0.92, 0.88 and 0.80) of glioma. GPGPS was superior to the single GPSs and other existing prognostic signatures (avg AUC = 0.70, concordance index = 0.74). In conclusion, the ensemble prognostic signature with 10 gene pairs could serve as an independent predictor for risk stratification and survival prediction in glioma. This study shed light on transferring knowledge from genetic alterations to expression changes to facilitate prognostic studies. AVAILABILITY AND IMPLEMENTATION Codes are available at https://github.com/Kimxbzheng/GPGPS.git. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Lixin Cheng
- To whom correspondence should be addressed. or
| | | | - Xubin Zheng
- Shenzhen People’s Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen 518020, China
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong
| | - Ning Zhang
- Guangdong Provincial Key Laboratory of Infectious Disease and Molecular Immunopathology, Shantou University Medical College, Shantou 515041, China
| | - Pengfei Zhao
- Shenzhen People’s Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen 518020, China
- Department of Geriatrics, Shenzhen Clinical Research Center for Aging, Shenzhen 518020, China
| | - Ran Wang
- Shenzhen People’s Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen 518020, China
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong
| | - Qiong Wu
- Hong Kong Genome Institute, Shatin, New Territories, Hong Kong
| | - Tao Liu
- International Digital Economy Academy, Shenzhen 518020, China
| | - Xiaojun Yang
- Guangdong Provincial Key Laboratory of Infectious Disease and Molecular Immunopathology, Shantou University Medical College, Shantou 515041, China
| | | |
Collapse
|
7
|
Chunikhina E, Logan P, Kovchegov Y, Yambartsev A, Mondal D, Morgun A. The C-SHIFT Algorithm for Normalizing Covariances. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:720-730. [PMID: 35167480 DOI: 10.1109/tcbb.2022.3151840] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Omics technologies are powerful tools for analyzing patterns in gene expression data for thousands of genes. Due to a number of systematic variations in experiments, the raw gene expression data is often obfuscated by undesirable technical noises. Various normalization techniques were designed in an attempt to remove these non-biological errors prior to any statistical analysis. One of the reasons for normalizing data is the need for recovering the covariance matrix used in gene network analysis. In this paper, we introduce a novel normalization technique, called the covariance shift (C-SHIFT) method. This normalization algorithm uses optimization techniques together with the blessing of dimensionality philosophy and energy minimization hypothesis for covariance matrix recovery under additive noise (in biology, known as the bias). Thus, it is perfectly suited for the analysis of logarithmic gene expression data. Numerical experiments on synthetic data demonstrate the method's advantage over the classical normalization techniques. Namely, the comparison is made with Rank, Quantile, cyclic LOESS (locally estimated scatterplot smoothing), and MAD (median absolute deviation) normalization methods. We also evaluate the performance of C-SHIFT algorithm on real biological data.
Collapse
|
8
|
Yang Y, Zhang Y, Li S, Zheng X, Wong MH, Leung KS, Cheng L. A Robust and Generalizable Immune-Related Signature for Sepsis Diagnostics. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:3246-3254. [PMID: 34437068 DOI: 10.1109/tcbb.2021.3107874] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
High-throughput sequencing can detect tens of thousands of genes in parallel, providing opportunities for improving the diagnostic accuracy of multiple diseases including sepsis, which is an aggressive inflammatory response to infection that can cause organ failure and death. Early screening of sepsis is essential in clinic, but no effective diagnostic biomarkers are available yet. Here, we present a novel method, Recurrent Logistic Regression, to identify diagnostic biomarkers for sepsis from the blood transcriptome data. A panel including five immune-related genes, LRRN3, IL2RB, FCER1A, TLR5, and S100A12, are determined as diagnostic biomarkers (LIFTS) for sepsis. LIFTS discriminates patients with sepsis from normal controls in high accuracy (AUROC = 0.9959 on average; IC = [0.9722-1.0]) on nine validation cohorts across three independent platforms, which outperforms existing markers. Our analysis determined an accurate prediction model and reproducible transcriptome biomarkers that can lay a foundation for clinical diagnostic tests and biological mechanistic studies.
Collapse
|
9
|
Xu C, Li W, Li T, Yuan J, Pang X, Liu T, Liang B, Cheng L, Sun X, Dong S. Iron metabolism-related genes reveal predictive value of acute coronary syndrome. Front Pharmacol 2022; 13:1040845. [PMID: 36330096 PMCID: PMC9622999 DOI: 10.3389/fphar.2022.1040845] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Accepted: 10/04/2022] [Indexed: 11/25/2022] Open
Abstract
Iron deficiency has detrimental effects in patients with acute coronary syndrome (ACS), which is a common nutritional disorder and inflammation-related disease affects up to one-third people worldwide. However, the specific role of iron metabolism in ACS progression is opaque. In this study, we construct an iron metabolism-related genes (IMRGs) based molecular signature of ACS and to identify novel iron metabolism gene markers for early stage of ACS. The IMRGs were mainly collected from Molecular Signatures Database (mSigDB) and two relevant studies. Two blood transcriptome datasets GSE61144 and GSE60993 were used for constructing the prediction model of ACS. After differential analysis, 22 IMRGs were differentially expressed and defined as DEIGs in the training set. Then, the 22 DEIGs were trained by the Elastic Net to build the prediction model. Five genes, PADI4, HLA-DQA1, LCN2, CD7, and VNN1, were determined using multiple Elastic Net calculations and retained to obtain the optimal performance. Finally, the generated model iron metabolism-related gene signature (imSig) was assessed by the validation set GSE60993 using a series of evaluation measurements. Compared with other machine learning methods, the performance of imSig using Elastic Net was superior in the validation set. Elastic Net consistently scores the higher than Lasso and Logistic regression in the validation set in terms of ROC, PRC, Sensitivity, and Specificity. The prediction model based on iron metabolism-related genes may assist in ACS early diagnosis.
Collapse
Affiliation(s)
- Cong Xu
- Shenzhen People’s Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, China
| | - Wanyang Li
- School of Mathematics, South China University of Technology, Guangzhou, China
| | - Tangzhiming Li
- Shenzhen People’s Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, China
| | - Jie Yuan
- Shenzhen People’s Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, China
| | - Xinli Pang
- Shenzhen People’s Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, China
| | - Tao Liu
- International Digital Economy Academy, Shenzhen, China
| | - Benhui Liang
- Department of Cardiology, Xiangya Hospital, Central South University, Changsha, China
| | - Lixin Cheng
- Shenzhen People’s Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, China
- *Correspondence: Lixin Cheng, ; Xin Sun, ; Shaohong Dong,
| | - Xin Sun
- Shenzhen People’s Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, China
- *Correspondence: Lixin Cheng, ; Xin Sun, ; Shaohong Dong,
| | - Shaohong Dong
- Shenzhen People’s Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, China
- *Correspondence: Lixin Cheng, ; Xin Sun, ; Shaohong Dong,
| |
Collapse
|
10
|
Wu Q, Zheng X, Leung KS, Wong MH, Tsui SKW, Cheng L. meGPS: a multi-omics signature for hepatocellular carcinoma detection integrating methylome and transcriptome data. Bioinformatics 2022; 38:3513-3522. [PMID: 35674358 DOI: 10.1093/bioinformatics/btac379] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Revised: 05/08/2022] [Accepted: 06/01/2022] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Hepatocellular carcinoma (HCC) is a primary malignancy with poor prognosis. Recently, multi-omics molecular-level measurement enables HCC diagnosis and prognosis prediction, which is crucial for early intervention of personalized therapy to diminish mortality. Here, we introduce a novel strategy utilizing DNA methylation and RNA expression data to achieve a multi-omics gene pair signature (GPS) for HCC discrimination. RESULTS The immune genes with negative correlations between expression and promoter methylation are enriched in the highly connected cancer-related pathway network, which are considered as the candidates for HCC detection. After that, we separately construct a methylation GPS (mGPS) and an expression GPS (eGPS), and then assemble them as a meGPS with five gene pairs, in which the significant methylation and expression changes occur between HCC tumor and non-tumor groups. Reliable performance has been validated by independent tissue (age, gender, and etiology) and blood datasets. This study proposes a procedure for multi-omics GPS identification and develops a novel HCC signature using both methylome and transcriptome data, suggesting potential molecular targets for the detection and therapy of HCC. AVAILABILITY AND IMPLEMENTATION Models are available at https://github.com/bioinformaticStudy/meGPS.git. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Qiong Wu
- Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, 518020, China.,School of Biomedical Sciences, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong SAR, China.,Department of Paediatrics, Prince of Wales Hospital, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong SAR, China
| | - Xubin Zheng
- Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, 518020, China.,Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong SAR, China
| | - Kwong-Sak Leung
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong SAR, China
| | - Man-Hon Wong
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong SAR, China
| | - Stephen Kwok-Wing Tsui
- School of Biomedical Sciences, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong SAR, China
| | - Lixin Cheng
- Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, 518020, China
| |
Collapse
|
11
|
Wang R, Zheng X, Wang J, Wan S, Song F, Wong MH, Leung KS, Cheng L. Improving bulk RNA-seq classification by transferring gene signature from single cells in acute myeloid leukemia. Brief Bioinform 2022; 23:6523149. [PMID: 35136933 DOI: 10.1093/bib/bbac002] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2021] [Revised: 12/22/2021] [Accepted: 01/04/2022] [Indexed: 12/13/2022] Open
Abstract
The advances in single-cell RNA sequencing (scRNA-seq) technologies enable the characterization of transcriptomic profiles at the cellular level and demonstrate great promise in bulk sample analysis thereby offering opportunities to transfer gene signature from scRNA-seq to bulk data. However, the gene expression signatures identified from single cells are typically inapplicable to bulk RNA-seq data due to the profiling differences of distinct sequencing technologies. Here, we propose single-cell pair-wise gene expression (scPAGE), a novel method to develop single-cell gene pair signatures (scGPSs) that were beneficial to bulk RNA-seq classification to transfer knowledge across platforms. PAGE was adopted to tackle the challenge of profiling differences. We applied the method to acute myeloid leukemia (AML) and identified the scGPS from mouse scRNA-seq that allowed discriminating between AML and control cells. The scGPS was validated in bulk RNA-seq datasets and demonstrated better performance (average area under the curve [AUC] = 0.96) than the conventional gene expression strategies (average AUC$\le$ 0.88) suggesting its potential in disclosing the molecular mechanism of AML. The scGPS also outperformed its bulk counterpart, which highlighted the benefit of gene signature transfer. Furthermore, we confirmed the utility of scPAGE in sepsis as an example of other disease scenarios. scPAGE leveraged the advantages of single-cell profiles to enhance the analysis of bulk samples revealing great potential of transferring knowledge from single-cell to bulk transcriptome studies.
Collapse
Affiliation(s)
- Ran Wang
- Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen 518020, China.,Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong
| | - Xubin Zheng
- Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen 518020, China.,Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong
| | - Jun Wang
- Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen 518020, China
| | - Shibiao Wan
- Center for Applied Bioinformatics, St. Jude Children's Research Hospital, Memphis, Tennessee 38105, USA
| | - Fangda Song
- School of Data Science, The Chinese University of Hong Kong, Shenzhen 518000, China
| | - Man Hon Wong
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong
| | - Kwong Sak Leung
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong
| | - Lixin Cheng
- Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen 518020, China
| |
Collapse
|
12
|
Song Y, Zhu S, Zhang N, Cheng L. Blood Circulating miRNA Pairs as a Robust Signature for Early Detection of Esophageal Cancer. Front Oncol 2021; 11:723779. [PMID: 34368003 PMCID: PMC8343071 DOI: 10.3389/fonc.2021.723779] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2021] [Accepted: 07/08/2021] [Indexed: 01/07/2023] Open
Abstract
Esophageal cancer (EC) is a common malignant tumor in the digestive system which is often diagnosed at the middle and late stages. Noninvasive diagnosis using circulating miRNA as biomarkers enables accurate detection of early-stage EC to reduce mortality. We built a diagnostic signature consisting of four miRNA pairs for the early detection of EC using individualized Pairwise Analysis of Gene Expression (iPAGE). Profiling of miRNA expression identified 496 miRNA pairs with significant relative expression change. Four miRNA pairs consistently selected from LASSO were used to construct the final diagnostic model. The performance of the signature was validated using two independent datasets, yielding both AUCs and PRCs over 0.99. Furthermore, precision, recall, and F-score were also evaluated for clinical application, when a fixed threshold is given, resulting in all the scores are larger than 0.92 in the training set, test set, and two validation sets. Our results suggested that the 4-miRNA signature is a new biomarker for the early diagnosis of patients with EC. The clinical use of this signature would have improved the detection of EC for earlier therapy and more favorite prognosis.
Collapse
Affiliation(s)
- Yang Song
- Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, China
| | - Suzhu Zhu
- Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, China
| | - Ning Zhang
- Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, China
| | - Lixin Cheng
- Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, China
| |
Collapse
|
13
|
Zheng X, Wu Q, Wu H, Leung KS, Wong MH, Liu X, Cheng L. Evaluating the Consistency of Gene Methylation in Liver Cancer Using Bisulfite Sequencing Data. Front Cell Dev Biol 2021; 9:671302. [PMID: 33996828 PMCID: PMC8116545 DOI: 10.3389/fcell.2021.671302] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Accepted: 04/06/2021] [Indexed: 01/07/2023] Open
Abstract
Bisulfite sequencing is considered as the gold standard approach for measuring DNA methylation, which acts as a pivotal part in regulating a variety of biological processes without changes in DNA sequences. In this study, we introduced the most prevalent methods for processing bisulfite sequencing data and evaluated the consistency of the data acquired from different measurements in liver cancer. Firstly, we introduced three commonly used bisulfite sequencing assays, i.e., reduced-representation bisulfite sequencing (RRBS), whole-genome bisulfite sequencing (WGBS), and targeted bisulfite sequencing (targeted BS). Next, we discussed the principles and compared different methods for alignment, quality assessment, methylation level scoring, and differentially methylated region identification. After that, we screened differential methylated genes in liver cancer through the three bisulfite sequencing assays and evaluated the consistency of their results. Ultimately, we compared bisulfite sequencing to 450 k beadchip and assessed the statistical similarity and functional association of differentially methylated genes (DMGs) among the four assays. Our results demonstrated that the DMGs measured by WGBS, RRBS, targeted BS and 450 k beadchip are consistently hypo-methylated in liver cancer with high functional similarity.
Collapse
Affiliation(s)
- Xubin Zheng
- Department of Critical Medicine, Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, China.,Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong, China
| | - Qiong Wu
- Department of Critical Medicine, Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, China.,School of Biomedical Sciences, The Chinese University of Hong Kong, Hong Kong, China
| | - Haonan Wu
- Department of Critical Medicine, Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, China
| | - Kwong-Sak Leung
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong, China
| | - Man-Hon Wong
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong, China
| | - Xueyan Liu
- Department of Critical Medicine, Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, China
| | - Lixin Cheng
- Department of Critical Medicine, Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, China
| |
Collapse
|
14
|
Zheng X, Leung KS, Wong MH, Cheng L. Long non-coding RNA pairs to assist in diagnosing sepsis. BMC Genomics 2021; 22:275. [PMID: 33863291 PMCID: PMC8050902 DOI: 10.1186/s12864-021-07576-4] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2020] [Accepted: 03/25/2021] [Indexed: 02/07/2023] Open
Abstract
Background Sepsis is the major cause of death in Intensive Care Unit (ICU) globally. Molecular detection enables rapid diagnosis that allows early intervention to minimize the death rate. Recent studies showed that long non-coding RNAs (lncRNAs) regulate proinflammatory genes and are related to the dysfunction of organs in sepsis. Identifying lncRNA signature with absolute abundance is challenging because of the technical variation and the systematic experimental bias. Results Cohorts (n = 768) containing whole blood lncRNA profiling of sepsis patients in the Gene Expression Omnibus (GEO) database were included. We proposed a novel diagnostic strategy that made use of the relative expressions of lncRNA pairs, which are reversed between sepsis patients and normal controls (eg. lncRNAi > lncRNAj in sepsis patients and lncRNAi < lncRNAj in normal controls), to identify 14 lncRNA pairs as a sepsis diagnostic signature. The signature was then applied to independent cohorts (n = 644) to evaluate its predictive performance across different ages and normalization methods. Comparing to common machine learning models and existing signatures, SepSigLnc consistently attains better performance on the validation cohorts from the same age group (AUC = 0.990 & 0.995 in two cohorts) and across different groups (AUC = 0.878 on average), as well as cohorts processed by an alternative normalization method (AUC = 0.953 on average). Functional analysis demonstrates that the lncRNA pairs in SepsigLnc are functionally similar and tend to implicate in the same biological processes including cell fate commitment and cellular response to steroid hormone stimulus. Conclusion Our study identified 14 lncRNA pairs as signature that can facilitate the diagnosis of septic patients at an intervenable point when clinical manifestations are not dramatic. Also, the computational procedure can be generalized to a standard procedure for discovering diagnostic molecule signatures. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-021-07576-4.
Collapse
Affiliation(s)
- Xubin Zheng
- Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, 518020, China.,Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong
| | - Kwong-Sak Leung
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong
| | - Man-Hon Wong
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong
| | - Lixin Cheng
- Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, 518020, China.
| |
Collapse
|
15
|
Wang L, Chu CY, McCall MN, Slaunwhite C, Holden-Wiltse J, Corbett A, Falsey AR, Topham DJ, Caserta MT, Mariani TJ, Walsh EE, Qiu X. Airway gene-expression classifiers for respiratory syncytial virus (RSV) disease severity in infants. BMC Med Genomics 2021; 14:57. [PMID: 33632195 PMCID: PMC7908785 DOI: 10.1186/s12920-021-00913-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2020] [Accepted: 02/19/2021] [Indexed: 02/08/2023] Open
Abstract
BACKGROUND A substantial number of infants infected with RSV develop severe symptoms requiring hospitalization. We currently lack accurate biomarkers that are associated with severe illness. METHOD We defined airway gene expression profiles based on RNA sequencing from nasal brush samples from 106 full-tem previously healthy RSV infected subjects during acute infection (day 1-10 of illness) and convalescence stage (day 28 of illness). All subjects were assigned a clinical illness severity score (GRSS). Using AIC-based model selection, we built a sparse linear correlate of GRSS based on 41 genes (NGSS1). We also built an alternate model based upon 13 genes associated with severe infection acutely but displaying stable expression over time (NGSS2). RESULTS NGSS1 is strongly correlated with the disease severity, demonstrating a naïve correlation (ρ) of ρ = 0.935 and cross-validated correlation of 0.813. As a binary classifier (mild versus severe), NGSS1 correctly classifies disease severity in 89.6% of the subjects following cross-validation. NGSS2 has slightly less, but comparable, accuracy with a cross-validated correlation of 0.741 and classification accuracy of 84.0%. CONCLUSION Airway gene expression patterns, obtained following a minimally-invasive procedure, have potential utility for development of clinically useful biomarkers that correlate with disease severity in primary RSV infection.
Collapse
Affiliation(s)
- Lu Wang
- Department of Biostatistics and Computational Biology, University of Rochester School Medicine, Rochester, NY, USA
| | - Chin-Yi Chu
- Department of Pediatrics, University of Rochester School Medicine, Rochester, NY, USA
| | - Matthew N McCall
- Department of Biostatistics and Computational Biology, University of Rochester School Medicine, Rochester, NY, USA
| | | | - Jeanne Holden-Wiltse
- Department of Biostatistics and Computational Biology, University of Rochester School Medicine, Rochester, NY, USA
| | - Anthony Corbett
- Department of Biostatistics and Computational Biology, University of Rochester School Medicine, Rochester, NY, USA
| | - Ann R Falsey
- Department of Medicine, University of Rochester School Medicine, Rochester, NY, USA
- Department of Medicine, Rochester General Hospital, Rochester, NY, USA
| | - David J Topham
- Department of Microbiology and Immunology, University of Rochester School Medicine, Rochester, NY, USA
| | - Mary T Caserta
- Department of Pediatrics, University of Rochester School Medicine, Rochester, NY, USA
| | - Thomas J Mariani
- Department of Pediatrics, University of Rochester School Medicine, Rochester, NY, USA.
| | - Edward E Walsh
- Department of Medicine, University of Rochester School Medicine, Rochester, NY, USA.
- Department of Medicine, Rochester General Hospital, Rochester, NY, USA.
| | - Xing Qiu
- Department of Biostatistics and Computational Biology, University of Rochester School Medicine, Rochester, NY, USA.
| |
Collapse
|
16
|
Cheng L, Zeng Y, Hu S, Zhang N, Cheung KCP, Li B, Leung KS, Jiang L. Systematic prediction of autophagy-related proteins using Arabidopsis thaliana interactome data. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2021; 105:708-720. [PMID: 33128829 DOI: 10.1111/tpj.15065] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Revised: 10/09/2020] [Accepted: 10/21/2020] [Indexed: 06/11/2023]
Abstract
Autophagy is a self-degradative process that is crucial for maintaining cellular homeostasis by removing damaged cytoplasmic components and recycling nutrients. Such an evolutionary conserved proteolysis process is regulated by the autophagy-related (Atg) proteins. The incomplete understanding of plant autophagy proteome and the importance of a proteome-wide understanding of the autophagy pathway prompted us to predict Atg proteins and regulators in Arabidopsis. Here, we developed a systems-level algorithm to identify autophagy-related modules (ARMs) based on protein subcellular localization, protein-protein interactions, and known Atg proteins. This generates a detailed landscape of the autophagic modules in Arabidopsis. We found that the newly identified genes in each ARM tend to be upregulated and coexpressed during the senescence stage of Arabidopsis. We also demonstrated that the Golgi apparatus ARM, ARM13, functions in the autophagy process by module clustering and functional analysis. To verify the in silico analysis, the Atg candidates in ARM13 that are functionally similar to the core Atg proteins were selected for experimental validation. Interestingly, two of the previously uncharacterized proteins identified from the ARM analysis, AGD1 and Sec14, exhibited bona fide association with the autophagy protein complex in plant cells, which provides evidence for a cross-talk between intracellular pathways and autophagy. Thus, the computational framework has facilitated the identification and characterization of plant-specific autophagy-related proteins and novel autophagy proteins/regulators in higher eukaryotes.
Collapse
Affiliation(s)
- Lixin Cheng
- School of Life Sciences, Centre for Cell & Developmental Biology and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong, China
- Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, China
| | - Yonglun Zeng
- School of Life Sciences, Centre for Cell & Developmental Biology and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong, China
| | - Shuai Hu
- School of Life Sciences, Centre for Cell & Developmental Biology and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong, China
| | - Ning Zhang
- Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, China
| | - Kenneth C P Cheung
- School of Life Sciences, Centre for Cell & Developmental Biology and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong, China
| | - Baiying Li
- School of Life Sciences, Centre for Cell & Developmental Biology and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong, China
| | - Kwong-Sak Leung
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong, China
| | - Liwen Jiang
- School of Life Sciences, Centre for Cell & Developmental Biology and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong, China
- CUHK Shenzhen Research Institute, Shenzhen, China
| |
Collapse
|
17
|
Nan CC, Zhang N, Cheung KCP, Zhang HD, Li W, Hong CY, Chen HS, Liu XY, Li N, Cheng L. Knockdown of lncRNA MALAT1 Alleviates LPS-Induced Acute Lung Injury via Inhibiting Apoptosis Through the miR-194-5p/FOXP2 Axis. Front Cell Dev Biol 2020; 8:586869. [PMID: 33117815 PMCID: PMC7575725 DOI: 10.3389/fcell.2020.586869] [Citation(s) in RCA: 41] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2020] [Accepted: 09/02/2020] [Indexed: 01/07/2023] Open
Abstract
Purpose We aimed to identify and verify the key genes and lncRNAs associated with acute lung injury (ALI) and explore the pathogenesis of ALI. Research showed that lower expression of the lncRNA metastasis-associated lung carcinoma transcript 1 (MALAT1) alleviates lung injury induced by lipopolysaccharide (LPS). Nevertheless, the mechanisms of MALAT1 on cellular apoptosis remain unclear in LPS-stimulated ALI. We investigated the mechanism of MALAT1 in modulating the apoptosis of LPS-induced human pulmonary alveolar epithelial cells (HPAEpiC). Methods Differentially expressed lncRNAs between the ALI samples and normal controls were identified using gene expression profiles. ALI-related genes were determined by the overlap of differentially expressed genes (DEGs), genes correlated with lung, genes correlated with key lncRNAs, and genes sharing significantly high proportions of microRNA targets with MALAT1. Quantitative real-time PCR (qPCR) was applied to detect the expression of MALAT1, microRNA (miR)-194-5p, and forkhead box P2 (FOXP2) mRNA in 1 μg/ml LPS-treated HPAEpiC. MALAT1 knockdown vectors, miR-194-5p inhibitors, and ov-FOXP2 were constructed and used to transfect HPAEpiC. The influence of MALAT1 knockdown on LPS-induced HPAEpiC proliferation and apoptosis via the miR-194-5p/FOXP2 axis was determined using Cell counting kit-8 (CCK-8) assay, flow cytometry, and Western blotting analysis, respectively. The interactions between MALAT1, miR-194-5p, and FOXP2 were verified using dual-luciferase reporter gene assay. Results We identified a key lncRNA (MALAT1) and three key genes (EYA1, WNT5A, and FOXP2) that are closely correlated with the pathogenesis of ALI. LPS stimulation promoted MALAT1 expression and apoptosis and also inhibited HPAEpiC viability. MALAT1 knockdown significantly improved viability and suppressed the apoptosis of LPS-stimulated HPAEpiC. Moreover, MALAT1 directly targeted miR-194-5p, a downregulated miRNA in LPS-stimulated HPAEpiC, when FOXP2 was overexpressed. MALAT1 knockdown led to the overexpression of miR-194-5p and restrained FOXP2 expression. Furthermore, inhibition of miR-194-5p exerted a rescue effect on MALAT1 knockdown of FOXP2, whereas the overexpression of FOXP2 reversed the effect of MALAT1 knockdown on viability and apoptosis of LPS-stimulated HPAEpiC. Conclusion Our results demonstrated that MALAT1 knockdown alleviated HPAEpiC apoptosis by competitively binding to miR-194-5p and then elevating the inhibitory effect on its target FOXP2. These data provide a novel insight into the role of MALAT1 in the progression of ALI and potential diagnostic and therapeutic strategies for ALI patients.
Collapse
Affiliation(s)
- Chuan-Chuan Nan
- Department of Critical Care Medicine, Shenzhen People's Hospital, Second Clinical Medical College of Jinan University, First Affiliated Hospital of Southern University of Science and Technology, Shenzhen, China
| | - Ning Zhang
- Department of Critical Care Medicine, Shenzhen People's Hospital, Second Clinical Medical College of Jinan University, First Affiliated Hospital of Southern University of Science and Technology, Shenzhen, China.,Department of Stomatology Center, Shenzhen People's Hospital, Second Clinical Medical College of Jinan University, First Affiliated Hospital of Southern University of Science and Technology, Shenzhen, China
| | - Kenneth C P Cheung
- School of Life Sciences, The Chinese University of Hong Kong, Sha Tin, China
| | - Hua-Dong Zhang
- Department of Critical Care Medicine, Shenzhen People's Hospital, Second Clinical Medical College of Jinan University, First Affiliated Hospital of Southern University of Science and Technology, Shenzhen, China
| | - Wei Li
- Department of Critical Care Medicine, Shenzhen People's Hospital, Second Clinical Medical College of Jinan University, First Affiliated Hospital of Southern University of Science and Technology, Shenzhen, China
| | - Cheng-Ying Hong
- Department of Critical Care Medicine, Shenzhen People's Hospital, Second Clinical Medical College of Jinan University, First Affiliated Hospital of Southern University of Science and Technology, Shenzhen, China
| | - Huai-Sheng Chen
- Department of Critical Care Medicine, Shenzhen People's Hospital, Second Clinical Medical College of Jinan University, First Affiliated Hospital of Southern University of Science and Technology, Shenzhen, China
| | - Xue-Yan Liu
- Department of Critical Care Medicine, Shenzhen People's Hospital, Second Clinical Medical College of Jinan University, First Affiliated Hospital of Southern University of Science and Technology, Shenzhen, China
| | - Nan Li
- Department of Stomatology Center, Shenzhen People's Hospital, Second Clinical Medical College of Jinan University, First Affiliated Hospital of Southern University of Science and Technology, Shenzhen, China
| | - Lixin Cheng
- Department of Critical Care Medicine, Shenzhen People's Hospital, Second Clinical Medical College of Jinan University, First Affiliated Hospital of Southern University of Science and Technology, Shenzhen, China
| |
Collapse
|
18
|
Yin R, Liu X, Yu J, Ji Y, Liu J, Cheng L, Zhou J. Up-regulation of autophagy by low concentration of salicylic acid delays methyl jasmonate-induced leaf senescence. Sci Rep 2020; 10:11472. [PMID: 32651431 PMCID: PMC7351724 DOI: 10.1038/s41598-020-68484-3] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2020] [Accepted: 06/25/2020] [Indexed: 11/09/2022] Open
Abstract
Crosstalk between salicylic acid (SA) and jasmonic acid (JA) signaling plays an important role in regulation of plant senescence. Our previous work found that SA could delay methyl jasmonate (MeJA)-induced leaf senescence in a concentration-dependent manner. Here, the effect of low concentration of SA (LCSA) application on MeJA-induced leaf senescence was further assessed. High-throughput sequencing (RNA-Seq) results showed that LCSA did not have dominant effects on the genetic regulatory pathways of basal metabolism like nitrogen metabolism, photosynthesis and glycolysis. The ClusterONE was applied to identify discrete gene modules based on protein-protein interaction (PPI) network. Interestingly, an autophagy-related (ATG) module was identified in the differentially expressed genes (DEGs) that exclusively induced by MeJA together with LCSA. RT-qPCR confirmed that the expression of most of the determined ATG genes were upregulated by LCSA. Remarkably, in contrast to wild type (Col-0), LCSA cannot alleviate the leaf yellowing phenotype in autophagy defective mutants (atg5-1 and atg7-2) upon MeJA treatment. Confocal results showed that LCSA increased the number of autophagic bodies accumulated in the vacuole during MeJA-induced leaf senescence. Collectively, our work revealed up-regulation of autophagy by LCSA as a key regulator to alleviate MeJA-induced leaf senescence.
Collapse
Affiliation(s)
- Runzhu Yin
- MOE Key Laboratory of Laser Life Science and Guangdong Provincial Key Laboratory of Laser Life Science, College of Biophotonics, South China Normal University, Guangzhou, 510631, China
| | - Xueyan Liu
- Department of Critical Care Medicine, Shenzhen People's Hospital, The Second Clinical Medicine College of Jinan University, Shenzhen, 518020, China
| | - Jingfang Yu
- MOE Key Laboratory of Laser Life Science and Guangdong Provincial Key Laboratory of Laser Life Science, College of Biophotonics, South China Normal University, Guangzhou, 510631, China
| | - Yingbin Ji
- MOE Key Laboratory of Laser Life Science and Guangdong Provincial Key Laboratory of Laser Life Science, College of Biophotonics, South China Normal University, Guangzhou, 510631, China
| | - Jian Liu
- Fujian Provincial Key Laboratory of Plant Functional Biology, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Lixin Cheng
- Department of Critical Care Medicine, Shenzhen People's Hospital, The Second Clinical Medicine College of Jinan University, Shenzhen, 518020, China.
| | - Jun Zhou
- MOE Key Laboratory of Laser Life Science and Guangdong Provincial Key Laboratory of Laser Life Science, College of Biophotonics, South China Normal University, Guangzhou, 510631, China.
| |
Collapse
|
19
|
Cheng L, Nan C, Kang L, Zhang N, Liu S, Chen H, Hong C, Chen Y, Liang Z, Liu X. Whole blood transcriptomic investigation identifies long non-coding RNAs as regulators in sepsis. J Transl Med 2020; 18:217. [PMID: 32471511 PMCID: PMC7257169 DOI: 10.1186/s12967-020-02372-2] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2020] [Accepted: 05/12/2020] [Indexed: 12/21/2022] Open
Abstract
Background Sepsis is a fatal disease referring to the presence of a known or strongly suspected infection coupled with systemic and uncontrolled immune activation causing multiple organ failure. However, current knowledge of the role of lncRNAs in sepsis is still extremely limited. Methods We performed an in silico investigation of the gene coexpression pattern for the patients response to all-cause sepsis in consecutive intensive care unit (ICU) admissions. Sepsis coexpression gene modules were identified using WGCNA and enrichment analysis. lncRNAs were determined as sepsis biomarkers based on the interactions among lncRNAs and the identified modules. Results Twenty-three sepsis modules, including both differentially expressed modules and prognostic modules, were identified from the whole blood RNA expression profiling of sepsis patients. Five lncRNAs, FENDRR, MALAT1, TUG1, CRNDE, and ANCR, were detected as sepsis regulators based on the interactions among lncRNAs and the identified coexpression modules. Furthermore, we found that CRNDE and MALAT1 may act as miRNA sponges of sepsis related miRNAs to regulate the expression of sepsis modules. Ultimately, FENDRR, MALAT1, TUG1, and CRNDE were reannotated using three independent lncRNA expression datasets and validated as differentially expressed lncRNAs. Conclusion The procedure facilitates the identification of prognostic biomarkers and novel therapeutic strategies of sepsis. Our findings highlight the importance of transcriptome modularity and regulatory lncRNAs in the progress of sepsis.
Collapse
Affiliation(s)
- Lixin Cheng
- Department of Critical Care Medicine, Shenzhen People's Hospital, The Second Clinical Medicine College of Jinan University, Shenzhen, China
| | - Chuanchuan Nan
- Department of Critical Care Medicine, Shenzhen People's Hospital, The Second Clinical Medicine College of Jinan University, Shenzhen, China
| | - Lin Kang
- Shenzhen People's Hospital, The Second Clinical Medicine College of Jinan University, Shenzhen, China
| | - Ning Zhang
- Department of Critical Care Medicine, Shenzhen People's Hospital, The Second Clinical Medicine College of Jinan University, Shenzhen, China
| | - Sheng Liu
- Department of Critical Care Medicine, Shenzhen People's Hospital, The Second Clinical Medicine College of Jinan University, Shenzhen, China
| | - Huaisheng Chen
- Department of Critical Care Medicine, Shenzhen People's Hospital, The Second Clinical Medicine College of Jinan University, Shenzhen, China
| | - Chengying Hong
- Department of Critical Care Medicine, Shenzhen People's Hospital, The Second Clinical Medicine College of Jinan University, Shenzhen, China
| | - Youlian Chen
- Department of Critical Care Medicine, Shenzhen People's Hospital, The Second Clinical Medicine College of Jinan University, Shenzhen, China
| | - Zhen Liang
- Shenzhen People's Hospital, The Second Clinical Medicine College of Jinan University, Shenzhen, China.
| | - Xueyan Liu
- Department of Critical Care Medicine, Shenzhen People's Hospital, The Second Clinical Medicine College of Jinan University, Shenzhen, China.
| |
Collapse
|
20
|
Mezencev R, Auerbach SS. The sensitivity of transcriptomics BMD modeling to the methods used for microarray data normalization. PLoS One 2020; 15:e0232955. [PMID: 32413060 PMCID: PMC7228135 DOI: 10.1371/journal.pone.0232955] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2019] [Accepted: 04/25/2020] [Indexed: 11/25/2022] Open
Abstract
Whole-genome expression data generated by microarray studies have shown promise for quantitative human health risk assessment. While numerous approaches have been developed to determine benchmark doses (BMDs) from probeset-level dose responses, sensitivity of the results to methods used for normalization of the data has not yet been systematically investigated. Normalization of microarray data converts raw hybridization signals to expression estimates that are expected to be proportional to the amounts of transcripts in the profiled specimens. Different approaches to normalization have been shown to greatly influence the results of some downstream analyses, including biological interpretation. In this study we evaluate the influence of microarray normalization methods on the transcriptomic BMDs. We demonstrate using in vivo data that the use of alternative pipelines for normalization of Affymetrix microarray data can have a considerable impact on the number of detected differentially expressed genes and pathways (processes) determined to be treatment responsive, which may lead to alternative interpretations of the data. In addition, we found that normalization can have a considerable effect (as much as ~30-fold in this study) on estimation of the minimum biological potency (transcriptomic point of departure). We argue for consideration of alternative normalization methods and their data-informed selection to most effectively interpret microarray data for use in human health risk assessment.
Collapse
Affiliation(s)
- Roman Mezencev
- Center for Public Health and Environmental Assessment, Office of Research and Development, US EPA, Washington DC, United States of America
| | - Scott S. Auerbach
- Division of the National Toxicology Program, National Institute of Environmental Health Sciences, NIH, Research Triangle Park, Durham, NC, United States of America
| |
Collapse
|
21
|
Liu X, Xu Y, Wang R, Liu S, Wang J, Luo Y, Leung KS, Cheng L. A network-based algorithm for the identification of moonlighting noncoding RNAs and its application in sepsis. Brief Bioinform 2020; 22:581-588. [PMID: 32003790 DOI: 10.1093/bib/bbz154] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2019] [Revised: 10/26/2019] [Accepted: 11/01/2019] [Indexed: 12/26/2022] Open
Abstract
Moonlighting proteins provide more options for cells to execute multiple functions without increasing the genome and transcriptome complexity. Although there have long been calls for computational methods for the prediction of moonlighting proteins, no method has been designed for determining moonlighting long noncoding ribonucleicacidz (RNAs) (mlncRNAs). Previously, we developed an algorithm MoonFinder for the identification of mlncRNAs at the genome level based on the functional annotation and interactome data of lncRNAs and proteins. Here, we update MoonFinder to MoonFinder v2.0 by providing an extensive framework for the detection of protein modules and the establishment of RNA-module associations in human. A novel measure, moonlighting coefficient, was also proposed to assess the confidence of an ncRNA acting in a moonlighting manner. Moreover, we explored the expression characteristics of mlncRNAs in sepsis, in which we found that mlncRNAs tend to be upregulated and differentially expressed. Interestingly, the mlncRNAs are mutually exclusive in terms of coexpression when compared to the other lncRNAs. Overall, MoonFinder v2.0 is dedicated to the prediction of human mlncRNAs and thus bears great promise to serve as a valuable R package for worldwide research communities (https://cran.r-project.org/web/packages/MoonFinder/index.html). Also, our analyses provide the first attempt to characterize mlncRNA expression and coexpression properties in adult sepsis patients, which will facilitate the understanding of the interaction and expression patterns of mlncRNAs.
Collapse
Affiliation(s)
- Xueyan Liu
- Critical Care Medici at Shenzhen People's Hospital
| | | | - Ran Wang
- Computer Science at The Chinese University of Hong Kong
| | | | | | | | - Kwong-Sak Leung
- Computer Science at the Chinese University of Hong Kong, Hong Kong, China
| | - Lixin Cheng
- Bioinformatics at Shenzhen People's Hospital, China
| |
Collapse
|
22
|
Liu X, Li N, Liu S, Wang J, Zhang N, Zheng X, Leung KS, Cheng L. Normalization Methods for the Analysis of Unbalanced Transcriptome Data: A Review. Front Bioeng Biotechnol 2019; 7:358. [PMID: 32039167 PMCID: PMC6988798 DOI: 10.3389/fbioe.2019.00358] [Citation(s) in RCA: 62] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2019] [Accepted: 11/11/2019] [Indexed: 12/15/2022] Open
Abstract
Dozens of normalization methods for correcting experimental variation and bias in high-throughput expression data have been developed during the last two decades. Up to 23 methods among them consider the skewness of expression data between sample states, which are even more than the conventional methods, such as loess and quantile. From the perspective of reference selection, we classified the normalization methods for skewed expression data into three categories, data-driven reference, foreign reference, and entire gene set. We separately introduced and summarized these normalization methods designed for gene expression data with global shift between compared conditions, including both microarray and RNA-seq, based on the reference selection strategies. To our best knowledge, this is the most comprehensive review of available preprocessing algorithms for the unbalanced transcriptome data. The anatomy and summarization of these methods shed light on the understanding and appropriate application of preprocessing methods.
Collapse
Affiliation(s)
- Xueyan Liu
- Department of Critical Care Medicine, Shenzhen People's Hospital, The Second Clinical Medicine College of Jinan University, Shenzhen, China
| | - Nan Li
- Department of Stomatology Center, Shenzhen People's Hospital, Second Clinical Medicine College of Jinan University, Shenzhen, China
| | - Sheng Liu
- Department of Critical Care Medicine, Shenzhen People's Hospital, The Second Clinical Medicine College of Jinan University, Shenzhen, China
| | - Jun Wang
- Department of Critical Care Medicine, Shenzhen People's Hospital, The Second Clinical Medicine College of Jinan University, Shenzhen, China
| | - Ning Zhang
- Department of Critical Care Medicine, Shenzhen People's Hospital, The Second Clinical Medicine College of Jinan University, Shenzhen, China
| | - Xubin Zheng
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong, Hong Kong
| | - Kwong-Sak Leung
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong, Hong Kong
| | - Lixin Cheng
- Department of Critical Care Medicine, Shenzhen People's Hospital, The Second Clinical Medicine College of Jinan University, Shenzhen, China
| |
Collapse
|
23
|
Cheng L, Liu P, Wang D, Leung KS. Exploiting locational and topological overlap model to identify modules in protein interaction networks. BMC Bioinformatics 2019; 20:23. [PMID: 30642247 PMCID: PMC6332531 DOI: 10.1186/s12859-019-2598-7] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2018] [Accepted: 01/03/2019] [Indexed: 12/27/2022] Open
Abstract
Background Clustering molecular network is a typical method in system biology, which is effective in predicting protein complexes or functional modules. However, few studies have realized that biological molecules are spatial-temporally regulated to form a dynamic cellular network and only a subset of interactions take place at the same location in cells. Results In this study, considering the subcellular localization of proteins, we first construct a co-localization human protein interaction network (PIN) and systematically investigate the relationship between subcellular localization and biological functions. After that, we propose a Locational and Topological Overlap Model (LTOM) to preprocess the co-localization PIN to identify functional modules. LTOM requires the topological overlaps, the common partners shared by two proteins, to be annotated in the same localization as the two proteins. We observed the model has better correspondence with the reference protein complexes and shows more relevance to cancers based on both human and yeast datasets and two clustering algorithms, ClusterONE and MCL. Conclusion Taking into consideration of protein localization and topological overlap can improve the performance of module detection from protein interaction networks. Electronic supplementary material The online version of this article (10.1186/s12859-019-2598-7) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Lixin Cheng
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, Hong Kong. .,Institute of translation medicine, Shenzhen Second People's Hospital, First Affiliated Hospital of Shenzhen University, Shenzhen, China.
| | - Pengfei Liu
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Dong Wang
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou, China.
| | - Kwong-Sak Leung
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, Hong Kong.
| |
Collapse
|
24
|
Cheng L, Liu P, Leung K. SMILE: a novel procedure for subcellular module identification with localisation expansion. IET Syst Biol 2018. [PMID: 29533218 PMCID: PMC8687326 DOI: 10.1049/iet-syb.2017.0085] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Computational clustering methods help identify functional modules in protein–protein interaction (PPI) network, in which proteins participate in the same biological pathways or specific functions. Subcellular localisation is crucial for proteins to implement biological functions and each compartment accommodates specific portions of the protein interaction structure. However, the importance of protein subcellular localisation is often neglected in the studies of module identification. In this study, the authors propose a novel procedure, subcellular module identification with localisation expansion (SMILE), to identify super modules that consist of several subcellular modules performing specific biological functions among cell compartments. These super modules identified by SMILE are more functionally diverse and have been verified to be more associated with known protein complexes and biological pathways compared with the modules identified from the global PPI networks in both the compartmentalised PPI and InWeb_InBioMap datasets. The authors’ results reveal that subcellular localisation is a principal feature of functional modules and offers important guidance in detecting biologically meaningful results.
Collapse
Affiliation(s)
- Lixin Cheng
- Department of Computer Science & EngineeringChinese University of Hong KongMa Liu ShuiHong Kong
| | - Pengfei Liu
- Department of Computer Science & EngineeringChinese University of Hong KongMa Liu ShuiHong Kong
| | - Kwong‐Sak Leung
- Department of Computer Science & EngineeringChinese University of Hong KongMa Liu ShuiHong Kong
| |
Collapse
|
25
|
Cheng L, Leung KS. Quantification of non-coding RNA target localization diversity and its application in cancers. J Mol Cell Biol 2018; 10:130-138. [DOI: 10.1093/jmcb/mjy006] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2017] [Accepted: 01/24/2018] [Indexed: 12/13/2022] Open
Affiliation(s)
- Lixin Cheng
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong SAR, China
| | - Kwong-Sak Leung
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong SAR, China
| |
Collapse
|