1
|
Rosati D, Palmieri M, Brunelli G, Morrione A, Iannelli F, Frullanti E, Giordano A. Differential gene expression analysis pipelines and bioinformatic tools for the identification of specific biomarkers: A review. Comput Struct Biotechnol J 2024; 23:1154-1168. [PMID: 38510977 PMCID: PMC10951429 DOI: 10.1016/j.csbj.2024.02.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Revised: 02/20/2024] [Accepted: 02/20/2024] [Indexed: 03/22/2024] Open
Abstract
In recent years, the role of bioinformatics and computational biology together with omics techniques and transcriptomics has gained tremendous importance in biomedicine and healthcare, particularly for the identification of biomarkers for precision medicine and drug discovery. Differential gene expression (DGE) analysis is one of the most used techniques for RNA-sequencing (RNA-seq) data analysis. This tool, which is typically used in various RNA-seq data processing applications, allows the identification of differentially expressed genes across two or more sample sets. Functional enrichment analyses can then be performed to annotate and contextualize the resulting gene lists. These studies provide valuable information about disease-causing biological processes and can help in identifying molecular targets for novel therapies. This review focuses on differential gene expression (DGE) analysis pipelines and bioinformatic techniques commonly used to identify specific biomarkers and discuss the advantages and disadvantages of these techniques.
Collapse
Affiliation(s)
- Diletta Rosati
- Department of Medical Biotechnologies, University of Siena, 53100 Siena, Italy
- Cancer Genomics & Systems Biology Lab, Dept. of Medical Biotechnologies, University of Siena, 53100 Siena, Italy
- Med Biotech Hub and Competence Center, Department of Medical Biotechnologies, University of Siena, Italy
| | - Maria Palmieri
- Cancer Genomics & Systems Biology Lab, Dept. of Medical Biotechnologies, University of Siena, 53100 Siena, Italy
- Med Biotech Hub and Competence Center, Department of Medical Biotechnologies, University of Siena, Italy
| | - Giulia Brunelli
- Med Biotech Hub and Competence Center, Department of Medical Biotechnologies, University of Siena, Italy
| | - Andrea Morrione
- Sbarro Institute for Cancer Research and Molecular Medicine, Center for Biotechnology, Department of Biology, College of Science and Technology, Temple University, Philadelphia, PA 19122, USA
| | - Francesco Iannelli
- Laboratory of Molecular Microbiology and Biotechnology, Department of Medical Biotechnologies, University of Siena, Siena, Italy
| | - Elisa Frullanti
- Cancer Genomics & Systems Biology Lab, Dept. of Medical Biotechnologies, University of Siena, 53100 Siena, Italy
- Med Biotech Hub and Competence Center, Department of Medical Biotechnologies, University of Siena, Italy
| | - Antonio Giordano
- Department of Medical Biotechnologies, University of Siena, 53100 Siena, Italy
- Sbarro Institute for Cancer Research and Molecular Medicine, Center for Biotechnology, Department of Biology, College of Science and Technology, Temple University, Philadelphia, PA 19122, USA
| |
Collapse
|
2
|
Optimal gene prioritization and disease prediction using knowledge based ontology structure. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2022.104548] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
|
3
|
Santhanam B, Oikonomou P, Tavazoie S. Systematic assessment of prognostic molecular features across cancers. CELL GENOMICS 2023; 3:100262. [PMID: 36950380 PMCID: PMC10025453 DOI: 10.1016/j.xgen.2023.100262] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Revised: 09/29/2022] [Accepted: 01/12/2023] [Indexed: 02/05/2023]
Abstract
Precision oncology promises accurate prediction of disease trajectories by utilizing molecular features of tumors. We present a systematic analysis of the prognostic potential of diverse molecular features across large cancer cohorts. We find that the mRNA expression of biologically coherent sets of genes (modules) is substantially more predictive of patient survival than single-locus genomic and transcriptomic aberrations. Extending our analysis beyond existing curated gene modules, we find a large novel class of highly prognostic DNA/RNA cis-regulatory modules associated with dynamic gene expression within cancers. Remarkably, in more than 82% of cancers, modules substantially improve survival stratification compared with conventional clinical factors and prominent genomic aberrations. The prognostic potential of cancer modules generalizes to external cohorts better than conventionally used single-gene features. Finally, a machine-learning framework demonstrates the combined predictive power of multiple modules, yielding prognostic models that perform substantially better than existing histopathological and clinical factors in common use.
Collapse
Affiliation(s)
- Balaji Santhanam
- Department of Biological Sciences, Columbia University, New York, NY 10027, USA
- Department of Systems Biology, Columbia University, New York, NY 10032, USA
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
- Irving Institute for Cancer Dynamics, Columbia University, New York, NY 10032, USA
| | - Panos Oikonomou
- Department of Biological Sciences, Columbia University, New York, NY 10027, USA
- Department of Systems Biology, Columbia University, New York, NY 10032, USA
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
- Irving Institute for Cancer Dynamics, Columbia University, New York, NY 10032, USA
| | - Saeed Tavazoie
- Department of Biological Sciences, Columbia University, New York, NY 10027, USA
- Department of Systems Biology, Columbia University, New York, NY 10032, USA
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
- Irving Institute for Cancer Dynamics, Columbia University, New York, NY 10032, USA
| |
Collapse
|
4
|
Li X, Zhang B, Yu K, Bao Z, Zhang W, Bai Y. Identifying cancer specific signaling pathways based on the dysregulation between genes. Comput Biol Chem 2021; 95:107586. [PMID: 34619555 DOI: 10.1016/j.compbiolchem.2021.107586] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2021] [Revised: 08/10/2021] [Accepted: 09/26/2021] [Indexed: 11/26/2022]
Abstract
A large collection of studies has shown that the occurrence of cancer is related to the functional dysfunction of the pathways. Identification of cancer-related pathways could help researchers understand the mechanisms of complex diseases well. Whereas, most current signaling pathway analysis methods take no account of the gene interaction variations within pathways. Furthermore, considering that some pathways have connection with two or more cancer types, while some are likely to be cancer-type specific pathways. Identifying cancer-type specific pathways contributes to interpreting the different mechanisms of different cancer types. In this study, we first proposed a pathway analysis method named Pathway Analysis of Intergenic Regulation (PAIGR) to identify pathways with dysregulation between genes and compared the performance of this method with four existing methods on four colorectal cancer (CRC) datasets. The results showed that PAIGR could find cancer-related pathways more accurately. Moreover, in order to explore the relationship between the identified pathways and the cancer type, we constructed a pathway interaction network, in which nodes and edges represented pathways and interactions between pathways respectively. Highly connected pathways were considered to play a central role in an extensive range of biological processes, while sparsely connected pathways are considered to have certain specificity. Our results showed that pathways identified by PAIGR had a low nodal degree (i.e., a few numbers of interactions), which suggested that most of these pathways were cancer-type specific.
Collapse
Affiliation(s)
- Xiaohan Li
- State Key Lab of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing, Jiangsu, 210096, China.
| | - Bing Zhang
- State Key Lab of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing, Jiangsu, 210096, China.
| | - Kequan Yu
- State Key Lab of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing, Jiangsu, 210096, China.
| | - Zhenshen Bao
- State Key Lab of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing, Jiangsu, 210096, China.
| | - Weizhong Zhang
- Department of Ophthalmology, First Affiliated Hospital of Nanjing Medical University, Nanjing, China.
| | - Yunfei Bai
- State Key Lab of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing, Jiangsu, 210096, China.
| |
Collapse
|