1
|
López-Sánchez P, Ávila-Moreno F, Hernández-Lemus E, Kuijjer ML, Espinal-Enríquez J. Patient-specific gene co-expression networks reveal novel subtypes and predictive biomarkers in lung adenocarcinoma. NPJ Syst Biol Appl 2025; 11:44. [PMID: 40346136 PMCID: PMC12064794 DOI: 10.1038/s41540-025-00522-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2024] [Accepted: 04/10/2025] [Indexed: 05/11/2025] Open
Abstract
Lung adenocarcinoma (LUAD) is a highly heterogenous and aggressive form of non-small cell lung cancer (NSCLC). The use of genome-wide gene co-expression networks (GCNs) has been paramount to describe changes in the transcriptional regulatory programs found between diseased and healthy states of LUAD. Recently, studies have shown that multiple cancerous phenotypes share a distinct GCN architecture, suggesting that network topology holds promise for understanding disease pathology. However, conventional GCN inference methods struggle to capture the inherent context-specificity within a patient population, thus flattening its heterogeneity. To address this issue, the use of single-sample network (SSN) modelling has emerged as a promising solution into studying heterogeneous traits of cancer through network-based approaches. Here, we reconstructed patient-specific GCNs (n=334) using the LIONESS equation and mutual information as the network inference method. Unsupervised analysis revealed six novel LUAD subtypes based on inter-patient network similarity, each with distinct network motifs reflecting unique biological programs. Supervised analysis, employing regularized Cox regression, identified 12 genes (CHRDL2, SPP2, VAC14, IRF5, GUCY1B1, NCS1, RRM2B, EIF5A2, CCDC62, CTCFL, XG, and TP53INP2) whose weighted degree in SSNs is predictive of patient survival in LUAD. These findings suggest that topological features of SSNs offer valuable insights into the context-specific nature of LUAD malignancy, highlighting the potential of SSN-based approaches for further research.
Collapse
Affiliation(s)
- Patricio López-Sánchez
- Computational Genomics Division, National Institute of Genomic Medicine, Mexico City, Mexico
| | - Federico Ávila-Moreno
- Facultad de Estudios Superiores-Iztacala (FES-Iztacala), Universidad Nacional Autónoma de México (UNAM), Mexico State, Mexico
- Subdirección de Investigación Básica, Instituto Nacional de Cancerología (INCAN), Mexico City, Mexico
| | - Enrique Hernández-Lemus
- Computational Genomics Division, National Institute of Genomic Medicine, Mexico City, Mexico
| | - Marieke L Kuijjer
- Centre for Molecular Medicine Norway (NCMM), University of Oslo, Oslo, Norway
| | - Jesús Espinal-Enríquez
- Computational Genomics Division, National Institute of Genomic Medicine, Mexico City, Mexico.
| |
Collapse
|
2
|
Eicher T, Guebila MB, Quackenbush J. BLOBFISH: Bipartite Limited Subnetworks from Multiple Observations using Breadth-First Search with Constrained Hops. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.03.11.642378. [PMID: 40196640 PMCID: PMC11973938 DOI: 10.1101/2025.03.11.642378] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 04/09/2025]
Abstract
In analyzing gene regulatory network models, a common question is how members of a particular set of genes are connected. For example, one might want to explore network relationship between a set of differentially expressed genes, a gene set previously reported in the literature, or elements of one or more pathways. BLOBFISH uses a breadth-first search algorithm adapted to bipartite graphs to identify a compact subnetwork connecting the members of a pre-specified set of genes, providing a regulatory context that can shed light on specific mechanisms involved in a phenotype and its development. We demonstrate the use of BLOBFISH to extract gene regulatory subnetworks reflecting tissue specificity using publicly available data from the Genotype Tissue Expression (GTEx) project.
Collapse
Affiliation(s)
- Tara Eicher
- Department of Biostatistics, T.H. Chan School of Public Health, Harvard University, Boston, MA, 02115, USA
| | - Marouen Ben Guebila
- Department of Biostatistics, T.H. Chan School of Public Health, Harvard University, Boston, MA, 02115, USA
| | - John Quackenbush
- Department of Biostatistics, T.H. Chan School of Public Health, Harvard University, Boston, MA, 02115, USA
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Boston, MA, 02115, USA
| |
Collapse
|
3
|
Cheng H, Liang M, Gao Y, Zhao W, Guo WF. Multiomics with Evolutionary Computation to Identify Molecular and Module Biomarkers for Early Diagnosis and Treatment of Complex Disease. Genes (Basel) 2025; 16:244. [PMID: 40149396 PMCID: PMC11942451 DOI: 10.3390/genes16030244] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2025] [Revised: 02/15/2025] [Accepted: 02/17/2025] [Indexed: 03/29/2025] Open
Abstract
It is important to identify disease biomarkers (DBs) for early diagnosis and treatment of complex diseases in personalized medicine. However, existing methods integrating intelligence technologies and multiomics to predict key biomarkers are limited by the complex dynamic characteristics of omics data, making it difficult to meet the high-precision requirements for biomarker characterization in large dimensions. This study reviewed current analysis methods of evolutionary computation (EC) by considering the essential characteristics of DB identification problems and the advantages of EC, aiming to explore the complex dynamic characteristics of multiomics. In this study, EC-based biomarker identification strategies were summarized as evolutionary algorithms, swarm intelligence and other EC methods for molecular and module DB identification, respectively. Finally, we pointed out the challenges in current research and future research directions. This study can enrich the application of EC theory and promote interdisciplinary integration between EC and bioinformatics.
Collapse
Affiliation(s)
- Han Cheng
- School of Life Sciences, Zhengzhou University, Zhengzhou 450001, China; (H.C.); (M.L.); (Y.G.); (W.Z.)
| | - Mengyu Liang
- School of Life Sciences, Zhengzhou University, Zhengzhou 450001, China; (H.C.); (M.L.); (Y.G.); (W.Z.)
| | - Yiwen Gao
- School of Life Sciences, Zhengzhou University, Zhengzhou 450001, China; (H.C.); (M.L.); (Y.G.); (W.Z.)
| | - Wenshan Zhao
- School of Life Sciences, Zhengzhou University, Zhengzhou 450001, China; (H.C.); (M.L.); (Y.G.); (W.Z.)
| | - Wei-Feng Guo
- School of Electrical and Information Engineering, Zhengzhou University, Zhengzhou 450001, China
| |
Collapse
|
4
|
Zhang Y, Du X, Zhang M, Sun Y. Constructing mRNA-meth-miRNA single-sample networks to reveal the molecular interaction patterns induced by lunar orbital stressors in rice (Oryzasativa). PLANT PHYSIOLOGY AND BIOCHEMISTRY : PPB 2025; 219:109430. [PMID: 39724765 DOI: 10.1016/j.plaphy.2024.109430] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/22/2024] [Revised: 12/06/2024] [Accepted: 12/17/2024] [Indexed: 12/28/2024]
Abstract
To explore the bio-effects during Moon exploration missions, we utilized the Chang'E 5 probe to carry the seeds of Oryza. Sativa L., which were later returned to Earth after 23 days in lunar orbit and planted in an artificial climate chamber. Compared to the control group, rice seeds that underwent spaceflight showed inhibited growth and development when planted on the ground. Then we collected samples and employed RNA sequencing (RNA-Seq) and whole-genome bisulfite sequencing (WGBS) in the tillering and heading stages of rice. To gain a comprehensive understanding of the dysregulation in molecular interaction patterns during Moon exploration, a bioinformatics pipeline based on mRNA-meth-miRNA Single-Sample Networks (SSNs) was developed. Specifically, we constructed four SSNs for each sample at the mRNA, DNA methylation (promoter and gene bodies), and miRNA levels. By combining with the Protein-Protein Interaction (PPI) network, SSNs can character individual-specific gene interaction patterns. Under spaceflight conditions, distinct interaction patterns emerge across various omics levels. However, the molecules driving changes at each omics level predominantly regulate consistent biological functions, such as metabolic processes, DNA damage and repair, cell cycle, developmental processes, etc. In the tillering stage, pathways such as ubiquitin mediated proteolysis, nucleotide excision repair, and nucleotide metabolism are significantly enriched. Moreover, we identified 18 genes that played key/hub roles in the dysregulation of multi-omics molecular interaction patterns, and observed their involvement in regulating the above biological processes. As aforementioned, our multi-omics SSNs method can reveal the molecular interaction patterns under deep space exploration.
Collapse
Affiliation(s)
- Yan Zhang
- Institute of Environmental Systems Biology, College of Environmental Science and Engineering, Dalian Maritime University, Dalian, 116026, Liaoning, China
| | - Xiaohui Du
- Institute of Environmental Systems Biology, College of Environmental Science and Engineering, Dalian Maritime University, Dalian, 116026, Liaoning, China
| | - Meng Zhang
- Institute of Environmental Systems Biology, College of Environmental Science and Engineering, Dalian Maritime University, Dalian, 116026, Liaoning, China
| | - Yeqing Sun
- Institute of Environmental Systems Biology, College of Environmental Science and Engineering, Dalian Maritime University, Dalian, 116026, Liaoning, China.
| |
Collapse
|
5
|
Khullar S, Huang X, Ramesh R, Svaren J, Wang D. NetREm: Network Regression Embeddings reveal cell-type transcription factor coordination for gene regulation. BIOINFORMATICS ADVANCES 2024; 5:vbae206. [PMID: 40260118 PMCID: PMC12011367 DOI: 10.1093/bioadv/vbae206] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/05/2024] [Revised: 10/22/2024] [Accepted: 12/18/2024] [Indexed: 04/23/2025]
Abstract
Motivation Transcription factor (TF) coordination plays a key role in gene regulation via direct and/or indirect protein-protein interactions (PPIs) and co-binding to regulatory elements on DNA. Single-cell technologies facilitate gene expression measurement for individual cells and cell-type identification, yet the connection between TF-TF coordination and target gene (TG) regulation of various cell types remains unclear. Results To address this, we introduce our innovative computational approach, Network Regression Embeddings (NetREm), to reveal cell-type TF-TF coordination activities for TG regulation. NetREm leverages network-constrained regularization, using prior knowledge of PPIs among TFs, to analyze single-cell gene expression data, uncovering cell-type coordinating TFs and identifying revolutionary TF-TG candidate regulatory network links. NetREm's performance is validated using simulation studies and benchmarked across several datasets in humans, mice, yeast. Further, we showcase NetREm's ability to prioritize valid novel human TF-TF coordination links in 9 peripheral blood mononuclear and 42 immune cell sub-types. We apply NetREm to examine cell-type networks in central and peripheral nerve systems (e.g. neuronal, glial, Schwann cells) and in Alzheimer's disease versus Controls. Top predictions are validated with experimental data from rat, mouse, and human models. Additional functional genomics data helps link genetic variants to our TF-TG regulatory and TF-TF coordination networks. Availability and implementation https://github.com/SaniyaKhullar/NetREm.
Collapse
Affiliation(s)
- Saniya Khullar
- Waisman Center, University of Wisconsin-Madison, Madison, WI 53705, United States
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI 53076, United States
| | - Xiang Huang
- Waisman Center, University of Wisconsin-Madison, Madison, WI 53705, United States
| | - Raghu Ramesh
- Waisman Center, University of Wisconsin-Madison, Madison, WI 53705, United States
- Comparative Biomedical Sciences Training Program, University of Wisconsin-Madison, Madison, WI 53706, United States
| | - John Svaren
- Waisman Center, University of Wisconsin-Madison, Madison, WI 53705, United States
- Department of Comparative Biosciences, School of Veterinary Medicine, University of Wisconsin-Madison, Madison, WI 53706, United States
| | - Daifeng Wang
- Waisman Center, University of Wisconsin-Madison, Madison, WI 53705, United States
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI 53076, United States
- Department of Computer Sciences, University of Wisconsin-Madison, Madison, WI 53706, United States
| |
Collapse
|
6
|
Hao M, Zhang H, Jiang S, Hu Z, Jiang X, Wu J, Li Y, Jin L, Wang X. Metrics of Physiological Network Topology Are Novel Biomarkers to Capture Functional Disability and Health. J Gerontol A Biol Sci Med Sci 2024; 80:glae268. [PMID: 39500737 DOI: 10.1093/gerona/glae268] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2024] [Indexed: 12/22/2024] Open
Abstract
BACKGROUND Physiological networks are highly complex, integrating connections among multiple organ systems and their dynamic changes underlying human aging. It is unknown whether individual-level network could serve as robust biomarkers for health and aging. METHODS We used personalized network analysis to construct a single-sample network and examine the associations between network properties and functional disability in the Rugao Longevity and Aging Study (RuLAS), the China Health and Retirement Longitudinal Study (CHARLS), the Chinese Longitudinal Healthy Longevity Survey (CLHLS), and the National Health and Nutrition Examination Survey (NHANES). RESULTS We observed impairments in interconnected physiological systems among long-lived adults in RuLAS. Single-sample network analysis was applied to reflect the co-occurrence of these multisystem impairments at the individual level. The activities of daily living (ADL)-disabled individuals' networks exhibited notably increased connectivity among various biomarkers. Significant associations were found between network topology and functional disability across RuLAS, CHARLS, CLHLS, and NHANES. Additionally, network topology served as a novel biomarker to capture risks of incident ADL disability in CHARLS. Furthermore, these metrics of physiological network topology predicted mortality across 4 cohorts. Sensitivity analysis demonstrated that the prediction performance of network topology remained robust, regardless of the chosen biomarkers and parameters. CONCLUSIONS These findings showed that metrics of network topology were sensitive and robust biomarkers to capture risks of functional disability and mortality, highlighting the role of single-sample physiological networks as novel biomarkers for health and aging.
Collapse
Affiliation(s)
- Meng Hao
- Department of Geriatrics, Huadong Hospital, Shanghai Medical College, Fudan University, Human Phenome Institute, Fudan University, Shanghai, China
- Fudan Zhangjiang Institute, Shanghai, China
| | - Hui Zhang
- Department of Geriatrics, Huadong Hospital, Shanghai Medical College, Fudan University, Human Phenome Institute, Fudan University, Shanghai, China
- School of Global Health, Chinese Center for Tropical Diseases Research, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Shuai Jiang
- Department of Vascular Surgery, Shanghai Key Laboratory of Vascular Lesion Regulation and Remodeling, Pudong Medical Center, Shanghai Pudong Hospital, Fudan University, Shanghai, China
| | - Zixin Hu
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China
| | - Xiaoyan Jiang
- State Key Laboratory of Cardiology, Department of Pathology and Pathophysiology, School of Medicine, Tongji University, Shanghai, China
| | - Jingyi Wu
- Department of Geriatrics, Huadong Hospital, Shanghai Medical College, Fudan University, Human Phenome Institute, Fudan University, Shanghai, China
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China
| | - Yi Li
- Department of Geriatrics, Huadong Hospital, Shanghai Medical College, Fudan University, Human Phenome Institute, Fudan University, Shanghai, China
- Department of Vascular Surgery, Shanghai Key Laboratory of Vascular Lesion Regulation and Remodeling, Pudong Medical Center, Shanghai Pudong Hospital, Fudan University, Shanghai, China
| | - Li Jin
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China
| | - Xiaofeng Wang
- Department of Geriatrics, Huadong Hospital, Shanghai Medical College, Fudan University, Human Phenome Institute, Fudan University, Shanghai, China
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China
| |
Collapse
|
7
|
Kersting J, Lazareva O, Louadi Z, Baumbach J, Blumenthal DB, List M. DysRegNet: Patient-specific and confounder-aware dysregulated network inference towards precision therapeutics. Br J Pharmacol 2024. [PMID: 39631757 DOI: 10.1111/bph.17395] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2024] [Revised: 09/09/2024] [Accepted: 10/05/2024] [Indexed: 12/07/2024] Open
Abstract
BACKGROUND AND PURPOSE Gene regulation is frequently altered in diseases in unique and patient-specific ways. Hence, personalised strategies have been proposed to infer patient-specific gene-regulatory networks. However, existing methods do not scale well because they often require recomputing the entire network per sample. Moreover, they do not account for clinically important confounding factors such as age, sex or treatment history. Finally, a user-friendly implementation for the analysis and interpretation of such networks is missing. EXPERIMENTAL APPROACH We present DysRegNet, a method for inferring patient-specific regulatory alterations (dysregulations) from bulk gene expression profiles. We compared DysRegNet to the well-known SSN method, considering patient clustering, promoter methylation, mutations and cancer-stage data. KEY RESULTS We demonstrate that both SSN and DysRegNet produce interpretable and biologically meaningful networks across various cancer types. In contrast to SSN, DysRegNet can scale to arbitrary sample numbers and highlights the importance of confounders in network inference, revealing an age-specific bias in gene regulation in breast cancer. DysRegNet is available as a Python package (https://github.com/biomedbigdata/DysRegNet_package), and analysis results for 11 TCGA cancer types are available through an interactive web interface (https://exbio.wzw.tum.de/dysregnet). CONCLUSION AND IMPLICATIONS DysRegNet introduces a novel bioinformatics tool enabling confounder-aware and patient-specific network analysis to unravel regulatory alteration in complex diseases.
Collapse
Affiliation(s)
- Johannes Kersting
- Data Science in Systems Biology, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Olga Lazareva
- Chair of Experimental Bioinformatics, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
- Division of Computational Genomics and Systems Genetics, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Junior Clinical Cooperation Unit Multiparametric Methods for Early Detection of Prostate Cancer, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Zakaria Louadi
- Chair of Experimental Bioinformatics, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
- Institute for Computational Systems Biology, University of Hamburg, Hamburg, Germany
| | - Jan Baumbach
- Institute for Computational Systems Biology, University of Hamburg, Hamburg, Germany
- Department of Mathematics and Computer Science, University of Southern Denmark, Odense, Denmark
| | - David B Blumenthal
- Department Artificial Intelligence in Biomedical Engineering (AIBE), Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU), Erlangen, Germany
| | - Markus List
- Data Science in Systems Biology, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
- Munich Data Science Institute, Technical University of Munich, Garching, Germany
| |
Collapse
|
8
|
Saha E, Fanfani V, Mandros P, Ben Guebila M, Fischer J, Shutta KH, DeMeo DL, Lopes-Ramos CM, Quackenbush J. Bayesian inference of sample-specific coexpression networks. Genome Res 2024; 34:1397-1410. [PMID: 39134413 PMCID: PMC11529861 DOI: 10.1101/gr.279117.124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2024] [Accepted: 07/31/2024] [Indexed: 08/28/2024]
Abstract
Gene regulatory networks (GRNs) are effective tools for inferring complex interactions between molecules that regulate biological processes and hence can provide insights into drivers of biological systems. Inferring coexpression networks is a critical element of GRN inference, as the correlation between expression patterns may indicate that genes are coregulated by common factors. However, methods that estimate coexpression networks generally derive an aggregate network representing the mean regulatory properties of the population and so fail to fully capture population heterogeneity. Bayesian optimized networks obtained by assimilating omic data (BONOBO) is a scalable Bayesian model for deriving individual sample-specific coexpression matrices that recognizes variations in molecular interactions across individuals. For each sample, BONOBO assumes a Gaussian distribution on the log-transformed centered gene expression and a conjugate prior distribution on the sample-specific coexpression matrix constructed from all other samples in the data. Combining the sample-specific gene coexpression with the prior distribution, BONOBO yields a closed-form solution for the posterior distribution of the sample-specific coexpression matrices, thus allowing the analysis of large data sets. We demonstrate BONOBO's utility in several contexts, including analyzing gene regulation in yeast transcription factor knockout studies, the prognostic significance of miRNA-mRNA interaction in human breast cancer subtypes, and sex differences in gene regulation within human thyroid tissue. We find that BONOBO outperforms other methods that have been used for sample-specific coexpression network inference and provides insight into individual differences in the drivers of biological processes.
Collapse
Affiliation(s)
- Enakshi Saha
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts 02115, USA
| | - Viola Fanfani
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts 02115, USA
| | - Panagiotis Mandros
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts 02115, USA
| | - Marouen Ben Guebila
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts 02115, USA
| | - Jonas Fischer
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts 02115, USA
| | - Katherine H Shutta
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts 02115, USA
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, Massachusetts 02115, USA
| | - Dawn L DeMeo
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, Massachusetts 02115, USA
- Department of Medicine, Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Camila M Lopes-Ramos
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts 02115, USA
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, Massachusetts 02115, USA
- Department of Medicine, Harvard Medical School, Boston, Massachusetts 02115, USA
| | - John Quackenbush
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts 02115, USA;
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, Massachusetts 02115, USA
- Department of Data Science, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, USA
| |
Collapse
|
9
|
Zhang J, Liu L, Wei X, Zhao C, Luo Y, Li J, Le TD. Scanning sample-specific miRNA regulation from bulk and single-cell RNA-sequencing data. BMC Biol 2024; 22:218. [PMID: 39334271 PMCID: PMC11438147 DOI: 10.1186/s12915-024-02020-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Accepted: 09/24/2024] [Indexed: 09/30/2024] Open
Abstract
BACKGROUND RNA-sequencing technology provides an effective tool for understanding miRNA regulation in complex human diseases, including cancers. A large number of computational methods have been developed to make use of bulk and single-cell RNA-sequencing data to identify miRNA regulations at the resolution of multiple samples (i.e. group of cells or tissues). However, due to the heterogeneity of individual samples, there is a strong need to infer miRNA regulation specific to individual samples to uncover miRNA regulation at the single-sample resolution level. RESULTS Here, we develop a framework, Scan, for scanning sample-specific miRNA regulation. Since a single network inference method or strategy cannot perform well for all types of new data, Scan incorporates 27 network inference methods and two strategies to infer tissue-specific or cell-specific miRNA regulation from bulk or single-cell RNA-sequencing data. Results on bulk and single-cell RNA-sequencing data demonstrate the effectiveness of Scan in inferring sample-specific miRNA regulation. Moreover, we have found that incorporating the prior information of miRNA targets can generally improve the accuracy of miRNA target prediction. In addition, Scan can contribute to construct cell/tissue correlation networks and recover aggregate miRNA regulatory networks. Finally, the comparison results have shown that the performance of network inference methods is likely to be data-specific, and selecting optimal network inference methods is required for more accurate prediction of miRNA targets. CONCLUSIONS Scan provides a useful method to help infer sample-specific miRNA regulation for new data, benchmark new network inference methods and deepen the understanding of miRNA regulation at the resolution of individual samples.
Collapse
Affiliation(s)
- Junpeng Zhang
- School of Engineering, Dali University, Dali, 671003, Yunnan, China.
| | - Lin Liu
- UniSA STEM, University of South Australia, Mawson Lakes, SA, 5095, Australia
| | - Xuemei Wei
- School of Engineering, Dali University, Dali, 671003, Yunnan, China
| | - Chunwen Zhao
- School of Engineering, Dali University, Dali, 671003, Yunnan, China
| | - Yanbi Luo
- School of Engineering, Dali University, Dali, 671003, Yunnan, China
| | - Jiuyong Li
- UniSA STEM, University of South Australia, Mawson Lakes, SA, 5095, Australia
| | - Thuc Duy Le
- UniSA STEM, University of South Australia, Mawson Lakes, SA, 5095, Australia.
| |
Collapse
|
10
|
Chang LY, Hao TY, Wang WJ, Lin CY. Inference of single-cell network using mutual information for scRNA-seq data analysis. BMC Bioinformatics 2024; 25:292. [PMID: 39237886 PMCID: PMC11378379 DOI: 10.1186/s12859-024-05895-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2022] [Accepted: 08/08/2024] [Indexed: 09/07/2024] Open
Abstract
BACKGROUND With the advance in single-cell RNA sequencing (scRNA-seq) technology, deriving inherent biological system information from expression profiles at a single-cell resolution has become possible. It has been known that network modeling by estimating the associations between genes could better reveal dynamic changes in biological systems. However, accurately constructing a single-cell network (SCN) to capture the network architecture of each cell and further explore cell-to-cell heterogeneity remains challenging. RESULTS We introduce SINUM, a method for constructing the SIngle-cell Network Using Mutual information, which estimates mutual information between any two genes from scRNA-seq data to determine whether they are dependent or independent in a specific cell. Experiments on various scRNA-seq datasets with different cell numbers based on eight performance indexes (e.g., adjusted rand index and F-measure index) validated the accuracy and robustness of SINUM in cell type identification, superior to the state-of-the-art SCN inference method. Additionally, the SINUM SCNs exhibit high overlap with the human interactome and possess the scale-free property. CONCLUSIONS SINUM presents a view of biological systems at the network level to detect cell-type marker genes/gene pairs and investigate time-dependent changes in gene associations during embryo development. Codes for SINUM are freely available at https://github.com/SysMednet/SINUM .
Collapse
Affiliation(s)
- Lan-Yun Chang
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu, 300, Taiwan
| | - Ting-Yi Hao
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu, 300, Taiwan
| | - Wei-Jie Wang
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu, 300, Taiwan
| | - Chun-Yu Lin
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu, 300, Taiwan.
- Department of Biological Science and Technology, National Yang Ming Chiao Tung University, Hsinchu, 300, Taiwan.
- Institute of Data Science and Engineering, National Yang Ming Chiao Tung University, Hsinchu, 300, Taiwan.
- Center for Intelligent Drug Systems and Smart Bio-Devices, National Yang Ming Chiao Tung University, Hsinchu, 300, Taiwan.
- Cancer and Immunology Research Center, National Yang Ming Chiao Tung University, Taipei, 112, Taiwan.
- School of Dentistry, Kaohsiung Medical University, Kaohsiung, 807, Taiwan.
| |
Collapse
|
11
|
Arici MK, Tuncbag N. Unveiling hidden connections in omics data via pyPARAGON: an integrative hybrid approach for disease network construction. Brief Bioinform 2024; 25:bbae399. [PMID: 39163205 PMCID: PMC11334722 DOI: 10.1093/bib/bbae399] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Revised: 06/26/2024] [Accepted: 08/07/2024] [Indexed: 08/22/2024] Open
Abstract
Network inference or reconstruction algorithms play an integral role in successfully analyzing and identifying causal relationships between omics hits for detecting dysregulated and altered signaling components in various contexts, encompassing disease states and drug perturbations. However, accurate representation of signaling networks and identification of context-specific interactions within sparse omics datasets in complex interactomes pose significant challenges in integrative approaches. To address these challenges, we present pyPARAGON (PAgeRAnk-flux on Graphlet-guided network for multi-Omic data integratioN), a novel tool that combines network propagation with graphlets. pyPARAGON enhances accuracy and minimizes the inclusion of nonspecific interactions in signaling networks by utilizing network rather than relying on pairwise connections among proteins. Through comprehensive evaluations on benchmark signaling pathways, we demonstrate that pyPARAGON outperforms state-of-the-art approaches in node propagation and edge inference. Furthermore, pyPARAGON exhibits promising performance in discovering cancer driver networks. Notably, we demonstrate its utility in network-based stratification of patient tumors by integrating phosphoproteomic data from 105 breast cancer tumors with the interactome and demonstrating tumor-specific signaling pathways. Overall, pyPARAGON is a novel tool for analyzing and integrating multi-omic data in the context of signaling networks. pyPARAGON is available at https://github.com/netlab-ku/pyPARAGON.
Collapse
Affiliation(s)
- Muslum Kaan Arici
- Graduate School of Informatics, Middle East Technical University, Ankara 06800, Turkey
| | - Nurcan Tuncbag
- Chemical and Biological Engineering, College of Engineering, Koc University, Istanbul 34450, Turkey
- School of Medicine, Koc University, Istanbul 34450, Turkey
- Koc University Research Center for Translational Medicine (KUTTAM), Koc University, Istanbul 34450, Turkey
| |
Collapse
|
12
|
Zhang Y, Zhao L, Sun Y. Using single-sample networks to identify the contrasting patterns of gene interactions and reveal the radiation dose-dependent effects in multiple tissues of spaceflight mice. NPJ Microgravity 2024; 10:45. [PMID: 38575629 PMCID: PMC10995210 DOI: 10.1038/s41526-024-00383-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Accepted: 03/08/2024] [Indexed: 04/06/2024] Open
Abstract
Transcriptome profiles are sensitive to space stressors and serve as valuable indicators of the biological effects during spaceflight. Herein, we transformed the expression profiles into gene interaction patterns by single-sample networks (SSNs) and performed the integrated analysis on the 301 spaceflight and 290 ground control samples, which were obtained from the GeneLab platform. Specifically, an individual SSN was established for each sample. Based on the topological structures of 591 SSNs, the differentially interacted genes (DIGs) were identified between spaceflights and ground controls. The results showed that spaceflight disrupted the gene interaction patterns in mice and resulted in significant enrichment of biological processes such as protein/amino acid metabolism and nucleic acid (DNA/RNA) metabolism (P-value < 0.05). We observed that the mice exposed to radiation doses within the three intervals (4.66-7.14, 7.592-8.295, 8.49-22.099 mGy) exhibited similar gene interaction patterns. Low and medium doses resulted in changes to the circadian rhythm, while the damaging effects on genetic material became more pronounced in higher doses. The gene interaction patterns in response to space stressors varied among different tissues, with the spleen, lung, and skin being the most responsive to space radiation (P-value < 0.01). The changes observed in gene networks during spaceflight conditions might contribute to the development of various diseases, such as mental disorders, depression, and metabolic disorders, among others. Additionally, organisms activated specific gene networks in response to virus reactivation. We identified several hub genes that were associated with circadian rhythms, suggesting that spaceflight could lead to substantial circadian rhythm dysregulation.
Collapse
Affiliation(s)
- Yan Zhang
- Institute of Environmental Systems Biology, College of Environmental Science and Engineering, Dalian Maritime University, 116026, Dalian, Liaoning, China
| | - Lei Zhao
- Institute of Environmental Systems Biology, College of Environmental Science and Engineering, Dalian Maritime University, 116026, Dalian, Liaoning, China.
| | - Yeqing Sun
- Institute of Environmental Systems Biology, College of Environmental Science and Engineering, Dalian Maritime University, 116026, Dalian, Liaoning, China.
| |
Collapse
|
13
|
Wang S, Lee D. Community cohesion looseness in gene networks reveals individualized drug targets and resistance. Brief Bioinform 2024; 25:bbae175. [PMID: 38622359 PMCID: PMC11018546 DOI: 10.1093/bib/bbae175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2024] [Revised: 03/19/2024] [Accepted: 04/02/2024] [Indexed: 04/17/2024] Open
Abstract
Community cohesion plays a critical role in the determination of an individual's health in social science. Intriguingly, a community structure of gene networks indicates that the concept of community cohesion could be applied between the genes as well to overcome the limitations of single gene-based biomarkers for precision oncology. Here, we develop community cohesion scores which precisely quantify the community ability to retain the interactions between the genes and their cellular functions in each individualized gene network. Using breast cancer as a proof-of-concept study, we measure the community cohesion score profiles of 950 case samples and predict the individualized therapeutic targets in 2-fold. First, we prioritize them by finding druggable genes present in the community with the most and relatively decreased scores in each individual. Then, we pinpoint more individualized therapeutic targets by discovering the genes which greatly contribute to the community cohesion looseness in each individualized gene network. Compared with the previous approaches, the community cohesion scores show at least four times higher performance in predicting effective individualized chemotherapy targets based on drug sensitivity data. Furthermore, the community cohesion scores successfully discover the known breast cancer subtypes and we suggest new targeted therapy targets for triple negative breast cancer (e.g. KIT and GABRP). Lastly, we demonstrate that the community cohesion scores can predict tamoxifen responses in ER+ breast cancer and suggest potential combination therapies (e.g. NAMPT and RXRA inhibitors) to reduce endocrine therapy resistance based on individualized characteristics. Our method opens new perspectives for the biomarker development in precision oncology.
Collapse
Affiliation(s)
- Seunghyun Wang
- Department of Bio and Brain Engineering, KAIST, Daejeon, Republic of Korea
| | - Doheon Lee
- Department of Bio and Brain Engineering, KAIST, Daejeon, Republic of Korea
| |
Collapse
|
14
|
Deschildre J, Vandemoortele B, Loers JU, De Preter K, Vermeirssen V. Evaluation of single-sample network inference methods for precision oncology. NPJ Syst Biol Appl 2024; 10:18. [PMID: 38360881 PMCID: PMC10869342 DOI: 10.1038/s41540-024-00340-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Accepted: 01/17/2024] [Indexed: 02/17/2024] Open
Abstract
A major challenge in precision oncology is to detect targetable cancer vulnerabilities in individual patients. Modeling high-throughput omics data in biological networks allows identifying key molecules and processes of tumorigenesis. Traditionally, network inference methods rely on many samples to contain sufficient information for learning, resulting in aggregate networks. However, to implement patient-tailored approaches in precision oncology, we need to interpret omics data at the level of individual patients. Several single-sample network inference methods have been developed that infer biological networks for an individual sample from bulk RNA-seq data. However, only a limited comparison of these methods has been made and many methods rely on 'normal tissue' samples as reference, which are not always available. Here, we conducted an evaluation of the single-sample network inference methods SSN, LIONESS, SWEET, iENA, CSN and SSPGI using transcriptomic profiles of lung and brain cancer cell lines from the CCLE database. The methods constructed functional gene networks with distinct network characteristics. Hub gene analyses revealed different degrees of subtype-specificity across methods. Single-sample networks were able to distinguish between tumor subtypes, as exemplified by node strength clustering, enrichment of known subtype-specific driver genes among hubs and differential node strength. We also showed that single-sample networks correlated better to other omics data from the same cell line as compared to aggregate networks. We conclude that single-sample network inference methods can reflect sample-specific biology when 'normal tissue' samples are absent and we point out peculiarities of each method.
Collapse
Affiliation(s)
- Joke Deschildre
- Lab for Computational Biology, Integromics and Gene Regulation (CBIGR), Cancer Research Institute Ghent (CRIG), Ghent, Belgium
- Department of Biomedical Molecular Biology, Ghent University, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| | - Boris Vandemoortele
- Lab for Computational Biology, Integromics and Gene Regulation (CBIGR), Cancer Research Institute Ghent (CRIG), Ghent, Belgium
- Department of Biomedical Molecular Biology, Ghent University, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| | - Jens Uwe Loers
- Lab for Computational Biology, Integromics and Gene Regulation (CBIGR), Cancer Research Institute Ghent (CRIG), Ghent, Belgium
- Department of Biomedical Molecular Biology, Ghent University, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| | - Katleen De Preter
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
- Lab of Translational Onco-genomics and Bio-informatics, Center for Medical Biotechnology (VIB-UGent), Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Vanessa Vermeirssen
- Lab for Computational Biology, Integromics and Gene Regulation (CBIGR), Cancer Research Institute Ghent (CRIG), Ghent, Belgium.
- Department of Biomedical Molecular Biology, Ghent University, Ghent, Belgium.
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium.
| |
Collapse
|
15
|
Chang LY, Lee MZ, Wu Y, Lee WK, Ma CL, Chang JM, Chen CW, Huang TC, Lee CH, Lee JC, Tseng YY, Lin CY. Gene set correlation enrichment analysis for interpreting and annotating gene expression profiles. Nucleic Acids Res 2024; 52:e17. [PMID: 38096046 PMCID: PMC10853793 DOI: 10.1093/nar/gkad1187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2022] [Revised: 11/17/2023] [Accepted: 11/29/2023] [Indexed: 02/10/2024] Open
Abstract
Pathway analysis, including nontopology-based (non-TB) and topology-based (TB) methods, is widely used to interpret the biological phenomena underlying differences in expression data between two phenotypes. By considering dependencies and interactions between genes, TB methods usually perform better than non-TB methods in identifying pathways that include closely relevant or directly causative genes for a given phenotype. However, most TB methods may be limited by incomplete pathway data used as the reference network or by difficulties in selecting appropriate reference networks for different research topics. Here, we propose a gene set correlation enrichment analysis method, Gscore, based on an expression dataset-derived coexpression network to examine whether a differentially expressed gene (DEG) list (or each of its DEGs) is associated with a known gene set. Gscore is better able to identify target pathways in 89 human disease expression datasets than eight other state-of-the-art methods and offers insight into how disease-wide and pathway-wide associations reflect clinical outcomes. When applied to RNA-seq data from COVID-19-related cells and patient samples, Gscore provided a means for studying how DEGs are implicated in COVID-19-related pathways. In summary, Gscore offers a powerful analytical approach for annotating individual DEGs, DEG lists, and genome-wide expression profiles based on existing biological knowledge.
Collapse
Affiliation(s)
- Lan-Yun Chang
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
| | - Meng-Zhan Lee
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
| | - Yujia Wu
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
| | - Wen-Kai Lee
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
| | - Chia-Liang Ma
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
| | - Jun-Mao Chang
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
| | - Ciao-Wen Chen
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
| | - Tzu-Chun Huang
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
| | - Chia-Hwa Lee
- School of Medical Laboratory Science and Biotechnology, College of Medical Science and Technology, Taipei Medical University, New Taipei City 235, Taiwan
- Center for Intelligent Drug Systems and Smart Bio-devices (IDSB), National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
- TMU Research Center of Cancer Translational Medicine, Taipei Medical University, Taipei 110, Taiwan
- Ph.D. Program in Medical Biotechnology, College of Medical Science and Technology, Taipei Medical University, New Taipei City 235, Taiwan
| | - Jih-Chin Lee
- Department of Otolaryngology-Head and Neck Surgery, Tri-Service General Hospital, National Defense Medical Center, Taipei 110, Taiwan
| | - Yu-Yao Tseng
- Department of Food Science, Nutrition, and Nutraceutical Biotechnology, Shih Chien University, Taipei 104, Taiwan
| | - Chun-Yu Lin
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
- Center for Intelligent Drug Systems and Smart Bio-devices (IDSB), National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
- Department of Biological Science and Technology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
- Cancer and Immunology Research Center, National Yang Ming Chiao Tung University, Taipei 112, Taiwan
- Institute of Data Science and Engineering, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
- School of Dentistry, Kaohsiung Medical University, Kaohsiung 807, Taiwan
| |
Collapse
|
16
|
De Marzio M, Glass K, Kuijjer ML. Single-sample network modeling on omics data. BMC Biol 2023; 21:296. [PMID: 38155351 PMCID: PMC10755944 DOI: 10.1186/s12915-023-01783-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Accepted: 11/27/2023] [Indexed: 12/30/2023] Open
Affiliation(s)
- Margherita De Marzio
- Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medicine School, Boston, MA, USA
| | - Kimberly Glass
- Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medicine School, Boston, MA, USA.
- Biostatistics Department, Harvard Chan School of Public Health, Boston, MA, USA.
| | - Marieke L Kuijjer
- Centre for Molecular Medicine Norway (NCMM), Nordic EMBL Partnership, University of Oslo, Oslo, Norway.
| |
Collapse
|