1
|
Gao Y, Huang Q, Qin Y, Bao X, Pan Y, Mo J, Ning S. A prognostic model related to necrotizing apoptosis of breast cancer based on biorthogonal constrained depth semi-supervised nonnegative matrix decomposition and single-cell sequencing analysis. Am J Cancer Res 2023; 13:3875-3897. [PMID: 37818066 PMCID: PMC10560928] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Accepted: 07/31/2023] [Indexed: 10/12/2023] Open
Abstract
Breast cancer (BC) is one of the most common malignant tumours in women, and its prognosis is poor. The prognosis of BC patients can be improved by immunotherapy. However, due to the heterogeneity of BC, the identification of new biomarkers is urgently needed to improve the prognosis of BC patients. Necrotic apoptosis has been shown to play an essential role in many cancers. First, this study proposed a novel clustering algorithm called biorthogonal constrained depth semisupervised nonnegative matrix factorization (DO-DSNMF). The DO-DSNMF algorithm added multilayer nonlinear transformation to the coefficient matrix obtained after decomposition, which was used to mine the nonlinear relationship between samples. In addition, we also added orthogonal constraints on the basis matrix and coefficient matrix to reduce the influence of redundant features and samples on the results. We applied the DO-DSNMF algorithm and analysed the differences in survival and immunity between the subtypes. Then, we used prognosis analysis to construct the prognosis model. Finally, we analysed single cells using single-cell sequencing (scRNA-seq) data from the GSE75688 dataset in the GEO database. We identified two BC subtypes based on the BC transcriptome data in the TCGA database. Immune infiltration analysis showed that the necrotizing apoptosis-related genes of BC were related to various immune cells and immune functions. Necrotizing apoptosis was found to play a role in BC progression and immunity. The role of prognosis-related NRGs in BC was also verified by cell experiments. This study proposed a novel clustering algorithm to analyse BC subtypes and constructed an NRG prognostic model for BC. The prognosis and immune landscape of BC patients were evaluated by this model. The cell experiment supported its role in BC, which provides a potential therapeutic target for the treatment of BC.
Collapse
Affiliation(s)
- Yuan Gao
- Department of Head and Neck Radiotherapy, Harbin Medical University Cancer Hospital Harbin 150000, Heilongjiang, China
| | - Qinghua Huang
- Department of Breast Surgery, Wuzhou Red Cross Hospital Wuzhou 543000, Guangxi, China
| | - Yuling Qin
- Department of Clinical Laboratory, Guangxi Medical University Cancer Hospital Nanning 530000, Guangxi, China
| | - Xianhui Bao
- Department of Neurology, Harbin The First Hospital Harbin 150000, Heilongjiang, China
| | - You Pan
- Department of Breast Surgery, Guangxi Medical University Cancer Hospital Nanning 530000, Guangxi, China
| | - Jianlan Mo
- Department of Anesthesiology, The Maternal and Child Health Hospital of Guangxi Zhuang Autonomous Region Nanning 530000, Guangxi, China
| | - Shipeng Ning
- Department of Breast Surgery, Guangxi Medical University Cancer Hospital Nanning 530000, Guangxi, China
| |
Collapse
|
2
|
Fratello M, Cattelani L, Federico A, Pavel A, Scala G, Serra A, Greco D. Unsupervised Algorithms for Microarray Sample Stratification. Methods Mol Biol 2022; 2401:121-146. [PMID: 34902126 DOI: 10.1007/978-1-0716-1839-4_9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
The amount of data made available by microarrays gives researchers the opportunity to delve into the complexity of biological systems. However, the noisy and extremely high-dimensional nature of this kind of data poses significant challenges. Microarrays allow for the parallel measurement of thousands of molecular objects spanning different layers of interactions. In order to be able to discover hidden patterns, the most disparate analytical techniques have been proposed. Here, we describe the basic methodologies to approach the analysis of microarray datasets that focus on the task of (sub)group discovery.
Collapse
Affiliation(s)
- Michele Fratello
- Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
- BioMediTech Institute, Tampere University, Tampere, Finland
- Finnish Hub for Development and Validation of Integrated Approaches (FHAIVE), Tampere University, Tampere, Finland
| | - Luca Cattelani
- Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
- BioMediTech Institute, Tampere University, Tampere, Finland
- Finnish Hub for Development and Validation of Integrated Approaches (FHAIVE), Tampere University, Tampere, Finland
| | - Antonio Federico
- Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
- BioMediTech Institute, Tampere University, Tampere, Finland
- Finnish Hub for Development and Validation of Integrated Approaches (FHAIVE), Tampere University, Tampere, Finland
| | - Alisa Pavel
- Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
- BioMediTech Institute, Tampere University, Tampere, Finland
- Finnish Hub for Development and Validation of Integrated Approaches (FHAIVE), Tampere University, Tampere, Finland
| | - Giovanni Scala
- Department of Biology, University of Naples Federico II, Naples, Italy
| | - Angela Serra
- Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
- BioMediTech Institute, Tampere University, Tampere, Finland
- Finnish Hub for Development and Validation of Integrated Approaches (FHAIVE), Tampere University, Tampere, Finland
| | - Dario Greco
- Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland.
- BioMediTech Institute, Tampere University, Tampere, Finland.
- Finnish Hub for Development and Validation of Integrated Approaches (FHAIVE), Tampere University, Tampere, Finland.
- Institute of Biotechnology, University of Helsinki, Helsinki, Finland.
| |
Collapse
|
3
|
Stein-O'Brien GL, Arora R, Culhane AC, Favorov AV, Garmire LX, Greene CS, Goff LA, Li Y, Ngom A, Ochs MF, Xu Y, Fertig EJ. Enter the Matrix: Factorization Uncovers Knowledge from Omics. Trends Genet 2018; 34:790-805. [PMID: 30143323 PMCID: PMC6309559 DOI: 10.1016/j.tig.2018.07.003] [Citation(s) in RCA: 100] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2018] [Revised: 06/01/2018] [Accepted: 07/16/2018] [Indexed: 12/20/2022]
Abstract
Omics data contain signals from the molecular, physical, and kinetic inter- and intracellular interactions that control biological systems. Matrix factorization (MF) techniques can reveal low-dimensional structure from high-dimensional data that reflect these interactions. These techniques can uncover new biological knowledge from diverse high-throughput omics data in applications ranging from pathway discovery to timecourse analysis. We review exemplary applications of MF for systems-level analyses. We discuss appropriate applications of these methods, their limitations, and focus on the analysis of results to facilitate optimal biological interpretation. The inference of biologically relevant features with MF enables discovery from high-throughput data beyond the limits of current biological knowledge - answering questions from high-dimensional data that we have not yet thought to ask.
Collapse
Affiliation(s)
- Genevieve L Stein-O'Brien
- Department of Oncology, Division of Biostatistics and Bioinformatics, Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins School of Medicine, Baltimore, MD, USA; Department of Neuroscience, Johns Hopkins School of Medicine, Baltimore, MD, USA; McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Raman Arora
- Department of Computer Science, Institute for Data Intensive Engineering and Science, Johns Hopkins University, Baltimore, MD, USA
| | - Aedin C Culhane
- Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA, USA; Department of Biostatistics, Harvard TH Chan School of Public Health, Boston, MA, USA
| | - Alexander V Favorov
- Department of Oncology, Division of Biostatistics and Bioinformatics, Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins School of Medicine, Baltimore, MD, USA; Vavilov Institute of General Genetics, Moscow, Russia
| | | | - Casey S Greene
- Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, PA, USA; Childhood Cancer Data Lab, Alex's Lemonade Stand Foundation, PA, USA
| | - Loyal A Goff
- Department of Neuroscience, Johns Hopkins School of Medicine, Baltimore, MD, USA; McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Yifeng Li
- Digital Technologies Research Centre, National Research Council of Canada, Ottawa, ON, Canada
| | - Aloune Ngom
- School of Computer Science, University of Windsor, Windsor, ON, Canada
| | - Michael F Ochs
- Department of Mathematics and Statistics, The College of New Jersey, Ewing, NJ, USA
| | - Yanxun Xu
- Department of Applied Mathematics and Statistics, Whiting School of Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Elana J Fertig
- Department of Oncology, Division of Biostatistics and Bioinformatics, Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins School of Medicine, Baltimore, MD, USA.
| |
Collapse
|
4
|
Stein-O'Brien GL, Carey JL, Lee WS, Considine M, Favorov AV, Flam E, Guo T, Li S, Marchionni L, Sherman T, Sivy S, Gaykalova DA, McKay RD, Ochs MF, Colantuoni C, Fertig EJ. PatternMarkers & GWCoGAPS for novel data-driven biomarkers via whole transcriptome NMF. Bioinformatics 2018; 33:1892-1894. [PMID: 28174896 DOI: 10.1093/bioinformatics/btx058] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2016] [Accepted: 01/27/2017] [Indexed: 12/24/2022] Open
Abstract
Summary Non-negative Matrix Factorization (NMF) algorithms associate gene expression with biological processes (e.g. time-course dynamics or disease subtypes). Compared with univariate associations, the relative weights of NMF solutions can obscure biomarkers. Therefore, we developed a novel patternMarkers statistic to extract genes for biological validation and enhanced visualization of NMF results. Finding novel and unbiased gene markers with patternMarkers requires whole-genome data. Therefore, we also developed Genome-Wide CoGAPS Analysis in Parallel Sets (GWCoGAPS), the first robust whole genome Bayesian NMF using the sparse, MCMC algorithm, CoGAPS. Additionally, a manual version of the GWCoGAPS algorithm contains analytic and visualization tools including patternMatcher, a Shiny web application. The decomposition in the manual pipeline can be replaced with any NMF algorithm, for further generalization of the software. Using these tools, we find granular brain-region and cell-type specific signatures with corresponding biomarkers in GTEx data, illustrating GWCoGAPS and patternMarkers ascertainment of data-driven biomarkers from whole-genome data. Availability and Implementation PatternMarkers & GWCoGAPS are in the CoGAPS Bioconductor package (3.5) under the GPL license. Contact gsteinobrien@jhmi.edu or ccolantu@jhmi.edu or ejfertig@jhmi.edu. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Genevieve L Stein-O'Brien
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA.,Lieber Institute for Brain Development, Baltimore, MD, USA
| | - Jacob L Carey
- Department of Oncology and Division of Biostatistics and Bioinformatics, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Wai Shing Lee
- Department of Oncology and Division of Biostatistics and Bioinformatics, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Michael Considine
- Department of Oncology and Division of Biostatistics and Bioinformatics, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Alexander V Favorov
- Department of Oncology and Division of Biostatistics and Bioinformatics, Johns Hopkins School of Medicine, Baltimore, MD, USA.,Vavilov Institute of General Genetics, Moscow, Russia.,Research Institute of Genetics and Selection of Industrial Microorganisms, Moscow, Russia
| | - Emily Flam
- Department of Otolaryngology-Head and Neck Surgery, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Theresa Guo
- Department of Otolaryngology-Head and Neck Surgery, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Sijia Li
- Department of Otolaryngology-Head and Neck Surgery, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Luigi Marchionni
- Department of Otolaryngology-Head and Neck Surgery, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Thomas Sherman
- Department of Mathematics and Statistics, The College of New Jersey, Ewing Township, NJ, USA
| | - Shawn Sivy
- Department of Mathematics and Statistics, The College of New Jersey, Ewing Township, NJ, USA
| | - Daria A Gaykalova
- Department of Otolaryngology-Head and Neck Surgery, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Ronald D McKay
- Lieber Institute for Brain Development, Baltimore, MD, USA
| | - Michael F Ochs
- Department of Mathematics and Statistics, The College of New Jersey, Ewing Township, NJ, USA
| | - Carlo Colantuoni
- Department of Neurology and Department of Neuroscience, Johns Hopkins School of Medicine, Baltimore, MD, USA.,Institute for Genome Sciences, University of Maryland School of Medicine
| | - Elana J Fertig
- Department of Oncology and Division of Biostatistics and Bioinformatics, Johns Hopkins School of Medicine, Baltimore, MD, USA
| |
Collapse
|
5
|
Li Y, Min W, Li M, Han G, Dai D, Zhang L, Chen X, Wang X, Zhang Y, Yue Z, Liu J. Identification of hub genes and regulatory factors of glioblastoma multiforme subgroups by RNA-seq data analysis. Int J Mol Med 2016; 38:1170-8. [PMID: 27572852 PMCID: PMC5029949 DOI: 10.3892/ijmm.2016.2717] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2015] [Accepted: 08/04/2016] [Indexed: 11/24/2022] Open
Abstract
Glioblastoma multiforme (GBM) is the most common malignant brain tumor. This study aimed to identify the hub genes and regulatory factors of GBM subgroups by RNA sequencing (RNA-seq) data analysis, in order to explore the possible mechanisms responsbile for the progression of GBM. The dataset RNASeqV2 was downloaded by TCGA-Assembler, containing 169 GBM and 5 normal samples. Gene expression was calculated by the reads per kilobase per million reads measurement, and nor malized with tag count comparison. Following subgroup classification by the non-negative matrix factorization, the differentially expressed genes (DEGs) were screened in 4 GBM subgroups using the method of significance analysis of microarrays. Functional enrichment analysis was performed by DAVID, and the protein-protein interaction (PPI) network was constructed based on the HPRD database. The subgroup-related microRNAs (miRNAs or miRs), transcription factors (TFs) and small molecule drugs were predicted with predefined criteria. A cohort of 19,515 DEGs between the GBM and control samples was screened, which were predominantly enriched in cell cycle- and immunoreaction-related pathways. In the PPI network, lymphocyte cytosolic protein 2 (LCP2), breast cancer 1 (BRCA1), specificity protein 1 (Sp1) and chromodomain-helicase-DNA-binding protein 3 (CHD3) were the hub nodes in subgroups 1–4, respectively. Paired box 5 (PAX5), adipocyte protein 2 (aP2), E2F transcription factor 1 (E2F1) and cAMP-response element-binding protein-1 (CREB1) were the specific TFs in subgroups 1–4, respectively. miR-147b, miR-770-5p, miR-220a and miR-1247 were the particular miRNAs in subgroups 1–4, respectively. Natalizumab was the predicted small molecule drug in subgroup 2. In conclusion, the molecular regulatory mechanisms of GBM pathogenesis were distinct in the different subgroups. Several crucial genes, TFs, miRNAs and small molecules in the different GBM subgroups were identified, which may be used as potential markers. However, further experimental validations may be required.
Collapse
Affiliation(s)
- Yanan Li
- Department of Neurosurgery, Changhai Hospital, The Second Military Medical University, Shanghai 200433, P.R. China
| | - Weijie Min
- Department of Neurosurgery, Changhai Hospital, The Second Military Medical University, Shanghai 200433, P.R. China
| | - Mengmeng Li
- Department of Rheumatology and Immunology, Shanghai Changzheng Hospital, The Second Military Medical University, Shanghai 200003, P.R. China
| | - Guosheng Han
- Department of Neurosurgery, Changhai Hospital, The Second Military Medical University, Shanghai 200433, P.R. China
| | - Dongwei Dai
- Department of Neurosurgery, Changhai Hospital, The Second Military Medical University, Shanghai 200433, P.R. China
| | - Lei Zhang
- Department of Neurosurgery, Changhai Hospital, The Second Military Medical University, Shanghai 200433, P.R. China
| | - Xin Chen
- Department of Neurosurgery, Changhai Hospital, The Second Military Medical University, Shanghai 200433, P.R. China
| | - Xinglai Wang
- Department of Neurosurgery, Changhai Hospital, The Second Military Medical University, Shanghai 200433, P.R. China
| | - Yuhui Zhang
- Department of Neurosurgery, Changhai Hospital, The Second Military Medical University, Shanghai 200433, P.R. China
| | - Zhijian Yue
- Department of Neurosurgery, Changhai Hospital, The Second Military Medical University, Shanghai 200433, P.R. China
| | - Jianmin Liu
- Department of Neurosurgery, Changhai Hospital, The Second Military Medical University, Shanghai 200433, P.R. China
| |
Collapse
|