1
|
Tiong KL, Sintupisut N, Lin MC, Cheng CH, Woolston A, Lin CH, Ho M, Lin YW, Padakanti S, Yeang CH. An integrated analysis of the cancer genome atlas data discovers a hierarchical association structure across thirty three cancer types. PLOS DIGITAL HEALTH 2022; 1:e0000151. [PMID: 36812605 PMCID: PMC9931374 DOI: 10.1371/journal.pdig.0000151] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/01/2022] [Accepted: 10/31/2022] [Indexed: 06/18/2023]
Abstract
Cancer cells harbor molecular alterations at all levels of information processing. Genomic/epigenomic and transcriptomic alterations are inter-related between genes, within and across cancer types and may affect clinical phenotypes. Despite the abundant prior studies of integrating cancer multi-omics data, none of them organizes these associations in a hierarchical structure and validates the discoveries in extensive external data. We infer this Integrated Hierarchical Association Structure (IHAS) from the complete data of The Cancer Genome Atlas (TCGA) and compile a compendium of cancer multi-omics associations. Intriguingly, diverse alterations on genomes/epigenomes from multiple cancer types impact transcriptions of 18 Gene Groups. Half of them are further reduced to three Meta Gene Groups enriched with (1) immune and inflammatory responses, (2) embryonic development and neurogenesis, (3) cell cycle process and DNA repair. Over 80% of the clinical/molecular phenotypes reported in TCGA are aligned with the combinatorial expressions of Meta Gene Groups, Gene Groups, and other IHAS subunits. Furthermore, IHAS derived from TCGA is validated in more than 300 external datasets including multi-omics measurements and cellular responses upon drug treatments and gene perturbations in tumors, cancer cell lines, and normal tissues. To sum up, IHAS stratifies patients in terms of molecular signatures of its subunits, selects targeted genes or drugs for precision cancer therapy, and demonstrates that associations between survival times and transcriptional biomarkers may vary with cancer types. These rich information is critical for diagnosis and treatments of cancers.
Collapse
Affiliation(s)
- Khong-Loon Tiong
- Institute of Statistical Science, Academia Sinica, Section 2, Taipei, Taiwan
| | - Nardnisa Sintupisut
- Institute of Statistical Science, Academia Sinica, Section 2, Taipei, Taiwan
| | - Min-Chin Lin
- Institute of Statistical Science, Academia Sinica, Section 2, Taipei, Taiwan
- Psomagen, Rockville, Maryland, United States of America
| | - Chih-Hung Cheng
- Institute of Statistical Science, Academia Sinica, Section 2, Taipei, Taiwan
| | - Andrew Woolston
- Institute of Statistical Science, Academia Sinica, Section 2, Taipei, Taiwan
- Translational Cancer Immunotherapy & Genomics Lab, Barts Cancer Institute, Charterhouse Square, London, United Kingdom
| | - Chih-Hsu Lin
- Institute of Statistical Science, Academia Sinica, Section 2, Taipei, Taiwan
- C3.ai, Redwood City, California, United States of America
| | - Mirrian Ho
- Institute of Statistical Science, Academia Sinica, Section 2, Taipei, Taiwan
| | - Yu-Wei Lin
- Institute of Statistical Science, Academia Sinica, Section 2, Taipei, Taiwan
- AiLife Diagnostics, Pearland, Texas, United States of America
| | - Sridevi Padakanti
- Institute of Statistical Science, Academia Sinica, Section 2, Taipei, Taiwan
| | - Chen-Hsiang Yeang
- Institute of Statistical Science, Academia Sinica, Section 2, Taipei, Taiwan
| |
Collapse
|
2
|
Jackson CA, Vogel C. New horizons in the stormy sea of multimodal single-cell data integration. Mol Cell 2022; 82:248-259. [PMID: 35063095 PMCID: PMC8830781 DOI: 10.1016/j.molcel.2021.12.012] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2021] [Revised: 12/08/2021] [Accepted: 12/13/2021] [Indexed: 01/22/2023]
Abstract
While measurements of RNA expression have dominated the world of single-cell analyses, new single-cell techniques increasingly allow collection of different data modalities, measuring different molecules, structural connections, and intermolecular interactions. Integrating the resulting multimodal single-cell datasets is a new bioinformatics challenge. Equally important, it is a new experimental design challenge for the bench scientist, who is not only choosing from a myriad of techniques for each data modality but also faces new challenges in experimental design. The ultimate goal is to design, execute, and analyze multimodal single-cell experiments that are more than just descriptive but enable the learning of new causal and mechanistic biology. This objective requires strict consideration of the goals behind the analysis, which might range from mapping the heterogeneity of a cellular population to assembling system-wide causal networks that can further our understanding of cellular functions and eventually lead to models of tissues and organs. We review steps and challenges toward this goal. Single-cell transcriptomics is now a mature technology, and methods to measure proteins, lipids, small-molecule metabolites, and other molecular phenotypes at the single-cell level are rapidly developing. Integrating these single-cell readouts so that each cell has measurements of multiple types of data, e.g., transcriptomes, proteomes, and metabolomes, is expected to allow identification of highly specific cellular subpopulations and to provide the basis for inferring causal biological mechanisms.
Collapse
Affiliation(s)
- Christopher A Jackson
- New York University, Department of Biology, Center for Genomics and Systems Biology, New York, NY, USA.
| | - Christine Vogel
- New York University, Department of Biology, Center for Genomics and Systems Biology, New York, NY, USA
| |
Collapse
|
3
|
Ho JJD, Man JHS, Schatz JH, Marsden PA. Translational remodeling by RNA-binding proteins and noncoding RNAs. WILEY INTERDISCIPLINARY REVIEWS-RNA 2021; 12:e1647. [PMID: 33694288 DOI: 10.1002/wrna.1647] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Revised: 02/09/2021] [Accepted: 02/10/2021] [Indexed: 12/14/2022]
Abstract
Responsible for generating the proteome that controls phenotype, translation is the ultimate convergence point for myriad upstream signals that influence gene expression. System-wide adaptive translational reprogramming has recently emerged as a pillar of cellular adaptation. As classic regulators of mRNA stability and translation efficiency, foundational studies established the concept of collaboration and competition between RNA-binding proteins (RBPs) and noncoding RNAs (ncRNAs) on individual mRNAs. Fresh conceptual innovations now highlight stress-activated, evolutionarily conserved RBP networks and ncRNAs that increase the translation efficiency of populations of transcripts encoding proteins that participate in a common cellular process. The discovery of post-transcriptional functions for long noncoding RNAs (lncRNAs) was particularly intriguing given their cell-type-specificity and historical definition as nuclear-functioning epigenetic regulators. The convergence of RBPs, lncRNAs, and microRNAs on functionally related mRNAs to enable adaptive protein synthesis is a newer biological paradigm that highlights their role as "translatome (protein output) remodelers" and reinvigorates the paradigm of "RNA operons." Together, these concepts modernize our understanding of cellular stress adaptation and strategies for therapeutic development. This article is categorized under: RNA Interactions with Proteins and Other Molecules > Protein-RNA Interactions: Functional Implications Translation > Translation Regulation Regulatory RNAs/RNAi/Riboswitches > Regulatory RNAs.
Collapse
Affiliation(s)
- J J David Ho
- Sylvester Comprehensive Cancer Center, Miller School of Medicine, University of Miami, Miami, Florida, USA.,Division of Hematology, Department of Medicine, Miller School of Medicine, University of Miami, Miami, Florida, USA
| | - Jeffrey H S Man
- Keenan Research Centre, Li Ka Shing Knowledge Institute, St. Michael's Hospital, Toronto, Ontario, Canada.,Department of Medicine, University of Toronto, Toronto, Ontario, Canada.,Department of Respirology, University Health Network, Latner Thoracic Research Laboratories, University of Toronto, Toronto, Ontario, Canada
| | - Jonathan H Schatz
- Sylvester Comprehensive Cancer Center, Miller School of Medicine, University of Miami, Miami, Florida, USA.,Division of Hematology, Department of Medicine, Miller School of Medicine, University of Miami, Miami, Florida, USA
| | - Philip A Marsden
- Keenan Research Centre, Li Ka Shing Knowledge Institute, St. Michael's Hospital, Toronto, Ontario, Canada.,Department of Medicine, University of Toronto, Toronto, Ontario, Canada
| |
Collapse
|
4
|
Wu C, Zhou F, Ren J, Li X, Jiang Y, Ma S. A Selective Review of Multi-Level Omics Data Integration Using Variable Selection. High Throughput 2019; 8:E4. [PMID: 30669303 PMCID: PMC6473252 DOI: 10.3390/ht8010004] [Citation(s) in RCA: 122] [Impact Index Per Article: 20.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2018] [Revised: 12/24/2018] [Accepted: 01/10/2019] [Indexed: 01/02/2023] Open
Abstract
High-throughput technologies have been used to generate a large amount of omics data. In the past, single-level analysis has been extensively conducted where the omics measurements at different levels, including mRNA, microRNA, CNV and DNA methylation, are analyzed separately. As the molecular complexity of disease etiology exists at all different levels, integrative analysis offers an effective way to borrow strength across multi-level omics data and can be more powerful than single level analysis. In this article, we focus on reviewing existing multi-omics integration studies by paying special attention to variable selection methods. We first summarize published reviews on integrating multi-level omics data. Next, after a brief overview on variable selection methods, we review existing supervised, semi-supervised and unsupervised integrative analyses within parallel and hierarchical integration studies, respectively. The strength and limitations of the methods are discussed in detail. No existing integration method can dominate the rest. The computation aspects are also investigated. The review concludes with possible limitations and future directions for multi-level omics data integration.
Collapse
Affiliation(s)
- Cen Wu
- Department of Statistics, Kansas State University, Manhattan, KS 66506, USA.
| | - Fei Zhou
- Department of Statistics, Kansas State University, Manhattan, KS 66506, USA.
| | - Jie Ren
- Department of Statistics, Kansas State University, Manhattan, KS 66506, USA.
| | - Xiaoxi Li
- Department of Statistics, Kansas State University, Manhattan, KS 66506, USA.
| | - Yu Jiang
- Division of Epidemiology, Biostatistics and Environmental Health, School of Public Health, University of Memphis, Memphis, TN 38152, USA.
| | - Shuangge Ma
- Department of Biostatistics, School of Public Health, Yale University, New Haven, CT 06510, USA.
| |
Collapse
|