1
|
Chakraborty S, Sharma G, Karmakar S, Banerjee S. Multi-OMICS approaches in cancer biology: New era in cancer therapy. Biochim Biophys Acta Mol Basis Dis 2024; 1870:167120. [PMID: 38484941 DOI: 10.1016/j.bbadis.2024.167120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2024] [Revised: 03/06/2024] [Accepted: 03/06/2024] [Indexed: 04/01/2024]
Abstract
Innovative multi-omics frameworks integrate diverse datasets from the same patients to enhance our understanding of the molecular and clinical aspects of cancers. Advanced omics and multi-view clustering algorithms present unprecedented opportunities for classifying cancers into subtypes, refining survival predictions and treatment outcomes, and unravelling key pathophysiological processes across various molecular layers. However, with the increasing availability of cost-effective high-throughput technologies (HTT) that generate vast amounts of data, analyzing single layers often falls short of establishing causal relations. Integrating multi-omics data spanning genomes, epigenomes, transcriptomes, proteomes, metabolomes, and microbiomes offers unique prospects to comprehend the underlying biology of complex diseases like cancer. This discussion explores algorithmic frameworks designed to uncover cancer subtypes, disease mechanisms, and methods for identifying pivotal genomic alterations. It also underscores the significance of multi-omics in tumor classifications, diagnostics, and prognostications. Despite its unparalleled advantages, the integration of multi-omics data has been slow to find its way into everyday clinics. A major hurdle is the uneven maturity of different omics approaches and the widening gap between the generation of large datasets and the capacity to process this data. Initiatives promoting the standardization of sample processing and analytical pipelines, as well as multidisciplinary training for experts in data analysis and interpretation, are crucial for translating theoretical findings into practical applications.
Collapse
Affiliation(s)
- Sohini Chakraborty
- Department of Biotechnology, School of Biosciences and Technology, Vellore Institute of Technology, Vellore 632014, Tamil Nadu, India
| | - Gaurav Sharma
- Department of Biotechnology, School of Biosciences and Technology, Vellore Institute of Technology, Vellore 632014, Tamil Nadu, India
| | - Sricheta Karmakar
- Department of Biotechnology, School of Biosciences and Technology, Vellore Institute of Technology, Vellore 632014, Tamil Nadu, India
| | - Satarupa Banerjee
- Department of Biotechnology, School of Biosciences and Technology, Vellore Institute of Technology, Vellore 632014, Tamil Nadu, India.
| |
Collapse
|
2
|
Baird T, Roychoudhuri R. GS-TCGA: Gene Set-Based Analysis of The Cancer Genome Atlas. J Comput Biol 2024; 31:229-240. [PMID: 38436570 DOI: 10.1089/cmb.2023.0278] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/05/2024] Open
Abstract
Most tools for analyzing large gene expression datasets, including The Cancer Genome Atlas (TCGA), have focused on analyzing the expression of individual genes or inference of the abundance of specific cell types from whole transcriptome information. While these methods provide useful insights, they can overlook crucial process-based information that may enhance our understanding of cancer biology. In this study, we describe three novel tools incorporated into an online resource; gene set-based analysis of The Cancer Genome Atlas (GS-TCGA). GS-TCGA is designed to enable user-friendly exploration of TCGA data using gene set-based analysis, leveraging gene sets from the Molecular Signatures Database. GS-TCGA includes three unique tools: GS-Surv determines the association between the expression of gene sets and survival in human cancers. Co-correlative gene set enrichment analysis (CC-GSEA) utilizes interpatient heterogeneity in cancer gene expression to infer functions of specific genes based on GSEA of coregulated genes in TCGA. GS-Corr utilizes interpatient heterogeneity in cancer gene expression profiles to identify genes coregulated with the expression of specific gene sets in TCGA. Users are also able to upload custom gene sets for analysis with each tool. These tools empower researchers to perform survival analysis linked to gene set expression, explore the functional implications of gene coexpression, and identify potential gene regulatory mechanisms.
Collapse
Affiliation(s)
- Tarrion Baird
- Department of Pathology, University of Cambridge, Cambridge, United Kingdom
| | - Rahul Roychoudhuri
- Department of Pathology, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
3
|
Romero-Garmendia I, Garcia-Etxebarria K. From Omic Layers to Personalized Medicine in Colorectal Cancer: The Road Ahead. Genes (Basel) 2023; 14:1430. [PMID: 37510334 PMCID: PMC10379575 DOI: 10.3390/genes14071430] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Revised: 07/05/2023] [Accepted: 07/10/2023] [Indexed: 07/30/2023] Open
Abstract
Colorectal cancer is a major health concern since it is a highly diagnosed cancer and the second cause of death among cancers. Thus, the most suitable biomarkers for its diagnosis, prognosis, and treatment have been studied to improve and personalize the prevention and clinical management of colorectal cancer. The emergence of omic techniques has provided a great opportunity to better study CRC and make personalized medicine feasible. In this review, we will try to summarize how the analysis of the omic layers can be useful for personalized medicine and the existing difficulties. We will discuss how single and multiple omic layer analyses have been used to improve the prediction of the risk of CRC and its outcomes and how to overcome the challenges in the use of omic layers in personalized medicine.
Collapse
Affiliation(s)
- Irati Romero-Garmendia
- Department of Genetics, Physical Anthropology and Animal Physiology, University of the Basque Country (Universidad del País Vasco/Euskal Herriko Unibertsitatea), 48940 Leioa, Spain
| | - Koldo Garcia-Etxebarria
- Biodonostia, Gastrointestinal Genetics Group, 20014 San Sebastián, Spain
- Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBERehd), 08036 Barcelona, Spain
| |
Collapse
|
4
|
Murali A, Sarkar RR. Mechano-immunology in microgravity. Life Sci Space Res (Amst) 2023; 37:50-64. [PMID: 37087179 DOI: 10.1016/j.lssr.2023.03.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Revised: 02/16/2023] [Accepted: 03/05/2023] [Indexed: 05/03/2023]
Abstract
Life on Earth has evolved to thrive in the Earth's natural gravitational field; however, as space technology advances, we must revisit and investigate the effects of unnatural conditions on human health, such as gravitational change. Studies have shown that microgravity has a negative impact on various systemic parts of humans, with the effects being more severe in the human immune system. Increasing costs, limited experimental time, and sample handling issues hampered our understanding of this field. To address the existing knowledge gap and provide confidence in modelling the phenomena, in this review, we highlight experimental works in mechano-immunology under microgravity and different computational modelling approaches that can be used to address the existing problems.
Collapse
Affiliation(s)
- Anirudh Murali
- Chemical Engineering and Process Development, CSIR - National Chemical Laboratory, Pune, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, 201002, India
| | - Ram Rup Sarkar
- Chemical Engineering and Process Development, CSIR - National Chemical Laboratory, Pune, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, 201002, India.
| |
Collapse
|
5
|
Tan MS, Cheah PL, Chin AV, Looi LM, Chang SW. A review on omics-based biomarkers discovery for Alzheimer's disease from the bioinformatics perspectives: Statistical approach vs machine learning approach. Comput Biol Med 2021; 139:104947. [PMID: 34678481 DOI: 10.1016/j.compbiomed.2021.104947] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2021] [Revised: 10/12/2021] [Accepted: 10/12/2021] [Indexed: 12/26/2022]
Abstract
Alzheimer's Disease (AD) is a neurodegenerative disease that affects cognition and is the most common cause of dementia in the elderly. As the number of elderly individuals increases globally, the incidence and prevalence of AD are expected to increase. At present, AD is diagnosed clinically, according to accepted criteria. The essential elements in the diagnosis of AD include a patients history, a physical examination and neuropsychological testing, in addition to appropriate investigations such as neuroimaging. The omics-based approach is an emerging field of study that may not only aid in the diagnosis of AD but also facilitate the exploration of factors that influence the development of the disease. Omics techniques, including genomics, transcriptomics, proteomics and metabolomics, may reveal the pathways that lead to neuronal death and identify biomolecular markers associated with AD. This will further facilitate an understanding of AD neuropathology. In this review, omics-based approaches that were implemented in studies on AD were assessed from a bioinformatics perspective. Current state-of-the-art statistical and machine learning approaches used in the single omics analysis of AD were compared based on correlations of variants, differential expression, functional analysis and network analysis. This was followed by a review of the approaches used in the integration and analysis of multi-omics of AD. The strengths and limitations of multi-omics analysis methods were explored and the issues and challenges associated with omics studies of AD were highlighted. Lastly, future studies in this area of research were justified.
Collapse
Affiliation(s)
- Mei Sze Tan
- Bioinformatics Programme, Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur, Malaysia
| | - Phaik-Leng Cheah
- Department of Pathology, Faculty of Medicine, University of Malaya, Kuala Lumpur, Malaysia
| | - Ai-Vyrn Chin
- Division of Geriatric Medicine, Department of Medicine, Faculty of Medicine, University of Malaya, Kuala Lumpur, Malaysia
| | - Lai-Meng Looi
- Department of Pathology, Faculty of Medicine, University of Malaya, Kuala Lumpur, Malaysia
| | - Siow-Wee Chang
- Bioinformatics Programme, Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur, Malaysia.
| |
Collapse
|
6
|
Abstract
Tendons heal by fibrosis, which hinders function and increases re-injury risk. Yet the biology that leads to degeneration and regeneration of tendons is not completely understood. Improved understanding of the metabolic nuances that cause diverse outcomes in tendinopathies is required to solve these problems. 'Omics methods are increasingly used to characterize phenotypes in tissues. Multiomics integrates 'omic datasets to identify coherent relationships and provide insight into differences in molecular and metabolic pathways between anatomic locations, and disease stages. This work reviews the current literature pertaining to multiomics in tendon and the potential of these platforms to improve tendon regeneration. We assessed the literature and identified areas where 'omics platforms contribute to the field: (1) Tendon biology where their hierarchical complexity and demographic factors are studied. (2) Tendon degeneration and healing, where comparisons across tendon pathologies are analyzed. (3) The in vitro engineered tendon phenotype, where we compare the engineered phenotype to relevant native tissues. (4) Finally, we review regenerative and therapeutic approaches. We identified gaps in current knowledge and opportunities for future study: (1) The need to increase the diversity of human subjects and cell sources. (2) Opportunities to improve understanding of tendon heterogeneity. (3) The need to use these improvements to inform new engineered and regenerative therapeutic approaches. (4) The need to increase understanding of the development of tendon pathology. Together, the expanding use of various 'omics platforms and data analysis resulting from these platforms could substantially contribute to major advances in the tendon tissue engineering and regenerative medicine field.
Collapse
|
7
|
Sindhu KJ, Venkatesan N, Karunagaran D. MicroRNA Interactome Multiomics Characterization for Cancer Research and Personalized Medicine: An Expert Review. OMICS 2021; 25:545-566. [PMID: 34448651 DOI: 10.1089/omi.2021.0087] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
MicroRNAs (miRNAs) that are mutually modulated by their interacting partners (interactome) are being increasingly noted for their significant role in pathogenesis and treatment of various human cancers. Recently, miRNA interactome dissected with multiomics approaches has been the subject of focus since individual tools or methods failed to provide the necessary comprehensive clues on the complete interactome. Even though single-omics technologies such as proteomics can uncover part of the interactome, the biological and clinical understanding still remain incomplete. In this study, we present an expert review of studies involving multiomics approaches to identification of miRNA interactome and its application in mechanistic characterization, classification, and therapeutic target identification in a variety of cancers, and with a focus on proteomics. We also discuss individual or multiple miRNA-based interactome identification in various pathological conditions of relevance to clinical medicine. Various new single-omics methods that can be integrated into multiomics cancer research and the computational approaches to analyze and predict miRNA interactome are also highlighted in this review. In all, we contextulize the power of multiomics approaches and the importance of the miRNA interactome to achieve the vision and practice of predictive, preventive, and personalized medicine in cancer research and clinical oncology.
Collapse
Affiliation(s)
- K J Sindhu
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, India
| | - Nalini Venkatesan
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, India
| | - Devarajan Karunagaran
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, India
| |
Collapse
|
8
|
Mallafré-Muro C, Llambrich M, Cumeras R, Pardo A, Brezmes J, Marco S, Gumà J. Comprehensive Volatilome and Metabolome Signatures of Colorectal Cancer in Urine: A Systematic Review and Meta-Analysis. Cancers (Basel) 2021; 13:2534. [PMID: 34064065 PMCID: PMC8196698 DOI: 10.3390/cancers13112534] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2021] [Revised: 05/13/2021] [Accepted: 05/17/2021] [Indexed: 01/22/2023] Open
Abstract
To increase compliance with colorectal cancer screening programs and to reduce the recommended screening age, cheaper and easy non-invasiveness alternatives to the fecal immunochemical test should be provided. Following the PRISMA procedure of studies that evaluated the metabolome and volatilome signatures of colorectal cancer in human urine samples, an exhaustive search in PubMed, Web of Science, and Scopus found 28 studies that met the required criteria. There were no restrictions on the query for the type of study, leading to not only colorectal cancer samples versus control comparison but also polyps versus control and prospective studies of surgical effects, CRC staging and comparisons of CRC with other cancers. With this systematic review, we identified up to 244 compounds in urine samples (3 shared compounds between the volatilome and metabolome), and 10 of them were relevant in more than three articles. In the meta-analysis, nine studies met the criteria for inclusion, and the results combining the case-control and the pre-/post-surgery groups, eleven compounds were found to be relevant. Four upregulated metabolites were identified, 3-hydroxybutyric acid, L-dopa, L-histidinol, and N1, N12-diacetylspermine and seven downregulated compounds were identified, pyruvic acid, hydroquinone, tartaric acid, and hippuric acid as metabolites and butyraldehyde, ether, and 1,1,6-trimethyl-1,2-dihydronaphthalene as volatiles.
Collapse
Affiliation(s)
- Celia Mallafré-Muro
- Department of Electronics and Biomedical Engineering, University of Barcelona, 08028 Barcelona, Spain; (C.M.-M.); (A.P.); (S.M.)
- Signal and Information Processing for Sensing Systems Group, Institute for Bioengineering of Catalonia, The Barcelona Institute of Science and Technology, 08028 Barcelona, Spain
| | - Maria Llambrich
- Metabolomics Interdisciplinary Group (MiL@b), Department of Electrical Electronic Engineering and Automation, Universitat Rovira i Virgili (URV), IISPV, CERCA, 43007 Tarragona, Spain; (M.L.); (J.B.)
| | - Raquel Cumeras
- Metabolomics Interdisciplinary Group (MiL@b), Department of Electrical Electronic Engineering and Automation, Universitat Rovira i Virgili (URV), IISPV, CERCA, 43007 Tarragona, Spain; (M.L.); (J.B.)
- Biomedical Research Centre, Diabetes and Associated Metabolic Disorders (CIBERDEM), ISCIII, 28029 Madrid, Spain
- Fiehn Lab, NIH West Coast Metabolomics Center, University of California Davis, Davis, CA 95616, USA
| | - Antonio Pardo
- Department of Electronics and Biomedical Engineering, University of Barcelona, 08028 Barcelona, Spain; (C.M.-M.); (A.P.); (S.M.)
| | - Jesús Brezmes
- Metabolomics Interdisciplinary Group (MiL@b), Department of Electrical Electronic Engineering and Automation, Universitat Rovira i Virgili (URV), IISPV, CERCA, 43007 Tarragona, Spain; (M.L.); (J.B.)
- Biomedical Research Centre, Diabetes and Associated Metabolic Disorders (CIBERDEM), ISCIII, 28029 Madrid, Spain
| | - Santiago Marco
- Department of Electronics and Biomedical Engineering, University of Barcelona, 08028 Barcelona, Spain; (C.M.-M.); (A.P.); (S.M.)
- Signal and Information Processing for Sensing Systems Group, Institute for Bioengineering of Catalonia, The Barcelona Institute of Science and Technology, 08028 Barcelona, Spain
| | - Josep Gumà
- Oncology Department, Hospital Universitari Sant Joan de Reus, Institut d’Investigació Sanitària Pere Virgili (IISPV), Universitat Rovira i Virgili (URV), 43204 Reus, Spain;
| |
Collapse
|
9
|
Nikmanesh F, Sarhadi S, Dadashpour M, Asgari Y, Zarghami N. Omics Integration Analysis Unravel the Landscape of Driving Mechanisms of Colorectal Cancer. Asian Pac J Cancer Prev 2020; 21:3539-3549. [PMID: 33369450 PMCID: PMC8046321 DOI: 10.31557/apjcp.2020.21.12.3539] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2020] [Accepted: 11/30/2020] [Indexed: 02/06/2023] Open
Abstract
Colorectal cancer (CRC) is one of the most malignant cancers and results in a substantial rate of morbidity and mortality. Diagnosis of this malignancy in early stages increases the chance of effective treatment. High-throughput data analyses reveal omics signatures and also provide the possibility of developing computational models for early detection of this disease. Such models would be able to use as complementary tools for early detection of different types of cancers including CRC. In this study, using gene expression data, the Flux balance analysis (FBA) applied to decode metabolic fluxes in cancer and normal cells. Moreover, transcriptome and genome analyses revealed driver agents of CRC in a biological network scheme. By applying comprehensive publicly available data from TCGA, different aspect of CRC regulome including the regulatory effect of gene expression, methylation, microRNA, copy number aberration and point mutation profile over protein levels investigated and the results provide a regulatory picture underlying CRC. Compiling omics profiles indicated snapshots of changes in different omics levels and flux rate of CRC. In conclusion, considering obtained CRC signatures and their role in biological operating systems of cells, the results suggest reliable driver regulatory modules that could potentially serve as biomarkers and therapeutic targets and furthermore expand our understanding of driving mechanisms of this disease. .
Collapse
Affiliation(s)
- Fatemeh Nikmanesh
- Stem Cell Research Center, Tabriz University of Medical Sciences, Tabriz, Iran.
- Department of Medical Biotechnology, Faculty of Advanced Medical Sciences, Tabriz University of Medical Sciences, Tabriz, Iran.
- Iranian Blood Transfusion Organization-Research Center, Iranian Blood Transfusion Organization, IBTO blg., Hemmat Exp. Way, Teheran, Iran.
| | - Shamim Sarhadi
- Stem Cell Research Center, Tabriz University of Medical Sciences, Tabriz, Iran.
- Department of Medical Biotechnology, School of Advanced Technologies in Medicine, Tehran University of Medical Sciences, Tehran, Iran.
| | - Mehdi Dadashpour
- Stem Cell Research Center, Tabriz University of Medical Sciences, Tabriz, Iran.
| | - Yazdan Asgari
- Iranian Blood Transfusion Organization-Research Center, Iranian Blood Transfusion Organization, IBTO blg., Hemmat Exp. Way, Teheran, Iran.
| | - Nosratollah Zarghami
- Stem Cell Research Center, Tabriz University of Medical Sciences, Tabriz, Iran.
- Department of Clinical Biochemistry and Laboratory Medicine, Faculty of Medicine, Tabriz University of Medical Sciences, Tabriz, Iran.
| |
Collapse
|
10
|
Biswas N, Chakrabarti S. Artificial Intelligence (AI)-Based Systems Biology Approaches in Multi-Omics Data Analysis of Cancer. Front Oncol 2020; 10:588221. [PMID: 33154949 PMCID: PMC7591760 DOI: 10.3389/fonc.2020.588221] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2020] [Accepted: 09/21/2020] [Indexed: 12/13/2022] Open
Abstract
Cancer is the manifestation of abnormalities of different physiological processes involving genes, DNAs, RNAs, proteins, and other biomolecules whose profiles are reflected in different omics data types. As these bio-entities are very much correlated, integrative analysis of different types of omics data, multi-omics data, is required to understanding the disease from the tumorigenesis to the disease progression. Artificial intelligence (AI), specifically machine learning algorithms, has the ability to make decisive interpretation of "big"-sized complex data and, hence, appears as the most effective tool for the analysis and understanding of multi-omics data for patient-specific observations. In this review, we have discussed about the recent outcomes of employing AI in multi-omics data analysis of different types of cancer. Based on the research trends and significance in patient treatment, we have primarily focused on the AI-based analysis for determining cancer subtypes, disease prognosis, and therapeutic targets. We have also discussed about AI analysis of some non-canonical types of omics data as they have the capability of playing the determiner role in cancer patient care. Additionally, we have briefly discussed about the data repositories because of their pivotal role in multi-omics data storing, processing, and analysis.
Collapse
Affiliation(s)
- Nupur Biswas
- Structural Biology and Bioinformatics Division, CSIR-Indian Institute of Chemical Biology, IICB TRUE Campus, Kolkata, India
| | - Saikat Chakrabarti
- Structural Biology and Bioinformatics Division, CSIR-Indian Institute of Chemical Biology, IICB TRUE Campus, Kolkata, India
| |
Collapse
|
11
|
Zhang Z, Li H, Jiang S, Li R, Li W, Chen H, Bo X. A survey and evaluation of Web-based tools/databases for variant analysis of TCGA data. Brief Bioinform 2020; 20:1524-1541. [PMID: 29617727 PMCID: PMC6781580 DOI: 10.1093/bib/bby023] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2017] [Revised: 02/22/2018] [Indexed: 12/28/2022] Open
Abstract
The Cancer Genome Atlas (TCGA) is a publicly funded project that aims to catalog and discover major cancer-causing genomic alterations with the goal of creating a comprehensive ‘atlas’ of cancer genomic profiles. The availability of this genome-wide information provides an unprecedented opportunity to expand our knowledge of tumourigenesis. Computational analytics and mining are frequently used as effective tools for exploring this byzantine series of biological and biomedical data. However, some of the more advanced computational tools are often difficult to understand or use, thereby limiting their application by scientists who do not have a strong computational background. Hence, it is of great importance to build user-friendly interfaces that allow both computational scientists and life scientists without a computational background to gain greater biological and medical insights. To that end, this survey was designed to systematically present available Web-based tools and facilitate the use TCGA data for cancer research.
Collapse
Affiliation(s)
- Zhuo Zhang
- Beijing Institute of Radiation Medicine, Beijing 100850, China
| | - Hao Li
- Beijing Institute of Radiation Medicine, Beijing 100850, China
| | - Shuai Jiang
- Beijing Institute of Radiation Medicine, Beijing 100850, China
| | - Ruijiang Li
- Beijing Institute of Radiation Medicine, Beijing 100850, China
| | - Wanying Li
- Beijing Institute of Radiation Medicine, Beijing 100850, China
| | - Hebing Chen
- Beijing Institute of Radiation Medicine, Beijing 100850, China
| | - Xiaochen Bo
- Beijing Institute of Radiation Medicine, Beijing 100850, China
| |
Collapse
|
12
|
Mak TD, Goudarzi M, Laiakis EC, Stein SE. Disparate Metabolomics Data Reassembler: A Novel Algorithm for Agglomerating Incongruent LC-MS Metabolomics Datasets. Anal Chem 2020; 92:5231-5239. [PMID: 32118408 PMCID: PMC10926180 DOI: 10.1021/acs.analchem.9b05763] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
In the past decade, the field of LC-MS-based metabolomics has transformed from an obscure specialty into a major "-omics" platform for studying metabolic processes and biomolecular characterization. However, as a whole the field is still very fractured, as the nature of the instrumentation and the information produced by the platform essentially creates incompatible "islands" of datasets. This lack of data coherency results in the inability to accumulate a critical mass of metabolomics data that has enabled other -omics platforms to make impactful discoveries and meaningful advances. As such, we have developed a novel algorithm, called Disparate Metabolomics Data Reassembler (DIMEDR), which attempts to bridge the inconsistencies between incongruent LC-MS metabolomics datasets of the same biological sample type. A single "primary" dataset is postprocessed via traditional means of peak identification, alignment, and grouping. DIMEDR utilizes this primary dataset as a progenitor template by which data from subsequent disparate datasets are reassembled and integrated into a unified framework that maximizes spectral feature similarity across all samples. This is accomplished by a novel procedure for universal retention time correction and comparison via identification of ubiquitous features in the initial primary dataset, which are subsequently utilized as endogenous internal standards during integration. For demonstration purposes, two human and two mouse urine metabolomics datasets from four unrelated studies acquired over 4 years were unified via DIMEDR, which enabled meaningful analysis across otherwise incomparable and unrelated datasets.
Collapse
Affiliation(s)
- Tytus D. Mak
- Mass Spectrometry Data Center, National Institute of Standards and Technology, 100 Bureau Drive, Gaithersburg, MD 20899-8632
- Lombardi Comprehensive Cancer Center, Georgetown University Medical Center, New Research Building E504/508, 3970 Reservoir Rd, NW, Washington, DC 20057
| | - Maryam Goudarzi
- Department of Cellular & Molecular Medicine, Cleveland Clinic Lerner Research Institute, Building NN1, Room 28, 9500 Euclid Ave, Cleveland, OH 44195
| | - Evagelia C. Laiakis
- Lombardi Comprehensive Cancer Center, Georgetown University Medical Center, New Research Building E504/508, 3970 Reservoir Rd, NW, Washington, DC 20057
| | - Stephen E. Stein
- Mass Spectrometry Data Center, National Institute of Standards and Technology, 100 Bureau Drive, Gaithersburg, MD 20899-8632
| |
Collapse
|
13
|
Darst B, Engelman CD, Tian Y, Lorenzo Bermejo J. Data mining and machine learning approaches for the integration of genome-wide association and methylation data: methodology and main conclusions from GAW20. BMC Genet 2018; 19:76. [PMID: 30255774 PMCID: PMC6157271 DOI: 10.1186/s12863-018-0646-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Multiple layers of genetic and epigenetic variability are being simultaneously explored in an increasing number of health studies. We summarize here different approaches applied in the Data Mining and Machine Learning group at the GAW20 to integrate genome-wide genotype and methylation array data. RESULTS We provide a non-intimidating introduction to some frequently used methods to investigate high-dimensional molecular data and compare the different approaches tried by group members: random forest, deep learning, cluster analysis, mixed models, and gene-set enrichment analysis. Group contributions were quite heterogeneous regarding investigated data sets (real vs simulated), conducted data quality control and assessed phenotypes (eg, metabolic syndrome vs relative differences of log-transformed triglyceride concentrations before and after fenofibrate treatment). However, some common technical issues were detected, leading to practical recommendations. CONCLUSIONS Different sources of correlation were identified by group members, including population stratification, family structure, batch effects, linkage disequilibrium and correlation of methylation values at neighboring cytosine-phosphate-guanine (CpG) sites, and the majority of applied approaches were able to take into account identified correlation structures. The ability to efficiently deal with high-dimensional omics data, and the model free nature of the approaches that did not require detailed model specifications were clearly recognized as the main strengths of applied methods. A limitation of random forest is its sensitivity to highly correlated variables. The parameter setup and the interpretation of results from deep learning methods, in particular deep neural networks, can be extremely challenging. Cluster analysis and mixed models may need some predimension reduction based on existing literature, data filtering, and supplementary statistical methods, and gene-set enrichment analysis requires biological insight.
Collapse
Affiliation(s)
- Burcu Darst
- Department of Population Health Sciences, School of Medicine and Public Health, University of Wisconsin, 610 Walnut St. 1007 WARF, Madison, WI 53726 USA
| | - Corinne D. Engelman
- Department of Population Health Sciences, School of Medicine and Public Health, University of Wisconsin, 610 Walnut St. 1007 WARF, Madison, WI 53726 USA
| | - Ye Tian
- Department of Biochemistry and Medical Genetics, University of Manitoba, 745 Bannatyne Ave, Winnipeg, MB R3E 0J9 Canada
- Department of Electrical and Computer Engineering, University of Manitoba, 745 Bannatyne Ave, Winnipeg, MB R3E 0J9 Canada
| | - Justo Lorenzo Bermejo
- Institute of Medical Biometry and Informatics, University of Heidelberg, Im Neuenheimer Feld 130.3, 69120 Heidelberg, Germany
| |
Collapse
|
14
|
Song L, Bhuvaneshwar K, Wang Y, Feng Y, Shih IM, Madhavan S, Gusev Y. CINdex: A Bioconductor Package for Analysis of Chromosome Instability in DNA Copy Number Data. Cancer Inform 2017; 16:1176935117746637. [PMID: 29343938 PMCID: PMC5761903 DOI: 10.1177/1176935117746637] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2017] [Accepted: 10/26/2017] [Indexed: 01/10/2023] Open
Abstract
The CINdex Bioconductor package addresses an important area of high-throughput genomic analysis. It calculates the chromosome instability (CIN) index, a novel measurement that quantitatively characterizes genome-wide copy number alterations (CNAs) as a measure of CIN. The advantage of this package is an ability to compare CIN index values between several groups for patients (case and control groups), which is a typical use case in translational research. The differentially changed cytobands or chromosomes can then be linked to genes located in the affected genomic regions, as well as pathways. This enables in-depth systems biology-based network analysis and assessment of the impact of CNA on various biological processes or clinical outcomes. This package was successfully applied to analysis of DNA copy number data in colorectal cancer as a part of multi-omics integrative study as well as for analysis of several other cancer types. The source code, along with an end-to-end tutorial, and example data are freely available in Bioconductor at http://bioconductor.org/packages/CINdex/.
Collapse
Affiliation(s)
- Lei Song
- Innovation Center for Biomedical Informatics, Georgetown University, Washington, DC, USA
| | - Krithika Bhuvaneshwar
- Innovation Center for Biomedical Informatics, Georgetown University, Washington, DC, USA
| | - Yue Wang
- The Bradley Department of Electrical and Computer Engineering, Virginia Polytechnic Institute and State University, Arlington, VA, USA
| | - Yuanjian Feng
- The Bradley Department of Electrical and Computer Engineering, Virginia Polytechnic Institute and State University, Arlington, VA, USA
| | - Ie-Ming Shih
- Department of Gynecology and Obstetrics, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Subha Madhavan
- Innovation Center for Biomedical Informatics, Georgetown University, Washington, DC, USA
| | - Yuriy Gusev
- Innovation Center for Biomedical Informatics, Georgetown University, Washington, DC, USA
| |
Collapse
|
15
|
Safonov A, Jiang T, Bianchini G, Győrffy B, Karn T, Hatzis C, Pusztai L. Immune Gene Expression Is Associated with Genomic Aberrations in Breast Cancer. Cancer Res 2017; 77:3317-3324. [PMID: 28428277 DOI: 10.1158/0008-5472.can-16-3478] [Citation(s) in RCA: 97] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2016] [Revised: 03/13/2017] [Accepted: 04/14/2017] [Indexed: 12/27/2022]
Abstract
The presence of tumor-infiltrating lymphocytes (TIL) is a favorable prognostic factor in breast cancer, but what drives immune infiltration remains unknown. Here we examine if clonal heterogeneity, total mutation load, neoantigen load, copy number variations (CNV), gene- or pathway-level somatic mutations, or germline polymorphisms (SNP) are associated with immune metagene expression in breast cancer subtypes. Thirteen published immune metagenes correlated separately with genomic metrics in the three major breast cancer subtypes. We analyzed RNA-Seq, DNA copy number, mutation and germline SNP data of 627 ER+, 207 HER2+, and 191 triple-negative (TNBC) cancers from The Cancer Genome Atlas. P-values were adjusted for multiple comparisons, and permutation testing was used to assess false discovery rates. Increased immune metagene expression associated significantly with lower clonal heterogeneity estimated by MATH score in all subtypes and with a trend for lower overall mutation, neoantigen, and CNV loads in TNBC and HER2+ cancers. In ER+ cancers, mutation load, neoantigen load, and CNV load weakly but positively associated with immune infiltration, which reached significance for overall mutation load only. No highly recurrent single gene or pathway level mutations associated with immune infiltration. High immune gene expression and lower clonal heterogeneity in TNBC and HER2+ cancers suggest an immune pruning effect and equilibrium between immune surveillance and clonal expansion. Thus, immune checkpoint inhibitors may tip the balance in favor of immune surveillance in these cancers. Cancer Res; 77(12); 3317-24. ©2017 AACR.
Collapse
Affiliation(s)
- Anton Safonov
- Breast Medical Oncology, Yale Cancer Center, Yale School of Medicine, New Haven, Connecticut
| | - Tingting Jiang
- Breast Medical Oncology, Yale Cancer Center, Yale School of Medicine, New Haven, Connecticut
| | | | - Balázs Győrffy
- MTA TTK Lendület Cancer Biomarker Research Group & Semmelweis University Second Department of Pediatrics, Budapest, Hungary
| | - Thomas Karn
- Department of Obstetrics and Gynecology, Goethe-University Frankfurt, Frankfurt, Germany
| | - Christos Hatzis
- Breast Medical Oncology, Yale Cancer Center, Yale School of Medicine, New Haven, Connecticut
| | - Lajos Pusztai
- Breast Medical Oncology, Yale Cancer Center, Yale School of Medicine, New Haven, Connecticut.
| |
Collapse
|
16
|
Abstract
BACKGROUND Of late, high-throughput microarray and sequencing data have been extensively used to monitor biomarkers and biological processes related to many diseases. Under this circumstance, the support vector machine (SVM) has been popularly used and been successful for gene selection in many applications. Despite surpassing benefits of the SVMs, single data analysis using small- and mid-size of data inevitably runs into the problem of low reproducibility and statistical power. To address this problem, we propose a meta-analytic support vector machine (Meta-SVM) that can accommodate multiple omics data, making it possible to detect consensus genes associated with diseases across studies. RESULTS Experimental studies show that the Meta-SVM is superior to the existing meta-analysis method in detecting true signal genes. In real data applications, diverse omics data of breast cancer (TCGA) and mRNA expression data of lung disease (idiopathic pulmonary fibrosis; IPF) were applied. As a result, we identified gene sets consistently associated with the diseases across studies. In particular, the ascertained gene set of TCGA omics data was found to be significantly enriched in the ABC transporters pathways well known as critical for the breast cancer mechanism. CONCLUSION The Meta-SVM effectively achieves the purpose of meta-analysis as jointly leveraging multiple omics data, and facilitates identifying potential biomarkers and elucidating the disease process.
Collapse
Affiliation(s)
- SungHwan Kim
- Department of Statistics, Korea University, Anam-dong, Seoul, 136-701 South Korea.,Department of Statistics, Keimyung University, Dalseoku, Daegu, 42601 South Korea
| | - Jae-Hwan Jhong
- Department of Statistics, Korea University, Anam-dong, Seoul, 136-701 South Korea
| | - JungJun Lee
- Department of Statistics, Korea University, Anam-dong, Seoul, 136-701 South Korea
| | - Ja-Yong Koo
- Department of Statistics, Korea University, Anam-dong, Seoul, 136-701 South Korea
| |
Collapse
|
17
|
Bhuvaneshwar K, Belouali A, Singh V, Johnson RM, Song L, Alaoui A, Harris MA, Clarke R, Weiner LM, Gusev Y, Madhavan S. G-DOC Plus - an integrative bioinformatics platform for precision medicine. BMC Bioinformatics 2016; 17:193. [PMID: 27130330 PMCID: PMC4851789 DOI: 10.1186/s12859-016-1010-0] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2015] [Accepted: 04/04/2016] [Indexed: 01/30/2023] Open
Abstract
BACKGROUND G-DOC Plus is a data integration and bioinformatics platform that uses cloud computing and other advanced computational tools to handle a variety of biomedical BIG DATA including gene expression arrays, NGS and medical images so that they can be analyzed in the full context of other omics and clinical information. RESULTS G-DOC Plus currently holds data from over 10,000 patients selected from private and public resources including Gene Expression Omnibus (GEO), The Cancer Genome Atlas (TCGA) and the recently added datasets from REpository for Molecular BRAin Neoplasia DaTa (REMBRANDT), caArray studies of lung and colon cancer, ImmPort and the 1000 genomes data sets. The system allows researchers to explore clinical-omic data one sample at a time, as a cohort of samples; or at the level of population, providing the user with a comprehensive view of the data. G-DOC Plus tools have been leveraged in cancer and non-cancer studies for hypothesis generation and validation; biomarker discovery and multi-omics analysis, to explore somatic mutations and cancer MRI images; as well as for training and graduate education in bioinformatics, data and computational sciences. Several of these use cases are described in this paper to demonstrate its multifaceted usability. CONCLUSION G-DOC Plus can be used to support a variety of user groups in multiple domains to enable hypothesis generation for precision medicine research. The long-term vision of G-DOC Plus is to extend this translational bioinformatics platform to stay current with emerging omics technologies and analysis methods to continue supporting novel hypothesis generation, analysis and validation for integrative biomedical research. By integrating several aspects of the disease and exposing various data elements, such as outpatient lab workup, pathology, radiology, current treatments, molecular signatures and expected outcomes over a web interface, G-DOC Plus will continue to strengthen precision medicine research. G-DOC Plus is available at: https://gdoc.georgetown.edu .
Collapse
Affiliation(s)
- Krithika Bhuvaneshwar
- />Innovation Center for Biomedical Informatics, Georgetown University Medical Center, Washington, DC USA
| | - Anas Belouali
- />Innovation Center for Biomedical Informatics, Georgetown University Medical Center, Washington, DC USA
| | - Varun Singh
- />Innovation Center for Biomedical Informatics, Georgetown University Medical Center, Washington, DC USA
| | - Robert M. Johnson
- />Innovation Center for Biomedical Informatics, Georgetown University Medical Center, Washington, DC USA
| | - Lei Song
- />Innovation Center for Biomedical Informatics, Georgetown University Medical Center, Washington, DC USA
| | - Adil Alaoui
- />Innovation Center for Biomedical Informatics, Georgetown University Medical Center, Washington, DC USA
| | - Michael A. Harris
- />Innovation Center for Biomedical Informatics, Georgetown University Medical Center, Washington, DC USA
| | - Robert Clarke
- />Department of Oncology, Lombardi Comprehensive Cancer Center, Georgetown University, Washington, DC USA
| | - Louis M. Weiner
- />Department of Oncology, Lombardi Comprehensive Cancer Center, Georgetown University, Washington, DC USA
| | - Yuriy Gusev
- />Innovation Center for Biomedical Informatics, Georgetown University Medical Center, Washington, DC USA
- />Department of Oncology, Lombardi Comprehensive Cancer Center, Georgetown University, Washington, DC USA
| | - Subha Madhavan
- />Innovation Center for Biomedical Informatics, Georgetown University Medical Center, Washington, DC USA
- />Department of Oncology, Lombardi Comprehensive Cancer Center, Georgetown University, Washington, DC USA
| |
Collapse
|
18
|
Huang C, Lee C, Yang S, Chien C, Huang C, Yang R, Chang C. Upregulation of the growth arrest-specific-2 in recurrent colorectal cancers, and its susceptibility to chemotherapy in a model cell system. Biochim Biophys Acta Mol Basis Dis 2016; 1862:1345-53. [PMID: 27085973 DOI: 10.1016/j.bbadis.2016.04.010] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2015] [Revised: 03/04/2016] [Accepted: 04/11/2016] [Indexed: 02/07/2023]
Abstract
Colorectal cancer (CRC) is one of the most common life-threatening malignances worldwide. CRC relapse markedly decreases the 5-year survival of patients following surgery. Aberrant expression of genes involved in pathways regulating the cell cycle, cell proliferation, or cell death are frequently reported in CRC tumorigenesis. We hypothesized that genes involved in CRC relapse might serve as prognostic indicators. We first evaluated the significance of gene sequences in the feces of patients with CRC relapse by consulting a public database. Tumorigenesis of target tissues was tested through tumor cell growth, cell cycle regulation, and chemotherapeutic efficacy. We found a highly significant correlation between CRC relapse and growth arrest-specific 2 (GAS2) gene expression. Based on cell models, the overexpressed GAS2 was associated with cellular growth rate, cell cycle regulation, and with chemotherapeutic sensitivity. Cell division was impaired by treating cells with 2-[4-(7-chloro-2-quinoxalinyloxy)phenoxy]-propionic acid (XK469), even when the cells were overexpressing GAS2. Thus, downregulation of GAS2 expression might control CRC relapse after curative resection. GAS2 could serve as a noninvasive marker from the feces of patients with prediagnosed CRC. Our findings suggest that GAS2 could have potential clinical applications for predicting early CRC relapse after radical resection, and that XK469 might impair tumor cell division by reducing GAS2 expression or blocking its cellular translocation. This will help in selecting the best therapeutic option, 5-fluorouracil in combination with XK469, for patients overexpressing GAS2 in CRC cells. Thus, GAS2 might act as a prognostic biomolecule and potential therapeutic target in patients with CRC relapse.
Collapse
|
19
|
Hing B, Ramos E, Braun P, McKane M, Jancic D, Tamashiro KLK, Lee RS, Michaelson JJ, Druley TE, Potash JB. Adaptation of the targeted capture Methyl-Seq platform for the mouse genome identifies novel tissue-specific DNA methylation patterns of genes involved in neurodevelopment. Epigenetics 2016; 10:581-96. [PMID: 25985232 DOI: 10.1080/15592294.2015.1045179] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Methyl-Seq was recently developed as a targeted approach to assess DNA methylation (DNAm) at a genome-wide level in human. We adapted it for mouse and sought to examine DNAm differences across liver and 2 brain regions: cortex and hippocampus. A custom hybridization array was designed to isolate 99 Mb of CpG islands, shores, shelves, and regulatory elements in the mouse genome. This was followed by bisulfite conversion and sequencing on the Illumina HiSeq2000. The majority of differentially methylated cytosines (DMCs) were present at greater than expected frequency in introns, intergenic regions, near CpG islands, and transcriptional enhancers. Liver-specific enhancers were observed to be methylated in cortex, while cortex specific enhancers were methylated in the liver. Interestingly, commonly shared enhancers were differentially methylated between the liver and cortex. Gene ontology and pathway analysis showed that genes that were hypomethylated in the cortex and hippocampus were enriched for neuronal components and neuronal function. In contrast, genes that were hypomethylated in the liver were enriched for cellular components important for liver function. Bisulfite-pyrosequencing validation of 75 DMCs from 19 different loci showed a correlation of r = 0.87 with Methyl-Seq data. We also identified genes involved in neurodevelopment that were not previously reported to be differentially methylated across brain regions. This platform constitutes a valuable tool for future genome-wide studies involving mouse models of disease.
Collapse
Key Words
- Apcdd1, Adenomatous Polyposis Coli Down-Regulated 1
- ChIP, Chromatin immunoprecipitation
- DMCs, Differentially methylated cytosines (DMCs)
- DMRs, Differentially methylated regions
- DNA methylation
- DNAm, DNA methylation
- FDR, False discovery rate
- GFAP, Glial fibrillary acidic protein
- GO, Gene ontology
- Gb, Gigabases
- H3K27ac, Histone 3 lysine 27 acetylation
- H3K4me1, Histone marks histone 3 lysine 4 monomethylation
- KEGG, Kyoto Encyclopedia of Genes and Genomes
- MAP, Mitogen activated protein
- Msx1, msh homeobox1
- PAVIS, Peak Annotation and Visualization
- RV, Range of variation
- TFBS, Transcription factor binding sites
- UTR, Untranslated regions.
- brain
- epigenetics
- genome-wide
- methylation array
Collapse
Affiliation(s)
- Benjamin Hing
- a Department of Psychiatry; University of Iowa Carver College of Medicine ; Iowa City , IA , USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
20
|
Affiliation(s)
| | - Chu-Xia Deng
- Faculty of Health Sciences, University of Macau, Macau
| |
Collapse
|
21
|
Zhang J, Yan B, Späth SS, Qun H, Cornelius S, Guan D, Shao J, Hagiwara K, Van Waes C, Chen Z, Su X, Bi Y. Integrated transcriptional profiling and genomic analyses reveal RPN2 and HMGB1 as promising biomarkers in colorectal cancer. Cell Biosci 2015; 5:53. [PMID: 26388988 PMCID: PMC4574027 DOI: 10.1186/s13578-015-0043-9] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2015] [Accepted: 08/16/2015] [Indexed: 02/07/2023] Open
Abstract
Colorectal cancer (CRC) is a heterogeneous disease that is associated with a gradual accumulation of genetic and epigenetic alterations. Among all CRC stages, stage II tumors are highly heterogeneous with a high relapse rate in about 20–25 % of stage II CRC patients following surgery. Thus, a comprehensive analysis of gene signatures to identify aggressive and metastatic phenotypes in stage II CRC is desired for a more accurate disease classification and outcome prediction. By utilizing a Cancer Array, containing 440 oncogenes and tumor suppressors to profile mRNA expression, we identified a larger number of differentially expressed genes in poorly differentiated stage II colorectal adenocarcinoma tissues, compared to their matched normal tissues. Ontology and Ingenuity Pathway Analysis (IPA) indicated that these genes are involved in functional mechanisms associated with several transcription factors. Genomic alterations of these genes were also investigated through The Cancer Genome Atlas (TCGA) database, utilizing 195 published CRC specimens. The percentage of genomic alterations in these genes was ranked based on their mRNA expression, copy number variations and mutations. This data was further combined with published microarray studies from a large set of CRC tumors classified based on prognostic features. This led to the identification of eight candidate genes including RPN2, HMGB1, AARS, IGFBP3, STAT1, HYOU1, NQO1 and PEA15 that were associated with the progressive phenotype. In particular, RPN2 and HMGB1 displayed a higher genomic alteration frequency in CRC, compared to eight other major solid cancers. Immunohistochemistry was performed on additional 78 stage I–IV CRC samples, where RPN2 protein immunostaining exhibited a significant association with stage III/IV tumors, distant metastasis, and poor differentiation, indicating that RPN2 expression is associated with poor prognosis. Further, our study revealed significant transcriptional regulatory mechanisms, networks and gene signatures, underlying CRC malignant progression and phenotype warranting future clinical investigations.
Collapse
Affiliation(s)
- Jialing Zhang
- School of Public Health, Wuhan University, Wuhan, China ; Clinical Medicine Research Center, The Affiliated Hospital, Inner Mongolia Medical University, Hohhot, China ; Tumor Biology Section, Head and Neck Surgery Branch, National Institute on Deafness and Other Communication Disorders, NIH, Bethesda, USA
| | - Bin Yan
- Laboratory for Food Safety and Environmental Technology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Stephan Stanislaw Späth
- Pediatric Endocrinology Unit, Department of Women's and Children's Health, Karolinska Institute and University Hospital, Stockholm, Sweden
| | - Hu Qun
- Department of Oncology, The Affiliated Hospital, Inner Mongolia Medical University, Hohhot, China
| | - Shaleeka Cornelius
- Tumor Biology Section, Head and Neck Surgery Branch, National Institute on Deafness and Other Communication Disorders, NIH, Bethesda, USA
| | - Daogang Guan
- Department of Biology, Hong Kong Baptist University, Hong Kong, China
| | - Jiaofang Shao
- Department of Bioinformatics, School of Basic Medical Sciences, Nanjing Medical University, Nanjing, China
| | - Koichi Hagiwara
- Department of Respiratory Medicine, Saitama Medical Center, Jichi Medical University, Saitama, Japan
| | - Carter Van Waes
- Tumor Biology Section, Head and Neck Surgery Branch, National Institute on Deafness and Other Communication Disorders, NIH, Bethesda, USA
| | - Zhong Chen
- Tumor Biology Section, Head and Neck Surgery Branch, National Institute on Deafness and Other Communication Disorders, NIH, Bethesda, USA
| | - Xiulan Su
- Clinical Medicine Research Center, The Affiliated Hospital, Inner Mongolia Medical University, Hohhot, China
| | - Yongyi Bi
- School of Public Health, Wuhan University, Wuhan, China
| |
Collapse
|
22
|
Mak TD, Laiakis EC, Goudarzi M, Fornace AJ. Selective paired ion contrast analysis: a novel algorithm for analyzing postprocessed LC-MS metabolomics data possessing high experimental noise. Anal Chem 2015; 87:3177-86. [PMID: 25683158 DOI: 10.1021/ac504012a] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
One of the consequences in analyzing biological data from noisy sources, such as human subjects, is the sheer variability of experimentally irrelevant factors that cannot be controlled for. This holds true especially in metabolomics, the global study of small molecules in a particular system. While metabolomics can offer deep quantitative insight into the metabolome via easy-to-acquire biofluid samples such as urine and blood, the aforementioned confounding factors can easily overwhelm attempts to extract relevant information. This can mar potentially crucial applications such as biomarker discovery. As such, a new algorithm, called Selective Paired Ion Contrast (SPICA), has been developed with the intent of extracting potentially biologically relevant information from the noisiest of metabolomic data sets. The basic idea of SPICA is built upon redefining the fundamental unit of statistical analysis. Whereas the vast majority of algorithms analyze metabolomics data on a single-ion basis, SPICA relies on analyzing ion-pairs. A standard metabolomic data set is reinterpreted by exhaustively considering all possible ion-pair combinations. Statistical comparisons between sample groups are made only by analyzing the differences in these pairs, which may be crucial in situations where no single metabolite can be used for normalization. With SPICA, human urine data sets from patients undergoing total body irradiation (TBI) and from a colorectal cancer (CRC) relapse study were analyzed in a statistically rigorous manner not possible with conventional methods. In the TBI study, 3530 statistically significant ion-pairs were identified, from which numerous putative radiation specific metabolite-pair biomarkers that mapped to potentially perturbed metabolic pathways were elucidated. In the CRC study, SPICA identified 6461 statistically significant ion-pairs, several of which putatively mapped to folic acid biosynthesis, a key pathway in colorectal cancer. Utilizing support vector machines (SVMs), SPICA was also able to unequivocally outperform binary classifiers built from classical single-ion feature based SVMs.
Collapse
Affiliation(s)
| | | | | | - Albert J Fornace
- §Center of Excellence in Genomic Medicine Research (CEGMR), King Abdulaziz University, Jeddah 22254, Saudi Arabia
| |
Collapse
|
23
|
Abstract
The Cancer Genome Atlas (TCGA) is a public funded project that aims to catalogue and discover major cancer-causing genomic alterations to create a comprehensive "atlas" of cancer genomic profiles. So far, TCGA researchers have analysed large cohorts of over 30 human tumours through large-scale genome sequencing and integrated multi-dimensional analyses. Studies of individual cancer types, as well as comprehensive pan-cancer analyses have extended current knowledge of tumorigenesis. A major goal of the project was to provide publicly available datasets to help improve diagnostic methods, treatment standards, and finally to prevent cancer. This review discusses the current status of TCGA Research Network structure, purpose, and achievements.
Collapse
|
24
|
Hing B, Gardner C, Potash JB. Effects of negative stressors on DNA methylation in the brain: implications for mood and anxiety disorders. Am J Med Genet B Neuropsychiatr Genet 2014; 165B:541-54. [PMID: 25139739 PMCID: PMC5096645 DOI: 10.1002/ajmg.b.32265] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/21/2014] [Accepted: 07/18/2014] [Indexed: 01/31/2023]
Abstract
Stress is a major contributor to anxiety and mood disorders. The recent discovery of epigenetic changes in the brain resulting from stress has enhanced our understanding of the mechanism by which stress is able to promote these disorders. Although epigenetics encompasses chemical modifications that occur at both DNA and histones, much attention has been focused on stress-induced DNA methylation changes on behavior. Here, we review the effect of stress-induced DNA methylation changes on physiological mechanisms that govern behavior and cognition, dysregulation of which can be harmful to mental health. A literature review was performed in the areas of DNA methylation, stress, and their impact on the brain and psychiatric illness. Key findings center on genes involved in the hypothalamic-pituitary-adrenal axis, neurotransmission and neuroplasticity. Using animal models of different stress paradigms and clinical studies, we detail how DNA methylation changes to these genes can alter physiological mechanisms that influence behavior. Appropriate levels of gene expression in the brain play an important role in mental health. This dynamic control can be disrupted by stress-induced changes to DNA methylation patterns. Advancement in other areas of epigenetics, such as histone modifications and the discovery of the novel DNA epigenetic mark, 5-hydroxymethylcytosine, could provide additional avenues to consider when determining the epigenetic effects of stress on the brain.
Collapse
Affiliation(s)
- Benjamin Hing
- Department of Psychiatry, Carver College of Medicine, University of Iowa, Iowa City, Iowa,Correspondence to: Dr Benjamin Hing, 25 South Grand Ave, Medical Laboratories, B002, Iowa City, Iowa, USA 52242.
| | - Caleb Gardner
- Department of Psychiatry and Behavioral Sciences, Johns Hopkins University School of Medicine, Baltimore, Maryland
| | - James B. Potash
- Department of Psychiatry, Carver College of Medicine, University of Iowa, Iowa City, Iowa
| |
Collapse
|