Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Geistlinger L, Csaba G, Santarelli M, Ramos M, Schiffer L, Turaga N, Law C, Davis S, Carey V, Morgan M, Zimmer R, Waldron L. Toward a gold standard for benchmarking gene set enrichment analysis. Brief Bioinform 2020;22:545-556. [PMID: 32026945 PMCID: PMC7820859 DOI: 10.1093/bib/bbz158] [Citation(s) in RCA: 52] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2019] [Revised: 10/11/2019] [Accepted: 11/09/2019] [Indexed: 12/22/2022] Open

For:	Geistlinger L, Csaba G, Santarelli M, Ramos M, Schiffer L, Turaga N, Law C, Davis S, Carey V, Morgan M, Zimmer R, Waldron L. Toward a gold standard for benchmarking gene set enrichment analysis. Brief Bioinform 2020;22:545-556. [PMID: 32026945 PMCID: PMC7820859 DOI: 10.1093/bib/bbz158] [Citation(s) in RCA: 52] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2019] [Revised: 10/11/2019] [Accepted: 11/09/2019] [Indexed: 12/22/2022] Open

Number

Cited by Other Article(s)

Tang N, Zhou Q, Liu S, Sun H, Li H, Zhang Q, Hao J, Qi C. GSEA analysis identifies potential drug targets and their interaction networks in coronary microcirculation disorders. SLAS Technol 2024:100152. [PMID: 38823582 DOI: 10.1016/j.slast.2024.100152] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2024] [Revised: 05/20/2024] [Accepted: 05/29/2024] [Indexed: 06/03/2024]

Abstract

Coronary microcirculation dysfunction (CMD) is one of the main causes of cardiovascular disease. Traditional treatment methods lack specificity, making it difficult to fully consider the differences in patient conditions and achieve effective treatment and intervention. The complexity and diversity of CMD require more standardized diagnosis and treatment plans to clarify the best treatment strategy and long-term outcomes. The existing treatment measures mainly focus on symptom management, including medication treatment, lifestyle intervention, and psychological therapy. However, the efficacy of these methods is not consistent for all patients, and the long-term efficacy is not yet clear. GSEA is a bioinformatics method used to interpret gene expression data, particularly for identifying the enrichment of predefined gene sets in gene expression data. In order to achieve personalized treatment and improve the quality and effectiveness of interventions, this article combined GSEA (Gene Set Enrichment Analysis) technology to conduct in-depth research on potential drug targets and their interaction networks in coronary microcirculation dysfunctions. This article first utilized the Coremine medical database, GeneCards, and DrugBank public databases to collect gene data. Then, filtering methods were used to preprocess the data, and GSEA was used to analyze the preprocessed gene expression data to identify and calculate pathways and enrichment scores related to CMD. Finally, protein sequence features were extracted through the calculation of autocorrelation features. To verify the effectiveness of GSEA, this article conducted experimental analysis from four aspects: precision, receiver operating characteristic (ROC) curve, correlation, and potential drug targets, and compared them with Gene Regulatory Networks (GRN) and Random Forest (RF) methods. The results showed that compared to the GRN and RF methods, the average precision of GSEA improved by 0.11. The conclusion indicated that GSEA helped identify and explore potential drug targets and their interaction networks, providing new ideas for personalized quality of CMD.

Collapse

Candia J, Ferrucci L. Assessment of Gene Set Enrichment Analysis using curated RNA-seq-based benchmarks. PLoS One 2024;19:e0302696. [PMID: 38753612 PMCID: PMC11098418 DOI: 10.1371/journal.pone.0302696] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Accepted: 04/09/2024] [Indexed: 05/18/2024] Open

Abstract

Pathway enrichment analysis is a ubiquitous computational biology method to interpret a list of genes (typically derived from the association of large-scale omics data with phenotypes of interest) in terms of higher-level, predefined gene sets that share biological function, chromosomal location, or other common features. Among many tools developed so far, Gene Set Enrichment Analysis (GSEA) stands out as one of the pioneering and most widely used methods. Although originally developed for microarray data, GSEA is nowadays extensively utilized for RNA-seq data analysis. Here, we quantitatively assessed the performance of a variety of GSEA modalities and provide guidance in the practical use of GSEA in RNA-seq experiments. We leveraged harmonized RNA-seq datasets available from The Cancer Genome Atlas (TCGA) in combination with large, curated pathway collections from the Molecular Signatures Database to obtain cancer-type-specific target pathway lists across multiple cancer types. We carried out a detailed analysis of GSEA performance using both gene-set and phenotype permutations combined with four different choices for the Kolmogorov-Smirnov enrichment statistic. Based on our benchmarks, we conclude that the classic/unweighted gene-set permutation approach offered comparable or better sensitivity-vs-specificity tradeoffs across cancer types compared with other, more complex and computationally intensive permutation methods. Finally, we analyzed other large cohorts for thyroid cancer and hepatocellular carcinoma. We utilized a new consensus metric, the Enrichment Evidence Score (EES), which showed a remarkable agreement between pathways identified in TCGA and those from other sources, despite differences in cancer etiology. This finding suggests an EES-based strategy to identify a core set of pathways that may be complemented by an expanded set of pathways for downstream exploratory analysis. This work fills the existing gap in current guidelines and benchmarks for the use of GSEA with RNA-seq data and provides a framework to enable detailed benchmarking of other RNA-seq-based pathway analysis tools.

Collapse

Geistlinger L, Mirzayi C, Zohra F, Azhar R, Elsafoury S, Grieve C, Wokaty J, Gamboa-Tuz SD, Sengupta P, Hecht I, Ravikrishnan A, Gonçalves RS, Franzosa E, Raman K, Carey V, Dowd JB, Jones HE, Davis S, Segata N, Huttenhower C, Waldron L. BugSigDB captures patterns of differential abundance across a broad range of host-associated microbial signatures. Nat Biotechnol 2024;42:790-802. [PMID: 37697152 PMCID: PMC11098749 DOI: 10.1038/s41587-023-01872-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Accepted: 06/20/2023] [Indexed: 09/13/2023]

Affiliation(s)

Ludwig Geistlinger Center for Computational Biomedicine, Harvard Medical School, Boston, MA, USA
Chloe Mirzayi Institute for Implementation Science in Population Health, City University of New York School of Public Health, New York, NY, USA Department of Epidemiology and Biostatistics, City University of New York School of Public Health, New York, NY, USA
Fatima Zohra Institute for Implementation Science in Population Health, City University of New York School of Public Health, New York, NY, USA Department of Epidemiology and Biostatistics, City University of New York School of Public Health, New York, NY, USA
Rimsha Azhar Institute for Implementation Science in Population Health, City University of New York School of Public Health, New York, NY, USA Department of Epidemiology and Biostatistics, City University of New York School of Public Health, New York, NY, USA
Shaimaa Elsafoury Institute for Implementation Science in Population Health, City University of New York School of Public Health, New York, NY, USA Department of Epidemiology and Biostatistics, City University of New York School of Public Health, New York, NY, USA
Clare Grieve Institute for Implementation Science in Population Health, City University of New York School of Public Health, New York, NY, USA Department of Epidemiology and Biostatistics, City University of New York School of Public Health, New York, NY, USA
Jennifer Wokaty Institute for Implementation Science in Population Health, City University of New York School of Public Health, New York, NY, USA Department of Epidemiology and Biostatistics, City University of New York School of Public Health, New York, NY, USA
Samuel David Gamboa-Tuz Institute for Implementation Science in Population Health, City University of New York School of Public Health, New York, NY, USA Department of Epidemiology and Biostatistics, City University of New York School of Public Health, New York, NY, USA
Pratyay Sengupta Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology (IIT) Madras, Chennai, India Robert Bosch Centre for Data Science and Artificial Intelligence, Indian Institute of Technology (IIT) Madras, Chennai, India Centre for Integrative Biology and Systems mEdicine (IBSE), Indian Institute of Technology (IIT) Madras, Chennai, India
Issac Hecht WikiWorks, Boca Raton, FL, USA
Aarthi Ravikrishnan Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore
Rafael S Gonçalves Center for Computational Biomedicine, Harvard Medical School, Boston, MA, USA
Eric Franzosa Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA, USA Harvard Chan Microbiome in Public Health Center, Harvard T. H. Chan School of Public Health, Boston, MA, USA
Karthik Raman Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology (IIT) Madras, Chennai, India Robert Bosch Centre for Data Science and Artificial Intelligence, Indian Institute of Technology (IIT) Madras, Chennai, India Centre for Integrative Biology and Systems mEdicine (IBSE), Indian Institute of Technology (IIT) Madras, Chennai, India
Vincent Carey Channing Division of Network Medicine, Mass General Brigham, Harvard Medical School, Boston, MA, USA
Jennifer B Dowd Leverhulme Centre for Demographic Science, University of Oxford, Oxford, UK
Heidi E Jones Institute for Implementation Science in Population Health, City University of New York School of Public Health, New York, NY, USA Department of Epidemiology and Biostatistics, City University of New York School of Public Health, New York, NY, USA
Sean Davis Departments of Biomedical Informatics and Medicine, University of Colorado Anschutz School of Medicine, Denver, CO, USA
Nicola Segata Department CIBIO, University of Trento, Trento, Italy Istituto Europeo di Oncologia (IEO) IRCSS, Milan, Italy
Curtis Huttenhower Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA, USA Harvard Chan Microbiome in Public Health Center, Harvard T. H. Chan School of Public Health, Boston, MA, USA
Levi Waldron Institute for Implementation Science in Population Health, City University of New York School of Public Health, New York, NY, USA. Department of Epidemiology and Biostatistics, City University of New York School of Public Health, New York, NY, USA. Department CIBIO, University of Trento, Trento, Italy.

Collapse

Frost HR. Reconstruction Set Test (RESET): A computationally efficient method for single sample gene set testing based on randomized reduced rank reconstruction error. PLoS Comput Biol 2024;20:e1012084. [PMID: 38683883 PMCID: PMC11081506 DOI: 10.1371/journal.pcbi.1012084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Revised: 05/09/2024] [Accepted: 04/17/2024] [Indexed: 05/02/2024] Open

Peng C, Chen Q, Tan S, Shen X, Jiang C. Generalized reporter score-based enrichment analysis for omics data. Brief Bioinform 2024;25:bbae116. [PMID: 38546324 PMCID: PMC10976918 DOI: 10.1093/bib/bbae116] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Revised: 01/25/2024] [Accepted: 03/01/2024] [Indexed: 06/15/2024] Open

Buzzao D, Castresana-Aguirre M, Guala D, Sonnhammer ELL. Benchmarking enrichment analysis methods with the disease pathway network. Brief Bioinform 2024;25:bbae069. [PMID: 38436561 PMCID: PMC10939300 DOI: 10.1093/bib/bbae069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Revised: 01/10/2024] [Accepted: 02/03/2024] [Indexed: 03/05/2024] Open

Lardelli M, Baer L, Hin N, Allen A, Pederson SM, Barthelson K. The Use of Zebrafish in Transcriptome Analysis of the Early Effects of Mutations Causing Early Onset Familial Alzheimer's Disease and Other Inherited Neurodegenerative Conditions. J Alzheimers Dis 2024;99:S367-S381. [PMID: 37742650 DOI: 10.3233/jad-230522] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]

Jablonski KP, Beerenwinkel N. Coherent pathway enrichment estimation by modeling inter-pathway dependencies using regularized regression. Bioinformatics 2023;39:btad522. [PMID: 37610338 PMCID: PMC10471899 DOI: 10.1093/bioinformatics/btad522] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2022] [Revised: 07/04/2023] [Accepted: 08/22/2023] [Indexed: 08/24/2023] Open

Frost HR. Reconstruction Set Test (RESET): a computationally efficient method for single sample gene set testing based on randomized reduced rank reconstruction error. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.03.535366. [PMID: 37066315 PMCID: PMC10104009 DOI: 10.1101/2023.04.03.535366] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/18/2023]

Angel-Velez D, Meese T, Hedia M, Fernandez-Montoro A, De Coster T, Pascottini OB, Van Nieuwerburgh F, Govaere J, Van Soom A, Pavani K, Smits K. Transcriptomics Reveal Molecular Differences in Equine Oocytes Vitrified before and after In Vitro Maturation. Int J Mol Sci 2023;24:ijms24086915. [PMID: 37108081 PMCID: PMC10138936 DOI: 10.3390/ijms24086915] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 03/27/2023] [Accepted: 04/04/2023] [Indexed: 04/29/2023] Open

Affiliation(s)

Daniel Angel-Velez Department of Internal Medicine, Reproduction and Population Medicine, Faculty of Veterinary Medicine, Ghent University, Salisburylaan 133, 9820 Merelbeke, Belgium Research Group in Animal Sciences-INCA-CES, Universidad CES, Medellin 050021, Colombia
Tim Meese Laboratory for Pharmaceutical Biotechnology, Faculty of Pharmaceutical Science, Ghent University, 9000 Ghent, Belgium
Mohamed Hedia Department of Internal Medicine, Reproduction and Population Medicine, Faculty of Veterinary Medicine, Ghent University, Salisburylaan 133, 9820 Merelbeke, Belgium Department of Theriogenology, Faculty of Veterinary Medicine, Cairo University, Giza 12211, Egypt
Andrea Fernandez-Montoro Department of Internal Medicine, Reproduction and Population Medicine, Faculty of Veterinary Medicine, Ghent University, Salisburylaan 133, 9820 Merelbeke, Belgium
Tine De Coster Department of Internal Medicine, Reproduction and Population Medicine, Faculty of Veterinary Medicine, Ghent University, Salisburylaan 133, 9820 Merelbeke, Belgium
Osvaldo Bogado Pascottini Department of Internal Medicine, Reproduction and Population Medicine, Faculty of Veterinary Medicine, Ghent University, Salisburylaan 133, 9820 Merelbeke, Belgium
Filip Van Nieuwerburgh Laboratory for Pharmaceutical Biotechnology, Faculty of Pharmaceutical Science, Ghent University, 9000 Ghent, Belgium
Jan Govaere Department of Internal Medicine, Reproduction and Population Medicine, Faculty of Veterinary Medicine, Ghent University, Salisburylaan 133, 9820 Merelbeke, Belgium
Ann Van Soom Department of Internal Medicine, Reproduction and Population Medicine, Faculty of Veterinary Medicine, Ghent University, Salisburylaan 133, 9820 Merelbeke, Belgium
Krishna Pavani Department of Internal Medicine, Reproduction and Population Medicine, Faculty of Veterinary Medicine, Ghent University, Salisburylaan 133, 9820 Merelbeke, Belgium Department for Reproductive Medicine, Ghent University Hospital, Corneel Heymanslaan 10, 9000 Gent, Belgium
Katrien Smits Department of Internal Medicine, Reproduction and Population Medicine, Faculty of Veterinary Medicine, Ghent University, Salisburylaan 133, 9820 Merelbeke, Belgium

Collapse

Zhao K, Rhee SY. Interpreting omics data with pathway enrichment analysis. Trends Genet 2023;39:308-319. [PMID: 36750393 DOI: 10.1016/j.tig.2023.01.003] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2022] [Revised: 11/24/2022] [Accepted: 01/13/2023] [Indexed: 02/09/2023]

Whittaker CA, Kucukural A, Gates C, Wilkins OM, Bell GW, Hutchinson JN, Polson SW, Dragon J. Functional Annotation Routines Used by ABRF Bioinformatics Core Facilities - Observations, Comparisons, and Considerations. J Biomol Tech 2023;34:3fc1f5fe.0b74b9db. [PMID: 37089874 PMCID: PMC10121236 DOI: 10.7171/3fc1f5fe.0b74b9db] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/30/2023]

Ye J, Feng JW, Wu WX, Qi GF, Wang F, Hu J, Hong LZ, Liu SY, Jiang Y. Microarray profiling identifies hsa_circ_0082003 as a novel tumor promoter for papillary thyroid carcinoma. J Endocrinol Invest 2023;46:509-522. [PMID: 36115894 DOI: 10.1007/s40618-022-01922-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Accepted: 09/11/2022] [Indexed: 11/30/2022]

Lu Y, Pang Z, Xia J. Comprehensive investigation of pathway enrichment methods for functional interpretation of LC-MS global metabolomics data. Brief Bioinform 2023;24:bbac553. [PMID: 36572652 PMCID: PMC9851290 DOI: 10.1093/bib/bbac553] [Citation(s) in RCA: 21] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Revised: 10/31/2022] [Accepted: 11/15/2022] [Indexed: 12/28/2022] Open

Chen JW, Shrestha L, Green G, Leier A, Marquez-Lago TT. The hitchhikers' guide to RNA sequencing and functional analysis. Brief Bioinform 2023;24:bbac529. [PMID: 36617463 PMCID: PMC9851315 DOI: 10.1093/bib/bbac529] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Revised: 10/18/2022] [Accepted: 11/07/2022] [Indexed: 01/10/2023] Open

Cousins H, Hall T, Guo Y, Tso L, Tzeng KTH, Cong L, Altman RB. Gene set proximity analysis: expanding gene set enrichment analysis through learned geometric embeddings, with drug-repurposing applications in COVID-19. Bioinformatics 2023;39:btac735. [PMID: 36394254 PMCID: PMC9805577 DOI: 10.1093/bioinformatics/btac735] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 09/27/2022] [Accepted: 11/16/2022] [Indexed: 11/18/2022] Open

Liu Z, Gao J, Gu R, Shi Y, Hu H, Liu J, Huang J, Zhong C, Zhou W, Yang Y, Gong C. Comprehensive Analysis of Transcriptomics and Genetic Alterations Identifies Potential Mechanisms Underlying Anthracycline Therapy Resistance in Breast Cancer. Biomolecules 2022;12:biom12121834. [PMID: 36551262 PMCID: PMC9775906 DOI: 10.3390/biom12121834] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Revised: 12/01/2022] [Accepted: 12/06/2022] [Indexed: 12/14/2022] Open

Affiliation(s)

Zihao Liu Breast Tumor Center, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou 510120, China Department of Breast and Thyroid Surgery, The Second Clinical Medical College of Jinan University, The First Affiliated Hospital of Southern University of Science and Technology, Shenzhen People’s Hospital, Shenzhen 518020, China
Jingbo Gao Breast Tumor Center, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou 510120, China
Ran Gu Breast Tumor Center, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou 510120, China
Yu Shi Breast Tumor Center, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou 510120, China
Hong Hu Department of Breast and Thyroid Surgery, The Second Clinical Medical College of Jinan University, The First Affiliated Hospital of Southern University of Science and Technology, Shenzhen People’s Hospital, Shenzhen 518020, China
Jianlan Liu Department of Pathology, The Second Clinical Medical College of Jinan University, The First Affiliated Hospital of Southern University of Science and Technology, Shenzhen People’s Hospital, Shenzhen 518020, China
Jiefeng Huang Department of Breast and Thyroid Surgery, The Second Clinical Medical College of Jinan University, The First Affiliated Hospital of Southern University of Science and Technology, Shenzhen People’s Hospital, Shenzhen 518020, China
Caineng Zhong Department of Breast and Thyroid Surgery, The Second Clinical Medical College of Jinan University, The First Affiliated Hospital of Southern University of Science and Technology, Shenzhen People’s Hospital, Shenzhen 518020, China
Wenbin Zhou Department of Breast and Thyroid Surgery, The Second Clinical Medical College of Jinan University, The First Affiliated Hospital of Southern University of Science and Technology, Shenzhen People’s Hospital, Shenzhen 518020, China
Yaping Yang Breast Tumor Center, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou 510120, China Correspondence: (Y.Y.); or (C.G.)
Chang Gong Breast Tumor Center, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou 510120, China Correspondence: (Y.Y.); or (C.G.)

Collapse

Zhou CD, Pettersson A, Plym A, Tyekucheva S, Penney KL, Sesso HD, Kantoff PW, Mucci LA, Stopsack KH. Differences in Prostate Cancer Transcriptomes by Age at Diagnosis: Are Primary Tumors from Older Men Inherently Different? Cancer Prev Res (Phila) 2022;15:815-825. [PMID: 36125434 PMCID: PMC9722523 DOI: 10.1158/1940-6207.capr-22-0212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2022] [Revised: 08/03/2022] [Accepted: 09/01/2022] [Indexed: 01/31/2023]

Zeng L, Yang K, Zhang T, Zhu X, Hao W, Chen H, Ge J. Research progress of single-cell transcriptome sequencing in autoimmune diseases and autoinflammatory disease: A review. J Autoimmun 2022;133:102919. [PMID: 36242821 DOI: 10.1016/j.jaut.2022.102919] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2022] [Revised: 09/16/2022] [Accepted: 09/19/2022] [Indexed: 12/07/2022]

Abstract

Autoimmunity refers to the phenomenon that the body's immune system produces antibodies or sensitized lymphocytes to its own tissues to cause an immune response. Immune disorders caused by autoimmunity can mediate autoimmune diseases. Autoimmune diseases have complicated pathogenesis due to the many types of cells involved, and the mechanism is still unclear. The emergence of single-cell research technology can solve the problem that ordinary transcriptome technology cannot be accurate to cell type. It provides unbiased results through independent analysis of cells in tissues and provides more mRNA information for identifying cell subpopulations, which provides a novel approach to study disruption of immune tolerance and disturbance of pro-inflammatory pathways on a cellular basis. It may fundamentally change the understanding of molecular pathways in the pathogenesis of autoimmune diseases and develop targeted drugs. Single-cell transcriptome sequencing (scRNA-seq) has been widely applied in autoimmune diseases, which provides a powerful tool for demonstrating the cellular heterogeneity of tissues involved in various immune inflammations, identifying pathogenic cell populations, and revealing the mechanism of disease occurrence and development. This review describes the principles of scRNA-seq, introduces common sequencing platforms and practical procedures, and focuses on the progress of scRNA-seq in 41 autoimmune diseases, which include 9 systemic autoimmune diseases and autoinflammatory diseases (rheumatoid arthritis, systemic lupus erythematosus, etc.) and 32 organ-specific autoimmune diseases (5 Skin diseases, 3 Nervous system diseases, 4 Eye diseases, 2 Respiratory system diseases, 2 Circulatory system diseases, 6 Liver, Gallbladder and Pancreas diseases, 2 Gastrointestinal system diseases, 3 Muscle, Bones and joint diseases, 3 Urinary system diseases, 2 Reproductive system diseases). This review also prospects the molecular mechanism targets of autoimmune diseases from the multi-molecular level and multi-dimensional analysis combined with single-cell multi-omics sequencing technology (such as scRNA-seq, Single cell ATAC-seq and single cell immune group library sequencing), which provides a reference for further exploring the pathogenesis and marker screening of autoimmune diseases and autoimmune inflammatory diseases in the future.

Collapse

Wieder C, Lai RPJ, Ebbels TMD. Single sample pathway analysis in metabolomics: performance evaluation and application. BMC Bioinformatics 2022;23:481. [PMID: 36376837 PMCID: PMC9664704 DOI: 10.1186/s12859-022-05005-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Accepted: 10/25/2022] [Indexed: 11/15/2022] Open

Jiménez‐Santos MJ, García‐Martín S, Fustero‐Torre C, Di Domenico T, Gómez‐López G, Al‐Shahrour F. Bioinformatics roadmap for therapy selection in cancer genomics. Mol Oncol 2022;16:3881-3908. [PMID: 35811332 PMCID: PMC9627786 DOI: 10.1002/1878-0261.13286] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Revised: 06/22/2022] [Accepted: 07/08/2022] [Indexed: 12/24/2022] Open

Mishra BH, Sievänen H, Raitoharju E, Mononen N, Viikari J, Juonala M, Laaksonen M, Hutri-Kähönen N, Kähönen M, Raitakari OT, Lehtimäki T, Mishra PP. Gene set analysis of transcriptomics data identifies new biological processes associated with early markers of atherosclerosis but not with those of osteoporosis: Atherosclerosis-osteoporosis co/multimorbidity study in the Young Finns Study. Atherosclerosis 2022;361:1-9. [PMID: 36252457 DOI: 10.1016/j.atherosclerosis.2022.10.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/13/2022] [Revised: 10/06/2022] [Accepted: 10/06/2022] [Indexed: 12/15/2022]

Affiliation(s)

Binisha H Mishra Department of Clinical Chemistry, Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland; Finnish Cardiovascular Research Center Tampere, Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland; Department of Clinical Chemistry, Fimlab Laboratories, Tampere, Finland.
Harri Sievänen The UKK Institute for Health Promotion Research, Tampere, Finland
Emma Raitoharju Molecular Epidemiology, Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland; Tampere University Hospital, Tampere, Finland
Nina Mononen Department of Clinical Chemistry, Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland; Finnish Cardiovascular Research Center Tampere, Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland; Department of Clinical Chemistry, Fimlab Laboratories, Tampere, Finland
Jorma Viikari Department of Medicine, University of Turku, Turku, Finland; Division of Medicine, Turku University Hospital, Turku, Finland
Markus Juonala Department of Medicine, University of Turku, Turku, Finland; Division of Medicine, Turku University Hospital, Turku, Finland
Marika Laaksonen Fazer Lab Research, Oy Karl Fazer Ab, Helsinki, Finland
Nina Hutri-Kähönen Department of Paediatrics, Tampere University Hospital, Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
Mika Kähönen Finnish Cardiovascular Research Center Tampere, Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland; Department of Clinical Physiology, Tampere University Hospital, Tampere, Finland
Olli T Raitakari Research Centre of Applied and Preventive Cardiovascular Medicine, University of Turku, Turku, Finland; Department of Clinical Physiology and Nuclear Medicine, Turku University Hospital, Turku, Finland; Centre for Population Health Research, University of Turku and Turku University Hospital, Turku, Finland
Terho Lehtimäki Department of Clinical Chemistry, Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland; Finnish Cardiovascular Research Center Tampere, Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland; Department of Clinical Chemistry, Fimlab Laboratories, Tampere, Finland
Pashupati P Mishra Department of Clinical Chemistry, Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland; Finnish Cardiovascular Research Center Tampere, Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland; Department of Clinical Chemistry, Fimlab Laboratories, Tampere, Finland

Collapse

Makrooni MA, O’Shea D, Geeleher P, Seoighe C. Random-effects meta-analysis of effect sizes as a unified framework for gene set analysis. PLoS Comput Biol 2022;18:e1010278. [PMID: 36197939 PMCID: PMC9576052 DOI: 10.1371/journal.pcbi.1010278] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2022] [Revised: 10/17/2022] [Accepted: 09/18/2022] [Indexed: 11/06/2022] Open

Abstract

Gene set analysis (GSA) remains a common step in genome-scale studies because it can reveal insights that are not apparent from results obtained for individual genes. Many different computational tools are applied for GSA, which may be sensitive to different types of signals; however, most methods implicitly test whether there are differences in the distribution of the effect of some experimental condition between genes in gene sets of interest. We have developed a unifying framework for GSA that first fits effect size distributions, and then tests for differences in these distributions between gene sets. These differences can be in the proportions of genes that are perturbed or in the sign or size of the effects. Inspired by statistical meta-analysis, we take into account the uncertainty in effect size estimates by reducing the influence of genes with greater uncertainty on the estimation of distribution parameters. We demonstrate, using simulation and by application to real data, that this approach provides significant gains in performance over existing methods. Furthermore, the statistical tests carried out are defined in terms of effect sizes, rather than the results of prior statistical tests measuring these changes, which leads to improved interpretability and greater robustness to variation in sample sizes.

The role of gene set analysis is to identify groups of genes that are perturbed in a genomics experiment. There are many tools available for this task and they do not all test for the same types of changes. Here we propose a new way to carry out gene set analysis that involves first working out the distribution of the group effect in the gene set and then comparing this distribution to the equivalent distribution in other genes. Tests performed by existing tools for gene set analysis can be related to different comparisons in these distributions of group effects. A unified framework for gene set analysis provides for more explicit null hypotheses against which to test sets of genes for different types of responses to the experimental conditions. These results are more interpretable, because the group effect distributions can be compared visually, providing an indication of how the experimental effect differs between the gene sets.

Collapse

Lee AJ, Mould DL, Crawford J, Hu D, Powers RK, Doing G, Costello JC, Hogan DA, Greene CS. SOPHIE: Generative Neural Networks Separate Common and Specific Transcriptional Responses. GENOMICS, PROTEOMICS & BIOINFORMATICS 2022;20:912-927. [PMID: 36216026 PMCID: PMC10025681 DOI: 10.1016/j.gpb.2022.09.011] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/22/2022] [Revised: 09/09/2022] [Accepted: 09/30/2022] [Indexed: 11/06/2022]

Datasets for gene expression profiles of head and neck squamous cell carcinoma and lung cancer treated or not by PD1/PD-L1 inhibitors. Data Brief 2022;44:108556. [PMID: 36111282 PMCID: PMC9467865 DOI: 10.1016/j.dib.2022.108556] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Revised: 08/19/2022] [Accepted: 08/22/2022] [Indexed: 11/22/2022] Open

Xu S, Chen Z, Ge L, Ma C, He Q, Liu W, Zhang L, Zhou L. Identification of potential biomarkers and pathogenesis in neutrophil-predominant severe asthma: A comprehensive bioinformatics analysis. Medicine (Baltimore) 2022;101:e30661. [PMID: 36197221 PMCID: PMC9509178 DOI: 10.1097/md.0000000000030661] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open

Abstract

BACKGROUND

Airway neutrophilia has been associated with asthma severity and asthma exacerbations. This study attempted to identify biomarkers, pathogenesis, and therapeutic molecular targets for severe asthma in neutrophils using bioinformatics analysis.

METHODS

Fifteen healthy controls and 3 patients with neutrophilic severe asthma were screened from the Gene Expression Omnibus (GEO) database. Based on the analysis of differentially expressed genes (DEGs), functional and pathway enrichment analyses, gene set enrichment analysis, protein-protein interaction network construction, and analysis were performed. Moreover, small-molecule drug candidates have also been identified.

RESULTS

Three hundred and three upregulated and 59 downregulated genes were identified. Gene ontology function enrichment analyses were primarily related to inflammatory response, immune response, leukocyte migration, neutrophil chemotaxis, mitogen-activated protein kinase cascade, Jun N-terminal kinase cascade, I-kappaB kinase/nuclear factor-κB, and MyD88-dependent toll-like receptor signaling pathway. Pathway enrichment analyses and gene set enrichment analysis were mainly involved in cytokine-cytokine receptor interaction, the TNF signaling pathway, leukocyte transendothelial migration, and the NOD-like receptor signaling pathway. Furthermore, 1 important module and 10 hub genes (CXCL8, TLR2, CXCL1, ICAM1, CXCR4, FPR2, SELL, PTEN, TREM1, and LEP) were identified in the protein-protein interaction network. Moreover, indoprofen, mimosine, STOCK1N-35874, trapidil, iloprost, aminoglutethimide, ajmaline, levobunolol, ethionamide, cefaclor, dimenhydrinate, and bethanechol are potential drugs for the treatment of neutrophil-predominant severe asthma.

CONCLUSION

This study identified potential biomarkers, pathogenesis, and therapeutic molecular targets for neutrophil-predominant severe asthma.

Collapse

Zhong H, Wang Z, Wei X, Liu Y, Huang X, Mo X, Tang W. Prognostic and immunological role of SERPINH1 in pan-cancer. Front Genet 2022;13:900495. [PMID: 36105106 PMCID: PMC9465257 DOI: 10.3389/fgene.2022.900495] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2022] [Accepted: 07/05/2022] [Indexed: 11/13/2022] Open

Kagiwada H, Motono C, Horimoto K, Fukui K. Phosprof: pathway analysis database of drug response based on phosphorylation activity measurements. Database (Oxford) 2022;2022:baac072. [PMID: 35994309 PMCID: PMC9394491 DOI: 10.1093/database/baac072] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Revised: 07/19/2022] [Accepted: 08/17/2022] [Indexed: 06/15/2023]

Androulakis IP. Towards a comprehensive assessment of QSP models: what would it take? J Pharmacokinet Pharmacodyn 2022:10.1007/s10928-022-09820-0. [PMID: 35962928 PMCID: PMC9922790 DOI: 10.1007/s10928-022-09820-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2022] [Accepted: 07/15/2022] [Indexed: 10/15/2022]

Abstract

Quantitative Systems Pharmacology (QSP) has emerged as a powerful ensemble of approaches aiming at developing integrated mathematical and computational models elucidating the complex interactions between pharmacology, physiology, and disease. As the field grows and matures its applications expand beyond the boundaries of research and development and slowly enter the decision making and regulatory arenas. However, widespread acceptance and eventual adoption of a new modeling approach requires assessment criteria and quantifiable metrics that establish credibility and increase confidence in model predictions. QSP aims to provide an integrated understanding of pathology in the context of therapeutic interventions. Because of its ambitious nature and the fact that QSP emerged in an uncoordinated manner as a result of activities distributed across organizations and academic institutions, high entropy characterizes the tools, methods, and computational methodologies and approaches used. The eventual acceptance of QSP model predictions as supporting material for an application to a regulatory agency will require that two key aspects are considered: (1) increase confidence in the QSP framework, which drives standardization and assessment; and (2) careful articulation of the expectations. Both rely heavily on our ability to rigorously and consistently assess QSP models. In this manuscript, we wish to discuss the meaning and purpose of such an assessment in the context of QSP model development and elaborate on the differentiating features of QSP that render such an endeavor challenging. We argue that QSP establishes a conceptual, integrative framework rather than a specific and well-defined computational methodology. QSP elicits the use of a wide variety of modeling and computational methodologies optimized with respect to specific applications and available data modalities, which exceed the data structures employed by chemometrics and PK/PD models. While the range of options fosters creativity and promises to substantially advance our ability to design pharmaceutical interventions rationally and optimally, our expectations of QSP models need to be clearly articulated and agreed on, with assessment emphasizing the scope of QSP studies rather than the methods used. Nevertheless, QSP should not be considered an independent approach, rather one of many in the broader continuum of computational models.

Collapse

Oh S, Geistlinger L, Ramos M, Blankenberg D, van den Beek M, Taroni JN, Carey VJ, Greene CS, Waldron L, Davis S. GenomicSuperSignature facilitates interpretation of RNA-seq experiments through robust, efficient comparison to public databases. Nat Commun 2022;13:3695. [PMID: 35760813 PMCID: PMC9237024 DOI: 10.1038/s41467-022-31411-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2021] [Accepted: 06/14/2022] [Indexed: 02/04/2023] Open

Cerulo L, Pagnotta SM. massiveGST: A Mann-Whitney-Wilcoxon Gene-Set Test Tool That Gives Meaning to Gene-Set Enrichment Analysis. ENTROPY 2022;24:e24050739. [PMID: 35626622 PMCID: PMC9140214 DOI: 10.3390/e24050739] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Revised: 05/16/2022] [Accepted: 05/19/2022] [Indexed: 01/27/2023]

Mubeen S, Tom Kodamullil A, Hofmann-Apitius M, Domingo-Fernández D. On the influence of several factors on pathway enrichment analysis. Brief Bioinform 2022;23:bbac143. [PMID: 35453140 PMCID: PMC9116215 DOI: 10.1093/bib/bbac143] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2022] [Revised: 03/21/2022] [Accepted: 03/30/2022] [Indexed: 02/01/2023] Open

Nguyen QP, Hoen AG, Frost HR. CBEA: Competitive balances for taxonomic enrichment analysis. PLoS Comput Biol 2022;18:e1010091. [PMID: 35584140 PMCID: PMC9154102 DOI: 10.1371/journal.pcbi.1010091] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2021] [Revised: 05/31/2022] [Accepted: 04/08/2022] [Indexed: 12/15/2022] Open

Jessica A. C, Rocío L. C. Differential gene expression in cancer: An overrated analysis? Curr Bioinform 2022. [DOI: 10.2174/1574893617666220422134525] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Functional Enrichment Analysis of Regulatory Elements. Biomedicines 2022;10:biomedicines10030590. [PMID: 35327392 PMCID: PMC8945021 DOI: 10.3390/biomedicines10030590] [Citation(s) in RCA: 43] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2022] [Revised: 02/22/2022] [Accepted: 02/25/2022] [Indexed: 01/27/2023] Open

Lycopene Supplementation to Serum-Free Maturation Medium Improves In Vitro Bovine Embryo Development and Quality and Modulates Embryonic Transcriptomic Profile. Antioxidants (Basel) 2022;11:antiox11020344. [PMID: 35204226 PMCID: PMC8868338 DOI: 10.3390/antiox11020344] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2022] [Revised: 02/02/2022] [Accepted: 02/08/2022] [Indexed: 02/08/2023] Open

Identification of MAD2L1 as a Potential Biomarker in Hepatocellular Carcinoma via Comprehensive Bioinformatics Analysis. BIOMED RESEARCH INTERNATIONAL 2022;2022:9868022. [PMID: 35132379 PMCID: PMC8817109 DOI: 10.1155/2022/9868022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 11/19/2021] [Accepted: 01/15/2022] [Indexed: 11/17/2022]

Huang JB, Hu BB, He R, He L, Zou C, Man CF, Fan Y. Analysis of N6-Methyladenosine Methylome in Adenocarcinoma of Esophagogastric Junction. Front Genet 2022;12:787800. [PMID: 35140740 PMCID: PMC8820482 DOI: 10.3389/fgene.2021.787800] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2021] [Accepted: 12/30/2021] [Indexed: 11/21/2022] Open

Marczyk M, Macioszek A, Tobiasz J, Polanska J, Zyla J. Importance of SNP Dependency Correction and Association Integration for Gene Set Analysis in Genome-Wide Association Studies. Front Genet 2021;12:767358. [PMID: 34956320 PMCID: PMC8696167 DOI: 10.3389/fgene.2021.767358] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Accepted: 11/10/2021] [Indexed: 11/13/2022] Open

Abstract

A typical genome-wide association study (GWAS) analyzes millions of single-nucleotide polymorphisms (SNPs), several of which are in a region of the same gene. To conduct gene set analysis (GSA), information from SNPs needs to be unified at the gene level. A widely used practice is to use only the most relevant SNP per gene; however, there are other methods of integration that could be applied here. Also, the problem of nonrandom association of alleles at two or more loci is often neglected. Here, we tested the impact of incorporation of different integrations and linkage disequilibrium (LD) correction on the performance of several GSA methods. Matched normal and breast cancer samples from The Cancer Genome Atlas database were used to evaluate the performance of six GSA algorithms: Coincident Extreme Ranks in Numerical Observations (CERNO), Gene Set Enrichment Analysis (GSEA), GSEA-SNP, improved GSEA for GWAS (i-GSEA4GWAS), Meta-Analysis Gene-set Enrichment of variaNT Associations (MAGENTA), and Over-Representation Analysis (ORA). Association of SNPs to phenotype was calculated using modified McNemar's test. Results for SNPs mapped to the same gene were integrated using Fisher and Stouffer methods and compared with the minimum p-value method. Four common measures were used to quantify the performance of all combinations of methods. Results of GSA analysis on GWAS were compared to the one performed on gene expression data. Comparing all evaluation metrics across different GSA algorithms, integrations, and LD correction, we highlighted CERNO, and MAGENTA with Stouffer as the most efficient. Applying LD correction increased prioritization and specificity of enrichment outcomes for all tested algorithms. When Fisher or Stouffer were used with LD, sensitivity and reproducibility were also better. Using any integration method was beneficial in comparison with a minimum p-value method in specific combinations. The correlation between GSA results from genomic and transcriptomic level was the highest when Stouffer integration was combined with LD correction. We thoroughly evaluated different approaches to GSA in GWAS in terms of performance to guide others to select the most effective combinations. We showed that LD correction and Stouffer integration could increase the performance of enrichment analysis and encourage the usage of these techniques.

Collapse

Marini F, Ludt A, Linke J, Strauch K. GeneTonic: an R/Bioconductor package for streamlining the interpretation of RNA-seq data. BMC Bioinformatics 2021;22:610. [PMID: 34949163 PMCID: PMC8697502 DOI: 10.1186/s12859-021-04461-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Accepted: 10/26/2021] [Indexed: 01/22/2023] Open

Abstract

BACKGROUND

The interpretation of results from transcriptome profiling experiments via RNA sequencing (RNA-seq) can be a complex task, where the essential information is distributed among different tabular and list formats-normalized expression values, results from differential expression analysis, and results from functional enrichment analyses. A number of tools and databases are widely used for the purpose of identification of relevant functional patterns, yet often their contextualization within the data and results at hand is not straightforward, especially if these analytic components are not combined together efficiently.

RESULTS

We developed the GeneTonic software package, which serves as a comprehensive toolkit for streamlining the interpretation of functional enrichment analyses, by fully leveraging the information of expression values in a differential expression context. GeneTonic is implemented in R and Shiny, leveraging packages that enable HTML-based interactive visualizations for executing drilldown tasks seamlessly, viewing the data at a level of increased detail. GeneTonic is integrated with the core classes of existing Bioconductor workflows, and can accept the output of many widely used tools for pathway analysis, making this approach applicable to a wide range of use cases. Users can effectively navigate interlinked components (otherwise available as flat text or spreadsheet tables), bookmark features of interest during the exploration sessions, and obtain at the end a tailored HTML report, thus combining the benefits of both interactivity and reproducibility.

CONCLUSION

GeneTonic is distributed as an R package in the Bioconductor project ( https://bioconductor.org/packages/GeneTonic/ ) under the MIT license. Offering both bird's-eye views of the components of transcriptome data analysis and the detailed inspection of single genes, individual signatures, and their relationships, GeneTonic aims at simplifying the process of interpretation of complex and compelling RNA-seq datasets for many researchers with different expertise profiles.

Collapse

Mubeen S, Bharadhwaj VS, Gadiya Y, Hofmann-Apitius M, Kodamullil AT, Domingo-Fernández D. DecoPath: a web application for decoding pathway enrichment analysis. NAR Genom Bioinform 2021;3:lqab087. [PMID: 34568823 PMCID: PMC8459727 DOI: 10.1093/nargab/lqab087] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2021] [Revised: 08/31/2021] [Accepted: 09/14/2021] [Indexed: 12/16/2022] Open

Ramos M, Geistlinger L, Oh S, Schiffer L, Azhar R, Kodali H, de Bruijn I, Gao J, Carey VJ, Morgan M, Waldron L. Multiomic Integration of Public Oncology Databases in Bioconductor. JCO Clin Cancer Inform 2021;4:958-971. [PMID: 33119407 PMCID: PMC7608653 DOI: 10.1200/cci.19.00119] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open

Abstract

PURPOSE

Investigations of the molecular basis for the development, progression, and treatment of cancer increasingly use complementary genomic assays to gather multiomic data, but management and analysis of such data remain complex. The cBioPortal for cancer genomics currently provides multiomic data from > 260 public studies, including The Cancer Genome Atlas (TCGA) data sets, but integration of different data types remains challenging and error prone for computational methods and tools using these resources. Recent advances in data infrastructure within the Bioconductor project enable a novel and powerful approach to creating fully integrated representations of these multiomic, pan-cancer databases.

METHODS

We provide a set of R/Bioconductor packages for working with TCGA legacy data and cBioPortal data, with special considerations for loading time; efficient representations in and out of memory; analysis platform; and an integrative framework, such as MultiAssayExperiment. Large methylation data sets are provided through out-of-memory data representation to provide responsive loading times and analysis capabilities on machines with limited memory.

RESULTS

We developed the curatedTCGAData and cBioPortalData R/Bioconductor packages to provide integrated multiomic data sets from the TCGA legacy database and the cBioPortal web application programming interface using the MultiAssayExperiment data structure. This suite of tools provides coordination of diverse experimental assays with clinicopathological data with minimal data management burden, as demonstrated through several greatly simplified multiomic and pan-cancer analyses.

CONCLUSION

These integrated representations enable analysts and tool developers to apply general statistical and plotting methods to extensive multiomic data through user-friendly commands and documented examples.

Collapse

Affiliation(s)

Marcel Ramos Graduate School of Public Health and Health Policy, City University of New York, New York, NY.,Institute for Implementation Science and Population Health, City University of New York, New York, NY.,Roswell Park Comprehensive Cancer Center, Buffalo, NY
Ludwig Geistlinger Graduate School of Public Health and Health Policy, City University of New York, New York, NY.,Institute for Implementation Science and Population Health, City University of New York, New York, NY
Sehyun Oh Graduate School of Public Health and Health Policy, City University of New York, New York, NY.,Institute for Implementation Science and Population Health, City University of New York, New York, NY
Lucas Schiffer Graduate School of Public Health and Health Policy, City University of New York, New York, NY.,Institute for Implementation Science and Population Health, City University of New York, New York, NY.,Section of Computational Biomedicine, Boston University School of Medicine, Boston, MA
Rimsha Azhar Graduate School of Public Health and Health Policy, City University of New York, New York, NY.,Institute for Implementation Science and Population Health, City University of New York, New York, NY.,Department of Healthcare Policy and Research, Weill Cornell Medicine, New York, NY
Hanish Kodali Graduate School of Public Health and Health Policy, City University of New York, New York, NY.,Institute for Implementation Science and Population Health, City University of New York, New York, NY
Ino de Bruijn Marie-Josée and Henry R. Kravis Center for Molecular Oncology, Memorial Sloan Kettering Cancer Center, New York, NY
Jianjiong Gao Marie-Josée and Henry R. Kravis Center for Molecular Oncology, Memorial Sloan Kettering Cancer Center, New York, NY.,Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY
Vincent J Carey Channing Division of Network Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA
Martin Morgan Roswell Park Comprehensive Cancer Center, Buffalo, NY
Levi Waldron Graduate School of Public Health and Health Policy, City University of New York, New York, NY.,Institute for Implementation Science and Population Health, City University of New York, New York, NY

Collapse

Application of Bioinformatics Methods to Identify Key Genes and Functions in Chronic Pelvic Pain. EVIDENCE-BASED COMPLEMENTARY AND ALTERNATIVE MEDICINE 2021;2021:7257405. [PMID: 34381521 PMCID: PMC8352682 DOI: 10.1155/2021/7257405] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/30/2021] [Accepted: 07/19/2021] [Indexed: 11/17/2022]

Gene expression analysis method integration and co-expression module detection applied to rare glucide metabolism disorders using ExpHunterSuite. Sci Rep 2021;11:15062. [PMID: 34301987 PMCID: PMC8302605 DOI: 10.1038/s41598-021-94343-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2021] [Accepted: 07/09/2021] [Indexed: 12/13/2022] Open

Bu D, Luo H, Huo P, Wang Z, Zhang S, He Z, Wu Y, Zhao L, Liu J, Guo J, Fang S, Cao W, Yi L, Zhao Y, Kong L. KOBAS-i: intelligent prioritization and exploratory visualization of biological functions for gene enrichment analysis. Nucleic Acids Res 2021;49:W317-W325. [PMID: 34086934 PMCID: PMC8265193 DOI: 10.1093/nar/gkab447] [Citation(s) in RCA: 676] [Impact Index Per Article: 225.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2021] [Revised: 04/24/2021] [Accepted: 05/09/2021] [Indexed: 12/20/2022] Open

Cheng X, Yan J, Liu Y, Wang J, Taubert S. eVITTA: a web-based visualization and inference toolbox for transcriptome analysis. Nucleic Acids Res 2021;49:W207-W215. [PMID: 34019643 PMCID: PMC8218201 DOI: 10.1093/nar/gkab366] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Revised: 04/12/2021] [Accepted: 05/04/2021] [Indexed: 12/12/2022] Open

Affiliation(s)

Xuanjin Cheng Centre for Molecular Medicine and Therapeutics, The University of British Columbia, Vancouver, British Columbia, Canada.,British Columbia Children's Hospital Research Institute, The University of British Columbia, Vancouver, British Columbia, Canada.,Department of Medical Genetics, The University of British Columbia, Vancouver, British Columbia, Canada
Junran Yan Centre for Molecular Medicine and Therapeutics, The University of British Columbia, Vancouver, British Columbia, Canada.,British Columbia Children's Hospital Research Institute, The University of British Columbia, Vancouver, British Columbia, Canada.,Graduate Program for Cell and Developmental Biology, The University of British Columbia, Vancouver, British Columbia, Canada
Yongxing Liu Centre for Molecular Medicine and Therapeutics, The University of British Columbia, Vancouver, British Columbia, Canada.,British Columbia Children's Hospital Research Institute, The University of British Columbia, Vancouver, British Columbia, Canada.,Department of Medical Genetics, The University of British Columbia, Vancouver, British Columbia, Canada
Jiahe Wang Centre for Molecular Medicine and Therapeutics, The University of British Columbia, Vancouver, British Columbia, Canada.,British Columbia Children's Hospital Research Institute, The University of British Columbia, Vancouver, British Columbia, Canada.,Department of Medical Genetics, The University of British Columbia, Vancouver, British Columbia, Canada
Stefan Taubert Centre for Molecular Medicine and Therapeutics, The University of British Columbia, Vancouver, British Columbia, Canada.,British Columbia Children's Hospital Research Institute, The University of British Columbia, Vancouver, British Columbia, Canada.,Department of Medical Genetics, The University of British Columbia, Vancouver, British Columbia, Canada.,Graduate Program for Cell and Developmental Biology, The University of British Columbia, Vancouver, British Columbia, Canada

Collapse

Angeloni M, Thievessen I, Engel FB, Magni P, Ferrazzi F. Functional genomics meta-analysis to identify gene set enrichment networks in cardiac hypertrophy. Biol Chem 2021;402:953-972. [PMID: 33951759 DOI: 10.1515/hsz-2020-0378] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Accepted: 04/19/2021] [Indexed: 12/28/2022]

Xie C, Jauhari S, Mora A. Popularity and performance of bioinformatics software: the case of gene set analysis. BMC Bioinformatics 2021;22:191. [PMID: 33858350 PMCID: PMC8050894 DOI: 10.1186/s12859-021-04124-5] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2020] [Accepted: 04/08/2021] [Indexed: 11/22/2022] Open

Katz S, Song J, Webb KP, Lounsbury NW, Bryant CE, Fraser IDC. SIGNAL: A web-based iterative analysis platform integrating pathway and network approaches optimizes hit selection from genome-scale assays. Cell Syst 2021;12:338-352.e5. [PMID: 33894945 DOI: 10.1016/j.cels.2021.03.001] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2020] [Revised: 11/25/2020] [Accepted: 03/03/2021] [Indexed: 01/13/2023]

Risso D, Pagnotta SM. Per-sample standardization and asymmetric winsorization lead to accurate clustering of RNA-seq expression profiles. Bioinformatics 2021;37:2356-2364. [PMID: 33560368 PMCID: PMC8388024 DOI: 10.1093/bioinformatics/btab091] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2020] [Revised: 01/27/2021] [Accepted: 02/05/2021] [Indexed: 11/13/2022] Open