Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Jiang Z, Gentleman R. Extensions to gene set enrichment. ACTA ACUST UNITED AC 2006;23:306-13. [PMID: 17127676 DOI: 10.1093/bioinformatics/btl599] [Citation(s) in RCA: 172] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

Number

Cited by Other Article(s)

Riquelme-Perez M, Perez-Sanz F, Deleuze JF, Escartin C, Bonnet E, Brohard S. DEVEA: an interactive shiny application for Differential Expression analysis, data Visualization and Enrichment Analysis of transcriptomics data. F1000Res 2023;11:711. [PMID: 36999088 PMCID: PMC10043628.2 DOI: 10.12688/f1000research.122949.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 02/21/2023] [Indexed: 03/29/2023] Open

Riquelme-Perez M, Perez-Sanz F, Deleuze JF, Escartin C, Bonnet E, Brohard S. DEVEA: an interactive shiny application for Differential Expression analysis, data Visualization and Enrichment Analysis of transcriptomics data. F1000Res 2022;11:711. [PMID: 36999088 PMCID: PMC10043628 DOI: 10.12688/f1000research.122949.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 06/20/2022] [Indexed: 11/20/2022] Open

Park C, Kim B, Park T. DeepHisCoM: deep learning pathway analysis using hierarchical structural component models. Brief Bioinform 2022;23:6590446. [DOI: 10.1093/bib/bbac171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Revised: 04/04/2022] [Accepted: 04/18/2022] [Indexed: 11/13/2022] Open

NMR in Metabolomics: From Conventional Statistics to Machine Learning and Neural Network Approaches. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12062824] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Nguyen H, Tran D, Galazka JM, Costes SV, Beheshti A, Petereit J, Draghici S, Nguyen T. CPA: a web-based platform for consensus pathway analysis and interactive visualization. Nucleic Acids Res 2021;49:W114-W124. [PMID: 34037798 PMCID: PMC8262702 DOI: 10.1093/nar/gkab421] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2021] [Revised: 04/16/2021] [Accepted: 05/05/2021] [Indexed: 01/06/2023] Open

Bi G, Bian Y, Liang J, Yin J, Li R, Zhao M, Huang Y, Lu T, Zhan C, Fan H, Wang Q. Pan-cancer characterization of metabolism-related biomarkers identifies potential therapeutic targets. J Transl Med 2021;19:219. [PMID: 34030708 PMCID: PMC8142489 DOI: 10.1186/s12967-021-02889-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2021] [Accepted: 05/17/2021] [Indexed: 02/07/2023] Open

Maleki F, Ovens K, Hogan DJ, Kusalik AJ. Gene Set Analysis: Challenges, Opportunities, and Future Research. Front Genet 2020;11:654. [PMID: 32695141 PMCID: PMC7339292 DOI: 10.3389/fgene.2020.00654] [Citation(s) in RCA: 90] [Impact Index Per Article: 22.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2020] [Accepted: 05/29/2020] [Indexed: 12/14/2022] Open

Tripathi H, Mukhopadhyay S, Mohapatra SK. Sepsis-associated pathways segregate cancer groups. BMC Cancer 2020;20:309. [PMID: 32293345 PMCID: PMC7160985 DOI: 10.1186/s12885-020-06774-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2019] [Accepted: 03/23/2020] [Indexed: 12/27/2022] Open

Abstract

BACKGROUND

Sepsis and cancer are both leading causes of death, and occurrence of any one, increases the likelihood of the other. While cancer patients are susceptible to sepsis, survivors of sepsis are also susceptible to develop certain cancers. This mutual dependence for susceptibility suggests shared biology between the two disease categories. Earlier analysis had revealed a cancer-related pathway to be up-regulated in Septic Shock (SS), an advanced stage of sepsis. This has motivated a more comprehensive comparison of the transcriptomes of SS and cancer.

METHODS

Gene Set Enrichment Analysis was performed to detect the pathways enriched in SS and cancer. Thereafter, hierarchical clustering was applied to identify relative segregation of 17 cancer types into two groups vis-a-vis SS. Biological significance of the selected pathways was explored by network analysis. Clinical significance of the pathways was tested by survival analysis. A robust classifier of cancer groups was developed based on machine learning.

RESULTS

A total of 66 pathways were observed to be enriched in both SS and cancer. However, clustering segregated cancer types into two categories based on the direction of transcriptomic change. In general, there was up-regulation in SS and one group of cancer (termed Sepsis-Like Cancer, or SLC), but not in other cancers (termed Cancer Alone, or CA). The SLC group mainly consisted of malignancies of the gastrointestinal tract (head and neck, oesophagus, stomach, liver and biliary system) often associated with infection. Machine learning classifier successfully segregated the two cancer groups with high accuracy (> 98%). Additionally, pathway up-regulation was observed to be associated with survival in the SLC group of cancers.

CONCLUSION

Transcriptome-based systems biology approach segregates cancer into two groups (SLC and CA) based on similarity with SS. Host response to infection plays a key role in pathogenesis of SS and SLC. However, we hypothesize that some component of the host response is protective in both SS and SLC.

Collapse

Fifteen Years of Gene Set Analysis for High-Throughput Genomic Data: A Review of Statistical Approaches and Future Challenges. ENTROPY 2020;22:e22040427. [PMID: 33286201 PMCID: PMC7516904 DOI: 10.3390/e22040427] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/24/2020] [Revised: 03/18/2020] [Accepted: 04/03/2020] [Indexed: 12/22/2022]

Nguyen TM, Shafi A, Nguyen T, Draghici S. Identifying significantly impacted pathways: a comprehensive review and assessment. Genome Biol 2019;20:203. [PMID: 31597578 PMCID: PMC6784345 DOI: 10.1186/s13059-019-1790-4] [Citation(s) in RCA: 90] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2019] [Accepted: 08/13/2019] [Indexed: 01/01/2023] Open

Understanding Statistical Hypothesis Testing: The Logic of Statistical Inference. MACHINE LEARNING AND KNOWLEDGE EXTRACTION 2019. [DOI: 10.3390/make1030054] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Powers RK, Goodspeed A, Pielke-Lombardo H, Tan AC, Costello JC. GSEA-InContext: identifying novel and common patterns in expression experiments. Bioinformatics 2019;34:i555-i564. [PMID: 29950010 PMCID: PMC6022535 DOI: 10.1093/bioinformatics/bty271] [Citation(s) in RCA: 134] [Impact Index Per Article: 26.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open

Li Y, Wu Y, Zhang X, Bai Y, Akthar LM, Lu X, Shi M, Zhao J, Jiang Q, Li Y. SCIA: A Novel Gene Set Analysis Applicable to Data With Different Characteristics. Front Genet 2019;10:598. [PMID: 31293623 PMCID: PMC6603225 DOI: 10.3389/fgene.2019.00598] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2019] [Accepted: 06/05/2019] [Indexed: 01/06/2023] Open

Alaimo S, Micale G, La Ferlita A, Ferro A, Pulvirenti A. Computational Methods to Investigate the Impact of miRNAs on Pathways. Methods Mol Biol 2019;1970:183-209. [PMID: 30963494 DOI: 10.1007/978-1-4939-9207-2_11] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]

Wang S, Yuan M. Combined Hypothesis Testing on Graphs With Applications to Gene Set Enrichment Analysis. J Am Stat Assoc 2018;114:1320-1338. [DOI: 10.1080/01621459.2018.1497501] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]

Jambusaria A, Klomp J, Hong Z, Rafii S, Dai Y, Malik AB, Rehman J. A computational approach to identify cellular heterogeneity and tissue-specific gene regulatory networks. BMC Bioinformatics 2018;19:217. [PMID: 29940845 PMCID: PMC6019795 DOI: 10.1186/s12859-018-2190-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2017] [Accepted: 05/04/2018] [Indexed: 01/26/2023] Open

Abstract

Background

The heterogeneity of cells across tissue types represents a major challenge for studying biological mechanisms as well as for therapeutic targeting of distinct tissues. Computational prediction of tissue-specific gene regulatory networks may provide important insights into the mechanisms underlying the cellular heterogeneity of cells in distinct organs and tissues.

Results

Using three pathway analysis techniques, gene set enrichment analysis (GSEA), parametric analysis of gene set enrichment (PGSEA), alongside our novel model (HeteroPath), which assesses heterogeneously upregulated and downregulated genes within the context of pathways, we generated distinct tissue-specific gene regulatory networks. We analyzed gene expression data derived from freshly isolated heart, brain, and lung endothelial cells and populations of neurons in the hippocampus, cingulate cortex, and amygdala. In both datasets, we found that HeteroPath segregated the distinct cellular populations by identifying regulatory pathways that were not identified by GSEA or PGSEA. Using simulated datasets, HeteroPath demonstrated robustness that was comparable to what was seen using existing gene set enrichment methods. Furthermore, we generated tissue-specific gene regulatory networks involved in vascular heterogeneity and neuronal heterogeneity by performing motif enrichment of the heterogeneous genes identified by HeteroPath and linking the enriched motifs to regulatory transcription factors in the ENCODE database.

Conclusions

HeteroPath assesses contextual bidirectional gene expression within pathways and thus allows for transcriptomic assessment of cellular heterogeneity. Unraveling tissue-specific heterogeneity of gene expression can lead to a better understanding of the molecular underpinnings of tissue-specific phenotypes.

Electronic supplementary material

The online version of this article (10.1186/s12859-018-2190-6) contains supplementary material, which is available to authorized users.

Collapse

Zhang Y, Topham DJ, Thakar J, Qiu X. FUNNEL-GSEA: FUNctioNal ELastic-net regression in time-course gene set enrichment analysis. Bioinformatics 2018;33:1944-1952. [PMID: 28334094 PMCID: PMC5939227 DOI: 10.1093/bioinformatics/btx104] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2016] [Accepted: 02/17/2017] [Indexed: 01/26/2023] Open

POST: A framework for set-based association analysis in high-dimensional data. Methods 2018;145:76-81. [PMID: 29777750 DOI: 10.1016/j.ymeth.2018.05.011] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2018] [Revised: 05/11/2018] [Accepted: 05/13/2018] [Indexed: 01/08/2023] Open

Abstract

Evaluating the differential expression of a set of genes belonging to a common biological process or ontology has proven to be a very useful tool for biological discovery. However, existing gene-set association methods are limited to applications that evaluate differential expression across k⩾2 treatment groups or biological categories. This limitation precludes researchers from most effectively evaluating the association with other phenotypes that may be more clinically meaningful, such as quantitative variables or censored survival time variables. Projection onto the Orthogonal Space Testing (POST) is proposed as a general procedure that can robustly evaluate the association of a gene-set with several different types of phenotypic data (categorical, ordinal, continuous, or censored). For each gene-set, POST transforms the gene profiles into a set of eigenvectors and then uses statistical modeling to compute a set of z-statistics that measure the association of each eigenvector with the phenotype. The overall gene-set statistic is the sum of squared z-statistics weighted by the corresponding eigenvalues. Finally, bootstrapping is used to compute a p-value. POST may evaluate associations with or without adjustment for covariates. In simulation studies, it is shown that the performance of POST in evaluating the association with a categorical phenotype is similar to or exceeds that of existing methods. In evaluating the association of 875 biological processes with the time to relapse of pediatric acute myeloid leukemia, POST identified the well-known oncogenic WNT signaling pathway as its top hit. These results indicate that POST can be a very useful tool for evaluating the association of a gene-set with a variety of different phenotypes. We have developed an R package named POST which is freely available in Bioconductor.

Collapse

Alaimo S, Giugno R, Acunzo M, Veneziano D, Ferro A, Pulvirenti A. Post-transcriptional knowledge in pathway analysis increases the accuracy of phenotypes classification. Oncotarget 2018;7:54572-54582. [PMID: 27275538 PMCID: PMC5342365 DOI: 10.18632/oncotarget.9788] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2015] [Accepted: 05/11/2016] [Indexed: 01/27/2023] Open

Statistical Approach for Gene Set Analysis with Trait Specific Quantitative Trait Loci. Sci Rep 2018;8:2391. [PMID: 29402907 PMCID: PMC5799309 DOI: 10.1038/s41598-018-19736-w] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2017] [Accepted: 12/06/2017] [Indexed: 11/20/2022] Open

Wei W, Sun Z, da Silveira WA, Yu Z, Lawson A, Hardiman G, Kelemen LE, Chung D. Semi-supervised identification of cancer subgroups using survival outcomes and overlapping grouping information. Stat Methods Med Res 2018;28:2137-2149. [PMID: 29336210 DOI: 10.1177/0962280217752980] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]

Brightbill HD, Suto E, Blaquiere N, Ramamoorthi N, Sujatha-Bhaskar S, Gogol EB, Castanedo GM, Jackson BT, Kwon YC, Haller S, Lesch J, Bents K, Everett C, Kohli PB, Linge S, Christian L, Barrett K, Jaochico A, Berezhkovskiy LM, Fan PW, Modrusan Z, Veliz K, Townsend MJ, DeVoss J, Johnson AR, Godemann R, Lee WP, Austin CD, McKenzie BS, Hackney JA, Crawford JJ, Staben ST, Alaoui Ismaili MH, Wu LC, Ghilardi N. NF-κB inducing kinase is a therapeutic target for systemic lupus erythematosus. Nat Commun 2018;9:179. [PMID: 29330524 PMCID: PMC5766581 DOI: 10.1038/s41467-017-02672-0] [Citation(s) in RCA: 84] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2017] [Accepted: 12/18/2017] [Indexed: 02/06/2023] Open

Affiliation(s)

Hans D Brightbill Department of Immunology Discovery, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
Eric Suto Department of Translational Immunology, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
Nicole Blaquiere Department of Discovery Chemistry, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
Nandhini Ramamoorthi Department of Biomarker Discovery, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
Swathi Sujatha-Bhaskar Department of Immunology Discovery, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
Emily B Gogol Department of Immunology Discovery, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
Georgette M Castanedo Department of Discovery Chemistry, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
Benjamin T Jackson Department of Immunology Discovery, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
Youngsu C Kwon Department of Translational Immunology, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
Susan Haller Department of Pathology, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
Justin Lesch Department of Translational Immunology, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
Karin Bents Evotec, Inc., Essener Bogen 7, Hamburg, 22419, Germany
Christine Everett Department of Biochemical and Cellular Pharmacology, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
Pawan Bir Kohli Department of Biochemical and Cellular Pharmacology, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
Sandra Linge Evotec, Inc., Essener Bogen 7, Hamburg, 22419, Germany
Laura Christian Department of Immunology Discovery, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
Kathy Barrett Department of Biochemical and Cellular Pharmacology, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
Allan Jaochico Department of Drug Metabolism and Pharmacokinetics, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
Leonid M Berezhkovskiy Department of Drug Metabolism and Pharmacokinetics, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
Peter W Fan Department of Drug Metabolism and Pharmacokinetics, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
Zora Modrusan Department of Molecular Biology, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
Kelli Veliz Department of Laboratory Animal Resources, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
Michael J Townsend Department of Biomarker Discovery, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
Jason DeVoss Department of Translational Immunology, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
Adam R Johnson Department of Biochemical and Cellular Pharmacology, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
Robert Godemann Evotec, Inc., Essener Bogen 7, Hamburg, 22419, Germany
Wyne P Lee Department of Translational Immunology, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
Cary D Austin Department of Pathology, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
Brent S McKenzie Department of Translational Immunology, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
Jason A Hackney Department of Bioinformatics and Computational Biology, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
James J Crawford Department of Discovery Chemistry, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
Steven T Staben Department of Discovery Chemistry, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
Moulay H Alaoui Ismaili Department of Biochemical and Cellular Pharmacology, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
Lawren C Wu Department of Immunology Discovery, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA
Nico Ghilardi Department of Immunology Discovery, Genentech, 1 DNA Way, South San Francisco, CA-94080, USA.

Collapse

Deconvolution of Transcriptional Networks in Post-Traumatic Stress Disorder Uncovers Master Regulators Driving Innate Immune System Function. Sci Rep 2017;7:14486. [PMID: 29101382 PMCID: PMC5670244 DOI: 10.1038/s41598-017-15221-y] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2017] [Accepted: 10/23/2017] [Indexed: 01/05/2023] Open

Lavallée-Adam M, Cloutier P, Coulombe B, Blanchette M. Functional 5' UTR motif discovery with LESMoN: Local Enrichment of Sequence Motifs in biological Networks. Nucleic Acids Res 2017;45:10415-10427. [PMID: 28977652 PMCID: PMC5737372 DOI: 10.1093/nar/gkx751] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2016] [Accepted: 08/17/2017] [Indexed: 01/09/2023] Open

A knowledge-based T2-statistic to perform pathway analysis for quantitative proteomic data. PLoS Comput Biol 2017. [PMID: 28622336 PMCID: PMC5493430 DOI: 10.1371/journal.pcbi.1005601] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open

Abstract

Approaches to identify significant pathways from high-throughput quantitative data have been developed in recent years. Still, the analysis of proteomic data stays difficult because of limited sample size. This limitation also leads to the practice of using a competitive null as common approach; which fundamentally implies genes or proteins as independent units. The independent assumption ignores the associations among biomolecules with similar functions or cellular localization, as well as the interactions among them manifested as changes in expression ratios. Consequently, these methods often underestimate the associations among biomolecules and cause false positives in practice. Some studies incorporate the sample covariance matrix into the calculation to address this issue. However, sample covariance may not be a precise estimation if the sample size is very limited, which is usually the case for the data produced by mass spectrometry. In this study, we introduce a multivariate test under a self-contained null to perform pathway analysis for quantitative proteomic data. The covariance matrix used in the test statistic is constructed by the confidence scores retrieved from the STRING database or the HitPredict database. We also design an integrating procedure to retain pathways of sufficient evidence as a pathway group. The performance of the proposed T²-statistic is demonstrated using five published experimental datasets: the T-cell activation, the cAMP/PKA signaling, the myoblast differentiation, and the effect of dasatinib on the BCR-ABL pathway are proteomic datasets produced by mass spectrometry; and the protective effect of myocilin via the MAPK signaling pathway is a gene expression dataset of limited sample size. Compared with other popular statistics, the proposed T²-statistic yields more accurate descriptions in agreement with the discussion of the original publication. We implemented the T²-statistic into an R package T2GA, which is available at https://github.com/roqe/T2GA.

Pathway analysis is a common approach to quickly access the pathways being regulated in the experiments. There are numerous statistics to perform pathway analysis; most of them assume that the genes or proteins are independent of each other for statistical ease. This assumption, however, is unrealistic to the real biological system and may cause false positives in practice. A standard way to address this issue is to measure the associations among genes or proteins. Unfortunately, the estimation of associations requires sufficient sample size, which is usually not available for proteomic data produced by mass spectrometry. In this study, we propose a T²-statistic, which estimates the associations among gene products, to perform pathway analysis for quantitative proteomic data. Instead of calculating the associations directly from data, we use the confidence scores retrieved from protein-protein interaction databases. We also design an integrating procedure to reserve pathways of sufficient evidence as a regulated pathway group. We compare the proposed T²-statistic to other popular statistics using five published experimental datasets, and the T²-statistic yields more accurate descriptions in agreement with the discussion of the original papers.

Collapse

Alhamdoosh M, Ng M, Wilson NJ, Sheridan JM, Huynh H, Wilson MJ, Ritchie ME. Combining multiple tools outperforms individual methods in gene set enrichment analyses. Bioinformatics 2017;33:414-424. [PMID: 27694195 PMCID: PMC5408797 DOI: 10.1093/bioinformatics/btw623] [Citation(s) in RCA: 88] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2016] [Accepted: 09/23/2016] [Indexed: 12/22/2022] Open

Hayashi N, Iwamoto T, Qi Y, Niikura N, Santarpia L, Yamauchi H, Nakamura S, Hortobagyi GN, Pusztai L, Symmans WF, Ueno NT. Bone metastasis-related signaling pathways in breast cancers stratified by estrogen receptor status. J Cancer 2017;8:1045-1052. [PMID: 28529618 PMCID: PMC5436258 DOI: 10.7150/jca.13690] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open

Abstract

Background: Breast cancer bone metastasis (BCBM)-specific genes have been reported without considering biological differences based on estrogen receptor (ER) status. The aims of this study were to identify BCBM-specific genes using our patient dataset and validate previously reported BCBM-specific genes, and to determine whether ER-status-related biological differences matter in identification of BCBM-specific genes.

Methods: We used Affymetrix GeneChips to analyze 365 primary human epidermal growth factor receptor 2 (HER2)-negative invasive breast cancer specimens. Genes that were differentially expressed between patients who developed bone metastasis and those who developed non-bone metastasis were identified using Cox proportional hazards model, and differential expression of gene sets was assessed using gene set analysis. We performed gene set analysis to determine whether biological function associated with bone metastasis were different by ER status using 2,246 functionally annotated gene sets assembled from Gene Ontology data base.

Results: Among 16,712 probe sets, 592 were overexpressed in the bone metastasis cohort compared to the non-bone-metastasis cohort (false discovery rate ≤ 0.05). However, no BCBM-specific genes met our significance tests when the cancers were stratified by ER status. In ER-positive and ER-negative breast cancers, 151 and 125 gene sets, respectively, were overexpressed for BCBM and the majority of BCBM-related pathways were different. Of significant gene sets, only 13 gene sets were overlapped between ER-positive and -negative cohorts.

Conclusion: ER-positive and ER-negative breast cancers have different biological pathways in BCBM development. We have yet to explore BCBM-related biomarkers and targets considering the biological features associated with BCBM depending on the ER status.

Collapse

Katewa A, Wang Y, Hackney JA, Huang T, Suto E, Ramamoorthi N, Austin CD, Bremer M, Chen JZ, Crawford JJ, Currie KS, Blomgren P, DeVoss J, DiPaolo JA, Hau J, Johnson A, Lesch J, DeForge LE, Lin Z, Liimatta M, Lubach JW, McVay S, Modrusan Z, Nguyen A, Poon C, Wang J, Liu L, Lee WP, Wong H, Young WB, Townsend MJ, Reif K. Btk-specific inhibition blocks pathogenic plasma cell signatures and myeloid cell-associated damage in IFNα-driven lupus nephritis. JCI Insight 2017;2:e90111. [PMID: 28405610 DOI: 10.1172/jci.insight.90111] [Citation(s) in RCA: 47] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open

Simillion C, Liechti R, Lischer HEL, Ioannidis V, Bruggmann R. Avoiding the pitfalls of gene set enrichment analysis with SetRank. BMC Bioinformatics 2017;18:151. [PMID: 28259142 PMCID: PMC5336655 DOI: 10.1186/s12859-017-1571-6] [Citation(s) in RCA: 68] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2016] [Accepted: 02/24/2017] [Indexed: 02/06/2023] Open

Ren X, Hu Q, Liu S, Wang J, Miecznikowski JC. Gene set analysis controlling for length bias in RNA-seq experiments. BioData Min 2017;10:5. [PMID: 28184252 PMCID: PMC5294840 DOI: 10.1186/s13040-017-0125-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2016] [Accepted: 01/11/2017] [Indexed: 01/29/2023] Open

PerSubs: A Graph-Based Algorithm for the Identification of Perturbed Subpathways Caused by Complex Diseases. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2017;988:215-224. [DOI: 10.1007/978-3-319-56246-9_17] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]

Haddick PCG, Larson JL, Rathore N, Bhangale TR, Phung QT, Srinivasan K, Hansen DV, Lill JR, Pericak-Vance MA, Haines J, Farrer LA, Kauwe JS, Schellenberg GD, Cruchaga C, Goate AM, Behrens TW, Watts RJ, Graham RR, Kaminker JS, van der Brug M. A Common Variant of IL-6R is Associated with Elevated IL-6 Pathway Activity in Alzheimer's Disease Brains. J Alzheimers Dis 2017;56:1037-1054. [PMID: 28106546 PMCID: PMC5667357 DOI: 10.3233/jad-160524] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

Affiliation(s)

Patrick C G Haddick Department of Diagnostic Discovery, Genentech Inc., South San Francisco, CA, USA
Jessica L Larson Department of Bioinformatics and Computational Biology, Genentech Inc., South San Francisco, CA, USA
Nisha Rathore Department of Human Genetics, Genentech Inc., South San Francisco, CA, USA
Tushar R Bhangale Department of Human Genetics, Genentech Inc., South San Francisco, CA, USA
Qui T Phung Department of Protein Chemistry, Genentech Inc., South San Francisco, CA, USA
Karpagam Srinivasan Department of Neuroscience, Genentech Inc., South San Francisco, CA, USA
David V Hansen Department of Neuroscience, Genentech Inc., South San Francisco, CA, USA
Jennie R Lill Department of Protein Chemistry, Genentech Inc., South San Francisco, CA, USA
Margaret A Pericak-Vance The John P. Hussman Institute for Human Genomics, University of Miami, Miami, FL, USA Dr. John T. Macdonald Foundation Department of Human Genetics, University of Miami, Miami, FL, USA
Jonathan Haines Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA
Lindsay A Farrer Department of Medicine (Biomedical Genetics), Boston University Schools of Medicine and Public Health, Boston, MA, USA Department of Neurology, Boston University Schools of Medicine and Public Health, Boston, MA, USA Department of Ophthalmology, Boston University Schools of Medicine and Public Health, Boston, MA, USA Department of Epidemiology, Boston University Schools of Medicine and Public Health, Boston, MA, USA Department of Biostatistics, Boston University Schools of Medicine and Public Health, Boston, MA, USA
John S Kauwe Department of Biology, Brigham Young University, Provo, UT, USA
Gerard D Schellenberg Department of Pathology and Laboratory Medicine, University of Pennsylvania School of Medicine, Philadelphia, PA, USA
Carlos Cruchaga Department of Psychiatry, Washington University School of Medicine, St. Louis, MO, USA Hope Center for Neurological Disorders, Washington University School of Medicine, St. Louis, MO, USA
Alison M Goate Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York City, NY, USA Ronald M. Loeb Center for Alzheimer's Disease, Icahn School of Medicine at Mount Sinai, New York City, NY, USA
Timothy W Behrens Department of Human Genetics, Genentech Inc., South San Francisco, CA, USA
Ryan J Watts Department of Neuroscience, Genentech Inc., South San Francisco, CA, USA
Robert R Graham Department of Human Genetics, Genentech Inc., South San Francisco, CA, USA
Joshua S Kaminker Department of Bioinformatics and Computational Biology, Genentech Inc., South San Francisco, CA, USA
Marcel van der Brug Department of Diagnostic Discovery, Genentech Inc., South San Francisco, CA, USA

Collapse

Lee J, Jo K, Lee S, Kang J, Kim S. Prioritizing biological pathways by recognizing context in time-series gene expression data. BMC Bioinformatics 2016;17:477. [PMID: 28155707 PMCID: PMC5259824 DOI: 10.1186/s12859-016-1335-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open

Abstract

Background

The primary goal of pathway analysis using transcriptome data is to find significantly perturbed pathways. However, pathway analysis is not always successful in identifying pathways that are truly relevant to the context under study. A major reason for this difficulty is that a single gene is involved in multiple pathways. In the KEGG pathway database, there are 146 genes, each of which is involved in more than 20 pathways. Thus activation of even a single gene will result in activation of many pathways. This complex relationship often makes the pathway analysis very difficult. While we need much more powerful pathway analysis methods, a readily available alternative way is to incorporate the literature information.

Results

In this study, we propose a novel approach for prioritizing pathways by combining results from both pathway analysis tools and literature information. The basic idea is as follows. Whenever there are enough articles that provide evidence on which pathways are relevant to the context, we can be assured that the pathways are indeed related to the context, which is termed as relevance in this paper. However, if there are few or no articles reported, then we should rely on the results from the pathway analysis tools, which is termed as significance in this paper. We realized this concept as an algorithm by introducing Context Score and Impact Score and then combining the two into a single score. Our method ranked truly relevant pathways significantly higher than existing pathway analysis tools in experiments with two data sets.

Conclusions

Our novel framework was implemented as ContextTRAP by utilizing two existing tools, TRAP and BEST. ContextTRAP will be a useful tool for the pathway based analysis of gene expression data since the user can specify the context of the biological experiment in a set of keywords. The web version of ContextTRAP is available at http://biohealth.snu.ac.kr/software/contextTRAP.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-016-1335-8) contains supplementary material, which is available to authorized users.

Collapse

Du J, Li M, Yuan Z, Guo M, Song J, Xie X, Chen Y. A decision analysis model for KEGG pathway analysis. BMC Bioinformatics 2016;17:407. [PMID: 27716040 PMCID: PMC5053338 DOI: 10.1186/s12859-016-1285-1] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2015] [Accepted: 09/28/2016] [Indexed: 11/18/2022] Open

Abstract

Background

The knowledge base-driven pathway analysis is becoming the first choice for many investigators, in that it not only can reduce the complexity of functional analysis by grouping thousands of genes into just several hundred pathways, but also can increase the explanatory power for the experiment by identifying active pathways in different conditions. However, current approaches are designed to analyze a biological system assuming that each pathway is independent of the other pathways.

Results

A decision analysis model is developed in this article that accounts for dependence among pathways in time-course experiments and multiple treatments experiments. This model introduces a decision coefficient—a designed index, to identify the most relevant pathways in a given experiment by taking into account not only the direct determination factor of each Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway itself, but also the indirect determination factors from its related pathways. Meanwhile, the direct and indirect determination factors of each pathway are employed to demonstrate the regulation mechanisms among KEGG pathways, and the sign of decision coefficient can be used to preliminarily estimate the impact direction of each KEGG pathway. The simulation study of decision analysis demonstrated the application of decision analysis model for KEGG pathway analysis.

Conclusions

A microarray dataset from bovine mammary tissue over entire lactation cycle was used to further illustrate our strategy. The results showed that the decision analysis model can provide the promising and more biologically meaningful results. Therefore, the decision analysis model is an initial attempt of optimizing pathway analysis methodology.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-016-1285-1) contains supplementary material, which is available to authorized users.

Collapse

Sugimoto M. Metabolomic pathway visualization tool outsourcing editing function. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2016;2015:7659-62. [PMID: 26738066 DOI: 10.1109/embc.2015.7320166] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Lee S, Choi S, Kim YJ, Kim BJ, Hwang H, Park T. Pathway-based approach using hierarchical components of collapsed rare variants. Bioinformatics 2016;32:i586-i594. [PMID: 27587678 PMCID: PMC5013912 DOI: 10.1093/bioinformatics/btw425] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open

Alvarez MJ, Shen Y, Giorgi FM, Lachmann A, Ding BB, Ye BH, Califano A. Functional characterization of somatic mutations in cancer using network-based inference of protein activity. Nat Genet 2016;48:838-47. [PMID: 27322546 PMCID: PMC5040167 DOI: 10.1038/ng.3593] [Citation(s) in RCA: 493] [Impact Index Per Article: 61.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2016] [Accepted: 05/23/2016] [Indexed: 01/05/2023]

Pham LM, Carvalho L, Schaus S, Kolaczyk ED. Perturbation Detection Through Modeling of Gene Expression on a Latent Biological Pathway Network: A Bayesian hierarchical approach. J Am Stat Assoc 2016;111:73-92. [PMID: 27647944 DOI: 10.1080/01621459.2015.1110523] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]

García-Marqués F, Trevisan-Herraz M, Martínez-Martínez S, Camafeita E, Jorge I, Lopez JA, Méndez-Barbero N, Méndez-Ferrer S, Del Pozo MA, Ibáñez B, Andrés V, Sánchez-Madrid F, Redondo JM, Bonzon-Kulichenko E, Vázquez J. A Novel Systems-Biology Algorithm for the Analysis of Coordinated Protein Responses Using Quantitative Proteomics. Mol Cell Proteomics 2016;15:1740-60. [PMID: 26893027 DOI: 10.1074/mcp.m115.055905] [Citation(s) in RCA: 64] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2015] [Indexed: 11/06/2022] Open

Hsueh HM, Tsai CA. Gene set analysis using sufficient dimension reduction. BMC Bioinformatics 2016;17:74. [PMID: 26852017 PMCID: PMC4744442 DOI: 10.1186/s12859-016-0928-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2015] [Accepted: 02/01/2016] [Indexed: 01/31/2023] Open

Liu Z, Roy NC, Guo Y, Jia H, Ryan L, Samuelsson L, Thomas A, Plowman J, Clerens S, Day L, Young W. Human Breast Milk and Infant Formulas Differentially Modify the Intestinal Microbiota in Human Infants and Host Physiology in Rats. J Nutr 2016;146:191-9. [PMID: 26674765 DOI: 10.3945/jn.115.223552] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2015] [Accepted: 11/11/2015] [Indexed: 12/30/2022] Open

Abstract

BACKGROUND

In the absence of human breast milk, infant and follow-on formulas can still promote efficient growth and development. However, infant formulas can differ in their nutritional value.

OBJECTIVE

The objective of this study was to compare the effects of human milk (HM) and infant formulas in human infants and a weanling rat model.

METHODS

In a 3 wk clinical randomized controlled trial, babies (7- to 90-d-old, male-to-female ratio 1:1) were exclusively breastfed (BF), exclusively fed Synlait Pure Canterbury Stage 1 infant formula (SPCF), or fed assorted standard formulas (SFs) purchased by their parents. We also compared feeding HM or SPCF in weanling male Sprague-Dawley rats for 28 d. We examined the effects of HM and infant formulas on fecal short chain fatty acids (SCFAs) and bacterial composition in human infants, and intestinal SCFAs, the microbiota, and host physiology in weanling rats.

RESULTS

Fecal Bifidobacterium concentrations (mean log copy number ± SEM) were higher (P = 0.003) in BF (8.17 ± 0.3) and SPCF-fed infants (8.29 ± 0.3) compared with those fed the SFs (6.94 ± 0.3). Fecal acetic acid (mean ± SEM) was also higher (P = 0.007) in the BF (5.5 ± 0.2 mg/g) and SPCF (5.3 ± 2.4 mg/g) groups compared with SF-fed babies (4.3 ± 0.2 mg/g). Colonic SCFAs did not differ between HM- and SPCF-fed rats. However, cecal acetic acid concentrations were higher (P = 0.001) in rats fed HM (42.6 ± 2.6 mg/g) than in those fed SPCF (30.6 ± 0.8 mg/g). Cecal transcriptome, proteome, and plasma metabolite analyses indicated that the growth and maturation of intestinal tissue was more highly promoted by HM than SPCF.

CONCLUSIONS

Fecal bacterial composition and SCFA concentrations were similar in babies fed SPCF or HM. However, results from the rat study showed substantial differences in host physiology between rats fed HM and SPCF. This trial was registered at Shanghai Jiào tong University School of Medicine as XHEC-C-2012-024.

Collapse

Tew GW, Hackney JA, Gibbons D, Lamb CA, Luca D, Egen JG, Diehl L, Eastham Anderson J, Vermeire S, Mansfield JC, Feagan BG, Panes J, Baumgart DC, Schreiber S, Dotan I, Sandborn WJ, Kirby JA, Irving PM, De Hertogh G, Van Assche GA, Rutgeerts P, O'Byrne S, Hayday A, Keir ME. Association Between Response to Etrolizumab and Expression of Integrin αE and Granzyme A in Colon Biopsies of Patients With Ulcerative Colitis. Gastroenterology 2016;150:477-87.e9. [PMID: 26522261 DOI: 10.1053/j.gastro.2015.10.041] [Citation(s) in RCA: 107] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/22/2015] [Revised: 10/05/2015] [Accepted: 10/22/2015] [Indexed: 12/13/2022]

Abstract

BACKGROUND & AIMS

Etrolizumab is a humanized monoclonal antibody against the β7 integrin subunit that has shown efficacy vs placebo in patients with moderate to severely active ulcerative colitis (UC). Patients with colon tissues that expressed high levels of the integrin αE gene (ITGAE) appeared to have the best response. We compared differences in colonic expression of ITGAE and other genes between patients who achieved clinical remission with etrolizumab vs those who did.

METHODS

We performed a retrospective analysis of data collected from 110 patients with UC who participated in a phase 2 placebo-controlled trial of etrolizumab, as well as from 21 patients with UC or without inflammatory bowel disease (controls) enrolled in an observational study at a separate site. Colon biopsies were collected from patients in both studies and analyzed by immunohistochemistry and gene expression profiling. Mononuclear cells were isolated and analyzed by flow cytometry. We identified biomarkers associated with response to etrolizumab. In the placebo-controlled trial, clinical remission was defined as total Mayo Clinic Score ≤2, with no individual subscore >1, and mucosal healing was defined as endoscopic score ≤1.

RESULTS

Colon tissues collected at baseline from patients who had a clinical response to etrolizumab expressed higher levels of T-cell-associated genes than patients who did not respond (P < .05). Colonic CD4(+) integrin αE(+) cells from patients with UC expressed higher levels of granzyme A messenger RNA (GZMA mRNA) than CD4(+) αE(-) cells (P < .0001); granzyme A and integrin αE protein were detected in the same cells. Of patients receiving 100 mg etrolizumab, a higher proportion of those with high levels of GZMA mRNA (41%) or ITGAE mRNA (38%) than those with low levels of GZMA (6%) or ITGAE mRNA (13%) achieved clinical remission (P < .05) and mucosal healing (41% GZMA(high) vs 19% GZMA(low) and 44% ITGAE(high) vs 19% ITGAE(low)). Compared with ITGAE(low) and GZMA(low) patients, patients with ITGAE(high) and GZMA(high) had higher baseline numbers of epithelial crypt-associated integrin αE(+) cells (P < .01 for both), but a smaller number of crypt-associated integrin αE(+) cells after etrolizumab treatment (P < .05 for both). After 10 weeks of etrolizumab treatment, expression of genes associated with T-cell activation and genes encoding inflammatory cytokines decreased by 40%-80% from baseline (P < .05) in patients with colon tissues expressing high levels of GZMA at baseline.

CONCLUSIONS

Levels of GZMA and ITGAE mRNAs in colon tissues can identify patients with UC who are most likely to benefit from etrolizumab; expression levels decrease with etrolizumab administration in biomarker(high) patients. Larger, prospective studies of markers are needed to assess their clinical value.

Collapse

Affiliation(s)

Gaik W Tew Genentech Research and Early Development, South San Francisco, California
Jason A Hackney Genentech Research and Early Development, South San Francisco, California
Deena Gibbons King's College, London, United Kingdom
Christopher A Lamb Newcastle University, Newcastle upon Tyne, United Kingdom
Diana Luca Genentech Research and Early Development, South San Francisco, California
Jackson G Egen Genentech Research and Early Development, South San Francisco, California
Lauri Diehl Genentech Research and Early Development, South San Francisco, California
Jeff Eastham Anderson Genentech Research and Early Development, South San Francisco, California
Severine Vermeire University of Leuven, Leuven, Belgium
John C Mansfield Newcastle University, Newcastle upon Tyne, United Kingdom
Brian G Feagan University of Western Ontario, London, Ontario, Canada
Julian Panes Hospital Clinic de Barcelona, Institut d'Investigacions Biomèdiques August Pi i Sunyer, Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas, Barcelona, Spain
Daniel C Baumgart Charité Medical School, Humboldt-University of Berlin, Germany
Stefan Schreiber Department of Medicine I, University Hospital Schleswig-Holstein, Christian Albrechts University, Kiel, Germany
Iris Dotan Inflammatory Bowel Disease Center, Department of Gastroenterology and Liver Diseases, Tel Aviv Medical Center and Sackler Faculty of Medicine, Tel Aviv, Israel
William J Sandborn University of California San Diego, La Jolla, California
John A Kirby Newcastle University, Newcastle upon Tyne, United Kingdom
Peter M Irving King's College, London, United Kingdom
Gert De Hertogh University of Leuven, Leuven, Belgium
Gert A Van Assche University of Leuven, Leuven, Belgium; University of Toronto, Toronto, Ontario, Canada
Paul Rutgeerts University of Leuven, Leuven, Belgium
Sharon O'Byrne Genentech Research and Early Development, South San Francisco, California
Adrian Hayday King's College, London, United Kingdom
Mary E Keir Genentech Research and Early Development, South San Francisco, California.

Collapse

García-Campos MA, Espinal-Enríquez J, Hernández-Lemus E. Pathway Analysis: State of the Art. Front Physiol 2015;6:383. [PMID: 26733877 PMCID: PMC4681784 DOI: 10.3389/fphys.2015.00383] [Citation(s) in RCA: 151] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2015] [Accepted: 11/26/2015] [Indexed: 12/02/2022] Open

Hamzić E, Buitenhuis B, Hérault F, Hawken R, Abrahamsen MS, Servin B, Elsen JM, Pinard-van der Laan MH, Bed'Hom B. Genome-wide association study and biological pathway analysis of the Eimeria maxima response in broilers. Genet Sel Evol 2015;47:91. [PMID: 26607727 PMCID: PMC4659166 DOI: 10.1186/s12711-015-0170-0] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2015] [Accepted: 11/05/2015] [Indexed: 02/22/2023] Open

Abstract

Background

Coccidiosis is the most common and costly disease in the poultry industry and is caused by protozoans of the Eimeria genus. The current control of coccidiosis, based on the use of anticoccidial drugs and vaccination, faces serious obstacles such as drug resistance and the high costs for the development of efficient vaccines, respectively. Therefore, the current control programs must be expanded with complementary approaches such as the use of genetics to improve the host response to Eimeria infections. Recently, we have performed a large-scale challenge study on Cobb500 broilers using E. maxima for which we investigated variability among animals in response to the challenge. As a follow-up to this challenge study, we performed a genome-wide association study (GWAS) to identify genomic regions underlying variability of the measured traits in the response to Eimeria maxima in broilers. Furthermore, we conducted a post-GWAS functional analysis to increase our biological understanding of the underlying response to Eimeria maxima challenge.

Results

In total, we identified 22 single nucleotide polymorphisms (SNPs) with q value <0.1 distributed across five chromosomes. The highly significant SNPs were associated with body weight gain (three SNPs on GGA5, one SNP on GGA1 and one SNP on GGA3), plasma coloration measured as optical density at wavelengths in the range 465–510 nm (10 SNPs and all on GGA10) and the percentage of β2-globulin in blood plasma (15 SNPs on GGA1 and one SNP on GGA2). Biological pathways related to metabolic processes, cell proliferation, and primary innate immune processes were among the most frequent significantly enriched biological pathways. Furthermore, the network-based analysis produced two networks of high confidence, with one centered on large tumor suppressor kinase 1 (LATS1) and 2 (LATS2) and the second involving the myosin heavy chain 6 (MYH6).

Conclusions

We identified several strong candidate genes and genomic regions associated with traits measured in response to Eimeria maxima in broilers. Furthermore, the post-GWAS functional analysis indicates that biological pathways and networks involved in tissue proliferation and repair along with the primary innate immune response may play the most important role during the early stage of Eimeria maxima infection in broilers.

Electronic supplementary material

The online version of this article (doi:10.1186/s12711-015-0170-0) contains supplementary material, which is available to authorized users.

Collapse

Yu X, Zeng T, Li G. Integrative enrichment analysis: a new computational method to detect dysregulated pathways in heterogeneous samples. BMC Genomics 2015;16:918. [PMID: 26556243 PMCID: PMC4641376 DOI: 10.1186/s12864-015-2188-7] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2015] [Accepted: 11/02/2015] [Indexed: 12/27/2022] Open

Abstract

BACKGROUND

Pathway enrichment analysis is a useful tool to study biology and biomedicine, due to its functional screening on well-defined biological procedures rather than separate molecules. The measurement of malfunctions of pathways with a phenotype change, e.g., from normal to diseased, is the key issue when applying enrichment analysis on a pathway. The differentially expressed genes (DEGs) are widely focused in conventional analysis, which is based on the great purity of samples. However, the disease samples are usually heterogeneous, so that, the genes with great differential expression variance (DEVGs) are becoming attractive and important to indicate the specific state of a biological system. In the context of differential expression variance, it is still a challenge to measure the enrichment or status of a pathway. To address this issue, we proposed Integrative Enrichment Analysis (IEA) based on a novel enrichment measurement.

RESULTS

The main competitive ability of IEA is to identify dysregulated pathways containing DEGs and DEVGs simultaneously, which are usually under-scored by other methods. Next, IEA provides two additional assistant approaches to investigate such dysregulated pathways. One is to infer the association among identified dysregulated pathways and expected target pathways by estimating pathway crosstalks. The other one is to recognize subtype-factors as dysregulated pathways associated to particular clinical indices according to the DEVGs' relative expressions rather than conventional raw expressions. Based on a previously established evaluation scheme, we found that, in particular cohorts (i.e., a group of real gene expression datasets from human patients), a few target disease pathways can be significantly high-ranked by IEA, which is more effective than other state-of-the-art methods. Furthermore, we present a proof-of-concept study on Diabetes to indicate: IEA rather than conventional ORA or GSEA can capture the under-estimated dysregulated pathways full of DEVGs and DEGs; these newly identified pathways could be significantly linked to prior-known disease pathways by estimated crosstalks; and many candidate subtype-factors recognized by IEA also have significant relation with the risk of subtypes of genotype-phenotype associations.

CONCLUSIONS

Totally, IEA supplies a new tool to carry on enrichment analysis in the complicate context of clinical application (i.e., heterogeneity of disease), as a necessary complementary and cooperative approach to conventional ones.

Collapse

Bayerlová M, Jung K, Kramer F, Klemm F, Bleckmann A, Beißbarth T. Comparative study on gene set and pathway topology-based enrichment methods. BMC Bioinformatics 2015;16:334. [PMID: 26489510 PMCID: PMC4618947 DOI: 10.1186/s12859-015-0751-5] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2015] [Accepted: 09/29/2015] [Indexed: 01/08/2023] Open

Abstract

Background

Enrichment analysis is a popular approach to identify pathways or sets of genes which are significantly enriched in the context of differentially expressed genes. The traditional gene set enrichment approach considers a pathway as a simple gene list disregarding any knowledge of gene or protein interactions. In contrast, the new group of so called pathway topology-based methods integrates the topological structure of a pathway into the analysis.

Methods

We comparatively investigated gene set and pathway topology-based enrichment approaches, considering three gene set and four topological methods. These methods were compared in two extensive simulation studies and on a benchmark of 36 real datasets, providing the same pathway input data for all methods.

Results

In the benchmark data analysis both types of methods showed a comparable ability to detect enriched pathways. The first simulation study was conducted with KEGG pathways, which showed considerable gene overlaps between each other. In this study with original KEGG pathways, none of the topology-based methods outperformed the gene set approach. Therefore, a second simulation study was performed on non-overlapping pathways created by unique gene IDs. Here, methods accounting for pathway topology reached higher accuracy than the gene set methods, however their sensitivity was lower.

Conclusions

We conducted one of the first comprehensive comparative works on evaluating gene set against pathway topology-based enrichment methods. The topological methods showed better performance in the simulation scenarios with non-overlapping pathways, however, they were not conclusively better in the other scenarios. This suggests that simple gene set approach might be sufficient to detect an enriched pathway under realistic circumstances. Nevertheless, more extensive studies and further benchmark data are needed to systematically evaluate these methods and to assess what gain and cost pathway topology information introduces into enrichment analysis. Both types of methods for enrichment analysis require further improvements in order to deal with the problem of pathway overlaps.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-015-0751-5) contains supplementary material, which is available to authorized users.

Collapse

Meijer RJ, Goeman JJ. Multiple Testing of Gene Sets from Gene Ontology: Possibilities and Pitfalls. Brief Bioinform 2015;17:808-18. [DOI: 10.1093/bib/bbv091] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2015] [Indexed: 11/14/2022] Open

Turner JA, Bolen CR, Blankenship DM. Quantitative gene set analysis generalized for repeated measures, confounder adjustment, and continuous covariates. BMC Bioinformatics 2015;16:272. [PMID: 26316107 PMCID: PMC4551517 DOI: 10.1186/s12859-015-0707-9] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2015] [Accepted: 08/17/2015] [Indexed: 12/20/2022] Open

Abstract

Background

Gene set analysis (GSA) of gene expression data can be highly powerful when the biological signal is weak compared to other sources of variability in the data. However, many gene set analysis approaches utilize permutation tests which are not appropriate for complex study designs. For example, the correlation of subjects is broken when comparing time points within a longitudinal study. Linear mixed models provide a method to analyze longitudinal studies as well as adjust for potential confounding factors and account for sources of variability that are not of primary interest. Currently, there are no known gene set analysis approaches that fully account for these study design and analysis aspects. In order to do so, we generalize the QuSAGE gene set analysis algorithm, denoted Q-Gen, and provide the necessary estimation adjustments to incorporate linear mixed model analyses.

Results

We assessed the performance of our generalized method in comparison to the original QuSAGE method in settings such as longitudinal repeated measures analysis and accounting for potential confounders. We demonstrate that the original QuSAGE method can not control for type-I error when these complexities exist. In addition to statistical appropriateness, analysis of a longitudinal influenza study suggests Q-Gen can allow for greater sensitivity when exploring a large number of gene sets.

Conclusions

Q-Gen is an extension to the gene set analysis method of QuSAGE, and allows for linear mixed models to be applied appropriately within a gene set analysis framework. It provides GSA an added layer of flexibility that was not currently available. This flexibility allows for more appropriate statistical modeling of complex data structures that are inherent to many microarray study designs and can provide more sensitivity.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-015-0707-9) contains supplementary material, which is available to authorized users.

Collapse

Metabolite profiling stratifies pancreatic ductal adenocarcinomas into subtypes with distinct sensitivities to metabolic inhibitors. Proc Natl Acad Sci U S A 2015. [PMID: 26216984 DOI: 10.1073/pnas.1501605112] [Citation(s) in RCA: 231] [Impact Index Per Article: 25.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open

MacNeil SM, Johnson WE, Li DY, Piccolo SR, Bild AH. Inferring pathway dysregulation in cancers from multiple types of omic data. Genome Med 2015;7:61. [PMID: 26170901 PMCID: PMC4499940 DOI: 10.1186/s13073-015-0189-4] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2014] [Accepted: 06/16/2015] [Indexed: 11/10/2022] Open