Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Park CY, Wong AK, Greene CS, Rowland J, Guan Y, Bongo LA, Burdine RD, Troyanskaya OG. Functional knowledge transfer for high-accuracy prediction of under-studied biological processes. PLoS Comput Biol 2013;9:e1002957. [PMID: 23516347 PMCID: PMC3597527 DOI: 10.1371/journal.pcbi.1002957] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2012] [Accepted: 01/15/2013] [Indexed: 11/19/2022] Open

For:	Park CY, Wong AK, Greene CS, Rowland J, Guan Y, Bongo LA, Burdine RD, Troyanskaya OG. Functional knowledge transfer for high-accuracy prediction of under-studied biological processes. PLoS Comput Biol 2013;9:e1002957. [PMID: 23516347 PMCID: PMC3597527 DOI: 10.1371/journal.pcbi.1002957] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2012] [Accepted: 01/15/2013] [Indexed: 11/19/2022] Open

Number

Cited by Other Article(s)

Yuan H, Mancuso CA, Johnson K, Braasch I, Krishnan A. Computational strategies for cross-species knowledge transfer and translational biomedicine. ARXIV 2024:arXiv:2408.08503v1. [PMID: 39184546 PMCID: PMC11343225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 08/27/2024]

Mancuso CA, Johnson KA, Liu R, Krishnan A. Joint representation of molecular networks from multiple species improves gene classification. PLoS Comput Biol 2024;20:e1011773. [PMID: 38198480 PMCID: PMC10805316 DOI: 10.1371/journal.pcbi.1011773] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 01/23/2024] [Accepted: 12/20/2023] [Indexed: 01/12/2024] Open

Li L, Dannenfelser R, Zhu Y, Hejduk N, Segarra S, Yao V. Joint embedding of biological networks for cross-species functional alignment. Bioinformatics 2023;39:btad529. [PMID: 37632792 PMCID: PMC10477935 DOI: 10.1093/bioinformatics/btad529] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Revised: 07/12/2023] [Accepted: 08/24/2023] [Indexed: 08/28/2023] Open

Ding K, Wang S, Luo Y. Supervised biological network alignment with graph neural networks. Bioinformatics 2023;39:i465-i474. [PMID: 37387160 PMCID: PMC10311300 DOI: 10.1093/bioinformatics/btad241] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/01/2023] Open

Abstract

MOTIVATION

Despite the advances in sequencing technology, massive proteins with known sequences remain functionally unannotated. Biological network alignment (NA), which aims to find the node correspondence between species' protein-protein interaction (PPI) networks, has been a popular strategy to uncover missing annotations by transferring functional knowledge across species. Traditional NA methods assumed that topologically similar proteins in PPIs are functionally similar. However, it was recently reported that functionally unrelated proteins can be as topologically similar as functionally related pairs, and a new data-driven or supervised NA paradigm has been proposed, which uses protein function data to discern which topological features correspond to functional relatedness.

RESULTS

Here, we propose GraNA, a deep learning framework for the supervised NA paradigm for the pairwise NA problem. Employing graph neural networks, GraNA utilizes within-network interactions and across-network anchor links for learning protein representations and predicting functional correspondence between across-species proteins. A major strength of GraNA is its flexibility to integrate multi-faceted non-functional relationship data, such as sequence similarity and ortholog relationships, as anchor links to guide the mapping of functionally related proteins across species. Evaluating GraNA on a benchmark dataset composed of several NA tasks between different pairs of species, we observed that GraNA accurately predicted the functional relatedness of proteins and robustly transferred functional annotations across species, outperforming a number of existing NA methods. When applied to a case study on a humanized yeast network, GraNA also successfully discovered functionally replaceable human-yeast protein pairs that were documented in previous studies.

AVAILABILITY AND IMPLEMENTATION

The code of GraNA is available at https://github.com/luo-group/GraNA.

Collapse

Lachmann A, Rizzo KA, Bartal A, Jeon M, Clarke DJB, Ma’ayan A. PrismEXP: gene annotation prediction from stratified gene-gene co-expression matrices. PeerJ 2023;11:e14927. [PMID: 36874981 PMCID: PMC9979837 DOI: 10.7717/peerj.14927] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Accepted: 01/30/2023] [Indexed: 03/03/2023] Open

Abstract

Background

Gene-gene co-expression correlations measured by mRNA-sequencing (RNA-seq) can be used to predict gene annotations based on the co-variance structure within these data. In our prior work, we showed that uniformly aligned RNA-seq co-expression data from thousands of diverse studies is highly predictive of both gene annotations and protein-protein interactions. However, the performance of the predictions varies depending on whether the gene annotations and interactions are cell type and tissue specific or agnostic. Tissue and cell type-specific gene-gene co-expression data can be useful for making more accurate predictions because many genes perform their functions in unique ways in different cellular contexts. However, identifying the optimal tissues and cell types to partition the global gene-gene co-expression matrix is challenging.

Results

Here we introduce and validate an approach called PRediction of gene Insights from Stratified Mammalian gene co-EXPression (PrismEXP) for improved gene annotation predictions based on RNA-seq gene-gene co-expression data. Using uniformly aligned data from ARCHS4, we apply PrismEXP to predict a wide variety of gene annotations including pathway membership, Gene Ontology terms, as well as human and mouse phenotypes. Predictions made with PrismEXP outperform predictions made with the global cross-tissue co-expression correlation matrix approach on all tested domains, and training using one annotation domain can be used to predict annotations in other domains.

Conclusions

By demonstrating the utility of PrismEXP predictions in multiple use cases we show how PrismEXP can be used to enhance unsupervised machine learning methods to better understand the roles of understudied genes and proteins. To make PrismEXP accessible, it is provided via a user-friendly web interface, a Python package, and an Appyter. AVAILABILITY. The PrismEXP web-based application, with pre-computed PrismEXP predictions, is available from: https://maayanlab.cloud/prismexp; PrismEXP is also available as an Appyter: https://appyters.maayanlab.cloud/PrismEXP/; and as Python package: https://github.com/maayanlab/prismexp.

Collapse

Mancuso CA, Bills PS, Krum D, Newsted J, Liu R, Krishnan A. GenePlexus: a web-server for gene discovery using network-based machine learning. Nucleic Acids Res 2022;50:W358-W366. [PMID: 35580053 PMCID: PMC9252732 DOI: 10.1093/nar/gkac335] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2022] [Revised: 04/13/2022] [Accepted: 04/30/2022] [Indexed: 11/28/2022] Open

Joblin-Mills A, Wu Z, Fraser K, Jones B, Yip W, Lim JJ, Lu L, Sequeira I, Poppitt S. The impact of ethnicity and intra-pancreatic fat on the postprandial metabolome response to whey protein in overweight Asian Chinese and European Caucasian women with prediabetes. FRONTIERS IN CLINICAL DIABETES AND HEALTHCARE 2022;3:980856. [PMID: 36992769 PMCID: PMC10012149 DOI: 10.3389/fcdhc.2022.980856] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/07/2022] [Accepted: 07/27/2022] [Indexed: 03/31/2023]

Abstract

The "Thin on the Outside Fat on the Inside" TOFI_Asia study found Asian Chinese to be more susceptible to Type 2 Diabetes (T2D) compared to European Caucasians matched for gender and body mass index (BMI). This was influenced by degree of visceral adipose deposition and ectopic fat accumulation in key organs, including liver and pancreas, leading to altered fasting plasma glucose, insulin resistance, and differences in plasma lipid and metabolite profiles. It remains unclear how intra-pancreatic fat deposition (IPFD) impacts TOFI phenotype-related T2D risk factors associated with Asian Chinese. Cow's milk whey protein isolate (WPI) is an insulin secretagogue which can suppress hyperglycemia in prediabetes. In this dietary intervention, we used untargeted metabolomics to characterize the postprandial WPI response in 24 overweight women with prediabetes. Participants were classified by ethnicity (Asian Chinese, n=12; European Caucasian, n=12) and IPFD (low IPFD < 4.66%, n=10; high IPFD ≥ 4.66%, n=10). Using a cross-over design participants were randomized to consume three WPI beverages on separate occasions; 0 g (water control), 12.5 g (low protein, LP) and 50 g (high protein, HP), consumed when fasted. An exclusion pipeline for isolating metabolites with temporal (T_0-240mins) WPI responses was implemented, and a support vector machine-recursive feature elimination (SVM-RFE) algorithm was used to model relevant metabolites by ethnicity and IPFD classes. Metabolic network analysis identified glycine as a central hub in both ethnicity and IPFD WPI response networks. A depletion of glycine relative to WPI concentration was detected in Chinese and high IPFD participants independent of BMI. Urea cycle metabolites were highly represented among the ethnicity WPI metabolome model, implicating a dysregulation in ammonia and nitrogen metabolism among Chinese participants. Uric acid and purine synthesis pathways were enriched within the high IPFD cohort's WPI metabolome response, implicating adipogenesis and insulin resistance pathways. In conclusion, the discrimination of ethnicity from WPI metabolome profiles was a stronger prediction model than IPFD in overweight women with prediabetes. Each models' discriminatory metabolites enriched different metabolic pathways that help to further characterize prediabetes in Asian Chinese women and women with increased IPFD, independently.

Collapse

Yu G, Zhou G, Zhang X, Domeniconi C, Guo M. DMIL-IsoFun: predicting isoform function using deep multi-instance learning. Bioinformatics 2021;37:4818-4825. [PMID: 34282449 DOI: 10.1093/bioinformatics/btab532] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2021] [Revised: 06/20/2021] [Accepted: 07/16/2021] [Indexed: 11/14/2022] Open

Zhao Y, Wang J, Guo M, Zhang X, Yu G. Cross-Species Protein Function Prediction with Asynchronous-Random Walk. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021;18:1439-1450. [PMID: 31562099 DOI: 10.1109/tcbb.2019.2943342] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

Karbalayghareh A, Qian X, Dougherty ER. Optimal Bayesian Transfer Learning for Count Data. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021;18:644-655. [PMID: 31180899 DOI: 10.1109/tcbb.2019.2920981] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Liu R, Mancuso CA, Yannakopoulos A, Johnson KA, Krishnan A. Supervised learning is an accurate method for network-based gene classification. Bioinformatics 2020;36:3457-3465. [PMID: 32129827 PMCID: PMC7267831 DOI: 10.1093/bioinformatics/btaa150] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2019] [Revised: 12/01/2019] [Accepted: 02/27/2020] [Indexed: 12/22/2022] Open

Abstract

Background

Assigning every human gene to specific functions, diseases and traits is a grand challenge in modern genetics. Key to addressing this challenge are computational methods, such as supervised learning and label propagation, that can leverage molecular interaction networks to predict gene attributes. In spite of being a popular machine-learning technique across fields, supervised learning has been applied only in a few network-based studies for predicting pathway-, phenotype- or disease-associated genes. It is unknown how supervised learning broadly performs across different networks and diverse gene classification tasks, and how it compares to label propagation, the widely benchmarked canonical approach for this problem.

Results

In this study, we present a comprehensive benchmarking of supervised learning for network-based gene classification, evaluating this approach and a classic label propagation technique on hundreds of diverse prediction tasks and multiple networks using stringent evaluation schemes. We demonstrate that supervised learning on a gene’s full network connectivity outperforms label propagaton and achieves high prediction accuracy by efficiently capturing local network properties, rivaling label propagation’s appeal for naturally using network topology. We further show that supervised learning on the full network is also superior to learning on node embeddings (derived using node2vec), an increasingly popular approach for concisely representing network connectivity. These results show that supervised learning is an accurate approach for prioritizing genes associated with diverse functions, diseases and traits and should be considered a staple of network-based gene classification workflows.

Availability and implementation

The datasets and the code used to reproduce the results and add new gene classification methods have been made freely available.

Contact

arjun@msu.edu

Supplementary information

Supplementary data are available at Bioinformatics online.

Collapse

Lack of a site-specific phosphorylation of Presenilin 1 disrupts microglial gene networks and progenitors during development. PLoS One 2020;15:e0237773. [PMID: 32822378 PMCID: PMC7444478 DOI: 10.1371/journal.pone.0237773] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2020] [Accepted: 08/03/2020] [Indexed: 12/27/2022] Open

Selective Neuronal Vulnerability in Alzheimer's Disease: A Network-Based Analysis. Neuron 2020;107:821-835.e12. [PMID: 32603655 DOI: 10.1016/j.neuron.2020.06.010] [Citation(s) in RCA: 115] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2019] [Revised: 04/23/2020] [Accepted: 06/05/2020] [Indexed: 12/17/2022]

Liu R, Mancuso CA, Yannakopoulos A, Johnson KA, Krishnan A. Supervised learning is an accurate method for network-based gene classification. BIOINFORMATICS (OXFORD, ENGLAND) 2020;36:3457-3465. [PMID: 32129827 DOI: 10.1101/721423] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/26/2019] [Revised: 12/01/2019] [Accepted: 02/27/2020] [Indexed: 05/26/2023]

Abstract

BACKGROUND

RESULTS

In this study, we present a comprehensive benchmarking of supervised learning for network-based gene classification, evaluating this approach and a classic label propagation technique on hundreds of diverse prediction tasks and multiple networks using stringent evaluation schemes. We demonstrate that supervised learning on a gene's full network connectivity outperforms label propagaton and achieves high prediction accuracy by efficiently capturing local network properties, rivaling label propagation's appeal for naturally using network topology. We further show that supervised learning on the full network is also superior to learning on node embeddings (derived using node2vec), an increasingly popular approach for concisely representing network connectivity. These results show that supervised learning is an accurate approach for prioritizing genes associated with diverse functions, diseases and traits and should be considered a staple of network-based gene classification workflows.

AVAILABILITY AND IMPLEMENTATION

The datasets and the code used to reproduce the results and add new gene classification methods have been made freely available.

CONTACT

arjun@msu.edu.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Zhao Y, Wang J, Chen J, Zhang X, Guo M, Yu G. A Literature Review of Gene Function Prediction by Modeling Gene Ontology. Front Genet 2020;11:400. [PMID: 32391061 PMCID: PMC7193026 DOI: 10.3389/fgene.2020.00400] [Citation(s) in RCA: 48] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2020] [Accepted: 03/30/2020] [Indexed: 12/14/2022] Open

Zhou J, Schor IE, Yao V, Theesfeld CL, Marco-Ferreres R, Tadych A, Furlong EEM, Troyanskaya OG. Accurate genome-wide predictions of spatio-temporal gene expression during embryonic development. PLoS Genet 2019;15:e1008382. [PMID: 31553718 PMCID: PMC6779412 DOI: 10.1371/journal.pgen.1008382] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2018] [Revised: 10/07/2019] [Accepted: 08/22/2019] [Indexed: 11/18/2022] Open

Abstract

Comprehensive information on the timing and location of gene expression is fundamental to our understanding of embryonic development and tissue formation. While high-throughput in situ hybridization projects provide invaluable information about developmental gene expression patterns for model organisms like Drosophila, the output of these experiments is primarily qualitative, and a high proportion of protein coding genes and most non-coding genes lack any annotation. Accurate data-centric predictions of spatio-temporal gene expression will therefore complement current in situ hybridization efforts. Here, we applied a machine learning approach by training models on all public gene expression and chromatin data, even from whole-organism experiments, to provide genome-wide, quantitative spatio-temporal predictions for all genes. We developed structured in silico nano-dissection, a computational approach that predicts gene expression in >200 tissue-developmental stages. The algorithm integrates expression signals from a compendium of 6,378 genome-wide expression and chromatin profiling experiments in a cell lineage-aware fashion. We systematically evaluated our performance via cross-validation and experimentally confirmed 22 new predictions for four different embryonic tissues. The model also predicts complex, multi-tissue expression and developmental regulation with high accuracy. We further show the potential of applying these genome-wide predictions to extract tissue specificity signals from non-tissue-dissected experiments, and to prioritize tissues and stages for disease modeling. This resource, together with the exploratory tools are freely available at our webserver http://find.princeton.edu, which provides a valuable tool for a range of applications, from predicting spatio-temporal expression patterns to recognizing tissue signatures from differential gene expression profiles.

Collapse

Proost S, Mutwil M. CoNekT: an open-source framework for comparative genomic and transcriptomic network analyses. Nucleic Acids Res 2019;46:W133-W140. [PMID: 29718322 PMCID: PMC6030989 DOI: 10.1093/nar/gky336] [Citation(s) in RCA: 71] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2018] [Accepted: 04/18/2018] [Indexed: 12/22/2022] Open

Wong AK, Krishnan A, Troyanskaya OG. GIANT 2.0: genome-scale integrated analysis of gene networks in tissues. Nucleic Acids Res 2019;46:W65-W70. [PMID: 29800226 PMCID: PMC6030827 DOI: 10.1093/nar/gky408] [Citation(s) in RCA: 48] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2018] [Accepted: 05/07/2018] [Indexed: 01/09/2023] Open

Ferrari C, Proost S, Ruprecht C, Mutwil M. PhytoNet: comparative co-expression network analyses across phytoplankton and land plants. Nucleic Acids Res 2019;46:W76-W83. [PMID: 29718316 PMCID: PMC6030924 DOI: 10.1093/nar/gky298] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2018] [Accepted: 04/11/2018] [Indexed: 11/15/2022] Open

Guala D, Ogris C, Müller N, Sonnhammer ELL. Genome-wide functional association networks: background, data & state-of-the-art resources. Brief Bioinform 2019;21:1224-1237. [PMID: 31281921 PMCID: PMC7373183 DOI: 10.1093/bib/bbz064] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2019] [Revised: 04/29/2019] [Accepted: 05/04/2019] [Indexed: 02/06/2023] Open

Lee YS, Wong AK, Tadych A, Hartmann BM, Park CY, DeJesus VA, Ramos I, Zaslavsky E, Sealfon SC, Troyanskaya OG. Interpretation of an individual functional genomics experiment guided by massive public data. Nat Methods 2018;15:1049-1052. [PMID: 30478325 PMCID: PMC6941785 DOI: 10.1038/s41592-018-0218-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2018] [Accepted: 09/27/2018] [Indexed: 12/11/2022]

Yao V, Kaletsky R, Keyes W, Mor DE, Wong AK, Sohrabi S, Murphy CT, Troyanskaya OG. An integrative tissue-network approach to identify and test human disease genes. Nat Biotechnol 2018;36:nbt.4246. [PMID: 30346941 PMCID: PMC7021177 DOI: 10.1038/nbt.4246] [Citation(s) in RCA: 44] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2018] [Accepted: 08/08/2018] [Indexed: 01/09/2023]

Enabling Precision Medicine through Integrative Network Models. J Mol Biol 2018;430:2913-2923. [DOI: 10.1016/j.jmb.2018.07.004] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2018] [Revised: 06/15/2018] [Accepted: 07/03/2018] [Indexed: 11/17/2022]

Ching T, Himmelstein DS, Beaulieu-Jones BK, Kalinin AA, Do BT, Way GP, Ferrero E, Agapow PM, Zietz M, Hoffman MM, Xie W, Rosen GL, Lengerich BJ, Israeli J, Lanchantin J, Woloszynek S, Carpenter AE, Shrikumar A, Xu J, Cofer EM, Lavender CA, Turaga SC, Alexandari AM, Lu Z, Harris DJ, DeCaprio D, Qi Y, Kundaje A, Peng Y, Wiley LK, Segler MHS, Boca SM, Swamidass SJ, Huang A, Gitter A, Greene CS. Opportunities and obstacles for deep learning in biology and medicine. J R Soc Interface 2018;15:20170387. [PMID: 29618526 PMCID: PMC5938574 DOI: 10.1098/rsif.2017.0387] [Citation(s) in RCA: 905] [Impact Index Per Article: 129.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2017] [Accepted: 03/07/2018] [Indexed: 11/12/2022] Open

Affiliation(s)

Travers Ching Molecular Biosciences and Bioengineering Graduate Program, University of Hawaii at Manoa, Honolulu, HI, USA
Daniel S Himmelstein Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Brett K Beaulieu-Jones Genomics and Computational Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Alexandr A Kalinin Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
Brian T Do Harvard Medical School, Boston, MA, USA
Gregory P Way Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Enrico Ferrero Computational Biology and Stats, Target Sciences, GlaxoSmithKline, Stevenage, UK
Paul-Michael Agapow Data Science Institute, Imperial College London, London, UK
Michael Zietz Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Michael M Hoffman Princess Margaret Cancer Centre, Toronto, Ontario, Canada Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
Wei Xie Electrical Engineering and Computer Science, Vanderbilt University, Nashville, TN, USA
Gail L Rosen Ecological and Evolutionary Signal-processing and Informatics Laboratory, Department of Electrical and Computer Engineering, Drexel University, Philadelphia, PA, USA
Benjamin J Lengerich Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
Johnny Israeli Biophysics Program, Stanford University, Stanford, CA, USA
Jack Lanchantin Department of Computer Science, University of Virginia, Charlottesville, VA, USA
Stephen Woloszynek Ecological and Evolutionary Signal-processing and Informatics Laboratory, Department of Electrical and Computer Engineering, Drexel University, Philadelphia, PA, USA
Anne E Carpenter Imaging Platform, Broad Institute of Harvard and MIT, Cambridge, MA, USA
Avanti Shrikumar Department of Computer Science, Stanford University, Stanford, CA, USA
Jinbo Xu Toyota Technological Institute at Chicago, Chicago, IL, USA
Evan M Cofer Department of Computer Science, Trinity University, San Antonio, TX, USA Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA
Christopher A Lavender Integrative Bioinformatics, National Institute of Environmental Health Sciences, National Institutes of Health, Research Triangle Park, NC, USA
Srinivas C Turaga Howard Hughes Medical Institute, Janelia Research Campus, Ashburn, VA, USA
Amr M Alexandari Department of Computer Science, Stanford University, Stanford, CA, USA
Zhiyong Lu National Center for Biotechnology Information and National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
David J Harris Department of Wildlife Ecology and Conservation, University of Florida, Gainesville, FL, USA
Dave DeCaprio ClosedLoop.ai, Austin, TX, USA
Yanjun Qi Department of Computer Science, University of Virginia, Charlottesville, VA, USA
Anshul Kundaje Department of Computer Science, Stanford University, Stanford, CA, USA Department of Genetics, Stanford University, Stanford, CA, USA
Yifan Peng National Center for Biotechnology Information and National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
Laura K Wiley Division of Biomedical Informatics and Personalized Medicine, University of Colorado School of Medicine, Aurora, CO, USA
Marwin H S Segler Institute of Organic Chemistry, Westfälische Wilhelms-Universität Münster, Münster, Germany
Simina M Boca Innovation Center for Biomedical Informatics, Georgetown University Medical Center, Washington, DC, USA
S Joshua Swamidass Department of Pathology and Immunology, Washington University in Saint Louis, St Louis, MO, USA
Austin Huang Department of Medicine, Brown University, Providence, RI, USA
Anthony Gitter Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, USA Morgridge Institute for Research, Madison, WI, USA
Casey S Greene Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA

Collapse

Sibout R, Proost S, Hansen BO, Vaid N, Giorgi FM, Ho-Yue-Kuang S, Legée F, Cézart L, Bouchabké-Coussa O, Soulhat C, Provart N, Pasha A, Le Bris P, Roujol D, Hofte H, Jamet E, Lapierre C, Persson S, Mutwil M. Expression atlas and comparative coexpression network analyses reveal important genes involved in the formation of lignified cell wall in Brachypodium distachyon. THE NEW PHYTOLOGIST 2017;215:1009-1025. [PMID: 28617955 DOI: 10.1111/nph.14635] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/28/2017] [Accepted: 04/26/2017] [Indexed: 05/08/2023]

Affiliation(s)

Richard Sibout Institut Jean-Pierre Bourgin, UMR 1318, INRA, AgroParisTech, CNRS, Université Paris-Saclay, RD10, Versailles Cedex, 78026, France
Sebastian Proost Max Planck Institute of Molecular Plant Physiology, Am Muehlenberg 1, Potsdam, 14476, Germany
Bjoern Oest Hansen Max Planck Institute of Molecular Plant Physiology, Am Muehlenberg 1, Potsdam, 14476, Germany
Neha Vaid Max Planck Institute of Molecular Plant Physiology, Am Muehlenberg 1, Potsdam, 14476, Germany
Federico M Giorgi Cancer Research UK, Cambridge Institute, Robinson Way, Cambridge, CB2 0RE, UK
Severine Ho-Yue-Kuang Institut Jean-Pierre Bourgin, UMR 1318, INRA, AgroParisTech, CNRS, Université Paris-Saclay, RD10, Versailles Cedex, 78026, France
Frédéric Legée Institut Jean-Pierre Bourgin, UMR 1318, INRA, AgroParisTech, CNRS, Université Paris-Saclay, RD10, Versailles Cedex, 78026, France
Laurent Cézart Institut Jean-Pierre Bourgin, UMR 1318, INRA, AgroParisTech, CNRS, Université Paris-Saclay, RD10, Versailles Cedex, 78026, France
Oumaya Bouchabké-Coussa Institut Jean-Pierre Bourgin, UMR 1318, INRA, AgroParisTech, CNRS, Université Paris-Saclay, RD10, Versailles Cedex, 78026, France
Camille Soulhat Institut Jean-Pierre Bourgin, UMR 1318, INRA, AgroParisTech, CNRS, Université Paris-Saclay, RD10, Versailles Cedex, 78026, France
Nicholas Provart Department of Cell and Systems Biology, Centre for the Analysis of Genome Evolution and Function, University of Toronto, 25 Willcocks St., Toronto, ON, M5S 3B2, Canada
Asher Pasha Department of Cell and Systems Biology, Centre for the Analysis of Genome Evolution and Function, University of Toronto, 25 Willcocks St., Toronto, ON, M5S 3B2, Canada
Philippe Le Bris Institut Jean-Pierre Bourgin, UMR 1318, INRA, AgroParisTech, CNRS, Université Paris-Saclay, RD10, Versailles Cedex, 78026, France
David Roujol Laboratoire de Recherche en Sciences Végétales, Université de Toulouse, CNRS, UPS, Castanet-Tolosan, France
Herman Hofte Institut Jean-Pierre Bourgin, UMR 1318, INRA, AgroParisTech, CNRS, Université Paris-Saclay, RD10, Versailles Cedex, 78026, France
Elisabeth Jamet Laboratoire de Recherche en Sciences Végétales, Université de Toulouse, CNRS, UPS, Castanet-Tolosan, France
Catherine Lapierre Institut Jean-Pierre Bourgin, UMR 1318, INRA, AgroParisTech, CNRS, Université Paris-Saclay, RD10, Versailles Cedex, 78026, France
Staffan Persson School of Biosciences, University of Melbourne, Parkville, Vic., 3010, Australia
Marek Mutwil Max Planck Institute of Molecular Plant Physiology, Am Muehlenberg 1, Potsdam, 14476, Germany

Collapse

Ruprecht C, Proost S, Hernandez-Coronado M, Ortiz-Ramirez C, Lang D, Rensing SA, Becker JD, Vandepoele K, Mutwil M. Phylogenomic analysis of gene co-expression networks reveals the evolution of functional modules. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2017;90:447-465. [PMID: 28161902 DOI: 10.1111/tpj.13502] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/19/2016] [Revised: 01/05/2017] [Accepted: 01/25/2017] [Indexed: 05/08/2023]

Ruprecht C, Vaid N, Proost S, Persson S, Mutwil M. Beyond Genomics: Studying Evolution with Gene Coexpression Networks. TRENDS IN PLANT SCIENCE 2017;22:298-307. [PMID: 28126286 DOI: 10.1016/j.tplants.2016.12.011] [Citation(s) in RCA: 47] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/02/2016] [Revised: 12/06/2016] [Accepted: 12/22/2016] [Indexed: 05/08/2023]

Proost S, Mutwil M. PlaNet: Comparative Co-Expression Network Analyses for Plants. Methods Mol Biol 2017;1533:213-227. [PMID: 27987173 DOI: 10.1007/978-1-4939-6658-5_12] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]

Krishnan A, Taroni JN, Greene CS. Integrative Networks Illuminate Biological Factors Underlying Gene–Disease Associations. CURRENT GENETIC MEDICINE REPORTS 2016. [DOI: 10.1007/s40142-016-0102-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Dolinski K, Troyanskaya OG. Implications of Big Data for cell biology. Mol Biol Cell 2016;26:2575-8. [PMID: 26174066 PMCID: PMC4501356 DOI: 10.1091/mbc.e13-12-0756] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open

Guan Y, Martini S, Mariani LH. Genes Caught In Flagranti: Integrating Renal Transcriptional Profiles With Genotypes and Phenotypes. Semin Nephrol 2016. [PMID: 26215861 DOI: 10.1016/j.semnephrol.2015.04.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

A Network of Splice Isoforms for the Mouse. Sci Rep 2016;6:24507. [PMID: 27079421 PMCID: PMC4832266 DOI: 10.1038/srep24507] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2015] [Accepted: 03/30/2016] [Indexed: 01/08/2023] Open

Ruprecht C, Mendrinna A, Tohge T, Sampathkumar A, Klie S, Fernie AR, Nikoloski Z, Persson S, Mutwil M. FamNet: A Framework to Identify Multiplied Modules Driving Pathway Expansion in Plants. PLANT PHYSIOLOGY 2016;170:1878-94. [PMID: 26754669 PMCID: PMC4775111 DOI: 10.1104/pp.15.01281] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/14/2015] [Accepted: 01/07/2016] [Indexed: 05/07/2023]

Affiliation(s)

Colin Ruprecht Max Planck Institute for Molecular Plant Physiology, 14476 Potsdam, Germany (C.R., T.T, S.K., A.R.F., Z.N., M.M.), School of Biosciences and Australian Research Council Centre of Excellence in Plant Cell Walls, University of Melbourne, Parkville, Victoria 3010, Australia (A.M., S.P.); andDivision of Biology and Biological Engineering, California Institute of Technology, Pasadena, California 91125 (A.S.)
Amelie Mendrinna Max Planck Institute for Molecular Plant Physiology, 14476 Potsdam, Germany (C.R., T.T, S.K., A.R.F., Z.N., M.M.), School of Biosciences and Australian Research Council Centre of Excellence in Plant Cell Walls, University of Melbourne, Parkville, Victoria 3010, Australia (A.M., S.P.); andDivision of Biology and Biological Engineering, California Institute of Technology, Pasadena, California 91125 (A.S.)
Takayuki Tohge Max Planck Institute for Molecular Plant Physiology, 14476 Potsdam, Germany (C.R., T.T, S.K., A.R.F., Z.N., M.M.), School of Biosciences and Australian Research Council Centre of Excellence in Plant Cell Walls, University of Melbourne, Parkville, Victoria 3010, Australia (A.M., S.P.); andDivision of Biology and Biological Engineering, California Institute of Technology, Pasadena, California 91125 (A.S.)
Arun Sampathkumar Max Planck Institute for Molecular Plant Physiology, 14476 Potsdam, Germany (C.R., T.T, S.K., A.R.F., Z.N., M.M.), School of Biosciences and Australian Research Council Centre of Excellence in Plant Cell Walls, University of Melbourne, Parkville, Victoria 3010, Australia (A.M., S.P.); andDivision of Biology and Biological Engineering, California Institute of Technology, Pasadena, California 91125 (A.S.)
Sebastian Klie Max Planck Institute for Molecular Plant Physiology, 14476 Potsdam, Germany (C.R., T.T, S.K., A.R.F., Z.N., M.M.), School of Biosciences and Australian Research Council Centre of Excellence in Plant Cell Walls, University of Melbourne, Parkville, Victoria 3010, Australia (A.M., S.P.); andDivision of Biology and Biological Engineering, California Institute of Technology, Pasadena, California 91125 (A.S.)
Alisdair R Fernie Max Planck Institute for Molecular Plant Physiology, 14476 Potsdam, Germany (C.R., T.T, S.K., A.R.F., Z.N., M.M.), School of Biosciences and Australian Research Council Centre of Excellence in Plant Cell Walls, University of Melbourne, Parkville, Victoria 3010, Australia (A.M., S.P.); andDivision of Biology and Biological Engineering, California Institute of Technology, Pasadena, California 91125 (A.S.)
Zoran Nikoloski Max Planck Institute for Molecular Plant Physiology, 14476 Potsdam, Germany (C.R., T.T, S.K., A.R.F., Z.N., M.M.), School of Biosciences and Australian Research Council Centre of Excellence in Plant Cell Walls, University of Melbourne, Parkville, Victoria 3010, Australia (A.M., S.P.); andDivision of Biology and Biological Engineering, California Institute of Technology, Pasadena, California 91125 (A.S.)
Staffan Persson Max Planck Institute for Molecular Plant Physiology, 14476 Potsdam, Germany (C.R., T.T, S.K., A.R.F., Z.N., M.M.), School of Biosciences and Australian Research Council Centre of Excellence in Plant Cell Walls, University of Melbourne, Parkville, Victoria 3010, Australia (A.M., S.P.); andDivision of Biology and Biological Engineering, California Institute of Technology, Pasadena, California 91125 (A.S.)
Marek Mutwil Max Planck Institute for Molecular Plant Physiology, 14476 Potsdam, Germany (C.R., T.T, S.K., A.R.F., Z.N., M.M.), School of Biosciences and Australian Research Council Centre of Excellence in Plant Cell Walls, University of Melbourne, Parkville, Victoria 3010, Australia (A.M., S.P.); andDivision of Biology and Biological Engineering, California Institute of Technology, Pasadena, California 91125 (A.S.)

Collapse

ADAGE-Based Integration of Publicly Available Pseudomonas aeruginosa Gene Expression Data with Denoising Autoencoders Illuminates Microbe-Host Interactions. mSystems 2016;1:mSystems00025-15. [PMID: 27822512 PMCID: PMC5069748 DOI: 10.1128/msystems.00025-15] [Citation(s) in RCA: 76] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2015] [Accepted: 12/08/2015] [Indexed: 12/21/2022] Open

Abstract

The increasing number of genome-wide assays of gene expression available from public databases presents opportunities for computational methods that facilitate hypothesis generation and biological interpretation of these data. We present an unsupervised machine learning approach, ADAGE (analysis using denoising autoencoders of gene expression), and apply it to the publicly available gene expression data compendium for Pseudomonas aeruginosa. In this approach, the machine-learned ADAGE model contained 50 nodes which we predicted would correspond to gene expression patterns across the gene expression compendium. While no biological knowledge was used during model construction, cooperonic genes had similar weights across nodes, and genes with similar weights across nodes were significantly more likely to share KEGG pathways. By analyzing newly generated and previously published microarray and transcriptome sequencing data, the ADAGE model identified differences between strains, modeled the cellular response to low oxygen, and predicted the involvement of biological processes based on low-level gene expression differences. ADAGE compared favorably with traditional principal component analysis and independent component analysis approaches in its ability to extract validated patterns, and based on our analyses, we propose that these approaches differ in the types of patterns they preferentially identify. We provide the ADAGE model with analysis of all publicly available P. aeruginosa GeneChip experiments and open source code for use with other species and settings. Extraction of consistent patterns across large-scale collections of genomic data using methods like ADAGE provides the opportunity to identify general principles and biologically important patterns in microbial biology. This approach will be particularly useful in less-well-studied microbial species. IMPORTANCE The quantity and breadth of genome-scale data sets that examine RNA expression in diverse bacterial and eukaryotic species are increasing more rapidly than for curated knowledge. Our ADAGE method integrates such data without requiring gene function, gene pathway, or experiment labeling, making practical its application to any large gene expression compendium. We built a Pseudomonas aeruginosa ADAGE model from a diverse set of publicly available experiments without any prespecified biological knowledge, and this model was accurate and predictive. We provide ADAGE results for the complete P. aeruginosa GeneChip compendium for use by researchers studying P. aeruginosa and source code that facilitates ADAGE's application to other species and data types. Author Video: An author video summary of this article is available.

Collapse

Gonzalez GH, Tahsin T, Goodale BC, Greene AC, Greene CS. Recent Advances and Emerging Applications in Text and Data Mining for Biomedical Discovery. Brief Bioinform 2015;17:33-42. [PMID: 26420781 PMCID: PMC4719073 DOI: 10.1093/bib/bbv087] [Citation(s) in RCA: 73] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2015] [Indexed: 02/06/2023] Open

Gui J, Greene CS, Sullivan C, Taylor W, Moore JH, Kim C. Testing multiple hypotheses through IMP weighted FDR based on a genetic functional network with application to a new zebrafish transcriptome study. BioData Min 2015;8:17. [PMID: 26097506 PMCID: PMC4474579 DOI: 10.1186/s13040-015-0050-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2014] [Accepted: 06/08/2015] [Indexed: 11/10/2022] Open

Wong AK, Krishnan A, Yao V, Tadych A, Troyanskaya OG. IMP 2.0: a multi-species functional genomics portal for integration, visualization and prediction of protein functions and networks. Nucleic Acids Res 2015;43:W128-33. [PMID: 25969450 PMCID: PMC4489318 DOI: 10.1093/nar/gkv486] [Citation(s) in RCA: 65] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2015] [Accepted: 05/02/2015] [Indexed: 01/08/2023] Open

Understanding multicellular function and disease with human tissue-specific networks. Nat Genet 2015;47:569-76. [PMID: 25915600 PMCID: PMC4828725 DOI: 10.1038/ng.3259] [Citation(s) in RCA: 594] [Impact Index Per Article: 59.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2014] [Accepted: 03/06/2015] [Indexed: 12/17/2022]

Greene AC, Giffin KA, Greene CS, Moore JH. Adapting bioinformatics curricula for big data. Brief Bioinform 2015;17:43-50. [PMID: 25829469 PMCID: PMC4719066 DOI: 10.1093/bib/bbv018] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2014] [Indexed: 12/16/2022] Open

Wangler MF, Yamamoto S, Bellen HJ. Fruit flies in biomedical research. Genetics 2015;199:639-653. [PMID: 25624315 PMCID: PMC4349060 DOI: 10.1534/genetics.114.171785] [Citation(s) in RCA: 119] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2014] [Accepted: 12/09/2014] [Indexed: 12/13/2022] Open

Park CY, Krishnan A, Zhu Q, Wong AK, Lee YS, Troyanskaya OG. Tissue-aware data integration approach for the inference of pathway interactions in metazoan organisms. ACTA ACUST UNITED AC 2014;31:1093-101. [PMID: 25431329 DOI: 10.1093/bioinformatics/btu786] [Citation(s) in RCA: 69] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2014] [Accepted: 11/20/2014] [Indexed: 11/12/2022]

Affiliation(s)

Christopher Y Park Department of Computer Science, Princeton University, Princeton, NJ 08544, USA, Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08540, USA and Simons Center for Data Analysis, Simons Foundation, New York, NY, 10010, USA Department of Computer Science, Princeton University, Princeton, NJ 08544, USA, Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08540, USA and Simons Center for Data Analysis, Simons Foundation, New York, NY, 10010, USA
Arjun Krishnan Department of Computer Science, Princeton University, Princeton, NJ 08544, USA, Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08540, USA and Simons Center for Data Analysis, Simons Foundation, New York, NY, 10010, USA
Qian Zhu Department of Computer Science, Princeton University, Princeton, NJ 08544, USA, Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08540, USA and Simons Center for Data Analysis, Simons Foundation, New York, NY, 10010, USA Department of Computer Science, Princeton University, Princeton, NJ 08544, USA, Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08540, USA and Simons Center for Data Analysis, Simons Foundation, New York, NY, 10010, USA
Aaron K Wong Department of Computer Science, Princeton University, Princeton, NJ 08544, USA, Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08540, USA and Simons Center for Data Analysis, Simons Foundation, New York, NY, 10010, USA Department of Computer Science, Princeton University, Princeton, NJ 08544, USA, Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08540, USA and Simons Center for Data Analysis, Simons Foundation, New York, NY, 10010, USA
Young-Suk Lee Department of Computer Science, Princeton University, Princeton, NJ 08544, USA, Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08540, USA and Simons Center for Data Analysis, Simons Foundation, New York, NY, 10010, USA Department of Computer Science, Princeton University, Princeton, NJ 08544, USA, Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08540, USA and Simons Center for Data Analysis, Simons Foundation, New York, NY, 10010, USA
Olga G Troyanskaya Department of Computer Science, Princeton University, Princeton, NJ 08544, USA, Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08540, USA and Simons Center for Data Analysis, Simons Foundation, New York, NY, 10010, USA Department of Computer Science, Princeton University, Princeton, NJ 08544, USA, Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08540, USA and Simons Center for Data Analysis, Simons Foundation, New York, NY, 10010, USA Department of Computer Science, Princeton University, Princeton, NJ 08544, USA, Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08540, USA and Simons Center for Data Analysis, Simons Foundation, New York, NY, 10010, USA

Collapse

Li HD, Menon R, Omenn GS, Guan Y. Revisiting the identification of canonical splice isoforms through integration of functional genomics and proteomics evidence. Proteomics 2014;14:2709-18. [PMID: 25265570 DOI: 10.1002/pmic.201400170] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2014] [Revised: 08/11/2014] [Accepted: 09/23/2014] [Indexed: 01/08/2023]

Joice R, Yasuda K, Shafquat A, Morgan XC, Huttenhower C. Determining microbial products and identifying molecular targets in the human microbiome. Cell Metab 2014;20:731-741. [PMID: 25440055 PMCID: PMC4254638 DOI: 10.1016/j.cmet.2014.10.003] [Citation(s) in RCA: 71] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]

Yu J, Wu H, Wen Y, Liu Y, Zhou T, Ni B, Lin Y, Dong J, Zhou Z, Hu Z, Guo X, Sha J, Tong C. Identification of seven genes essential for male fertility through a genome-wide association study of non-obstructive azoospermia and RNA interference-mediated large-scale functional screening in Drosophila. Hum Mol Genet 2014;24:1493-503. [DOI: 10.1093/hmg/ddu557] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open

Zhu F, Shi L, Li H, Eksi R, Engel JD, Guan Y. Modeling dynamic functional relationship networks and application to ex vivo human erythroid differentiation. ACTA ACUST UNITED AC 2014;30:3325-33. [PMID: 25115705 DOI: 10.1093/bioinformatics/btu542] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]

Affiliation(s)

Fan Zhu Department of Computational Medicine and Bioinformatics, Department of Cell and Developmental Biology, Department of Internal Medicine and Department of Computer Science and Engineering, University of Michigan, MI48109, USA
Lihong Shi Department of Computational Medicine and Bioinformatics, Department of Cell and Developmental Biology, Department of Internal Medicine and Department of Computer Science and Engineering, University of Michigan, MI48109, USA
Hongdong Li Department of Computational Medicine and Bioinformatics, Department of Cell and Developmental Biology, Department of Internal Medicine and Department of Computer Science and Engineering, University of Michigan, MI48109, USA
Ridvan Eksi Department of Computational Medicine and Bioinformatics, Department of Cell and Developmental Biology, Department of Internal Medicine and Department of Computer Science and Engineering, University of Michigan, MI48109, USA
James Douglas Engel Department of Computational Medicine and Bioinformatics, Department of Cell and Developmental Biology, Department of Internal Medicine and Department of Computer Science and Engineering, University of Michigan, MI48109, USA
Yuanfang Guan Department of Computational Medicine and Bioinformatics, Department of Cell and Developmental Biology, Department of Internal Medicine and Department of Computer Science and Engineering, University of Michigan, MI48109, USA Department of Computational Medicine and Bioinformatics, Department of Cell and Developmental Biology, Department of Internal Medicine and Department of Computer Science and Engineering, University of Michigan, MI48109, USA Department of Computational Medicine and Bioinformatics, Department of Cell and Developmental Biology, Department of Internal Medicine and Department of Computer Science and Engineering, University of Michigan, MI48109, USA

Collapse

Selecting biologically informative genes in co-expression networks with a centrality score. Biol Direct 2014;9:12. [PMID: 24947308 PMCID: PMC4079186 DOI: 10.1186/1745-6150-9-12] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2014] [Accepted: 06/11/2014] [Indexed: 11/10/2022] Open

Penrod NM, Greene CS, Moore JH. Predicting targeted drug combinations based on Pareto optimal patterns of coexpression network connectivity. Genome Med 2014;6:33. [PMID: 24944582 PMCID: PMC4062052 DOI: 10.1186/gm550] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2013] [Accepted: 04/22/2014] [Indexed: 01/05/2023] Open

Abstract

Background

Molecularly targeted drugs promise a safer and more effective treatment modality than conventional chemotherapy for cancer patients. However, tumors are dynamic systems that readily adapt to these agents activating alternative survival pathways as they evolve resistant phenotypes. Combination therapies can overcome resistance but finding the optimal combinations efficiently presents a formidable challenge. Here we introduce a new paradigm for the design of combination therapy treatment strategies that exploits the tumor adaptive process to identify context-dependent essential genes as druggable targets.

Methods

We have developed a framework to mine high-throughput transcriptomic data, based on differential coexpression and Pareto optimization, to investigate drug-induced tumor adaptation. We use this approach to identify tumor-essential genes as druggable candidates. We apply our method to a set of ER⁺ breast tumor samples, collected before (n = 58) and after (n = 60) neoadjuvant treatment with the aromatase inhibitor letrozole, to prioritize genes as targets for combination therapy with letrozole treatment. We validate letrozole-induced tumor adaptation through coexpression and pathway analyses in an independent data set (n = 18).

Results

We find pervasive differential coexpression between the untreated and letrozole-treated tumor samples as evidence of letrozole-induced tumor adaptation. Based on patterns of coexpression, we identify ten genes as potential candidates for combination therapy with letrozole including EPCAM, a letrozole-induced essential gene and a target to which drugs have already been developed as cancer therapeutics. Through replication, we validate six letrozole-induced coexpression relationships and confirm the epithelial-to-mesenchymal transition as a process that is upregulated in the residual tumor samples following letrozole treatment.

Conclusions

To derive the greatest benefit from molecularly targeted drugs it is critical to design combination treatment strategies rationally. Incorporating knowledge of the tumor adaptation process into the design provides an opportunity to match targeted drugs to the evolving tumor phenotype and surmount resistance.

Collapse

Li L, Cui X, Yu S, Zhang Y, Luo Z, Yang H, Zhou Y, Zheng X. PSSP-RFE: accurate prediction of protein structural class by recursive feature extraction from PSI-BLAST profile, physical-chemical property and functional annotations. PLoS One 2014;9:e92863. [PMID: 24675610 PMCID: PMC3968047 DOI: 10.1371/journal.pone.0092863] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2013] [Accepted: 02/27/2014] [Indexed: 02/05/2023] Open