1
|
Abstract
BACKGROUND Biomolecular pathways and networks are dynamic and complex, and the perturbations to them which cause disease are often multiple, heterogeneous and contingent. Pathway and network visualizations, rendered on a computer or published on paper, however, tend to be static, lacking in detail, and ill-equipped to explore the variety and quantities of data available today, and the complex causes we seek to understand. RESULTS RCytoscape integrates R (an open-ended programming environment rich in statistical power and data-handling facilities) and Cytoscape (powerful network visualization and analysis software). RCytoscape extends Cytoscape's functionality beyond what is possible with the Cytoscape graphical user interface. To illustrate the power of RCytoscape, a portion of the Glioblastoma multiforme (GBM) data set from the Cancer Genome Atlas (TCGA) is examined. Network visualization reveals previously unreported patterns in the data suggesting heterogeneous signaling mechanisms active in GBM Proneural tumors, with possible clinical relevance. CONCLUSIONS Progress in bioinformatics and computational biology depends upon exploratory and confirmatory data analysis, upon inference, and upon modeling. These activities will eventually permit the prediction and control of complex biological systems. Network visualizations--molecular maps--created from an open-ended programming environment rich in statistical power and data-handling facilities, such as RCytoscape, will play an essential role in this progression.
Collapse
Affiliation(s)
- Paul T Shannon
- Fred Hutchison Cancer Research Institute, Seattle Washington, and the Institute for Systems Biology, 401 Terry Ave. N, Seattle, WA, USA
- Institute for Systems Biology, 401 Terry Ave. N, Seattle, WA, USA
| | - Mark Grimes
- Division of Biological Sciences, Center for Structural and Functional Neuroscience, University of Montana, Missoula, MT, USA
| | - Burak Kutlu
- Institute for Systems Biology, 401 Terry Ave. N, Seattle, WA, USA
| | - Jan J Bot
- Delft University of Technology, Delft Bioinformatics Lab, Delft, The Netherlands
| | - David J Galas
- Pacific Northwest Diabetes Research Institute, 720 Broadway, Seattle, WA 98120, USA
| |
Collapse
|
2
|
Roach JC, Glusman G, Smit AF, Huff CD, Hubley R, Shannon PT, Rowen L, Pant KP, Goodman N, Bamshad M, Shendure J, Drmanac R, Jorde LB, Hood L, Galas DJ. Analysis of genetic inheritance in a family quartet by whole-genome sequencing. Science 2010; 328:636-9. [PMID: 20220176 PMCID: PMC3037280 DOI: 10.1126/science.1186802] [Citation(s) in RCA: 729] [Impact Index Per Article: 52.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
We analyzed the whole-genome sequences of a family of four, consisting of two siblings and their parents. Family-based sequencing allowed us to delineate recombination sites precisely, identify 70% of the sequencing errors (resulting in > 99.999% accuracy), and identify very rare single-nucleotide polymorphisms. We also directly estimated a human intergeneration mutation rate of approximately 1.1 x 10(-8) per position per haploid genome. Both offspring in this family have two recessive disorders: Miller syndrome, for which the gene was concurrently identified, and primary ciliary dyskinesia, for which causative genes have been previously identified. Family-based genome analysis enabled us to narrow the candidate genes for both of these Mendelian disorders to only four. Our results demonstrate the value of complete genome sequencing in families.
Collapse
Affiliation(s)
| | | | | | - Chad D. Huff
- Institute for Systems Biology, Seattle, WA 98103
- Department of Human Genetics, Eccles Institute of Human Genetics, University of Utah, Salt Lake City, Utah 84112, USA
| | | | | | - Lee Rowen
- Institute for Systems Biology, Seattle, WA 98103
| | | | | | - Michael Bamshad
- Department of Pediatrics, University of Washington, Seattle WA 98195
| | - Jay Shendure
- Department of Genome Sciences, University of Washington, Seattle WA 98195
| | | | - Lynn B. Jorde
- Department of Human Genetics, Eccles Institute of Human Genetics, University of Utah, Salt Lake City, Utah 84112, USA
| | - Leroy Hood
- Institute for Systems Biology, Seattle, WA 98103
| | | |
Collapse
|
3
|
Saleem RA, Rogers RS, Ratushny AV, Dilworth DJ, Shannon PT, Shteynberg D, Wan Y, Moritz RL, Nesvizhskii AI, Rachubinski RA, Aitchison JD. Integrated phosphoproteomics analysis of a signaling network governing nutrient response and peroxisome induction. Mol Cell Proteomics 2010; 9:2076-88. [PMID: 20395639 DOI: 10.1074/mcp.m000116-mcp201] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Phosphorylation of proteins is a key posttranslational modification in cellular signaling, regulating many aspects of cellular responses. We used a quantitative, integrated, phosphoproteomics approach to characterize the cellular responses of the yeast Saccharomyces cerevisiae to the fatty acid oleic acid, a molecule with broad human health implications and a potent inducer of peroxisomes. A combination of cryolysis and urea solubilization was used to minimize the opportunity for reorientation of the phosphoproteome, and hydrophilic interaction liquid chromatography and IMAC chemistries were used to fractionate and enrich for phosphopeptides. Using these approaches, numerous phosphorylated peptides specific to oleate-induced and glucose-repressed conditions were identified and mapped to known signaling pathways. These include several transcription factors, two of which, Pip2p and Cst6p, must be phosphorylated for the normal transcriptional response of fatty acid-responsive loci encoding peroxisomal proteins. The phosphoproteome data were integrated with results from genome-wide assays studying the effects of signaling molecule deletions and known protein-protein interactions to generate a putative fatty acid-responsive signaling network. In this network, the most highly connected nodes are those with the largest effects on cellular responses to oleic acid. These properties are consistent with a scale-free topology, demonstrating that scale-free properties are conserved in condition-specific networks.
Collapse
Affiliation(s)
- Ramsey A Saleem
- Institute for Systems Biology, Seattle, Washington 98103, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
4
|
Ng SB, Buckingham KJ, Lee C, Bigham AW, Tabor HK, Dent KM, Huff CD, Shannon PT, Jabs EW, Nickerson DA, Shendure J, Bamshad MJ. Exome sequencing identifies the cause of a mendelian disorder. Nat Genet 2009; 42:30-5. [PMID: 19915526 PMCID: PMC2847889 DOI: 10.1038/ng.499] [Citation(s) in RCA: 1363] [Impact Index Per Article: 90.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2009] [Accepted: 11/09/2009] [Indexed: 12/15/2022]
Abstract
We demonstrate the first successful application of exome sequencing to discover the gene for a rare, Mendelian disorder of unknown cause, Miller syndrome (OMIM %263750). For four affected individuals in three independent kindreds, we captured and sequenced coding regions to a mean coverage of 40X, and sufficient depth to call variants at ~97% of each targeted exome. Filtering against public SNP databases and a small number of HapMap exomes for genes with two novel variants in each of the four cases identified a single candidate gene, DHODH, which encodes a key enzyme in the pyrimidine de novo biosynthesis pathway. Sanger sequencing confirmed the presence of DHODH mutations in three additional families with Miller syndrome. Exome sequencing of a small number of unrelated, affected individuals is a powerful, efficient strategy for identifying the genes underlying rare Mendelian disorders and will likely transform the genetic analysis of monogenic traits.
Collapse
Affiliation(s)
- Sarah B Ng
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
5
|
Bare JC, Shannon PT, Schmid AK, Baliga NS. The Firegoose: two-way integration of diverse data from different bioinformatics web resources with desktop applications. BMC Bioinformatics 2007; 8:456. [PMID: 18021453 PMCID: PMC2211326 DOI: 10.1186/1471-2105-8-456] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2007] [Accepted: 11/19/2007] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Information resources on the World Wide Web play an indispensable role in modern biology. But integrating data from multiple sources is often encumbered by the need to reformat data files, convert between naming systems, or perform ongoing maintenance of local copies of public databases. Opportunities for new ways of combining and re-using data are arising as a result of the increasing use of web protocols to transmit structured data. RESULTS The Firegoose, an extension to the Mozilla Firefox web browser, enables data transfer between web sites and desktop tools. As a component of the Gaggle integration framework, Firegoose can also exchange data with Cytoscape, the R statistical package, Multiexperiment Viewer (MeV), and several other popular desktop software tools. Firegoose adds the capability to easily use local data to query KEGG, EMBL STRING, DAVID, and other widely-used bioinformatics web sites. Query results from these web sites can be transferred to desktop tools for further analysis with a few clicks. Firegoose acquires data from the web by screen scraping, microformats, embedded XML, or web services. We define a microformat, which allows structured information compatible with the Gaggle to be embedded in HTML documents. We demonstrate the capabilities of this software by performing an analysis of the genes activated in the microbe Halobacterium salinarum NRC-1 in response to anaerobic environments. Starting with microarray data, we explore functions of differentially expressed genes by combining data from several public web resources and construct an integrated view of the cellular processes involved. CONCLUSION The Firegoose incorporates Mozilla Firefox into the Gaggle environment and enables interactive sharing of data between diverse web resources and desktop software tools without maintaining local copies. Additional web sites can be incorporated easily into the framework using the scripting platform of the Firefox browser. Performing data integration in the browser allows the excellent search and navigation capabilities of the browser to be used in combination with powerful desktop tools.
Collapse
Affiliation(s)
- J Christopher Bare
- Institute for Systems Biology, 1441 N 34th Street, Seattle, WA 98103, USA.
| | | | | | | |
Collapse
|
6
|
Shannon PT, Reiss DJ, Bonneau R, Baliga NS. The Gaggle: an open-source software system for integrating bioinformatics software and data sources. BMC Bioinformatics 2006; 7:176. [PMID: 16569235 PMCID: PMC1464137 DOI: 10.1186/1471-2105-7-176] [Citation(s) in RCA: 122] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2005] [Accepted: 03/28/2006] [Indexed: 01/16/2023] Open
Abstract
Background Systems biologists work with many kinds of data, from many different sources, using a variety of software tools. Each of these tools typically excels at one type of analysis, such as of microarrays, of metabolic networks and of predicted protein structure. A crucial challenge is to combine the capabilities of these (and other forthcoming) data resources and tools to create a data exploration and analysis environment that does justice to the variety and complexity of systems biology data sets. A solution to this problem should recognize that data types, formats and software in this high throughput age of biology are constantly changing. Results In this paper we describe the Gaggle -a simple, open-source Java software environment that helps to solve the problem of software and database integration. Guided by the classic software engineering strategy of separation of concerns and a policy of semantic flexibility, it integrates existing popular programs and web resources into a user-friendly, easily-extended environment. We demonstrate that four simple data types (names, matrices, networks, and associative arrays) are sufficient to bring together diverse databases and software. We highlight some capabilities of the Gaggle with an exploration of Helicobacter pylori pathogenesis genes, in which we identify a putative ricin-like protein -a discovery made possible by simultaneous data exploration using a wide range of publicly available data and a variety of popular bioinformatics software tools. Conclusion We have integrated diverse databases (for example, KEGG, BioCyc, String) and software (Cytoscape, DataMatrixViewer, R statistical environment, and TIGR Microarray Expression Viewer). Through this loose coupling of diverse software and databases the Gaggle enables simultaneous exploration of experimental data (mRNA and protein abundance, protein-protein and protein-DNA interactions), functional associations (operon, chromosomal proximity, phylogenetic pattern), metabolic pathways (KEGG) and Pubmed abstracts (STRING web resource), creating an exploratory environment useful to 'web browser and spreadsheet biologists', to statistically savvy computational biologists, and those in between. The Gaggle uses Java RMI and Java Web Start technologies and can be found at .
Collapse
Affiliation(s)
- Paul T Shannon
- Institute for Systems Biology, 1441 N 34Street, Seattle, WA 98103, USA
| | - David J Reiss
- Institute for Systems Biology, 1441 N 34Street, Seattle, WA 98103, USA
| | - Richard Bonneau
- Institute for Systems Biology, 1441 N 34Street, Seattle, WA 98103, USA
- Department of Biology, New York University, 100 Washington Square E, New York, NY 10003, USA
| | - Nitin S Baliga
- Institute for Systems Biology, 1441 N 34Street, Seattle, WA 98103, USA
| |
Collapse
|
7
|
Ginzberg HH, Cherapanov V, Dong Q, Cantin A, McCulloch CA, Shannon PT, Downey GP. Neutrophil-mediated epithelial injury during transmigration: role of elastase. Am J Physiol Gastrointest Liver Physiol 2001; 281:G705-17. [PMID: 11518683 DOI: 10.1152/ajpgi.2001.281.3.g705] [Citation(s) in RCA: 131] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
Neutrophil-mediated injury to gut epithelium may lead to disruption of the epithelial barrier function with consequent organ dysfunction, but the mechanisms of this are incompletely characterized. Because the epithelial apical junctional complex, comprised of tight and adherens junctions, is responsible in part for this barrier function, we investigated the effects of neutrophil transmigration on these structures. Using a colonic epithelial cell line, we observed that neutrophils migrating across cell monolayers formed clusters that were associated with focal epithelial cell loss and the creation of circular defects within the monolayer. The loss of epithelial cells was partly attributable to neutrophil-derived proteases, likely elastase, because it was prevented by elastase inhibitors. Spatially delimited disruption of epithelial junctional complexes with focal loss of E-cadherin, beta-catenin, and zonula occludens 1 was observed adjacent to clusters of transmigrating neutrophils. During neutrophil transmigration, fragments of E-cadherin were released into the apical supernatant, and inhibitors of neutrophil elastase prevented this proteolytic degradation. Addition of purified leukocyte elastase also resulted in release of E-cadherin fragments, but only after opening of tight junctions. Taken together, these data demonstrate that neutrophil-derived proteases can mediate spatially delimited disruption of epithelial apical junctions during transmigration. These processes may contribute to epithelial loss and disruption of epithelial barrier function in inflammatory diseases.
Collapse
Affiliation(s)
- H H Ginzberg
- Department of Pediatrics, Division of Gastroenterology and Nutrition, The University of Toronto, Toronto, Ontario, Canada M5S 1A8
| | | | | | | | | | | | | |
Collapse
|
8
|
Jones NL, Shannon PT, Cutz E, Yeger H, Sherman PM. Increase in proliferation and apoptosis of gastric epithelial cells early in the natural history of Helicobacter pylori infection. Am J Pathol 1997; 151:1695-703. [PMID: 9403720 PMCID: PMC1858359] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Childhood acquisition of Helicobacter pylori is a critical risk factor for gastric cancer. Since tumorigenesis involves deregulation of proliferation and apoptosis, we examined gastric epithelial cell proliferation and apoptosis in H. pylori-infected children. Apoptosis and proliferation of gastric antral epithelial cells in biopsy specimens from patients with H. pylori-induced gastritis, secondary gastritis, and noninflamed controls were compared. p53 protein expression was examined immunohistochemically. Apoptotic cells were identified in the surface epithelium in each group. The apoptotic index was higher in specimens from patients with H. pylori gastritis (120 +/- 10) than secondary gastritis (50 +/- 10) and noninflamed controls (40 +/- 10, analysis of variance P < 0.005). Apoptosis decreased following H. pylori eradication and resolution of gastritis (P < 0.02). An expanded proliferative compartment was identified in H. pylori-induced gastritis (32.4 +/- 3.5; proliferative labeling index +/- SE) compared with secondary gastritis (18.9 +/- 2.8) and noninflamed controls (13.7 +/- 3.1, analysis of variance P < 0.01). The accelerated cell turnover was associated with p53 overexpression (analysis of variance P < 0.005). Accumulation of p53 was not associated with expression of the cyclin-dependent kinase inhibitor p21. The occurrence of altered cell turnover early in the natural history of chronic infection provides an explanation for the increased risk of gastric cancer development associated with childhood acquisition of infection.
Collapse
Affiliation(s)
- N L Jones
- Research Institute, The Hospital for Sick Children, and the Department of Pediatrics, University of Toronto, Ontario, Canada
| | | | | | | | | |
Collapse
|
9
|
|
10
|
|
11
|
Shannon PT, O'Neill PS. Enthalpy of nonideal binary solutions:T-butanol-benzene,T-butanol-water. AIChE J 1963. [DOI: 10.1002/aic.690090218] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
12
|
Shannon PT, Gustafson DB, O'Neill PS. Enthalpy of nonideal solutions:n-butanol-benzene. AIChE J 1963. [DOI: 10.1002/aic.690090114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
13
|
|