1
|
Abstract
We present a computational approach for studying the effect of potential drug combinations on the protein networks associated with tumor cells. The majority of therapeutics are designed to target single proteins, yet most diseased states are characterized by a combination of many interacting genes and proteins. Using the topology of protein-protein interaction networks, our methods can explicitly model the possible synergistic effect of targeting multiple proteins using drug combinations in different cancer types. The methodology can be conceptually split into two distinct stages. Firstly, we integrate protein interaction and gene expression data to develop network representations of different tissue types and cancer types. Secondly, we model network perturbations to search for target combinations which cause significant damage to a relevant cancer network but only minimal damage to an equivalent normal network. We have developed sets of predicted target and drug combinations for multiple cancer types, which are validated using known cancer and drug associations, and are currently in experimental testing for prostate cancer. Our methods also revealed significant bias in curated interaction data sources towards targets with associations compared with high-throughput data sources from model organisms. The approach developed can potentially be applied to many other diseased cell types.
Collapse
|
2
|
Radovich M, Clare SE, Sledge GW, Pardo I, Mathieson T, Kassem N, Hancock BA, Storniolo AMV, Rufenbarger C, Lillemoe HA, Sun J, Henry JE, Goulet R, Hilligoss EE, Siddiqui AS, Breu H, Sakarya O, Hyland FC, Muller MW, Popescu L, Zhu J, Hickenbotham M, Glasscock J, Ivan M, Liu Y, Schneider BP. Abstract PD01-08: Decoding the Transcriptional Landscape of Triple-Negative Breast Cancer Using Next-Generation Whole Transcriptome Sequencing. Cancer Res 2010. [DOI: 10.1158/0008-5472.sabcs10-pd01-08] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Background: Triple-negative breast cancer (TNBC) has been plagued by the absence of targeted therapies. Discovery of therapeutic targets in TNBC has in part, been hampered by an inadequate understanding of the transcriptional biology of the normal breast as an optimal comparator. Using next-generation sequencing, we embarked on a study to compare the transcriptomes of TNBC and normal breast to comprehensively identify novel targets by analyzing all full length transcripts expressed in these tissues.
Methods: Normal breast tissues from healthy pre-menopausal volunteers with no history of disease were procured from the Susan G. Komen for the Cure® Tissue Bank at the IU Simon Cancer Center. To eliminate bias from stromal tissue, normal tissues were laser capture microdissected for ductal epithelium. cDNA libraries from 10 TNBC tumors and 10 normal breast tissues were sequenced on an Applied Biosystems (AB) SOLiD3 sequencer using 50bp fragment runs. For gene expression, mapping of reads to the genome was performed using the AB BioScope 1.2 Pipeline and outputs imported into Partek Genomics Suite for analysis. In Partek, mapped reads were cross-referenced against known genes from the UCSC database followed by statistical comparison of RPKM values for each gene between TNBC and normal. Dimensionality reduction analyses (PCA & Hierarchical clustering) and identification of Novel Transcribed Regions were also performed in Partek, whereas construction of gene networks was performed using Ingenuity Pathway Analysis. To identify gene fusions, partially mapped reads were interrogated utilizing a novel algorithm that searched for reads spanning exons from two different genes. Fusions that were supported by at least 3 reads (of which 2 had to be unique) were considered candidates and were subsequently validated. Results/Discussion: Sequencing produced 1.1 billion reads equaling 57.3GB of data of which 36.0GB (63%) mapped to the human genome. In comparing RPKM values between TNBC and Normal, we report 7140 RefSeq Genes, 22 pre-miRNAs, 109 lincRNA exons, and 15 ultraconserved regions that were differentially expressed between these tissues (FDR<0.01). Biological interpretation of these results reveals upregulation of genes and miRNAs involved in DNA repair, angiogenesis, and inhibitors of Estrogen Receptor-alpha. Some previous drug targets (e.g. EGFR and c-kit) were not found to be upregulated here which may explain lack of clinical success to date. Conversely, PARP was significantly upregulated and early trial results suggest a strong signal for efficacy with inhibition of PARP. We also surveyed the genome for Novel Transcribed Regions (NTRs), defined as areas of significant transcription where no annotated gene is present. When comparing between TNBC and Normal, we report 6408 NTRs to be differentially expressed (FDR<0.01). Lastly, when analyzing the dataset for gene fusions, we identified several gene fusions in the TNBC samples, though no individual fusion was present in more than one sample.
Conclusion: We report an extensive comparison of the transcriptomes of TNBC and normal ductal epithelium. We identified numerous genes previously unknown to be dysregulated in TNBC that can be utilized for therapeutic discovery.
Citation Information: Cancer Res 2010;70(24 Suppl):Abstract nr PD01-08.
Collapse
Affiliation(s)
- M Radovich
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - SE Clare
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - GW Sledge
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - I Pardo
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - T Mathieson
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - N Kassem
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - BA Hancock
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - AMV Storniolo
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - C Rufenbarger
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - HA Lillemoe
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - J Sun
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - JE Henry
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - R Goulet
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - EE Hilligoss
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - AS Siddiqui
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - H Breu
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - O Sakarya
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - FC Hyland
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - MW Muller
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - L Popescu
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - J Zhu
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - M Hickenbotham
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - J Glasscock
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - M Ivan
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - Y Liu
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| | - BP. Schneider
- Indiana University School of Medicine, Indianapolis, IN; Life Technologies, Inc, Foster City, CA; Cofactor Genomics, LLC, St. Louis, MO
| |
Collapse
|
3
|
Quayle AP, Siddiqui AS, Jones SJM. Perturbation of interaction networks for application to cancer therapy. Cancer Inform 2007; 5:45-65. [PMID: 19390668 PMCID: PMC2666951] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/29/2022] Open
Abstract
We present a computational approach for studying the effect of potential drug combinations on the protein networks associated with tumor cells. The majority of therapeutics are designed to target single proteins, yet most diseased states are characterized by a combination of many interacting genes and proteins. Using the topology of protein-protein interaction networks, our methods can explicitly model the possible synergistic effect of targeting multiple proteins using drug combinations in different cancer types. The methodology can be conceptually split into two distinct stages. Firstly, we integrate protein interaction and gene expression data to develop network representations of different tissue types and cancer types. Secondly, we model network perturbations to search for target combinations which cause significant damage to a relevant cancer network but only minimal damage to an equivalent normal network. We have developed sets of predicted target and drug combinations for multiple cancer types, which are validated using known cancer and drug associations, and are currently in experimental testing for prostate cancer. Our methods also revealed significant bias in curated interaction data sources towards targets with associations compared with high-throughput data sources from model organisms. The approach developed can potentially be applied to many other diseased cell types.
Collapse
Affiliation(s)
| | | | - Steven J. M. Jones
- Correspondence: Dr Steven Jones, Genome Sciences Centre, BC Cancer Agency, 675 West 10th Avenue, Vancouver, BC. V5Z 1L3. Tel: 604-675-8170;
| |
Collapse
|
4
|
McLeod MP, Warren RL, Hsiao WWL, Araki N, Myhre M, Fernandes C, Miyazawa D, Wong W, Lillquist AL, Wang D, Dosanjh M, Hara H, Petrescu A, Morin RD, Yang G, Stott JM, Schein JE, Shin H, Smailus D, Siddiqui AS, Marra MA, Jones SJM, Holt R, Brinkman FSL, Miyauchi K, Fukuda M, Davies JE, Mohn WW, Eltis LD. The complete genome of Rhodococcus sp. RHA1 provides insights into a catabolic powerhouse. Proc Natl Acad Sci U S A 2006; 103:15582-7. [PMID: 17030794 PMCID: PMC1622865 DOI: 10.1073/pnas.0607048103] [Citation(s) in RCA: 431] [Impact Index Per Article: 23.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Rhodococcus sp. RHA1 (RHA1) is a potent polychlorinated biphenyl-degrading soil actinomycete that catabolizes a wide range of compounds and represents a genus of considerable industrial interest. RHA1 has one of the largest bacterial genomes sequenced to date, comprising 9,702,737 bp (67% G+C) arranged in a linear chromosome and three linear plasmids. A targeted insertion methodology was developed to determine the telomeric sequences. RHA1's 9,145 predicted protein-encoding genes are exceptionally rich in oxygenases (203) and ligases (192). Many of the oxygenases occur in the numerous pathways predicted to degrade aromatic compounds (30) or steroids (4). RHA1 also contains 24 nonribosomal peptide synthase genes, six of which exceed 25 kbp, and seven polyketide synthase genes, providing evidence that rhodococci harbor an extensive secondary metabolism. Among sequenced genomes, RHA1 is most similar to those of nocardial and mycobacterial strains. The genome contains few recent gene duplications. Moreover, three different analyses indicate that RHA1 has acquired fewer genes by recent horizontal transfer than most bacteria characterized to date and far fewer than Burkholderia xenovorans LB400, whose genome size and catabolic versatility rival those of RHA1. RHA1 and LB400 thus appear to demonstrate that ecologically similar bacteria can evolve large genomes by different means. Overall, RHA1 appears to have evolved to simultaneously catabolize a diverse range of plant-derived compounds in an O(2)-rich environment. In addition to establishing RHA1 as an important model for studying actinomycete physiology, this study provides critical insights that facilitate the exploitation of these industrially important microorganisms.
Collapse
Affiliation(s)
- Michael P. McLeod
- *Department of Microbiology and Immunology, Life Sciences Institute, University of British Columbia, Vancouver, BC, Canada V6T 1Z3
| | - René L. Warren
- Michael Smith Genome Sciences Centre, Vancouver, BC, Canada V5Z 1L3
| | - William W. L. Hsiao
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada V5A 1S6; and
| | - Naoto Araki
- Department of Bioengineering, Nagaoka University of Technology, Nagaoka 940-2118, Japan
| | - Matthew Myhre
- *Department of Microbiology and Immunology, Life Sciences Institute, University of British Columbia, Vancouver, BC, Canada V6T 1Z3
| | - Clinton Fernandes
- *Department of Microbiology and Immunology, Life Sciences Institute, University of British Columbia, Vancouver, BC, Canada V6T 1Z3
| | - Daisuke Miyazawa
- *Department of Microbiology and Immunology, Life Sciences Institute, University of British Columbia, Vancouver, BC, Canada V6T 1Z3
| | - Wendy Wong
- *Department of Microbiology and Immunology, Life Sciences Institute, University of British Columbia, Vancouver, BC, Canada V6T 1Z3
| | - Anita L. Lillquist
- *Department of Microbiology and Immunology, Life Sciences Institute, University of British Columbia, Vancouver, BC, Canada V6T 1Z3
| | - Dennis Wang
- *Department of Microbiology and Immunology, Life Sciences Institute, University of British Columbia, Vancouver, BC, Canada V6T 1Z3
| | - Manisha Dosanjh
- *Department of Microbiology and Immunology, Life Sciences Institute, University of British Columbia, Vancouver, BC, Canada V6T 1Z3
| | - Hirofumi Hara
- *Department of Microbiology and Immunology, Life Sciences Institute, University of British Columbia, Vancouver, BC, Canada V6T 1Z3
| | - Anca Petrescu
- Michael Smith Genome Sciences Centre, Vancouver, BC, Canada V5Z 1L3
| | - Ryan D. Morin
- Michael Smith Genome Sciences Centre, Vancouver, BC, Canada V5Z 1L3
| | - George Yang
- Michael Smith Genome Sciences Centre, Vancouver, BC, Canada V5Z 1L3
| | - Jeff M. Stott
- Michael Smith Genome Sciences Centre, Vancouver, BC, Canada V5Z 1L3
| | | | - Heesun Shin
- Michael Smith Genome Sciences Centre, Vancouver, BC, Canada V5Z 1L3
| | - Duane Smailus
- Michael Smith Genome Sciences Centre, Vancouver, BC, Canada V5Z 1L3
| | - Asim S. Siddiqui
- Michael Smith Genome Sciences Centre, Vancouver, BC, Canada V5Z 1L3
| | - Marco A. Marra
- Michael Smith Genome Sciences Centre, Vancouver, BC, Canada V5Z 1L3
| | | | - Robert Holt
- Michael Smith Genome Sciences Centre, Vancouver, BC, Canada V5Z 1L3
| | - Fiona S. L. Brinkman
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada V5A 1S6; and
| | - Keisuke Miyauchi
- Department of Bioengineering, Nagaoka University of Technology, Nagaoka 940-2118, Japan
| | - Masao Fukuda
- Department of Bioengineering, Nagaoka University of Technology, Nagaoka 940-2118, Japan
| | - Julian E. Davies
- *Department of Microbiology and Immunology, Life Sciences Institute, University of British Columbia, Vancouver, BC, Canada V6T 1Z3
| | - William W. Mohn
- *Department of Microbiology and Immunology, Life Sciences Institute, University of British Columbia, Vancouver, BC, Canada V6T 1Z3
| | - Lindsay D. Eltis
- *Department of Microbiology and Immunology, Life Sciences Institute, University of British Columbia, Vancouver, BC, Canada V6T 1Z3
- To whom correspondence should be addressed. E-mail:
| |
Collapse
|
5
|
Bainbridge MN, Warren RL, Hirst M, Romanuik T, Zeng T, Go A, Delaney A, Griffith M, Hickenbotham M, Magrini V, Mardis ER, Sadar MD, Siddiqui AS, Marra MA, Jones SJM. Analysis of the prostate cancer cell line LNCaP transcriptome using a sequencing-by-synthesis approach. BMC Genomics 2006; 7:246. [PMID: 17010196 PMCID: PMC1592491 DOI: 10.1186/1471-2164-7-246] [Citation(s) in RCA: 131] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2006] [Accepted: 09/29/2006] [Indexed: 11/10/2022] Open
Abstract
Background High throughput sequencing-by-synthesis is an emerging technology that allows the rapid production of millions of bases of data. Although the sequence reads are short, they can readily be used for re-sequencing. By re-sequencing the mRNA products of a cell, one may rapidly discover polymorphisms and splice variants particular to that cell. Results We present the utility of massively parallel sequencing by synthesis for profiling the transcriptome of a human prostate cancer cell-line, LNCaP, that has been treated with the synthetic androgen, R1881. Through the generation of approximately 20 megabases (MB) of EST data, we detect transcription from over 10,000 gene loci, 25 previously undescribed alternative splicing events involving known exons, and over 1,500 high quality single nucleotide discrepancies with the reference human sequence. Further, we map nearly 10,000 ESTs to positions on the genome where no transcription is currently predicted to occur. We also characterize various obstacles with using sequencing by synthesis for transcriptome analysis and propose solutions to these problems. Conclusion The use of high-throughput sequencing-by-synthesis methods for transcript profiling allows the specific and sensitive detection of many of a cell's transcripts, and also allows the discovery of high quality base discrepancies, and alternative splice variants. Thus, this technology may provide an effective means of understanding various disease states, discovering novel targets for disease treatment, and discovery of novel transcripts.
Collapse
MESH Headings
- Adenocarcinoma/genetics
- Adenocarcinoma/pathology
- Alternative Splicing
- Androgens
- Cell Line, Tumor/chemistry
- Cell Line, Tumor/drug effects
- Chromosome Mapping
- Chromosomes, Human/genetics
- DNA, Complementary/genetics
- Exons/genetics
- Expressed Sequence Tags
- Gene Expression Profiling/methods
- Gene Expression Regulation, Neoplastic/drug effects
- Humans
- Male
- Metribolone/pharmacology
- Neoplasms, Hormone-Dependent/genetics
- Neoplasms, Hormone-Dependent/pathology
- Polymorphism, Single Nucleotide
- Prostatic Neoplasms/genetics
- Prostatic Neoplasms/pathology
- RNA, Messenger/genetics
- RNA, Neoplasm/genetics
- Sequence Alignment
- Sequence Analysis, DNA/methods
- Sequence Homology, Nucleic Acid
- Transcription, Genetic
Collapse
Affiliation(s)
- Matthew N Bainbridge
- British Columbia Cancer Agency (BCCA) Genome Sciences Centre, Vancouver, British Columbia, Canada
| | - René L Warren
- British Columbia Cancer Agency (BCCA) Genome Sciences Centre, Vancouver, British Columbia, Canada
| | - Martin Hirst
- British Columbia Cancer Agency (BCCA) Genome Sciences Centre, Vancouver, British Columbia, Canada
| | - Tammy Romanuik
- British Columbia Cancer Agency (BCCA) Genome Sciences Centre, Vancouver, British Columbia, Canada
| | - Thomas Zeng
- British Columbia Cancer Agency (BCCA) Genome Sciences Centre, Vancouver, British Columbia, Canada
| | - Anne Go
- British Columbia Cancer Agency (BCCA) Genome Sciences Centre, Vancouver, British Columbia, Canada
| | - Allen Delaney
- British Columbia Cancer Agency (BCCA) Genome Sciences Centre, Vancouver, British Columbia, Canada
| | - Malachi Griffith
- British Columbia Cancer Agency (BCCA) Genome Sciences Centre, Vancouver, British Columbia, Canada
| | - Matthew Hickenbotham
- Washington University School of Medicine, Genome Sequencing Center, St. Louis, Missouri 63108, USA
| | - Vincent Magrini
- Washington University School of Medicine, Genome Sequencing Center, St. Louis, Missouri 63108, USA
| | - Elaine R Mardis
- Washington University School of Medicine, Genome Sequencing Center, St. Louis, Missouri 63108, USA
| | - Marianne D Sadar
- British Columbia Cancer Agency (BCCA) Genome Sciences Centre, Vancouver, British Columbia, Canada
| | - Asim S Siddiqui
- British Columbia Cancer Agency (BCCA) Genome Sciences Centre, Vancouver, British Columbia, Canada
| | - Marco A Marra
- British Columbia Cancer Agency (BCCA) Genome Sciences Centre, Vancouver, British Columbia, Canada
| | - Steven JM Jones
- British Columbia Cancer Agency (BCCA) Genome Sciences Centre, Vancouver, British Columbia, Canada
| |
Collapse
|
6
|
Abstract
We present the results of a simple, statistical assay that measures the G+C content sensitivity bias of gene expression experiments without the requirement of a duplicate experiment. We analyse five gene expression profiling methods: Affymetrix GeneChip, Long Serial Analysis of Gene Expression (LongSAGE), LongSAGELite, 'Classic' Massively Parallel Signature Sequencing (MPSS) and 'Signature' MPSS. We demonstrate the methods have systematic and random errors leading to a different G+C content sensitivity. The relationship between this experimental error and the G+C content of the probe set or tag that identifies each gene influences whether the gene is detected and, if detected, the level of gene expression measured. LongSAGE has the least bias, while Signature MPSS shows a strong bias to G+C rich tags and Affymetrix data show different bias depending on the data processing method (MAS 5.0, RMA or GC-RMA). The bias in the Affymetrix data primarily impacts genes expressed at lower levels. Despite the larger sampling of the MPSS library, SAGE identifies significantly more genes (60% more RefSeq genes in a single comparison).
Collapse
Affiliation(s)
| | | | | | | | | | - Marco A. Marra
- To whom correspondence should be addressed at Genome Sciences Centre, Suite 100, 570 West 7th Avenue, Vancouver BC, Canada V5Z 4S6. Tel: 604 877 6082; Fax: 604 877 6085;
| |
Collapse
|
7
|
Warren RL, Varabei D, Platt D, Huang X, Messina D, Yang SP, Kronstad JW, Krzywinski M, Warren WC, Wallis JW, Hillier LW, Chinwalla AT, Schein JE, Siddiqui AS, Marra MA, Wilson RK, Jones SJ. Physical map-assisted whole-genome shotgun sequence assemblies. Genes Dev 2006; 16:768-75. [PMID: 16741162 PMCID: PMC1473187 DOI: 10.1101/gr.5090606] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2005] [Accepted: 04/11/2006] [Indexed: 01/15/2023]
Abstract
We describe a targeted approach to improve the contiguity of whole-genome shotgun sequence (WGS) assemblies at run-time, using information from Bacterial Artificial Chromosome (BAC)-based physical maps. Clone sizes and overlaps derived from clone fingerprints are used for the calculation of length constraints between any two BAC neighbors sharing 40% of their size. These constraints are used to promote the linkage and guide the arrangement of sequence contigs within a sequence scaffold at the layout phase of WGS assemblies. This process is facilitated by FASSI, a stand-alone application that calculates BAC end and BAC overlap length constraints from clone fingerprint map contigs created by the FPC package. FASSI is designed to work with the assembly tool PCAP, but its output can be formatted to work with other WGS assembly algorithms able to use length constraints for individual clones. The FASSI method is simple to implement, potentially cost-effective, and has resulted in the increase of scaffold contiguity for both the Drosophila melanogaster and Cryptococcus gattii genomes when compared to a control assembly without map-derived constraints. A 6.5-fold coverage draft DNA sequence of the Pan troglodytes (chimpanzee) genome was assembled using map-derived constraints and resulted in a 26.1% increase in scaffold contiguity.
Collapse
Affiliation(s)
- René L. Warren
- British Columbia Cancer Agency, Genome Sciences Centre, Vancouver, British Columbia V5Z 4S6, Canada
| | - Dmitry Varabei
- British Columbia Cancer Agency, Genome Sciences Centre, Vancouver, British Columbia V5Z 4S6, Canada
| | - Darren Platt
- U.S. Department of Energy, Joint Genome Institute, Walnut Creek, California 94598, USA
| | - Xiaoqiu Huang
- Department of Computer Science, Iowa State University, Ames, Iowa 50011-1040, USA
| | - David Messina
- Washington University School of Medicine, Genome Sequencing Center, St. Louis, Missouri 63108, USA
| | - Shiaw-Pyng Yang
- Washington University School of Medicine, Genome Sequencing Center, St. Louis, Missouri 63108, USA
| | - James W. Kronstad
- The Michael Smith Laboratories, Department of Microbiology and Immunology, The University of British Columbia, Vancouver, British Columbia V6T 2Z4, Canada
| | - Martin Krzywinski
- British Columbia Cancer Agency, Genome Sciences Centre, Vancouver, British Columbia V5Z 4S6, Canada
| | - Wesley C. Warren
- Washington University School of Medicine, Genome Sequencing Center, St. Louis, Missouri 63108, USA
| | - John W. Wallis
- Washington University School of Medicine, Genome Sequencing Center, St. Louis, Missouri 63108, USA
| | - LaDeana W. Hillier
- Washington University School of Medicine, Genome Sequencing Center, St. Louis, Missouri 63108, USA
| | - Asif T. Chinwalla
- Washington University School of Medicine, Genome Sequencing Center, St. Louis, Missouri 63108, USA
| | - Jacqueline E. Schein
- British Columbia Cancer Agency, Genome Sciences Centre, Vancouver, British Columbia V5Z 4S6, Canada
| | - Asim S. Siddiqui
- British Columbia Cancer Agency, Genome Sciences Centre, Vancouver, British Columbia V5Z 4S6, Canada
| | - Marco A. Marra
- British Columbia Cancer Agency, Genome Sciences Centre, Vancouver, British Columbia V5Z 4S6, Canada
| | - Richard K. Wilson
- Washington University School of Medicine, Genome Sequencing Center, St. Louis, Missouri 63108, USA
| | - Steven J.M. Jones
- British Columbia Cancer Agency, Genome Sciences Centre, Vancouver, British Columbia V5Z 4S6, Canada
| |
Collapse
|
8
|
Morin RD, Chang E, Petrescu A, Liao N, Griffith M, Kirkpatrick R, Butterfield YS, Young AC, Stott J, Barber S, Babakaiff R, Dickson MC, Matsuo C, Wong D, Yang GS, Smailus DE, Wetherby KD, Kwong PN, Grimwood J, Brinkley CP, Brown-John M, Reddix-Dugue ND, Mayo M, Schmutz J, Beland J, Park M, Gibson S, Olson T, Bouffard GG, Tsai M, Featherstone R, Chand S, Siddiqui AS, Jang W, Lee E, Klein SL, Blakesley RW, Zeeberg BR, Narasimhan S, Weinstein JN, Pennacchio CP, Myers RM, Green ED, Wagner L, Gerhard DS, Marra MA, Jones SJ, Holt RA. Sequencing and analysis of 10,967 full-length cDNA clones from Xenopus laevis and Xenopus tropicalis reveals post-tetraploidization transcriptome remodeling. Genome Res 2006; 16:796-803. [PMID: 16672307 PMCID: PMC1479861 DOI: 10.1101/gr.4871006] [Citation(s) in RCA: 60] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
Sequencing of full-insert clones from full-length cDNA libraries from both Xenopus laevis and Xenopus tropicalis has been ongoing as part of the Xenopus Gene Collection Initiative. Here we present 10,967 full ORF verified cDNA clones (8049 from X. laevis and 2918 from X. tropicalis) as a community resource. Because the genome of X. laevis, but not X. tropicalis, has undergone allotetraploidization, comparison of coding sequences from these two clawed (pipid) frogs provides a unique angle for exploring the molecular evolution of duplicate genes. Within our clone set, we have identified 445 gene trios, each comprised of an allotetraploidization-derived X. laevis gene pair and their shared X. tropicalis ortholog. Pairwise dN/dS, comparisons within trios show strong evidence for purifying selection acting on all three members. However, dN/dS ratios between X. laevis gene pairs are elevated relative to their X. tropicalis ortholog. This difference is highly significant and indicates an overall relaxation of selective pressures on duplicated gene pairs. We have found that the paralogs that have been lost since the tetraploidization event are enriched for several molecular functions, but have found no such enrichment in the extant paralogs. Approximately 14% of the paralogous pairs analyzed here also show differential expression indicative of subfunctionalization.
Collapse
Affiliation(s)
- Ryan D. Morin
- British Columbia Genome Sciences Centre, BCCA, Vancouver, BC V5Z 1L3 Canada
| | - Elbert Chang
- British Columbia Genome Sciences Centre, BCCA, Vancouver, BC V5Z 1L3 Canada
| | - Anca Petrescu
- British Columbia Genome Sciences Centre, BCCA, Vancouver, BC V5Z 1L3 Canada
| | - Nancy Liao
- British Columbia Genome Sciences Centre, BCCA, Vancouver, BC V5Z 1L3 Canada
| | - Malachi Griffith
- British Columbia Genome Sciences Centre, BCCA, Vancouver, BC V5Z 1L3 Canada
| | - Robert Kirkpatrick
- British Columbia Genome Sciences Centre, BCCA, Vancouver, BC V5Z 1L3 Canada
| | | | - Alice C. Young
- NIH Intramural Sequencing Center, National Human Genome Research Institute
| | - Jeffrey Stott
- British Columbia Genome Sciences Centre, BCCA, Vancouver, BC V5Z 1L3 Canada
| | - Sarah Barber
- British Columbia Genome Sciences Centre, BCCA, Vancouver, BC V5Z 1L3 Canada
| | - Ryan Babakaiff
- British Columbia Genome Sciences Centre, BCCA, Vancouver, BC V5Z 1L3 Canada
| | - Mark C. Dickson
- Stanford Human Genome Center and Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Corey Matsuo
- British Columbia Genome Sciences Centre, BCCA, Vancouver, BC V5Z 1L3 Canada
| | - David Wong
- British Columbia Genome Sciences Centre, BCCA, Vancouver, BC V5Z 1L3 Canada
| | - George S. Yang
- British Columbia Genome Sciences Centre, BCCA, Vancouver, BC V5Z 1L3 Canada
| | - Duane E. Smailus
- British Columbia Genome Sciences Centre, BCCA, Vancouver, BC V5Z 1L3 Canada
| | - Keith D. Wetherby
- NIH Intramural Sequencing Center, National Human Genome Research Institute
| | - Peggy N. Kwong
- NIH Intramural Sequencing Center, National Human Genome Research Institute
| | - Jane Grimwood
- Stanford Human Genome Center and Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA
| | | | - Mabel Brown-John
- British Columbia Genome Sciences Centre, BCCA, Vancouver, BC V5Z 1L3 Canada
| | | | - Michael Mayo
- British Columbia Genome Sciences Centre, BCCA, Vancouver, BC V5Z 1L3 Canada
| | - Jeremy Schmutz
- Stanford Human Genome Center and Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Jaclyn Beland
- British Columbia Genome Sciences Centre, BCCA, Vancouver, BC V5Z 1L3 Canada
| | - Morgan Park
- NIH Intramural Sequencing Center, National Human Genome Research Institute
| | - Susan Gibson
- British Columbia Genome Sciences Centre, BCCA, Vancouver, BC V5Z 1L3 Canada
| | - Teika Olson
- British Columbia Genome Sciences Centre, BCCA, Vancouver, BC V5Z 1L3 Canada
| | - Gerard G. Bouffard
- NIH Intramural Sequencing Center, National Human Genome Research Institute
| | - Miranda Tsai
- British Columbia Genome Sciences Centre, BCCA, Vancouver, BC V5Z 1L3 Canada
| | - Ruth Featherstone
- British Columbia Genome Sciences Centre, BCCA, Vancouver, BC V5Z 1L3 Canada
| | - Steve Chand
- British Columbia Genome Sciences Centre, BCCA, Vancouver, BC V5Z 1L3 Canada
| | - Asim S. Siddiqui
- British Columbia Genome Sciences Centre, BCCA, Vancouver, BC V5Z 1L3 Canada
| | - Wonhee Jang
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, Maryland 20894, USA
| | - Ed Lee
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, Maryland 20894, USA
| | - Steven L. Klein
- National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, Maryland 20892, USA
| | | | - Barry R. Zeeberg
- Genomics and Bioinformatics Group, Laboratory of Molecular Pharmacology
| | | | - John N. Weinstein
- Genomics and Bioinformatics Group, Laboratory of Molecular Pharmacology
| | - Christa Prange Pennacchio
- The I.M.A.G.E Consortium, Biology and Biotechnology Research Program, Lawrence Livermore National Laboratory, Livermore, California 94550, USA
| | - Richard M. Myers
- Stanford Human Genome Center and Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Eric D. Green
- NIH Intramural Sequencing Center, National Human Genome Research Institute
| | - Lukas Wagner
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, Maryland 20894, USA
| | | | - Marco A. Marra
- British Columbia Genome Sciences Centre, BCCA, Vancouver, BC V5Z 1L3 Canada
| | - Steven J.M. Jones
- British Columbia Genome Sciences Centre, BCCA, Vancouver, BC V5Z 1L3 Canada
| | - Robert A. Holt
- British Columbia Genome Sciences Centre, BCCA, Vancouver, BC V5Z 1L3 Canada
- Corresponding author.E-mail ; fax (604) 877-6085
| |
Collapse
|
9
|
Griffith OL, Pleasance ED, Fulton DL, Oveisi M, Ester M, Siddiqui AS, Jones SJM. Assessment and integration of publicly available SAGE, cDNA microarray, and oligonucleotide microarray expression data for global coexpression analyses. Genomics 2006; 86:476-88. [PMID: 16098712 DOI: 10.1016/j.ygeno.2005.06.009] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2005] [Revised: 05/12/2005] [Accepted: 06/16/2005] [Indexed: 10/25/2022]
Abstract
Large amounts of gene expression data from several different technologies are becoming available to the scientific community. A common practice is to use these data to calculate global gene coexpression for validation or integration of other "omic" data. To assess the utility of publicly available datasets for this purpose we have analyzed Homo sapiens data from 1202 cDNA microarray experiments, 242 SAGE libraries, and 667 Affymetrix oligonucleotide microarray experiments. The three datasets compared demonstrate significant but low levels of global concordance (rc<0.11). Assessment against Gene Ontology (GO) revealed that all three platforms identify more coexpressed gene pairs with common biological processes than expected by chance. As the Pearson correlation for a gene pair increased it was more likely to be confirmed by GO. The Affymetrix dataset performed best individually with gene pairs of correlation 0.9-1.0 confirmed by GO in 74% of cases. However, in all cases, gene pairs confirmed by multiple platforms were more likely to be confirmed by GO. We show that combining results from different expression platforms increases reliability of coexpression. A comparison with other recently published coexpression studies found similar results in terms of performance against GO but with each method producing distinctly different gene pair lists.
Collapse
Affiliation(s)
- Obi L Griffith
- Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC, Canada V5Z 4E6
| | | | | | | | | | | | | |
Collapse
|
10
|
Robertson G, Bilenky M, Lin K, He A, Yuen W, Dagpinar M, Varhol R, Teague K, Griffith OL, Zhang X, Pan Y, Hassel M, Sleumer MC, Pan W, Pleasance ED, Chuang M, Hao H, Li YY, Robertson N, Fjell C, Li B, Montgomery SB, Astakhova T, Zhou J, Sander J, Siddiqui AS, Jones SJM. cisRED: a database system for genome-scale computational discovery of regulatory elements. Nucleic Acids Res 2006; 34:D68-73. [PMID: 16381958 PMCID: PMC1347438 DOI: 10.1093/nar/gkj075] [Citation(s) in RCA: 88] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2005] [Revised: 10/08/2005] [Accepted: 10/08/2005] [Indexed: 11/30/2022] Open
Abstract
We describe cisRED, a database for conserved regulatory elements that are identified and ranked by a genome-scale computational system (www.cisred.org). The database and high-throughput predictive pipeline are designed to address diverse target genomes in the context of rapidly evolving data resources and tools. Motifs are predicted in promoter regions using multiple discovery methods applied to sequence sets that include corresponding sequence regions from vertebrates. We estimate motif significance by applying discovery and post-processing methods to randomized sequence sets that are adaptively derived from target sequence sets, retain motifs with p-values below a threshold and identify groups of similar motifs and co-occurring motif patterns. The database offers information on atomic motifs, motif groups and patterns. It is web-accessible, and can be queried directly, downloaded or installed locally.
Collapse
Affiliation(s)
- G Robertson
- Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC, Canada.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
11
|
Siddiqui AS, Khattra J, Delaney AD, Zhao Y, Astell C, Asano J, Babakaiff R, Barber S, Beland J, Bohacec S, Brown-John M, Chand S, Charest D, Charters AM, Cullum R, Dhalla N, Featherstone R, Gerhard DS, Hoffman B, Holt RA, Hou J, Kuo BYL, Lee LLC, Lee S, Leung D, Ma K, Matsuo C, Mayo M, McDonald H, Prabhu AL, Pandoh P, Riggins GJ, de Algara TR, Rupert JL, Smailus D, Stott J, Tsai M, Varhol R, Vrljicak P, Wong D, Wu MK, Xie YY, Yang G, Zhang I, Hirst M, Jones SJM, Helgason CD, Simpson EM, Hoodless PA, Marra MA. A mouse atlas of gene expression: large-scale digital gene-expression profiles from precisely defined developing C57BL/6J mouse tissues and cells. Proc Natl Acad Sci U S A 2005; 102:18485-90. [PMID: 16352711 PMCID: PMC1311911 DOI: 10.1073/pnas.0509455102] [Citation(s) in RCA: 100] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We analyzed 8.55 million LongSAGE tags generated from 72 libraries. Each LongSAGE library was prepared from a different mouse tissue. Analysis of the data revealed extensive overlap with existing gene data sets and evidence for the existence of approximately 24,000 previously undescribed genomic loci. The visual cortex, pancreas, mammary gland, preimplantation embryo, and placenta contain the largest number of differentially expressed transcripts, 25% of which are previously undescribed loci.
Collapse
Affiliation(s)
- Asim S Siddiqui
- Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Research Centre, British Columbia Cancer Agency, Vancouver, BC, Canada V5Z 4S6
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
12
|
Warren RL, Butterfield YS, Morin RD, Siddiqui AS, Marra MA, Jones SJM. Management and visualization of whole genome shotgun assemblies using SAM. Biotechniques 2005; 38:715-6, 718, 720. [PMID: 15945370 DOI: 10.2144/05385st01] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
We have designed and implemented a system to manage whole genome shotgun sequences and whole genome sequence assembly data flow. The Sequence Assembly Manager (SAM) consists primarily of a MySQL relational database and Perl applications designed to easily manipulate and coordinate the analysis of sequence information and to view and report genome assembly progress through its Common Gateway Interface (CGI) web interface. The application includes a tool to compare sequence assemblies to fingerprint maps that has been used successfully to improve and validate both maps and sequence assemblies of the Rhodococcus sp.RHAI and Cryptococcus neoformans WM276 genomes.
Collapse
|
13
|
Montgomery SB, Astakhova T, Bilenky M, Birney E, Fu T, Hassel M, Melsopp C, Rak M, Robertson AG, Sleumer M, Siddiqui AS, Jones SJM. Sockeye: a 3D environment for comparative genomics. Genome Res 2004; 14:956-62. [PMID: 15123592 PMCID: PMC479126 DOI: 10.1101/gr.1890304] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Comparative genomics techniques are used in bioinformatics analyses to identify the structural and functional properties of DNA sequences. As the amount of available sequence data steadily increases, the ability to perform large-scale comparative analyses has become increasingly relevant. In addition, the growing complexity of genomic feature annotation means that new approaches to genomic visualization need to be explored. We have developed a Java-based application called Sockeye that uses three-dimensional (3D) graphics technology to facilitate the visualization of annotation and conservation across multiple sequences. This software uses the Ensembl database project to import sequence and annotation information from several eukaryotic species. A user can additionally import their own custom sequence and annotation data. Individual annotation objects are displayed in Sockeye by using custom 3D models. Ensembl-derived and imported sequences can be analyzed by using a suite of multiple and pair-wise alignment algorithms. The results of these comparative analyses are also displayed in the 3D environment of Sockeye. By using the Java3D API to visualize genomic data in a 3D environment, we are able to compactly display cross-sequence comparisons. This provides the user with a novel platform for visualizing and comparing genomic feature organization.
Collapse
Affiliation(s)
- Stephen B Montgomery
- Canada's Michael Smith Genome Sciences Centre, Vancouver, British Columbia V5Z 4E6, Canada
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Dengler U, Siddiqui AS, Barton GJ. Protein structural domains: analysis of the 3Dee domains database. Proteins 2001; 42:332-44. [PMID: 11151005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/18/2023]
Abstract
The 3Dee database of domain definitions was developed as a comprehensive collection of domain definitions for all three-dimensional structures in the Protein Data Bank (PDB). The database includes definitions for complex, multiple-segment and multiple-chain domains as well as simple sequential domains, organized in a structural hierarchy. Two different snapshots of the 3Dee database were analyzed at September 1996 and November 1999. For the November 1999 release, 7,995 PDB entries contained 13,767 protein chains and gave rise to 18,896 domains. The domain sequences clustered into 1,715 domain sequence families, which were further clustered into a conservative 1,199 domain structure families (families with similar folds). The proportion of different domain structure families per domain sequence family increases from 84% for domains 1-100 residues long to 100% for domains greater than 600 residues. This is in keeping with the idea that longer chains will have more alternative folds available to them. Of the representative domains from the domain sequence families, 49% are in the range of 51-150 residues, whereas 64% of the representative chains over 200 residues have more than 1 domain. Of the representative chains, 8.5% are part of multichain domains. The largest multichain domain in the database has 14 chains and 1,400 residues, whereas the largest single-chain domain has 907 residues. The largest number of domains found in a protein is 13. The analysis shows that over the history of the PDB, new domain folds have been discovered at a slower rate than by random selection of all known folds. Between 1992 and 1997, a constant 1 in 11 new domains deposited in the PDB has shown no sequence similarity to a previously known domain sequence family, and only 1 in 15 new domain structures has had a fold that has not been seen previously. A comparison of the September 1996 release of 3Dee to the Structural Classification of Proteins (SCOP) showed that the domain definitions agreed for 80% of the representative protein chains. However, 3Dee provided explicit domain boundaries for more proteins. 3Dee is accessible on the World Wide Web at http://barton.ebi.ac.uk/servers/3Dee.html.
Collapse
Affiliation(s)
- U Dengler
- EMBL, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
| | | | | |
Collapse
|
15
|
Abstract
UNLABELLED The 3Dee database is a repository of protein structural domains. It stores alternative domain definitions for the same protein, organises domains into sequence and structural hierarchies, contains non-redundant set(s) of sequences and structures, multiple structure alignments for families of domains, and allows previous versions of the database to be regenerated. AVAILABILITY 3Dee is accessible on the World Wide Web at the URL http://barton.ebi.ac.uk/servers/3Dee.html.
Collapse
Affiliation(s)
- A S Siddiqui
- University of Oxford, Laboratory of Molecular Biophysics, The Rex Richards Building, South Parks Road, Oxford OX1 3QU, UK
| | | | | |
Collapse
|
16
|
|
17
|
|
18
|
Bebbington DH, Siddiqui AS. Analytical description of backscattered states of polarization in polarization optical time-domain reflectometry measurements on uniformly twisted linearly birefringent optical fiber. J Opt Soc Am A Opt Image Sci Vis 2000; 17:2260-2266. [PMID: 11140486 DOI: 10.1364/josaa.17.002260] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
We present a detailed derivation of the locus of Rayleigh backscattered states of polarization for polarization optical time domain reflectometry in uniformly twisted optical fiber with intrinsic linear birefringence. The locus is algebraically a quartic whose topology is determined by the relative orientation on the Poincaré sphere of the input SOP and the effective birefringence vector. We present an analysis that indicates how experimental data may be interpreted, through geometric parameters of the locus, for evaluating the fiber parameters. The analysis also indicates the minimum number of experimental data points required for meaningful values for the fiber parameters to be obtained.
Collapse
Affiliation(s)
- D H Bebbington
- Department of Electronic Systems Engineering, University of Essex, Colchester, UK.
| | | |
Collapse
|
19
|
|
20
|
|
21
|
|
22
|
Abstract
UNLABELLED An interactive protein secondary structure prediction Internet server is presented. The server allows a single sequence or multiple alignment to be submitted, and returns predictions from six secondary structure prediction algorithms that exploit evolutionary information from multiple sequences. A consensus prediction is also returned which improves the average Q3 accuracy of prediction by 1% to 72.9%. The server simplifies the use of current prediction algorithms and allows conservation patterns important to structure and function to be identified. AVAILABILITY http://barton.ebi.ac.uk/servers/jpred.h tml CONTACT geoff@ebi.ac.uk
Collapse
Affiliation(s)
- J A Cuff
- Laboratory of Molecular Biophysics, Rex Richards Building, South Parks Road, Oxford OX1 3QU, UK
| | | | | | | | | |
Collapse
|
23
|
Abstract
An algorithm is presented for the fast and accurate definition of protein structural domains from coordinate data without prior knowledge of the number or type of domains. The algorithm explicitly locates domains that comprise one or two continuous segments of protein chain. Domains that include more than two segments are also located. The algorithm was applied to a nonredundant database of 230 protein structures and the results compared to domain definitions obtained from the literature, or by inspection of the coordinates on molecular graphics. For 70% of the proteins, the derived domains agree with the reference definitions, 18% show minor differences and only 12% (28 proteins) show very different definitions. Three screens were applied to identify the derived domains least likely to agree with the subjective definition set. These screens revealed a set of 173 proteins, 97% of which agree well with the subjective definitions. The algorithm represents a practical domain identification tool that can be run routinely on the entire structural database. Adjustment of parameters also allows smaller compact units to be identified in proteins.
Collapse
Affiliation(s)
- A S Siddiqui
- Laboratory of Molecular Biophysics, University of Oxford, United Kingdom
| | | |
Collapse
|
24
|
Siddiqui AS, D'Costa DF, Moore-Smith B. Covert hypothyroidism with weight loss and atrial fibrillation. Br J Clin Pract 1993; 47:268. [PMID: 8292476] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
Hypothyroidism may present with weight gain and/or cardiovascular manifestations such as bradycardia or cardiac failure, but has not previously been documented as presenting with atrial fibrillation and weight loss. Our case highlights the importance of thyroid function tests in heart failure and emphasises the importance of regular follow-up after irradiation to the thyroid.
Collapse
Affiliation(s)
- A S Siddiqui
- Department of Geriatric Medicine, Ipswich Hospital, Suffolk
| | | | | |
Collapse
|
25
|
Abstract
Three bacteria, two of which were previously noted as active heterotrophic nitrifiers, were examined for their ability to grow and nitrify with the siderophore deferrioxamine B as the carbon source. Pseudomonas aureofaciens displayed limited growth and nitrification while a heterotrophic nitrifying Alcaligenes sp. was without action concerning its metabolism of deferrioxamine B. The third bacterium, a unique Gram-negative soil isolate, was unable to nitrify deferrioxamine B but grew well when the siderophore was employed as the sole C source. The Gram-negative bacterium removed deferrioxamine B from the medium and left only residual amounts of the compound in solution at the termination of its growth. The organism was without action when the ferrated analogue of deferrioxamine B, ferrioxamine B, served as either the C source for growth, for metabolism by resting cells, or as the substrate for cell-free extracts. Deferrioxamine B, by contrast, was rapidly metabolized by resting cells. Cell-free extracts of the bacterium synthesized a monohydroxamate(s) when deferrioxamine B was the substrate. The catabolism of deferrioxamine B, which is synthesized by soil microbes, suggests that soil microflora have the ability to return deferrioxamine B, and perhaps other, siderophores to the C cycle.
Collapse
Affiliation(s)
- D Castignetti
- Department of Biology, Loyola University of Chicago, IL 60626
| | | |
Collapse
|