1
|
Khelik K, Sandve GK, Nederbragt AJ, Rognes T. NucBreak: location of structural errors in a genome assembly by using paired-end Illumina reads. BMC Bioinformatics 2020; 21:66. [PMID: 32085722 PMCID: PMC7035700 DOI: 10.1186/s12859-020-3414-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2018] [Accepted: 02/12/2020] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Advances in whole genome sequencing strategies have provided the opportunity for genomic and comparative genomic analysis of a vast variety of organisms. The analysis results are highly dependent on the quality of the genome assemblies used. Assessment of the assembly accuracy may significantly increase the reliability of the analysis results and is therefore of great importance. RESULTS Here, we present a new tool called NucBreak aimed at localizing structural errors in assemblies, including insertions, deletions, duplications, inversions, and different inter- and intra-chromosomal rearrangements. The approach taken by existing alternative tools is based on analysing reads that do not map properly to the assembly, for instance discordantly mapped reads, soft-clipped reads and singletons. NucBreak uses an entirely different and unique method to localise the errors. It is based on analysing the alignments of reads that are properly mapped to an assembly and exploit information about the alternative read alignments. It does not annotate detected errors. We have compared NucBreak with other existing assembly accuracy assessment tools, namely Pilon, REAPR, and FRCbam as well as with several structural variant detection tools, including BreakDancer, Lumpy, and Wham, by using both simulated and real datasets. CONCLUSIONS The benchmarking results have shown that NucBreak in general predicts assembly errors of different types and sizes with relatively high sensitivity and with lower false discovery rate than the other tools. Such a balance between sensitivity and false discovery rate makes NucBreak a good alternative to the existing assembly accuracy assessment tools and SV detection tools. NucBreak is freely available at https://github.com/uio-bmi/NucBreak under the MPL license.
Collapse
Affiliation(s)
- Ksenia Khelik
- Biomedical Informatics Research Group, Department of Informatics, University of Oslo, PO Box 1080 Blindern, NO-0316, Oslo, Norway
| | - Geir Kjetil Sandve
- Biomedical Informatics Research Group, Department of Informatics, University of Oslo, PO Box 1080 Blindern, NO-0316, Oslo, Norway
| | - Alexander Johan Nederbragt
- Biomedical Informatics Research Group, Department of Informatics, University of Oslo, PO Box 1080 Blindern, NO-0316, Oslo, Norway.,Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, PO Box 1066 Blindern, NO-0316, Oslo, Norway
| | - Torbjørn Rognes
- Biomedical Informatics Research Group, Department of Informatics, University of Oslo, PO Box 1080 Blindern, NO-0316, Oslo, Norway. .,Department of Microbiology, Oslo University Hospital, Rikshospitalet, PO Box 4950 Nydalen, NO-0424, Oslo, Norway.
| |
Collapse
|
2
|
Khelik K, Lagesen K, Sandve GK, Rognes T, Nederbragt AJ. NucDiff: in-depth characterization and annotation of differences between two sets of DNA sequences. BMC Bioinformatics 2017; 18:338. [PMID: 28701187 PMCID: PMC5508607 DOI: 10.1186/s12859-017-1748-z] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2016] [Accepted: 07/04/2017] [Indexed: 12/05/2022] Open
Abstract
Background Comparing sets of sequences is a situation frequently encountered in bioinformatics, examples being comparing an assembly to a reference genome, or two genomes to each other. The purpose of the comparison is usually to find where the two sets differ, e.g. to find where a subsequence is repeated or deleted, or where insertions have been introduced. Such comparisons can be done using whole-genome alignments. Several tools for making such alignments exist, but none of them 1) provides detailed information about the types and locations of all differences between the two sets of sequences, 2) enables visualisation of alignment results at different levels of detail, and 3) carefully takes genomic repeats into consideration. Results We here present NucDiff, a tool aimed at locating and categorizing differences between two sets of closely related DNA sequences. NucDiff is able to deal with very fragmented genomes, repeated sequences, and various local differences and structural rearrangements. NucDiff determines differences by a rigorous analysis of alignment results obtained by the NUCmer, delta-filter and show-snps programs in the MUMmer sequence alignment package. All differences found are categorized according to a carefully defined classification scheme covering all possible differences between two sequences. Information about the differences is made available as GFF3 files, thus enabling visualisation using genome browsers as well as usage of the results as a component in an analysis pipeline. NucDiff was tested with varying parameters for the alignment step and compared with existing alternatives, called QUAST and dnadiff. Conclusions We have developed a whole genome alignment difference classification scheme together with the program NucDiff for finding such differences. The proposed classification scheme is comprehensive and can be used by other tools. NucDiff performs comparably to QUAST and dnadiff but gives much more detailed results that can easily be visualized. NucDiff is freely available on https://github.com/uio-cels/NucDiff under the MPL license. Electronic supplementary material The online version of this article (doi:10.1186/s12859-017-1748-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Ksenia Khelik
- Biomedical Informatics Research Group, Department of Informatics, University of Oslo, PO Box 1080, 0316, Oslo, Norway
| | - Karin Lagesen
- Biomedical Informatics Research Group, Department of Informatics, University of Oslo, PO Box 1080, 0316, Oslo, Norway.,Norwegian Veterinary Institute, PO Box 750 Sentrum, 0106, Oslo, Norway
| | - Geir Kjetil Sandve
- Biomedical Informatics Research Group, Department of Informatics, University of Oslo, PO Box 1080, 0316, Oslo, Norway
| | - Torbjørn Rognes
- Biomedical Informatics Research Group, Department of Informatics, University of Oslo, PO Box 1080, 0316, Oslo, Norway.,Department of Microbiology, Oslo University Hospital, Rikshospitalet, PO Box 4950 Nydalen, 0424, Oslo, Norway
| | - Alexander Johan Nederbragt
- Biomedical Informatics Research Group, Department of Informatics, University of Oslo, PO Box 1080, 0316, Oslo, Norway. .,Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, PO Box 1066 Blindern, 0316, Oslo, Norway.
| |
Collapse
|
3
|
Simovski B, Vodák D, Gundersen S, Domanska D, Azab A, Holden L, Holden M, Grytten I, Rand K, Drabløs F, Johansen M, Mora A, Lund-Andersen C, Fromm B, Eskeland R, Gabrielsen OS, Ferkingstad E, Nakken S, Bengtsen M, Nederbragt AJ, Thorarensen HS, Akse JA, Glad I, Hovig E, Sandve GK. GSuite HyperBrowser: integrative analysis of dataset collections across the genome and epigenome. Gigascience 2017; 6:1-12. [PMID: 28459977 PMCID: PMC5493745 DOI: 10.1093/gigascience/gix032] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2016] [Revised: 01/17/2017] [Accepted: 04/24/2017] [Indexed: 12/01/2022] Open
Abstract
Background Recent large-scale undertakings such as ENCODE and Roadmap Epigenomics have generated experimental data mapped to the human reference genome (as genomic tracks) representing a variety of functional elements across a large number of cell types. Despite the high potential value of these publicly available data for a broad variety of investigations, little attention has been given to the analytical methodology necessary for their widespread utilisation. Findings We here present a first principled treatment of the analysis of collections of genomic tracks. We have developed novel computational and statistical methodology to permit comparative and confirmatory analyses across multiple and disparate data sources. We delineate a set of generic questions that are useful across a broad range of investigations and discuss the implications of choosing different statistical measures and null models. Examples include contrasting analyses across different tissues or diseases. The methodology has been implemented in a comprehensive open-source software system, the GSuite HyperBrowser. To make the functionality accessible to biologists, and to facilitate reproducible analysis, we have also developed a web-based interface providing an expertly guided and customizable way of utilizing the methodology. With this system, many novel biological questions can flexibly be posed and rapidly answered. Conclusions Through a combination of streamlined data acquisition, interoperable representation of dataset collections, and customizable statistical analysis with guided setup and interpretation, the GSuite HyperBrowser represents a first comprehensive solution for integrative analysis of track collections across the genome and epigenome. The software is available at: https://hyperbrowser.uio.no.
Collapse
Affiliation(s)
- Boris Simovski
- Department of Informatics, University of Oslo, Oslo, Norway
| | - Daniel Vodák
- Department of Tumor Biology, Institute for Cancer Research, Oslo University Hospital, Oslo, Norway
| | | | - Diana Domanska
- Department of Informatics, University of Oslo, Oslo, Norway
| | - Abdulrahman Azab
- Department of Informatics, University of Oslo, Oslo, Norway
- Research Support Services Group, University Center for Information Technology, Oslo, Norway
| | - Lars Holden
- Statistics For Innovation, Norwegian Computing Center, Oslo, Norway
| | - Marit Holden
- Statistics For Innovation, Norwegian Computing Center, Oslo, Norway
| | - Ivar Grytten
- Department of Informatics, University of Oslo, Oslo, Norway
| | - Knut Rand
- Department of Mathematics, University of Oslo, Oslo, Norway
| | - Finn Drabløs
- Department of Cancer Research and Molecular Medicine, Norwegian University of Science and Technology (NTNU), Trondheim, Norway
| | - Morten Johansen
- Institute for Medical Informatics, The Norwegian Radium Hospital, Oslo University Hospital, Oslo, Norway
| | - Antonio Mora
- Department of Informatics, University of Oslo, Oslo, Norway
- Department of Biosciences, University of Oslo, Oslo, Norway
| | - Christin Lund-Andersen
- Department of Tumor Biology, Institute for Cancer Research, Oslo University Hospital, Oslo, Norway
| | - Bastian Fromm
- Department of Tumor Biology, Institute for Cancer Research, Oslo University Hospital, Oslo, Norway
| | - Ragnhild Eskeland
- Department of Biosciences, University of Oslo, Oslo, Norway
- Norwegian Center for Stem Cell Research, Department of Immunology, Oslo University Hospital, Oslo, Norway
| | | | | | - Sigve Nakken
- Department of Tumor Biology, Institute for Cancer Research, Oslo University Hospital, Oslo, Norway
| | - Mads Bengtsen
- Department of Biosciences, University of Oslo, Oslo, Norway
| | - Alexander Johan Nederbragt
- Department of Informatics, University of Oslo, Oslo, Norway
- Centre for Ecological and Evolutionary Synthesis (CEES), Department of Biosciences, University of Oslo, Oslo, Norway
| | | | | | - Ingrid Glad
- Department of Mathematics, University of Oslo, Oslo, Norway
| | - Eivind Hovig
- Department of Informatics, University of Oslo, Oslo, Norway
- Department of Tumor Biology, Institute for Cancer Research, Oslo University Hospital, Oslo, Norway
- Statistics For Innovation, Norwegian Computing Center, Oslo, Norway
- Institute for Medical Informatics, The Norwegian Radium Hospital, Oslo University Hospital, Oslo, Norway
| | | |
Collapse
|
4
|
Reynolds DJ, Scourse JD, Halloran PR, Nederbragt AJ, Wanamaker AD, Butler PG, Richardson CA, Heinemeier J, Eiríksson J, Knudsen KL, Hall IR. Annually resolved North Atlantic marine climate over the last millennium. Nat Commun 2016; 7:13502. [PMID: 27922004 PMCID: PMC5150573 DOI: 10.1038/ncomms13502] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2015] [Accepted: 10/10/2016] [Indexed: 11/24/2022] Open
Abstract
Owing to the lack of absolutely dated oceanographic information before the modern instrumental period, there is currently significant debate as to the role played by North Atlantic Ocean dynamics in previous climate transitions (for example, Medieval Climate Anomaly-Little Ice Age, MCA-LIA). Here we present analyses of a millennial-length, annually resolved and absolutely dated marine δ18O archive. We interpret our record of oxygen isotope ratios from the shells of the long-lived marine bivalve Arctica islandica (δ18O-shell), from the North Icelandic shelf, in relation to seawater density variability and demonstrate that solar and volcanic forcing coupled with ocean circulation dynamics are key drivers of climate variability over the last millennium. During the pre-industrial period (AD 1000–1800) variability in the sub-polar North Atlantic leads changes in Northern Hemisphere surface air temperatures at multi-decadal timescales, indicating that North Atlantic Ocean dynamics played an active role in modulating the response of the atmosphere to solar and volcanic forcing. A lack of annually resolved climate records from the marine archive limits our understanding of oceanic processes. Here, the authors present a millennial-length, annually-resolved and absolutely-dated marine δ18O record from the shells of marine bivalves and offer insight into North Atlantic climate dynamics.
Collapse
Affiliation(s)
- D J Reynolds
- School of Earth and Ocean Sciences, Cardiff University, Main Building, Park Place, Cardiff CF10 3AT, UK
| | - J D Scourse
- School of Ocean Sciences, College of Natural Science, Bangor University, Menai Bridge, Anglesey LL59 5AB, UK
| | - P R Halloran
- Department of Geography, College of Life and Environmental Sciences, University of Exeter, Exeter EX4 4RJ, UK
| | - A J Nederbragt
- School of Earth and Ocean Sciences, Cardiff University, Main Building, Park Place, Cardiff CF10 3AT, UK
| | - A D Wanamaker
- Department of Geological and Atmospheric Sciences, Iowa State University, Ames, Iowa 50011-3212, USA
| | - P G Butler
- School of Ocean Sciences, College of Natural Science, Bangor University, Menai Bridge, Anglesey LL59 5AB, UK
| | - C A Richardson
- School of Ocean Sciences, College of Natural Science, Bangor University, Menai Bridge, Anglesey LL59 5AB, UK
| | - J Heinemeier
- Aarhus AMS Centre, Department of Physics and Astronomy, Aarhus University, Ny Munkegade 120, DK-8000 Aarhus C, Denmark
| | - J Eiríksson
- Institute of Earth Sciences, University of Iceland, Askja, Sturlugata 7, IS-101 Reykjavík, Iceland
| | - K L Knudsen
- Department of Geoscience, Aarhus University, Høegh-Guldbergs Gade 2, DK-8000 Aarhus C, Denmark
| | - I R Hall
- School of Earth and Ocean Sciences, Cardiff University, Main Building, Park Place, Cardiff CF10 3AT, UK
| |
Collapse
|
5
|
Frank JA, Pan Y, Tooming-Klunderud A, Eijsink VGH, McHardy AC, Nederbragt AJ, Pope PB. Improved metagenome assemblies and taxonomic binning using long-read circular consensus sequence data. Sci Rep 2016; 6:25373. [PMID: 27156482 PMCID: PMC4860591 DOI: 10.1038/srep25373] [Citation(s) in RCA: 109] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2015] [Accepted: 04/12/2016] [Indexed: 01/22/2023] Open
Abstract
DNA assembly is a core methodological step in metagenomic pipelines used to study the structure and function within microbial communities. Here we investigate the utility of Pacific Biosciences long and high accuracy circular consensus sequencing (CCS) reads for metagenomic projects. We compared the application and performance of both PacBio CCS and Illumina HiSeq data with assembly and taxonomic binning algorithms using metagenomic samples representing a complex microbial community. Eight SMRT cells produced approximately 94 Mb of CCS reads from a biogas reactor microbiome sample that averaged 1319 nt in length and 99.7% accuracy. CCS data assembly generated a comparative number of large contigs greater than 1 kb, to those assembled from a ~190x larger HiSeq dataset (~18 Gb) produced from the same sample (i.e approximately 62% of total contigs). Hybrid assemblies using PacBio CCS and HiSeq contigs produced improvements in assembly statistics, including an increase in the average contig length and number of large contigs. The incorporation of CCS data produced significant enhancements in taxonomic binning and genome reconstruction of two dominant phylotypes, which assembled and binned poorly using HiSeq data alone. Collectively these results illustrate the value of PacBio CCS reads in certain metagenomics applications.
Collapse
Affiliation(s)
- J A Frank
- Department of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences, Ås, 1432 Norway
| | - Y Pan
- Computational Biology of Infection Research, Helmholtz Centre for Infection Research, Inhoffenstraβe 7, 38124 Braunschweig, Germany
| | - A Tooming-Klunderud
- University of Oslo, Department of Biosciences, Centre for Ecological and Evolutionary Synthesis, Blindern, 0316 Norway
| | - V G H Eijsink
- Department of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences, Ås, 1432 Norway
| | - A C McHardy
- Computational Biology of Infection Research, Helmholtz Centre for Infection Research, Inhoffenstraβe 7, 38124 Braunschweig, Germany
| | - A J Nederbragt
- University of Oslo, Department of Biosciences, Centre for Ecological and Evolutionary Synthesis, Blindern, 0316 Norway
| | - P B Pope
- Department of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences, Ås, 1432 Norway
| |
Collapse
|
6
|
Crusoe MR, Alameldin HF, Awad S, Boucher E, Caldwell A, Cartwright R, Charbonneau A, Constantinides B, Edvenson G, Fay S, Fenton J, Fenzl T, Fish J, Garcia-Gutierrez L, Garland P, Gluck J, González I, Guermond S, Guo J, Gupta A, Herr JR, Howe A, Hyer A, Härpfer A, Irber L, Kidd R, Lin D, Lippi J, Mansour T, McA'Nulty P, McDonald E, Mizzi J, Murray KD, Nahum JR, Nanlohy K, Nederbragt AJ, Ortiz-Zuazaga H, Ory J, Pell J, Pepe-Ranney C, Russ ZN, Schwarz E, Scott C, Seaman J, Sievert S, Simpson J, Skennerton CT, Spencer J, Srinivasan R, Standage D, Stapleton JA, Steinman SR, Stein J, Taylor B, Trimble W, Wiencko HL, Wright M, Wyss B, Zhang Q, Zyme E, Brown CT. The khmer software package: enabling efficient nucleotide sequence analysis. F1000Res 2015; 4:900. [PMID: 26535114 PMCID: PMC4608353 DOI: 10.12688/f1000research.6924.1] [Citation(s) in RCA: 230] [Impact Index Per Article: 25.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 09/25/2015] [Indexed: 12/02/2022] Open
Abstract
The khmer package is a freely available software library for working efficiently with fixed length DNA words, or k-mers. khmer provides implementations of a probabilistic k-mer counting data structure, a compressible De Bruijn graph representation, De Bruijn graph partitioning, and digital normalization. khmer is implemented in C++ and Python, and is freely available under the BSD license at
https://github.com/dib-lab/khmer/.
Collapse
Affiliation(s)
- Michael R Crusoe
- Microbiology and Molecular Genetics, Michigan State University, East Lansing, MI, USA
| | - Hussien F Alameldin
- Department of Plant, Soil and Microbial Sciences, Michigan State University, East Lansing, MI, USA
| | - Sherine Awad
- Population Health and Reproduction, University of California, Davis, Davis, CA, USA
| | - Elmar Boucher
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, OR, USA
| | - Adam Caldwell
- Biology Department, San Jose State University, San Jose, CA, USA
| | - Reed Cartwright
- School of Life Sciences and The Biodesign Institute, Arizona State University, Tempe, AZ, USA
| | | | - Bede Constantinides
- Computational and Evolutionary Biology, Faculty of Life Sciences, University of Manchester, Manchester, UK
| | | | | | - Jacob Fenton
- Computer Science and Engineering, Michigan State University, East Lansing, MI, USA
| | | | - Jordan Fish
- Computer Science and Engineering, Michigan State University, East Lansing, MI, USA
| | | | | | - Jonathan Gluck
- Graduate Program, University of Maryland, College Park, MD, USA
| | - Iván González
- Athinoula A. Martinos Center for Biomedical Imaging, Department of Radiology, Massachusetts General Hospital, Charlestown, MA, USA
| | | | - Jiarong Guo
- Center for Microbial Ecology, Michigan State University, East Lansing, MI, USA
| | - Aditi Gupta
- Microbiology and Molecular Genetics, Michigan State University, East Lansing, MI, USA
| | - Joshua R Herr
- Microbiology and Molecular Genetics, Michigan State University, East Lansing, MI, USA
| | - Adina Howe
- Department of Agricultural and Biosystems Engineering, Iowa State University, Ames, IA, USA
| | - Alex Hyer
- Department of Biology, University of Utah, Salt Lake City, UT, USA
| | | | - Luiz Irber
- Computer Science and Engineering, Michigan State University, East Lansing, MI, USA
| | - Rhys Kidd
- Independent Researcher, Sydney, Australia
| | | | | | - Tamer Mansour
- Population Health and Reproduction, University of California, Davis, Davis, CA, USA ; Clinical Pathology, Mansoura University, Mansoura, Egypt
| | | | - Eric McDonald
- Computer Science and Engineering, Michigan State University, East Lansing, MI, USA
| | - Jessica Mizzi
- Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI, USA
| | - Kevin D Murray
- ARC Centre of Excellence in Plant Energy Biology, The Australian National University, Canberra, ACT, Australia
| | - Joshua R Nahum
- BEACON Center, Michigan State University, East Lansing, MI, USA
| | | | - Alexander Johan Nederbragt
- Centre for Ecological and Evolutionary Synthesis, Dept. of Biosciences, University of Oslo, Oslo, Norway
| | - Humberto Ortiz-Zuazaga
- Department of Computer Science, Rio Piedras Campus, University of Puerto Rico, San Juan, Puerto Rico
| | - Jeramia Ory
- Biochemistry, St. Louis College of Pharmacy, St. Louis, MO, USA
| | - Jason Pell
- Computer Science and Engineering, Michigan State University, East Lansing, MI, USA
| | | | - Zachary N Russ
- Department of Bioengineering, UC Berkeley, Berkeley, CA, USA
| | - Erich Schwarz
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA
| | - Camille Scott
- Computer Science and Engineering, Michigan State University, East Lansing, MI, USA
| | - Josiah Seaman
- Data Visualization, Newline Technical Innovations, Windsor, CO, USA
| | - Scott Sievert
- Electrical and Computer Engineering, University of Minnesota, Minneapolis, MN, USA
| | - Jared Simpson
- Ontario Institute for Cancer Research, Toronto, ON, Canada ; Computer Science, University of Toronto, Toronto, ON, Canada
| | - Connor T Skennerton
- Division of Geological and Planetary Sciences, California Institute of Technology, Pasadena, CA, USA
| | - James Spencer
- Dept of Physics and Dept of Materials, Imperial College London, London, UK
| | | | - Daniel Standage
- Department of Biology, Indiana University, Bloomington, IN, USA ; Bioinformatics and Computational Biology Graduate Program, Iowa State University, Ames, IA, USA
| | - James A Stapleton
- Chemical Engineering & Materials Science, Michigan State University, East Lansing, MIS, USA
| | - Susan R Steinman
- The New York Eye and Ear Infirmary of Mount Sinai, New York, NY, USA
| | - Joe Stein
- Independent Researcher, Providence, RI, USA
| | - Benjamin Taylor
- Computer Science and Engineering, Michigan State University, East Lansing, MI, USA
| | - Will Trimble
- Mathematics and Computer Science Division, Argonne National Laboratory, Lemont, IL, USA
| | - Heather L Wiencko
- Department of Genetics, Smurfit Institute, Trinity College Dublin, Dublin, Ireland
| | - Michael Wright
- Computer Science and Engineering, Michigan State University, East Lansing, MI, USA
| | - Brian Wyss
- Computer Science and Engineering, Michigan State University, East Lansing, MI, USA
| | - Qingpeng Zhang
- Computer Science and Engineering, Michigan State University, East Lansing, MI, USA
| | - En Zyme
- Independent Researcher, Boston, MA, USA
| | - C Titus Brown
- Microbiology and Molecular Genetics, Michigan State University, East Lansing, MI, USA ; Population Health and Reproduction, University of California, Davis, Davis, CA, USA ; Computer Science and Engineering, Michigan State University, East Lansing, MI, USA
| |
Collapse
|
7
|
Abstract
As a first step in analyzing the function of a cdc25 homolog during the embryonic development of Patella vulgata (Pv), genomic clones encoding these stringlike proteins (Stl) were isolated and characterized. These clones belong to four groups which are derived from different regions of the Pv genome. As the sequences of Stl genes from two of these groups are almost identical, we suggest that these genes represent copies of the same gene. The Stl3 gene, which has been analyzed in detail, consists of four exons separated by three introns. Its sequence encodes a 250-amino-acid protein with a calculated weight of 28 kDa. The Stl protein contains regions conserved in all other cdc25 proteins. Stl messengers are not stored maternally in Pv oocytes and Stl transcription only starts after the first embryonic cleavages.
Collapse
Affiliation(s)
- A van der Kooij
- Department of Experimental Zoology, Utrecht University, The Netherlands
| | | | | | | |
Collapse
|
8
|
Sijmons PC, Nederbragt AJ, Klis FM, Van den Ende H. Isolation and composition of the constitutive agglutinins from haploid Saccharomyces cerevisiae cells. Arch Microbiol 1987; 148:208-12. [PMID: 3314767 DOI: 10.1007/bf00414813] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
Sex-specific agglutinins from the cell surface of haploid cells of Saccharomyces cerevisiae (X2180, mta and mt alpha) were purified and analysed. The constitutive agglutinin from mta cells was extractable with 3 mM dithiothreitol. It was shown to be a glycoprotein (3% mannose) with an apparent Mr of 43,000 based on gel filtration, but in SDS-PAGE it behaved as a much smaller molecule (Mr between 18,000 and 26,000). About one in three amino acids was a hydroxyamino acid. Its biological activity was resistant to boiling for 1 h, but sensitive to pronase. Intact mt alpha cells retained their agglutinability in the presence of dithiothreitol but limited trypsinizing released a biologically active agglutinin fragment. It had an apparent Mr of 320,000 (gel filtration). When analysed by SDS-PAGE, a single diffuse band with an apparent Mr of 225,000 was observed. The protein was 94% (w/w) mannose with a trace of N-acetyl glucosamine. Its biological activity was almost completely lost after boiling for 1 h. Both agglutinins behaved as monovalent molecules and specifically inhibited the biological activity of both noninduced and pheromone-induced cells. Pheromone treatment of mta cells resulted in an apparent 32-fold increase in agglutinin activity at the cell surface, whereas pheromone treatment of mt alpha cells only doubled the apparent agglutinin activity.
Collapse
Affiliation(s)
- P C Sijmons
- Department of Plant Physiology, University of Amsterdam, The Netherlands
| | | | | | | |
Collapse
|