1
|
Adaptation to the High-Arctic island environment despite long-term reduced genetic variation in Svalbard reindeer. iScience 2023; 26:107811. [PMID: 37744038 PMCID: PMC10514459 DOI: 10.1016/j.isci.2023.107811] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 07/24/2023] [Accepted: 08/30/2023] [Indexed: 09/26/2023] Open
Abstract
Typically much smaller in number than their mainland counterparts, island populations are ideal systems to investigate genetic threats to small populations. The Svalbard reindeer (Rangifer tarandus platyrhynchus) is an endemic subspecies that colonized the Svalbard archipelago ca. 6,000-8,000 years ago and now shows numerous physiological and morphological adaptations to its arctic habitat. Here, we report a de-novo chromosome-level assembly for Svalbard reindeer and analyze 133 reindeer genomes spanning Svalbard and most of the species' Holarctic range, to examine the genomic consequences of long-term isolation and small population size in this insular subspecies. Empirical data, demographic reconstructions, and forward simulations show that long-term isolation and high inbreeding levels may have facilitated the reduction of highly deleterious-and to a lesser extent, moderately deleterious-variation. Our study indicates that long-term reduced genetic diversity did not preclude local adaptation to the High Arctic, suggesting that even severely bottlenecked populations can retain evolutionary potential.
Collapse
|
2
|
De novo annotation of lncRNA HOTAIR transcripts by long-read RNA capture-seq reveals a differentiation-driven isoform switch. BMC Genomics 2022; 23:658. [PMID: 36115964 PMCID: PMC9482196 DOI: 10.1186/s12864-022-08887-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2022] [Accepted: 09/09/2022] [Indexed: 11/10/2022] Open
Abstract
Background LncRNAs are tissue-specific and emerge as important regulators of various biological processes and as disease biomarkers. HOTAIR is a well-established pro-oncogenic lncRNA which has been attributed a variety of functions in cancer and native contexts. However, a lack of an exhaustive, cell type-specific annotation questions whether HOTAIR functions are supported by the expression of multiple isoforms. Results Using a capture long-read sequencing approach, we characterize HOTAIR isoforms expressed in human primary adipose stem cells. We find HOTAIR isoforms population displays varied splicing patterns, frequently leading to the exclusion or truncation of canonical LSD1 and PRC2 binding domains. We identify a highly cell type-specific HOTAIR isoform pool regulated by distinct promoter usage, and uncover a shift in the HOTAIR TSS usage that modulates the balance of HOTAIR isoforms at differentiation onset. Conclusion Our results highlight the complexity and cell type-specificity of HOTAIR isoforms and open perspectives on functional implications of these variants and their balance to key cellular processes. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08887-w.
Collapse
|
3
|
Long-read sequence capture of the haemoglobin gene clusters across codfish species. Mol Ecol Resour 2018; 19:245-259. [PMID: 30329222 PMCID: PMC7379720 DOI: 10.1111/1755-0998.12955] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2018] [Revised: 10/05/2018] [Accepted: 10/09/2018] [Indexed: 11/30/2022]
Abstract
Combining high-throughput sequencing with targeted sequence capture has become an attractive tool to study specific genomic regions of interest. Most studies have so far focused on the exome using short-read technology. These approaches are not designed to capture intergenic regions needed to reconstruct genomic organization, including regulatory regions and gene synteny. Here, we demonstrate the power of combining targeted sequence capture with long-read sequencing technology for comparative genomic analyses of the haemoglobin (Hb) gene clusters across eight species separated by up to 70 million years. Guided by the reference genome assembly of the Atlantic cod (Gadus morhua) together with genome information from draft assemblies of selected codfishes, we designed probes covering the two Hb gene clusters. Use of custom-made barcodes combined with PacBio RSII sequencing led to highly continuous assemblies of the LA (~100 kb) and MN (~200 kb) clusters, which include syntenic regions of coding and intergenic sequences. Our results revealed an overall conserved genomic organization of the Hb genes within this lineage, yet with several, lineage-specific gene duplications. Moreover, for some of the species examined, we identified amino acid substitutions at two sites in the Hbb1 gene as well as length polymorphisms in its regulatory region, which has previously been linked to temperature adaptation in Atlantic cod populations. This study highlights the use of targeted long-read capture as a versatile approach for comparative genomic studies by generation of a cross-species genomic resource elucidating the evolutionary history of the Hb gene family across the highly divergent group of codfishes.
Collapse
|
4
|
Loss of stomach, loss of appetite? Sequencing of the ballan wrasse (Labrus bergylta) genome and intestinal transcriptomic profiling illuminate the evolution of loss of stomach function in fish. BMC Genomics 2018; 19:186. [PMID: 29510660 PMCID: PMC5840709 DOI: 10.1186/s12864-018-4570-8] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2017] [Accepted: 02/28/2018] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND The ballan wrasse (Labrus bergylta) belongs to a large teleost family containing more than 600 species showing several unique evolutionary traits such as lack of stomach and hermaphroditism. Agastric fish are found throughout the teleost phylogeny, in quite diverse and unrelated lineages, indicating stomach loss has occurred independently multiple times in the course of evolution. By assembling the ballan wrasse genome and transcriptome we aimed to determine the genetic basis for its digestive system function and appetite regulation. Among other, this knowledge will aid the formulation of aquaculture diets that meet the nutritional needs of agastric species. RESULTS Long and short read sequencing technologies were combined to generate a ballan wrasse genome of 805 Mbp. Analysis of the genome and transcriptome assemblies confirmed the absence of genes that code for proteins involved in gastric function. The gene coding for the appetite stimulating protein ghrelin was also absent in wrasse. Gene synteny mapping identified several appetite-controlling genes and their paralogs previously undescribed in fish. Transcriptome profiling along the length of the intestine found a declining expression gradient from the anterior to the posterior, and a distinct expression profile in the hind gut. CONCLUSIONS We showed gene loss has occurred for all known genes related to stomach function in the ballan wrasse, while the remaining functions of the digestive tract appear intact. The results also show appetite control in ballan wrasse has undergone substantial changes. The loss of ghrelin suggests that other genes, such as motilin, may play a ghrelin like role. The wrasse genome offers novel insight in to the evolutionary traits of this large family. As the stomach plays a major role in protein digestion, the lack of genes related to stomach digestion in wrasse suggests it requires formulated diets with higher levels of readily digestible protein than those for gastric species.
Collapse
|
5
|
PacBio metabarcoding of Fungi and other eukaryotes: errors, biases and perspectives. THE NEW PHYTOLOGIST 2018; 217:1370-1385. [PMID: 28906012 DOI: 10.1111/nph.14776] [Citation(s) in RCA: 109] [Impact Index Per Article: 18.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/05/2017] [Accepted: 07/31/2017] [Indexed: 05/04/2023]
Abstract
Second-generation, high-throughput sequencing methods have greatly improved our understanding of the ecology of soil microorganisms, yet the short barcodes (< 500 bp) provide limited taxonomic and phylogenetic information for species discrimination and taxonomic assignment. Here, we utilized the third-generation Pacific Biosciences (PacBio) RSII and Sequel instruments to evaluate the suitability of full-length internal transcribed spacer (ITS) barcodes and longer rRNA gene amplicons for metabarcoding Fungi, Oomycetes and other eukaryotes in soil samples. Metabarcoding revealed multiple errors and biases: Taq polymerase substitution errors and mis-incorporating indels in sequencing homopolymers constitute major errors; sequence length biases occur during PCR, library preparation, loading to the sequencing instrument and quality filtering; primer-template mismatches bias the taxonomic profile when using regular and highly degenerate primers. The RSII and Sequel platforms enable the sequencing of amplicons up to 3000 bp, but the sequence quality remains slightly inferior to Illumina sequencing especially in longer amplicons. The full ITS barcode and flanking rRNA small subunit gene greatly improve taxonomic identification at the species and phylum levels, respectively. We conclude that PacBio sequencing provides a viable alternative for metabarcoding of organisms that are of relatively low diversity, require > 500-bp barcode for reliable identification or when phylogenetic approaches are intended.
Collapse
|
6
|
An improved genome assembly uncovers prolific tandem repeats in Atlantic cod. BMC Genomics 2017; 18:95. [PMID: 28100185 PMCID: PMC5241972 DOI: 10.1186/s12864-016-3448-x] [Citation(s) in RCA: 115] [Impact Index Per Article: 16.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2016] [Accepted: 12/20/2016] [Indexed: 01/06/2023] Open
Abstract
BACKGROUND The first Atlantic cod (Gadus morhua) genome assembly published in 2011 was one of the early genome assemblies exclusively based on high-throughput 454 pyrosequencing. Since then, rapid advances in sequencing technologies have led to a multitude of assemblies generated for complex genomes, although many of these are of a fragmented nature with a significant fraction of bases in gaps. The development of long-read sequencing and improved software now enable the generation of more contiguous genome assemblies. RESULTS By combining data from Illumina, 454 and the longer PacBio sequencing technologies, as well as integrating the results of multiple assembly programs, we have created a substantially improved version of the Atlantic cod genome assembly. The sequence contiguity of this assembly is increased fifty-fold and the proportion of gap-bases has been reduced fifteen-fold. Compared to other vertebrates, the assembly contains an unusual high density of tandem repeats (TRs). Indeed, retrospective analyses reveal that gaps in the first genome assembly were largely associated with these TRs. We show that 21% of the TRs across the assembly, 19% in the promoter regions and 12% in the coding sequences are heterozygous in the sequenced individual. CONCLUSIONS The inclusion of PacBio reads combined with the use of multiple assembly programs drastically improved the Atlantic cod genome assembly by successfully resolving long TRs. The high frequency of heterozygous TRs within or in the vicinity of genes in the genome indicate a considerable standing genomic variation in Atlantic cod populations, which is likely of evolutionary importance.
Collapse
|
7
|
Improved metagenome assemblies and taxonomic binning using long-read circular consensus sequence data. Sci Rep 2016; 6:25373. [PMID: 27156482 PMCID: PMC4860591 DOI: 10.1038/srep25373] [Citation(s) in RCA: 109] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2015] [Accepted: 04/12/2016] [Indexed: 01/22/2023] Open
Abstract
DNA assembly is a core methodological step in metagenomic pipelines used to study the structure and function within microbial communities. Here we investigate the utility of Pacific Biosciences long and high accuracy circular consensus sequencing (CCS) reads for metagenomic projects. We compared the application and performance of both PacBio CCS and Illumina HiSeq data with assembly and taxonomic binning algorithms using metagenomic samples representing a complex microbial community. Eight SMRT cells produced approximately 94 Mb of CCS reads from a biogas reactor microbiome sample that averaged 1319 nt in length and 99.7% accuracy. CCS data assembly generated a comparative number of large contigs greater than 1 kb, to those assembled from a ~190x larger HiSeq dataset (~18 Gb) produced from the same sample (i.e approximately 62% of total contigs). Hybrid assemblies using PacBio CCS and HiSeq contigs produced improvements in assembly statistics, including an increase in the average contig length and number of large contigs. The incorporation of CCS data produced significant enhancements in taxonomic binning and genome reconstruction of two dominant phylotypes, which assembled and binned poorly using HiSeq data alone. Collectively these results illustrate the value of PacBio CCS reads in certain metagenomics applications.
Collapse
|
8
|
Functional interactions among filamentous Epsilonproteobacteria and Bacteroidetes in a deep-sea hydrothermal vent biofilm. Environ Microbiol 2015; 17:4063-77. [PMID: 26147346 DOI: 10.1111/1462-2920.12970] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2015] [Revised: 06/25/2015] [Accepted: 06/25/2015] [Indexed: 11/30/2022]
Abstract
Little is known about how lithoautotrophic primary production is connected to microbial organotrophic consumption in hydrothermal systems. Using a multifaceted approach, we analysed the structure and metabolic capabilities within a biofilm growing on the surface of a black smoker chimney in the Loki's Castle vent field. Imaging revealed the presence of rod-shaped Bacteroidetes growing as ectobionts on long, sheathed microbial filaments (> 100 μm) affiliated with the Sulfurovum genus within Epsilonproteobacteria. The filaments were composed of a thick (> 200 nm) stable polysaccharide, representing a substantial fraction of organic carbon produced by primary production. An integrated -omics approach enabled us to assess the metabolic potential and in situ metabolism of individual taxonomic and morphological groups identified by imaging. Specifically, we provide evidence that organotrophic Bacteroidetes attach to and glide along the surface of Sulfurovum filaments utilizing organic polymers produced by the lithoautotrophic Sulfurovum. Furthermore, in situ expression of acetyl-CoA synthetase by Sulfurovum suggested the ability to assimilate acetate, indicating recycling of organic matter in the biofilm. This study expands our understanding of the lifestyles of Epsilonproteobacteria in hydrothermal vents, their metabolic properties and co-operative interactions in deep-sea hydrothermal vent food webs.
Collapse
|
9
|
Use of high throughput sequencing and light microscopy show contrasting results in a study of phytoplankton occurrence in a freshwater environment. PLoS One 2014; 9:e106510. [PMID: 25171164 PMCID: PMC4149573 DOI: 10.1371/journal.pone.0106510] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2014] [Accepted: 07/19/2014] [Indexed: 11/25/2022] Open
Abstract
Assessing phytoplankton diversity is of primary importance for both basic and applied ecological studies. Following the advances in molecular methods, phytoplankton studies are switching from using classical microscopy to high throughput sequencing approaches. However, methodological comparisons of these approaches have rarely been reported. In this study, we compared the two methods, using a unique dataset of multiple water samples taken from a natural freshwater environment. Environmental DNA was extracted from 300 water samples collected weekly during 20 years, followed by high throughput sequencing of amplicons from the 16S and 18S rRNA hypervariable regions. For each water sample, phytoplankton diversity was also estimated using light microscopy. Our study indicates that species compositions detected by light microscopy and 454 high throughput sequencing do not always match. High throughput sequencing detected more rare species and picoplankton than light microscopy, and thus gave a better assessment of phytoplankton diversity. However, when compared to light microscopy, high throughput sequencing of 16S and 18S rRNA amplicons did not adequately identify phytoplankton at the species level. In summary, our study recommends a combined strategy using both morphological and molecular techniques.
Collapse
|
10
|
The chloroplast genome of the diatom Seminavis robusta: New features introduced through multiple mechanisms of horizontal gene transfer. Mar Genomics 2014; 16:17-27. [DOI: 10.1016/j.margen.2013.12.002] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2013] [Revised: 11/29/2013] [Accepted: 12/04/2013] [Indexed: 10/25/2022]
|
11
|
The genome sequence of Atlantic cod reveals a unique immune system. Nature 2011; 477:207-10. [PMID: 21832995 DOI: 10.1038/nature10342] [Citation(s) in RCA: 532] [Impact Index Per Article: 40.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2011] [Accepted: 06/28/2011] [Indexed: 01/24/2023]
Abstract
Atlantic cod (Gadus morhua) is a large, cold-adapted teleost that sustains long-standing commercial fisheries and incipient aquaculture. Here we present the genome sequence of Atlantic cod, showing evidence for complex thermal adaptations in its haemoglobin gene cluster and an unusual immune architecture compared to other sequenced vertebrates. The genome assembly was obtained exclusively by 454 sequencing of shotgun and paired-end libraries, and automated annotation identified 22,154 genes. The major histocompatibility complex (MHC) II is a conserved feature of the adaptive immune system of jawed vertebrates, but we show that Atlantic cod has lost the genes for MHC II, CD4 and invariant chain (Ii) that are essential for the function of this pathway. Nevertheless, Atlantic cod is not exceptionally susceptible to disease under natural conditions. We find a highly expanded number of MHC I genes and a unique composition of its Toll-like receptor (TLR) families. This indicates how the Atlantic cod immune system has evolved compensatory mechanisms in both adaptive and innate immunity in the absence of MHC II. These observations affect fundamental assumptions about the evolution of the adaptive immune system and its components in vertebrates.
Collapse
|
12
|
Polymorphism, selection and tandem duplication of transferrin genes in Atlantic cod (Gadus morhua)--conserved synteny between fish monolobal and tetrapod bilobal transferrin loci. BMC Genet 2011; 12:51. [PMID: 21612617 PMCID: PMC3125230 DOI: 10.1186/1471-2156-12-51] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2011] [Accepted: 05/25/2011] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The two homologous iron-binding lobes of transferrins are thought to have evolved by gene duplication of an ancestral monolobal form, but any conserved synteny between bilobal and monolobal transferrin loci remains unexplored. The important role played by transferrin in the resistance to invading pathogens makes this polymorphic gene a highly valuable candidate for studying adaptive divergence among local populations. RESULTS The Atlantic cod genome was shown to harbour two tandem duplicated serum transferrin genes (Tf1, Tf2), a melanotransferrin gene (MTf), and a monolobal transferrin gene (Omp). Whereas Tf1 and Tf2 were differentially expressed in liver and brain, the Omp transcript was restricted to the otoliths. Fish, chicken and mammals showed highly conserved syntenic regions in which monolobal and bilobal transferrins reside, but contrasting with tetrapods, the fish transferrin genes are positioned on three different linkage groups. Sequence alignment of cod Tf1 cDNAs from Northeast (NE) and Northwest (NW) Atlantic populations revealed 22 single nucleotide polymorphisms (SNP) causing the replacement of 16 amino acids, including eight surface residues revealed by the modelled 3D-structures, that might influence the binding of pathogens for removal of iron. SNP analysis of a total of 375 individuals from 14 trans-Atlantic populations showed that the Tf1-NE variant was almost fixed in the Baltic cod and predominated in the other NE Atlantic populations, whereas the NW Atlantic populations were more heterozygous and showed high frequencies of the Tf-NW SNP alleles. CONCLUSIONS The highly conserved synteny between fish and tetrapod transferrin loci infers that the fusion of tandem duplicated Omp-like genes gave rise to the modern transferrins. The multiple nonsynonymous substitutions in cod Tf1 with putative structural effects, together with highly divergent allele frequencies among different cod populations, strongly suggest evidence for positive selection and local adaptation in trans-Atlantic cod populations.
Collapse
|
13
|
Abstract
The dinoflagellates have repeatedly replaced their ancestral peridinin-plastid by plastids derived from a variety of algal lineages ranging from green algae to diatoms. Here, we have characterized the genome of a dinoflagellate plastid of tertiary origin in order to understand the evolutionary processes that have shaped the organelle since it was acquired as a symbiont cell. To address this, the genome of the haptophyte-derived plastid in Karlodinium veneficum was analyzed by Sanger sequencing of library clones and 454 pyrosequencing of plastid enriched DNA fractions. The sequences were assembled into a single contig of 143 kb, encoding 70 proteins, 3 rRNAs and a nearly full set of tRNAs. Comparative genomics revealed massive rearrangements and gene losses compared to the haptophyte plastid; only a small fraction of the gene clusters usually found in haptophytes as well as other types of plastids are present in K. veneficum. Despite the reduced number of genes, the K. veneficum plastid genome has retained a large size due to expanded intergenic regions. Some of the plastid genes are highly diverged and may be pseudogenes or subject to RNA editing. Gene losses and rearrangements are also features of the genomes of the peridinin-containing plastids, apicomplexa and Chromera, suggesting that the evolutionary processes that once shaped these plastids have occurred at multiple independent occasions over the history of the Alveolata.
Collapse
|
14
|
Structural analysis of a non-ribosomal halogenated cyclic peptide and its putative operon from Microcystis: implication for evolution of cyanopeptolins. Microbiology (Reading) 2009; 155:2106-2108. [DOI: 10.1099/mic.0.30275-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
|
15
|
Differential spatio-temporal expression and functional diversification of the myogenic regulatory factors MyoD1 and MyoD2 in Atlantic halibut (Hippoglossus hippoglossus). Comp Biochem Physiol B Biochem Mol Biol 2009; 154:93-101. [PMID: 19454321 DOI: 10.1016/j.cbpb.2009.05.009] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2009] [Revised: 04/30/2009] [Accepted: 05/11/2009] [Indexed: 10/20/2022]
Abstract
Development of the vertebrate skeletal muscle is orchestrated by the myogenic regulatory factors MyoD, Myf5, myogenin and MRF4, which likely arose from the duplications of a single ancestral gene early in vertebrate evolution. We have isolated two myod genes from Atlantic halibut and examined their differential expression during embryogenesis using quantitative PCR and in situ hybridization to address their functional roles in this asymmetrically organized flatfish. myod1 was initially maternally expressed, while myod2 mRNA was first detectable during gastrulation. The myod1 mRNA levels predominated throughout somitogenesis, and both slow and fast muscle precursor cells displayed the bilateral symmetric myod1 signal during the formation of the symmetric somite pairs. In contrast, myod2 was left-right asymmetrically expressed in the fast muscle precursors. The random expression of myod2 was not associated with the right-sided eye migration and the development of thicker fast skeletal muscle on the eyed side than on the blind side. The nucleotide substitution analysis indicated that the teleost MyoDs essentially have evolved under purifying selection, but a subset of amino acid sites under strong positive selection were identified in the MyoD2 branch. Altogether, halibut MyoD1 seems to have retained the central role of MyoD in driving skeletal myogenesis, whereas the function of MyoD2 is uncertain in this flatfish species.
Collapse
|
16
|
Evidence for positive selection acting on microcystin synthetase adenylation domains in three cyanobacterial genera. BMC Evol Biol 2008; 8:256. [PMID: 18808704 PMCID: PMC2564945 DOI: 10.1186/1471-2148-8-256] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2008] [Accepted: 09/22/2008] [Indexed: 11/30/2022] Open
Abstract
Background Cyanobacteria produce a wealth of secondary metabolites, including the group of small cyclic heptapeptide hepatotoxins that constitutes the microcystin family. The enzyme complex that directs the biosynthesis of microcystin is encoded in a single large gene cluster (mcy). mcy genes have a widespread distribution among cyanobacteria and are likely to have an ancient origin. The notable diversity within some of the Mcy modules is generated through various recombination events including horizontal gene transfer. Results A comparative analysis of the adenylation domains from the first module of McyB (McyB1) and McyC in the microcystin synthetase complex was performed on a large number of microcystin-producing strains from the Anabaena, Microcystis and Planktothrix genera. We found no decisive evidence for recombination between strains from different genera. However, we detected frequent recombination events in the mcyB and mcyC genes between strains within the same genus. Frequent interdomain recombination events were also observed between mcyB and mcyC sequences in Anabaena and Microcystis. Recombination and mutation rate ratios suggest that the diversification of mcyB and mcyC genes is driven by recombination events as well as point mutations in all three genera. Sequence analysis suggests that generally the adenylation domains of the first domain of McyB and McyC are under purifying selection. However, we found clear evidence for positive selection acting on a number of amino acid residues within these adenylation domains. These include residues important for active site selectivity of the adenylation domain, strongly suggesting selection for novel microcystin variants. Conclusion We provide the first clear evidence for positive selection acting on amino acid residues involved directly in the recognition and activation of amino acids incorporated into microcystin, indicating that the microcystin complement of a given strain may influence the ability of a particular strain to interact with its environment.
Collapse
|
17
|
Abstract
An extensive study of the mcyABC genes and regions flanking the mcy gene cluster was performed in naturally occurring Microcystis strains. Lack of methylation in strains producing only desmethyl(7)-microcystin was found to be associated with point mutations in substrate-binding sequence motifs of the N-methyltransferase (NMT) domain in McyA. Multiple recombination events giving rise to 'phylogenetic mosaics' were detected within the NMT-domain-encoding mcyA sequences and the adenylation (A) domain sequences of mcyB and mcyC. Recombination leading to exchanges between the mcyB and mcyC regions encoding A domains in modules McyB1 and McyC was also detected. A previously reported replacement of the A domain in McyB1 was found to involve the region between the conserved motifs A3 and A8/A9. In all microcystin-producing strains the mcy gene cluster was flanked by the genes uma1 and dnaN. Clear indications of recombination, an insertion element and footprints of IS elements were found in the dnaN-mcyJ intergenic region. Among the non-microcystin producers, uma1 and dnaN were linked in some, but not all strains. Most non-producing strains lacked all mcy genes, while one strain possessed a partially deleted mcy operon. Our results show that frequent horizontal gene transfer events in addition to point mutations and insertions/deletions contribute to variation in the mcy gene cluster.
Collapse
|
18
|
Natural occurrence of microcystin synthetase deletion mutants capable of producing microcystins in strains of the genus Anabaena (Cyanobacteria). Microbiology (Reading) 2008; 154:1007-1014. [DOI: 10.1099/mic.0.2007/016097-0] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
|
19
|
Structural analysis of a non-ribosomal halogenated cyclic peptide and its putative operon from Microcystis: implications for evolution of cyanopeptolins. MICROBIOLOGY-SGM 2007; 153:1382-1393. [PMID: 17464052 DOI: 10.1099/mic.0.2006/001123-0] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
The structure of the major peptide produced by Microcystis cf. wesenbergii NIVA-CYA 172/5, the halogenated heptapeptide cyanopeptolin-984, was determined using LC/MS/MS. A gene cluster encoding a peptide synthetase putatively producing a cyanopeptolin was cloned from the same strain and sequenced. The cluster consists of four genes encoding peptide synthetases and one gene encoding a halogenase. Two additional ORFs transcribed in the opposite direction were found in the 5' flanking sequence; one of these encodes an ABC transporter. The overall organization of the cyanopeptolin synthetase operon (mcn) resembles a previously analysed anabaenopeptilide synthetase operon (apd) from Anabaena strain 90. Phylogenetic analyses of the individual domains from Mcn, Apd and other cyanobacterial peptide synthetases showed clustering of the adenylation domains according to function irrespective of operon origin - indicating strong functional constraints across peptide synthetases. In contrast, the condensation and thiolation domains to a large extent grouped according to operon affiliation or position in the respective operons. Phylogenetic analyses of condensation domains indicated that N-terminal domains and domains that condense L-amino acids and D-amino acids, respectively, form three separate groups. Although recombination events are likely to be involved in the evolution of mcn, no clear evidence of genetic recombination between the two cyanopeptolin gene clusters was found. Within the genus Microcystis, microcystin and cyanopeptolin synthetases have an evolutionary history of genomic coexistence. However, the data indicated that the two classes of peptide synthetase gene clusters have evolved independently.
Collapse
MESH Headings
- ATP-Binding Cassette Transporters/genetics
- Anabaena/genetics
- Bacterial Proteins/genetics
- Chromatography, Liquid
- DNA, Bacterial/chemistry
- DNA, Bacterial/genetics
- Evolution, Molecular
- Gene Order
- Mass Spectrometry
- Microcystis/genetics
- Microcystis/metabolism
- Molecular Sequence Data
- Molecular Structure
- Multigene Family
- Oligopeptides/biosynthesis
- Oligopeptides/chemistry
- Oligopeptides/genetics
- Open Reading Frames
- Operon
- Oxidoreductases/genetics
- Peptide Synthases/genetics
- Peptides, Cyclic/biosynthesis
- Peptides, Cyclic/chemistry
- Peptides, Cyclic/genetics
- Phylogeny
- Protein Structure, Tertiary/genetics
- Sequence Analysis, DNA
- Sequence Homology, Amino Acid
Collapse
|
20
|
Myostatin (MSTN) gene duplications in Atlantic salmon (Salmo salar): evidence for different selective pressure on teleost MSTN-1 and -2. Gene 2007; 403:159-69. [PMID: 17890020 DOI: 10.1016/j.gene.2007.08.008] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2007] [Revised: 08/08/2007] [Accepted: 08/16/2007] [Indexed: 12/14/2022]
Abstract
Whereas the negative muscle regulator myostatin (MSTN) in mammals is almost exclusively expressed in the muscle by a single encoding gene, teleost fish possess at least two MSTN genes which are differentially expressed in both muscular and non-muscular tissues. Duplicated MSTN-1 genes have previously been identified in the tetraploid salmonid genome. From Atlantic salmon we succeeded in isolating the paralogous genes of MSTN-2, which shared about 70% identity with MSTN-1a and -1b. The salmon MSTN-2a cDNA encoded a predicted protein of 363 residues and included the conserved C-terminal bioactive domain. MSTN-2a seemed to be primarily expressed in the brain, and a functional role of teleost MSTN-2 in the neurogenesis similar to the inhibitory action of the closely related GDF-11 in the mammalian brain was proposed. In contrast, a frame-shift mutation in exon 1 of salmon MSTN-2b would lead to the synthesis of a putatively non-functional truncated protein. The absence of processed MSTN-2b mRNA in the examined tissues indicated that this gene has become a non-functional pseudogene. The differential, but partially overlapping, expression patterns of salmon MSTN-2a, -1a and -1b in muscular and non-muscular tissues are probably due to the different arrangement of the potential cis-acting regulatory elements identified in their putative promoter regions. Single and paired E-boxes in the MSTN-1b promoter were shown to bind both homo-and hetero-dimers of the myogenic regulatory factor MyoD and E47 in vitro of importance for initiating the myogenic program. Analyses of nucleotide substitution patterns indicated that the teleost MSTNs essentially have evolved under purifying selection, but a subset of amino acid sites under positive selective pressure were identified within the MSTN1 branch. The results may reflect the evolutionary forces related to adoption of the different functional roles proposed for the teleost MSTN isoforms. The phylogenetic analysis of multiple vertebrate MSTNs suggested at least two separate gene duplication events in the fish lineage. Linkage analysis of polymorphic microsatellites within intron 2 of salmon MSTN-1a and -1b mapped the two genes to different linkage groups in agreement with the tetraploid origin of the duplicated salmonid MSTN-1 and MSTN-2 genes.
Collapse
|