51
|
Elsner D, Kremer LP, Arning N, Bornberg-Bauer E. Chapter 6. Comparative genomic approaches to investigate molecular traits specific to social insects. CURRENT OPINION IN INSECT SCIENCE 2016; 16:87-94. [PMID: 27720056 DOI: 10.1016/j.cois.2016.05.016] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/31/2016] [Revised: 05/01/2016] [Accepted: 05/20/2016] [Indexed: 06/06/2023]
Abstract
Ageing is a feature of nearly all known organisms and, by its connection to survival, appears to trade off with fecundity. However, in some organisms such as in queens of social insects, this negative relation appears reversed and individuals live long and reproduce much. Since new experimental techniques, transcriptomes and genomes of many social insects have recently become available, a comparison of these data in a phylogenetic framework becomes feasible. This allows the study of general trends, species specific oddities and evolutionary dynamics of the molecular properties and changes which underlie ageing, fecundity and the reversal of this negative association. In the framework of social insect evolution, we review the most important recent insights, computational methods, their applications and data resources which are available.
Collapse
|
52
|
Haase D, Rieger JK, Witten A, Stoll M, Bornberg-Bauer E, Kalbe M, Schmidt-Drewello A, Scharsack JP, Reusch TB. Comparative transcriptomics of stickleback immune gene responses upon infection by two helminth parasites, Diplostomum pseudospathaceum and Schistocephalus solidus. ZOOLOGY 2016; 119:307-13. [DOI: 10.1016/j.zool.2016.05.005] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2015] [Revised: 03/07/2016] [Accepted: 05/22/2016] [Indexed: 12/13/2022]
|
53
|
Schmitz JF, Zimmer F, Bornberg-Bauer E. Mechanisms of transcription factor evolution in Metazoa. Nucleic Acids Res 2016; 44:6287-97. [PMID: 27288445 PMCID: PMC5291267 DOI: 10.1093/nar/gkw492] [Citation(s) in RCA: 42] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2016] [Revised: 05/18/2016] [Accepted: 05/22/2016] [Indexed: 11/12/2022] Open
Abstract
Transcriptions factors (TFs) are pivotal for the regulation of virtually all cellular processes, including growth and development. Expansions of TF families are causally linked to increases in organismal complexity. Here we study the evolutionary dynamics, genetic causes and functional implications of the five largest metazoan TF families. We find that family expansions dominate across the whole metazoan tree; however, some branches experience exceptional family-specific accelerated expansions. Additionally, we find that such expansions are often predated by modular domain rearrangements, which spur the expansion of a new sub-family by separating it from the rest of the TF family in terms of protein-protein interactions. This separation allows for radical shifts in the functional spectrum of a duplicated TF. We also find functional differentiation inside TF sub-families as changes in expression specificity. Furthermore, accelerated family expansions are facilitated by repeats of sequence motifs such as C2H2 zinc fingers. We quantify whole genome duplications and single gene duplications as sources of TF family expansions, implying that some, but not all, TF duplicates are preferentially retained. We conclude that trans-regulatory changes (domain rearrangements) are instrumental for fundamental functional innovations, that cis-regulatory changes (affecting expression) accomplish wide-spread fine tuning and both jointly contribute to the functional diversification of TFs.
Collapse
|
54
|
Dohmen E, Kremer LPM, Bornberg-Bauer E, Kemena C. DOGMA: domain-based transcriptome and proteome quality assessment. Bioinformatics 2016; 32:2577-81. [PMID: 27153665 DOI: 10.1093/bioinformatics/btw231] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2016] [Accepted: 04/21/2016] [Indexed: 12/29/2022] Open
Abstract
MOTIVATION Genome studies have become cheaper and easier than ever before, due to the decreased costs of high-throughput sequencing and the free availability of analysis software. However, the quality of genome or transcriptome assemblies can vary a lot. Therefore, quality assessment of assemblies and annotations are crucial aspects of genome analysis pipelines. RESULTS We developed DOGMA, a program for fast and easy quality assessment of transcriptome and proteome data based on conserved protein domains. DOGMA measures the completeness of a given transcriptome or proteome and provides information about domain content for further analysis. DOGMA provides a very fast way to do quality assessment within seconds. AVAILABILITY AND IMPLEMENTATION DOGMA is implemented in Python and published under GNU GPL v.3 license. The source code is available on https://ebbgit.uni-muenster.de/domainWorld/DOGMA/ CONTACTS: e.dohmen@wwu.de or c.kemena@wwu.de SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|
55
|
Huang Y, Chain FJJ, Panchal M, Eizaguirre C, Kalbe M, Lenz TL, Samonte IE, Stoll M, Bornberg-Bauer E, Reusch TBH, Milinski M, Feulner PGD. Transcriptome profiling of immune tissues reveals habitat-specific gene expression between lake and river sticklebacks. Mol Ecol 2016; 25:943-58. [PMID: 26749022 PMCID: PMC4790908 DOI: 10.1111/mec.13520] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2015] [Revised: 11/18/2015] [Accepted: 12/10/2015] [Indexed: 12/16/2022]
Abstract
The observation of habitat-specific phenotypes suggests the action of natural selection. The three-spined stickleback (Gasterosteus aculeatus) has repeatedly colonized and adapted to diverse freshwater habitats across the northern hemisphere since the last glaciation, while giving rise to recurring phenotypes associated with specific habitats. Parapatric lake and river populations of sticklebacks harbour distinct parasite communities, a factor proposed to contribute to adaptive differentiation between these ecotypes. However, little is known about the transcriptional response to the distinct parasite pressure of those fish in a natural setting. Here, we sampled wild-caught sticklebacks across four geographical locations from lake and river habitats differing in their parasite load. We compared gene expression profiles between lake and river populations using 77 whole-transcriptome libraries from two immune-relevant tissues, the head kidney and the spleen. Differential expression analyses revealed 139 genes with habitat-specific expression patterns across the sampled population pairs. Among the 139 differentially expressed genes, eight are annotated with an immune function and 42 have been identified as differentially expressed in previous experimental studies in which fish have been immune challenged. Together, these findings reinforce the hypothesis that parasites contribute to adaptation of sticklebacks in lake and river habitats.
Collapse
|
56
|
Haase D, Rieger JK, Witten A, Stoll M, Bornberg-Bauer E, Kalbe M, Reusch TBH. Immunity comes first: the effect of parasite genotypes on adaptive immunity and immunization in three-spined sticklebacks. DEVELOPMENTAL AND COMPARATIVE IMMUNOLOGY 2016; 54:137-144. [PMID: 26400836 DOI: 10.1016/j.dci.2015.09.008] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/31/2015] [Revised: 09/16/2015] [Accepted: 09/17/2015] [Indexed: 06/05/2023]
Abstract
Adaptive immunity in vertebrates can confer increased resistance against invading pathogens upon re-infection. But how specific parasite genotypes affect the temporal transition from innate to adaptive immunity under continual exposure to parasites is poorly understood. Here, we investigated the effects of homologous and heterologous exposures of genetically distinct parasite lineages of the eye fluke Diplostomum pseudospathaceum on gene expression patterns of adaptive immunity in sticklebacks (Gasterosteus aculeatus). Observable differences in gene expression were largely attributable to final exposures while there was no transcription pattern characteristic for a general response to repeated infections with D. pseudospathaceum. None of the final exposure treatments was able to erase the distinct expression patterns resulting from a heterologous pre-exposed fish. Interestingly, heterologous final exposures showed similarities between different treatment groups subjected to homologous pre-exposure. The observed pattern was supported by parasite infection rates and suggests that host immunization was optimized towards an adaptive immune response that favored effectiveness against parasite diversity over specificity.
Collapse
|
57
|
Feulner PGD, Chain FJJ, Panchal M, Huang Y, Eizaguirre C, Kalbe M, Lenz TL, Samonte IE, Stoll M, Bornberg-Bauer E, Reusch TBH, Milinski M. Correction: Genomics of Divergence along a Continuum of Parapatric Population Differentiation. PLoS Genet 2015; 11:e1005414. [PMID: 26226517 PMCID: PMC4520447 DOI: 10.1371/journal.pgen.1005414] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
|
58
|
Smith CR, Helms Cahan S, Kemena C, Brady SG, Yang W, Bornberg-Bauer E, Eriksson T, Gadau J, Helmkampf M, Gotzek D, Okamoto Miyakawa M, Suarez AV, Mikheyev A. How Do Genomes Create Novel Phenotypes? Insights from the Loss of the Worker Caste in Ant Social Parasites. Mol Biol Evol 2015; 32:2919-31. [PMID: 26226984 PMCID: PMC4651238 DOI: 10.1093/molbev/msv165] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
A central goal of biology is to uncover the genetic basis for the origin of new
phenotypes. A particularly effective approach is to examine the genomic
architecture of species that have secondarily lost a phenotype with respect to
their close relatives. In the eusocial Hymenoptera, queens and workers have
divergent phenotypes that may be produced via either expression of alternative
sets of caste-specific genes and pathways or differences in expression patterns
of a shared set of multifunctional genes. To distinguish between these two
hypotheses, we investigated how secondary loss of the worker phenotype in
workerless ant social parasites impacted genome evolution across two independent
origins of social parasitism in the ant genera Pogonomyrmex and
Vollenhovia. We sequenced the genomes of three social
parasites and their most-closely related eusocial host species and compared gene
losses in social parasites with gene expression differences between host queens
and workers. Virtually all annotated genes were expressed to some degree in both
castes of the host, with most shifting in queen-worker bias across developmental
stages. As a result, despite >1 My of divergence from the last common
ancestor that had workers, the social parasites showed strikingly little
evidence of gene loss, damaging mutations, or shifts in selection regime
resulting from loss of the worker caste. This suggests that regulatory changes
within a multifunctional genome, rather than sequence differences, have played a
predominant role in the evolution of social parasitism, and perhaps also in the
many gains and losses of phenotypes in the social insects.
Collapse
|
59
|
Schüler A, Schmitz G, Reft A, Özbek S, Thurm U, Bornberg-Bauer E. The Rise and Fall of TRP-N, an Ancient Family of Mechanogated Ion Channels, in Metazoa. Genome Biol Evol 2015; 7:1713-27. [PMID: 26100409 PMCID: PMC4494053 DOI: 10.1093/gbe/evv091] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Mechanoreception, the sensing of mechanical forces, is an ancient means of orientation and communication and tightly linked to the evolution of motile animals. In flies, the transient-receptor-potential N protein (TRP-N) was found to be a cilia-associated mechanoreceptor. TRP-N belongs to a large and diverse family of ion channels. Its unusually long N-terminal repeat of 28 ankyrin domains presumably acts as the gating spring by which mechanical energy induces channel gating. We analyzed the evolutionary origins and possible diversification of TRP-N. Using a custom-made set of highly discriminative sequence profiles we scanned a representative set of metazoan genomes and subsequently corrected several gene models. We find that, contrary to other ion channel families, TRP-N is remarkably conserved in its domain arrangements and copy number (1) in all Bilateria except for amniotes, even in the wake of several whole-genome duplications. TRP-N is absent in Porifera but present in Ctenophora and Placozoa. Exceptional multiplications of TRP-N occurred in Cnidaria, independently along the Hydra and the Nematostella lineage. Molecular signals of subfunctionalization can be attributed to different mechanisms of activation of the gating spring. In Hydra this is further supported by in situ hybridization and immune staining, suggesting that at least three paralogs adapted to nematocyte discharge, which is key for predation and defense. We propose that these new candidate proteins help explain the sensory complexity of Cnidaria which has been previously observed but so far has lacked a molecular underpinning. Also, the ancient appearance of TRP-N supports a common origin of important components of the nervous systems in Ctenophores, Cnidaria, and Bilateria.
Collapse
|
60
|
Kersting AR, Mizrachi E, Bornberg-Bauer E, Myburg AA. Protein domain evolution is associated with reproductive diversification and adaptive radiation in the genus Eucalyptus. THE NEW PHYTOLOGIST 2015; 206:1328-36. [PMID: 25494981 DOI: 10.1111/nph.13211] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/15/2014] [Accepted: 11/04/2014] [Indexed: 05/04/2023]
Abstract
Eucalyptus is a pivotal genus within the rosid order Myrtales with distinct geographic history and adaptations. Comparative analysis of protein domain evolution in the newly sequenced Eucalyptus grandis genome and other rosid lineages sheds light on the adaptive mechanisms integral to the success of this genus of woody perennials. We reconstructed the ancestral domain content to elucidate the gain, loss and expansion of protein domains and domain arrangements in Eucalyptus in the context of rosid phylogeny. We used functional gene ontology (GO) annotation of genes to investigate the possible biological and evolutionary consequences of protein domain expansion. We found that protein modulation within the angiosperms occurred primarily on the level of expansion of certain domains and arrangements. Using RNA-Seq data from E. grandis, we showed that domain expansions have contributed to tissue-specific expression of tandemly duplicated genes. Our results indicate that tandem duplication of genes, a key feature of the Eucalyptus genome, has played an important role in the expansion of domains, particularly in proteins related to the specialization of reproduction and biotic and abiotic interactions affecting root and floral biology, and that tissue-specific expression of proteins with expanded domains has facilitated subfunctionalization in domain families.
Collapse
|
61
|
Masri L, Branca A, Sheppard AE, Papkou A, Laehnemann D, Guenther PS, Prahl S, Saebelfeld M, Hollensteiner J, Liesegang H, Brzuszkiewicz E, Daniel R, Michiels NK, Schulte RD, Kurtz J, Rosenstiel P, Telschow A, Bornberg-Bauer E, Schulenburg H. Host-Pathogen Coevolution: The Selective Advantage of Bacillus thuringiensis Virulence and Its Cry Toxin Genes. PLoS Biol 2015; 13:e1002169. [PMID: 26042786 PMCID: PMC4456383 DOI: 10.1371/journal.pbio.1002169] [Citation(s) in RCA: 62] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2014] [Accepted: 05/07/2015] [Indexed: 01/11/2023] Open
Abstract
Reciprocal coevolution between host and pathogen is widely seen as a major driver of evolution and biological innovation. Yet, to date, the underlying genetic mechanisms and associated trait functions that are unique to rapid coevolutionary change are generally unknown. We here combined experimental evolution of the bacterial biocontrol agent Bacillus thuringiensis and its nematode host Caenorhabditis elegans with large-scale phenotyping, whole genome analysis, and functional genetics to demonstrate the selective benefit of pathogen virulence and the underlying toxin genes during the adaptation process. We show that: (i) high virulence was specifically favoured during pathogen-host coevolution rather than pathogen one-sided adaptation to a nonchanging host or to an environment without host; (ii) the pathogen genotype BT-679 with known nematocidal toxin genes and high virulence specifically swept to fixation in all of the independent replicate populations under coevolution but only some under one-sided adaptation; (iii) high virulence in the BT-679-dominated populations correlated with elevated copy numbers of the plasmid containing the nematocidal toxin genes; (iv) loss of virulence in a toxin-plasmid lacking BT-679 isolate was reconstituted by genetic reintroduction or external addition of the toxins. We conclude that sustained coevolution is distinct from unidirectional selection in shaping the pathogen's genome and life history characteristics. To our knowledge, this study is the first to characterize the pathogen genes involved in coevolutionary adaptation in an animal host-pathogen interaction system.
Collapse
|
62
|
Bitard-Feildel T, Kemena C, Greenwood JM, Bornberg-Bauer E. Domain similarity based orthology detection. BMC Bioinformatics 2015; 16:154. [PMID: 25968113 PMCID: PMC4443542 DOI: 10.1186/s12859-015-0570-8] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2014] [Accepted: 04/10/2015] [Indexed: 11/10/2022] Open
Abstract
Background Orthologous protein detection software mostly uses pairwise comparisons of amino-acid sequences to assert whether two proteins are orthologous or not. Accordingly, when the number of sequences for comparison increases, the number of comparisons to compute grows in a quadratic order. A current challenge of bioinformatic research, especially when taking into account the increasing number of sequenced organisms available, is to make this ever-growing number of comparisons computationally feasible in a reasonable amount of time. We propose to speed up the detection of orthologous proteins by using strings of domains to characterize the proteins. Results We present two new protein similarity measures, a cosine and a maximal weight matching score based on domain content similarity, and new software, named porthoDom. The qualities of the cosine and the maximal weight matching similarity measures are compared against curated datasets. The measures show that domain content similarities are able to correctly group proteins into their families. Accordingly, the cosine similarity measure is used inside porthoDom, the wrapper developed for proteinortho. porthoDom makes use of domain content similarity measures to group proteins together before searching for orthologs. By using domains instead of amino acid sequences, the reduction of the search space decreases the computational complexity of an all-against-all sequence comparison. Conclusion We demonstrate that representing and comparing proteins as strings of discrete domains, i.e. as a concatenation of their unique identifiers, allows a drastic simplification of search space. porthoDom has the advantage of speeding up orthology detection while maintaining a degree of accuracy similar to proteinortho. The implementation of porthoDom is released using python and C++ languages and is available under the GNU GPL licence 3 at http://www.bornberglab.org/pages/porthoda. Electronic supplementary material The online version of this article (doi:10.1186/s12859-015-0570-8) contains supplementary material, which is available to authorized users.
Collapse
|
63
|
Sadd BM, Barribeau SM, Bloch G, de Graaf DC, Dearden P, Elsik CG, Gadau J, Grimmelikhuijzen CJP, Hasselmann M, Lozier JD, Robertson HM, Smagghe G, Stolle E, Van Vaerenbergh M, Waterhouse RM, Bornberg-Bauer E, Klasberg S, Bennett AK, Câmara F, Guigó R, Hoff K, Mariotti M, Munoz-Torres M, Murphy T, Santesmasses D, Amdam GV, Beckers M, Beye M, Biewer M, Bitondi MMG, Blaxter ML, Bourke AFG, Brown MJF, Buechel SD, Cameron R, Cappelle K, Carolan JC, Christiaens O, Ciborowski KL, Clarke DF, Colgan TJ, Collins DH, Cridge AG, Dalmay T, Dreier S, du Plessis L, Duncan E, Erler S, Evans J, Falcon T, Flores K, Freitas FCP, Fuchikawa T, Gempe T, Hartfelder K, Hauser F, Helbing S, Humann FC, Irvine F, Jermiin LS, Johnson CE, Johnson RM, Jones AK, Kadowaki T, Kidner JH, Koch V, Köhler A, Kraus FB, Lattorff HMG, Leask M, Lockett GA, Mallon EB, Antonio DSM, Marxer M, Meeus I, Moritz RFA, Nair A, Näpflin K, Nissen I, Niu J, Nunes FMF, Oakeshott JG, Osborne A, Otte M, Pinheiro DG, Rossié N, Rueppell O, Santos CG, Schmid-Hempel R, Schmitt BD, Schulte C, Simões ZLP, Soares MPM, Swevers L, Winnebeck EC, Wolschin F, Yu N, Zdobnov EM, Aqrawi PK, Blankenburg KP, Coyle M, Francisco L, Hernandez AG, Holder M, Hudson ME, Jackson L, Jayaseelan J, Joshi V, Kovar C, Lee SL, Mata R, Mathew T, Newsham IF, Ngo R, Okwuonu G, Pham C, Pu LL, Saada N, Santibanez J, Simmons D, Thornton R, Venkat A, Walden KKO, Wu YQ, Debyser G, Devreese B, Asher C, Blommaert J, Chipman AD, Chittka L, Fouks B, Liu J, O'Neill MP, Sumner S, Puiu D, Qu J, Salzberg SL, Scherer SE, Muzny DM, Richards S, Robinson GE, Gibbs RA, Schmid-Hempel P, Worley KC. The genomes of two key bumblebee species with primitive eusocial organization. Genome Biol 2015; 16:76. [PMID: 25908251 PMCID: PMC4414376 DOI: 10.1186/s13059-015-0623-3] [Citation(s) in RCA: 249] [Impact Index Per Article: 27.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2014] [Accepted: 03/10/2015] [Indexed: 12/25/2022] Open
Abstract
Background The shift from solitary to social behavior is one of the major evolutionary transitions. Primitively eusocial bumblebees are uniquely placed to illuminate the evolution of highly eusocial insect societies. Bumblebees are also invaluable natural and agricultural pollinators, and there is widespread concern over recent population declines in some species. High-quality genomic data will inform key aspects of bumblebee biology, including susceptibility to implicated population viability threats. Results We report the high quality draft genome sequences of Bombus terrestris and Bombus impatiens, two ecologically dominant bumblebees and widely utilized study species. Comparing these new genomes to those of the highly eusocial honeybee Apis mellifera and other Hymenoptera, we identify deeply conserved similarities, as well as novelties key to the biology of these organisms. Some honeybee genome features thought to underpin advanced eusociality are also present in bumblebees, indicating an earlier evolution in the bee lineage. Xenobiotic detoxification and immune genes are similarly depauperate in bumblebees and honeybees, and multiple categories of genes linked to social organization, including development and behavior, show high conservation. Key differences identified include a bias in bumblebee chemoreception towards gustation from olfaction, and striking differences in microRNAs, potentially responsible for gene regulation underlying social and other traits. Conclusions These two bumblebee genomes provide a foundation for post-genomic research on these key pollinators and insect societies. Overall, gene repertoires suggest that the route to advanced eusociality in bees was mediated by many small changes in many genes and processes, and not by notable expansion or depauperation. Electronic supplementary material The online version of this article (doi:10.1186/s13059-015-0623-3) contains supplementary material, which is available to authorized users.
Collapse
|
64
|
Bitard-Feildel T, Heberlein M, Bornberg-Bauer E, Callebaut I. Detection of orphan domains in Drosophila using "hydrophobic cluster analysis". Biochimie 2015; 119:244-53. [PMID: 25736992 DOI: 10.1016/j.biochi.2015.02.019] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2014] [Accepted: 02/20/2015] [Indexed: 11/30/2022]
Abstract
INTRODUCTION Comparative genomics has become an important strategy in life science research. While many genes, and the proteins they code for, can be well characterized by assigning orthologs, a significant amount of proteins or domains remain obscure "orphans". Some orphans are overlooked by current computational methods because they rapidly diverged, others emerged relatively recently (de novo). Recent research has demonstrated the importance of orphans, and of de novo proteins and domains for development of new phenotypic traits and adaptation. New approaches for detecting novel domains are thus of paramount importance. RESULTS The hydrophobic cluster analysis (HCA) method delineates globular-like domains from the information of a protein sequence and thereby allows bypassing some of the established methods limitations based on conserved sequence similarity. In this study, HCA is tested for orphan domain detection on 12 Drosophila genomes. After their detection, the oprhan domains are classified into two categories, depending on their presence/absence in distantly related species. The two categories show significantly different physico-chemical properties when compared to previously characterized domains from the Pfam database. The newly detected domains have a higher degree of intrinsic disorder and a particular hydrophobic cluster composition. The older the domains are, the more similar their hydrophobic cluster content is to the cluster content of Pfam domains. The results suggest that, over time, newly created domains acquire a canonical set of hydrophobic clusters but conserve some features of intrinsically disordered regions. CONCLUSION Our results agree with previous findings on orphan domains and suggest that the physico-chemical properties of domains change over evolutionary long time scale. The presented HCA-based method is able to detect domains with unusual properties without relying on prior knowledge, such as the availability of homologs. Therefore, the method has large potential for complementing existing strategies to annotate genomes, and for better understanding how molecular features emerge.
Collapse
|
65
|
Feulner PGD, Chain FJJ, Panchal M, Huang Y, Eizaguirre C, Kalbe M, Lenz TL, Samonte IE, Stoll M, Bornberg-Bauer E, Reusch TBH, Milinski M. Genomics of divergence along a continuum of parapatric population differentiation. PLoS Genet 2015; 11:e1004966. [PMID: 25679225 PMCID: PMC4334544 DOI: 10.1371/journal.pgen.1004966] [Citation(s) in RCA: 101] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2014] [Accepted: 12/19/2014] [Indexed: 12/20/2022] Open
Abstract
The patterns of genomic divergence during ecological speciation are shaped by a combination of evolutionary forces. Processes such as genetic drift, local reduction of gene flow around genes causing reproductive isolation, hitchhiking around selected variants, variation in recombination and mutation rates are all factors that can contribute to the heterogeneity of genomic divergence. On the basis of 60 fully sequenced three-spined stickleback genomes, we explore these different mechanisms explaining the heterogeneity of genomic divergence across five parapatric lake and river population pairs varying in their degree of genetic differentiation. We find that divergent regions of the genome are mostly specific for each population pair, while their size and abundance are not correlated with the extent of genome-wide population differentiation. In each pair-wise comparison, an analysis of allele frequency spectra reveals that 25–55% of the divergent regions are consistent with a local restriction of gene flow. Another large proportion of divergent regions (38–75%) appears to be mainly shaped by hitchhiking effects around positively selected variants. We provide empirical evidence that alternative mechanisms determining the evolution of genomic patterns of divergence are not mutually exclusive, but rather act in concert to shape the genome during population differentiation, a first necessary step towards ecological speciation. A variety of evolutionary forces influence the genomic landscape of divergence during ecological speciation. Here we characterize the evolution of genomic divergence patterns based on 60 fully sequenced three-spined stickleback genomes, contrasting lake and river populations that differ in parasite abundance. Our comparison of the size and abundance of divergent regions in the genomes across a continuum of population differentiation suggests that selection and the hitchhiking effect on neutral sites mainly contributes to the observed heterogeneous patterns of genomic divergence. Additional divergent regions of the genome can be explained by a local reduction of gene flow. Our description of genomic divergence patterns across a continuum of population differentiation combined with an analysis of molecular signatures of evolution highlights how adaptation shapes the differentiation of sticklebacks in freshwater habitats.
Collapse
|
66
|
Kemena C, Bitard-Feildel T, Bornberg-Bauer E. MDAT- Aligning multiple domain arrangements. BMC Bioinformatics 2015; 16:19. [PMID: 25626688 PMCID: PMC4384290 DOI: 10.1186/s12859-014-0442-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2014] [Accepted: 12/03/2014] [Indexed: 11/26/2022] Open
Abstract
Background Proteins are composed of domains, protein segments that fold independently from the rest of the protein and have a specific function. During evolution the arrangement of domains can change: domains are gained, lost or their order is rearranged. To facilitate the analysis of these changes we propose the use of multiple domain alignments. Results We developed an alignment program, called MDAT, which aligns multiple domain arrangements. MDAT extends earlier programs which perform pairwise alignments of domain arrangements. MDAT uses a domain similarity matrix to score domain pairs and aligns the domain arrangements using a consistency supported progressive alignment method. Conclusion MDAT will be useful for analysing changes in domain arrangements within and between protein families and will thus provide valuable insights into the evolution of proteins and their domains. MDAT is coded in C++, and the source code is freely available for download at http://www.bornberglab.org/pages/mdat. Electronic supplementary material The online version of this article (doi:10.1186/s12859-014-0442-7) contains supplementary material, which is available to authorized users.
Collapse
|
67
|
Haase D, Rieger JK, Witten A, Stoll M, Bornberg-Bauer E, Kalbe M, Reusch TBH. Specific gene expression responses to parasite genotypes reveal redundancy of innate immunity in vertebrates. PLoS One 2014; 9:e108001. [PMID: 25254967 PMCID: PMC4177871 DOI: 10.1371/journal.pone.0108001] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2014] [Accepted: 07/08/2014] [Indexed: 11/27/2022] Open
Abstract
Vertebrate innate immunity is the first line of defense against an invading pathogen and has long been assumed to be largely unspecific with respect to parasite/pathogen species. However, recent phenotypic evidence suggests that immunogenetic variation, i.e. allelic variability in genes associated with the immune system, results in host-parasite genotype-by-genotype interactions and thus specific innate immune responses. Immunogenetic variation is common in all vertebrate taxa and this reflects an effective immunological function in complex environments. However, the underlying variability in host gene expression patterns as response of innate immunity to within-species genetic diversity of macroparasites in vertebrates is unknown. We hypothesized that intra-specific variation among parasite genotypes must be reflected in host gene expression patterns. Here we used high-throughput RNA-sequencing to examine the effect of parasite genotypes on gene expression patterns of a vertebrate host, the three-spined stickleback (Gasterosteus aculeatus). By infecting naïve fish with distinct trematode genotypes of the species Diplostomum pseudospathaceum we show that gene activity of innate immunity in three-spined sticklebacks depended on the identity of an infecting macroparasite genotype. In addition to a suite of genes indicative for a general response against the trematode we also find parasite-strain specific gene expression, in particular in the complement system genes, despite similar infection rates of single clone treatments. The observed discrepancy between infection rates and gene expression indicates the presence of alternative pathways which execute similar functions. This suggests that the innate immune system can induce redundant responses specific to parasite genotypes.
Collapse
|
68
|
Behrens S, Peuß R, Milutinović B, Eggert H, Esser D, Rosenstiel P, Schulenburg H, Bornberg-Bauer E, Kurtz J. Infection routes matter in population-specific responses of the red flour beetle to the entomopathogen Bacillus thuringiensis. BMC Genomics 2014; 15:445. [PMID: 24908078 PMCID: PMC4079954 DOI: 10.1186/1471-2164-15-445] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2014] [Accepted: 06/04/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Pathogens can infect their hosts through different routes. For studying the consequences for host resistance, we here used the entomopathogen Bacillus thuringiensis and the red flour beetle Tribolium castaneum for oral and systemic (i. e. pricking the cuticle) experimental infection. In order to characterize the molecular mechanisms underpinning the two different infection routes, the transcriptomes of beetles of two different T. castaneum populations--one recently collected population (Cro1) and a commonly used laboratory strain (SB)--were analyzed using a next generation RNA sequencing approach. RESULTS The genetically more diverse population Cro1 showed a significantly larger number of differentially expressed genes. While both populations exhibited similar reactions to pricking, their expression patterns in response to oral infection differed remarkably. In particular, the Cro1 population showed a strong response of cuticular proteins and developmental genes, which might indicate an adaptive developmental flexibility that was lost in the SB population presumably as a result of inbreeding. The immune response of SB was primarily based on antimicrobial peptides, while Cro1 relied on responses mediated by phenoloxidase and reactive oxygen species, which may explain the higher resistance of this strain against oral infection. CONCLUSIONS Our data demonstrate that immunological and physiological processes underpinning the two different routes of infection are clearly distinct, and that host populations particularly differ in responses to oral infection. Furthermore, gene expression upon pricking infection entailed a strong signal of wounding, highlighting the importance of pricking controls in future infection studies.
Collapse
|
69
|
Terrapon N, Li C, Robertson HM, Ji L, Meng X, Booth W, Chen Z, Childers CP, Glastad KM, Gokhale K, Gowin J, Gronenberg W, Hermansen RA, Hu H, Hunt BG, Huylmans AK, Khalil SMS, Mitchell RD, Munoz-Torres MC, Mustard JA, Pan H, Reese JT, Scharf ME, Sun F, Vogel H, Xiao J, Yang W, Yang Z, Yang Z, Zhou J, Zhu J, Brent CS, Elsik CG, Goodisman MAD, Liberles DA, Roe RM, Vargo EL, Vilcinskas A, Wang J, Bornberg-Bauer E, Korb J, Zhang G, Liebig J. Molecular traces of alternative social organization in a termite genome. Nat Commun 2014; 5:3636. [PMID: 24845553 DOI: 10.1038/ncomms4636] [Citation(s) in RCA: 268] [Impact Index Per Article: 26.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2013] [Accepted: 03/13/2014] [Indexed: 01/28/2023] Open
Abstract
Although eusociality evolved independently within several orders of insects, research into the molecular underpinnings of the transition towards social complexity has been confined primarily to Hymenoptera (for example, ants and bees). Here we sequence the genome and stage-specific transcriptomes of the dampwood termite Zootermopsis nevadensis (Blattodea) and compare them with similar data for eusocial Hymenoptera, to better identify commonalities and differences in achieving this significant transition. We show an expansion of genes related to male fertility, with upregulated gene expression in male reproductive individuals reflecting the profound differences in mating biology relative to the Hymenoptera. For several chemoreceptor families, we show divergent numbers of genes, which may correspond to the more claustral lifestyle of these termites. We also show similarities in the number and expression of genes related to caste determination mechanisms. Finally, patterns of DNA methylation and alternative splicing support a hypothesized epigenetic regulation of caste differentiation.
Collapse
|
70
|
Moore AD, Held A, Terrapon N, Weiner J, Bornberg-Bauer E. DoMosaics: software for domain arrangement visualization and domain-centric analysis of proteins. Bioinformatics 2013; 30:282-3. [PMID: 24222210 DOI: 10.1093/bioinformatics/btt640] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
UNLABELLED DoMosaics is an application that unifies protein domain annotation, domain arrangement analysis and visualization in a single tool. It simplifies the analysis of protein families by consolidating disjunct procedures based on often inconvenient command-line applications and complex analysis tools. It provides a simple user interface with access to domain annotation services such as InterProScan or a local HMMER installation, and can be used to compare, analyze and visualize the evolution of domain architectures. AVAILABILITY AND IMPLEMENTATION DoMosaics is licensed under theApache License, Version 2.0, and binaries can be freely obtained from www.domosaics.net.
Collapse
|
71
|
Guo B, Chain FJJ, Bornberg-Bauer E, Leder EH, Merilä J. Genomic divergence between nine- and three-spined sticklebacks. BMC Genomics 2013; 14:756. [PMID: 24188282 PMCID: PMC4046692 DOI: 10.1186/1471-2164-14-756] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2013] [Accepted: 10/31/2013] [Indexed: 12/22/2022] Open
Abstract
Background Comparative genomics approaches help to shed light on evolutionary processes that shape differentiation between lineages. The nine-spined stickleback (Pungitius pungitius) is a closely related species of the ecological ‘supermodel’ three-spined stickleback (Gasterosteus aculeatus). It is an emerging model system for evolutionary biology research but has garnered less attention and lacks extensive genomic resources. To expand on these resources and aid the study of sticklebacks in a phylogenetic framework, we characterized nine-spined stickleback transcriptomes from brain and liver using deep sequencing. Results We obtained nearly eight thousand assembled transcripts, of which 3,091 were assigned as putative one-to-one orthologs to genes found in the three-spined stickleback. These sequences were used for evaluating overall differentiation and substitution rates between nine- and three-spined sticklebacks, and to identify genes that are putatively evolving under positive selection. The synonymous substitution rate was estimated to be 7.1 × 10-9 per site per year between the two species, and a total of 165 genes showed patterns of adaptive evolution in one or both species. A few nine-spined stickleback contigs lacked an obvious ortholog in three-spined sticklebacks but were found to match genes in other fish species, suggesting several gene losses within 13 million years since the divergence of the two stickleback species. We identified 47 SNPs in 25 different genes that differentiate pond and marine ecotypes. We also identified 468 microsatellites that could be further developed as genetic markers in nine-spined sticklebacks. Conclusion With deep sequencing of nine-spined stickleback cDNA libraries, our study provides a significant increase in the number of gene sequences and microsatellite markers for this species, and identifies a number of genes showing patterns of adaptive evolution between nine- and three-spined sticklebacks. We also report several candidate genes that might be involved in differential adaptation between marine and freshwater nine-spined sticklebacks. This study provides a valuable resource for future studies aiming to identify candidate genes underlying ecological adaptation in this and other stickleback species. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-14-756) contains supplementary material, which is available to authorized users.
Collapse
|
72
|
Wissler L, Gadau J, Simola DF, Helmkampf M, Bornberg-Bauer E. Mechanisms and dynamics of orphan gene emergence in insect genomes. Genome Biol Evol 2013; 5:439-55. [PMID: 23348040 PMCID: PMC3590893 DOI: 10.1093/gbe/evt009] [Citation(s) in RCA: 100] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Orphan genes are defined as genes that lack detectable similarity to genes in other species and therefore no clear signals of common descent (i.e., homology) can be inferred. Orphans are an enigmatic portion of the genome because their origin and function are mostly unknown and they typically make up 10% to 30% of all genes in a genome. Several case studies demonstrated that orphans can contribute to lineage-specific adaptation. Here, we study orphan genes by comparing 30 arthropod genomes, focusing in particular on seven recently sequenced ant genomes. This setup allows analyzing a major metazoan taxon and a comparison between social Hymenoptera (ants and bees) and nonsocial Diptera (flies and mosquitoes). First, we find that recently split lineages undergo accelerated genomic reorganization, including the rapid gain of many orphan genes. Second, between the two insect orders Hymenoptera and Diptera, orphan genes are more abundant and emerge more rapidly in Hymenoptera, in particular, in leaf-cutter ants. With respect to intragenomic localization, we find that ant orphan genes show little clustering, which suggests that orphan genes in ants are scattered uniformly over the genome and between nonorphan genes. Finally, our results indicate that the genetic mechanisms creating orphan genes—such as gene duplication, frame-shift fixation, creation of overlapping genes, horizontal gene transfer, and exaptation of transposable elements—act at different rates in insects, primates, and plants. In Formicidae, the majority of orphan genes has their origin in intergenic regions, pointing to a high rate of de novo gene formation or generalized gene loss, and support a recently proposed dynamic model of frequent gene birth and death.
Collapse
|
73
|
Terrapon N, Weiner J, Grath S, Moore AD, Bornberg-Bauer E. Rapid similarity search of proteins using alignments of domain arrangements. ACTA ACUST UNITED AC 2013; 30:274-81. [PMID: 23828785 DOI: 10.1093/bioinformatics/btt379] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
MOTIVATION Homology search methods are dominated by the central paradigm that sequence similarity is a proxy for common ancestry and, by extension, functional similarity. For determining sequence similarity in proteins, most widely used methods use models of sequence evolution and compare amino-acid strings in search for conserved linear stretches. Probabilistic models or sequence profiles capture the position-specific variation in an alignment of homologous sequences and can identify conserved motifs or domains. While profile-based search methods are generally more accurate than simple sequence comparison methods, they tend to be computationally more demanding. In recent years, several methods have emerged that perform protein similarity searches based on domain composition. However, few methods have considered the linear arrangements of domains when conducting similarity searches, despite strong evidence that domain order can harbour considerable functional and evolutionary signal. RESULTS Here, we introduce an alignment scheme that uses a classical dynamic programming approach to the global alignment of domains. We illustrate that representing proteins as strings of domains (domain arrangements) and comparing these strings globally allows for a both fast and sensitive homology search. Further, we demonstrate that the presented methods complement existing methods by finding similar proteins missed by popular amino-acid-based comparison methods. AVAILABILITY An implementation of the presented algorithms, a web-based interface as well as a command-line program for batch searching against the UniProt database can be found at http://rads.uni-muenster.de. Furthermore, we provide a JAVA API for programmatic access to domain-string–based search methods.
Collapse
|
74
|
Niehuis O, Hartig G, Grath S, Pohl H, Lehmann J, Tafer H, Donath A, Krauss V, Eisenhardt C, Hertel J, Petersen M, Mayer C, Meusemann K, Peters R, Stadler P, Beutel R, Bornberg-Bauer E, McKenna D, Misof B. Genomic and Morphological Evidence Converge to Resolve the Enigma of Strepsiptera. Curr Biol 2013. [DOI: 10.1016/j.cub.2013.06.062] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
75
|
Simola DF, Wissler L, Donahue G, Waterhouse RM, Helmkampf M, Roux J, Nygaard S, Glastad KM, Hagen DE, Viljakainen L, Reese JT, Hunt BG, Graur D, Elhaik E, Kriventseva EV, Wen J, Parker BJ, Cash E, Privman E, Childers CP, Muñoz-Torres MC, Boomsma JJ, Bornberg-Bauer E, Currie CR, Elsik CG, Suen G, Goodisman MAD, Keller L, Liebig J, Rawls A, Reinberg D, Smith CD, Smith CR, Tsutsui N, Wurm Y, Zdobnov EM, Berger SL, Gadau J. Social insect genomes exhibit dramatic evolution in gene composition and regulation while preserving regulatory features linked to sociality. Genome Res 2013; 23:1235-47. [PMID: 23636946 PMCID: PMC3730098 DOI: 10.1101/gr.155408.113] [Citation(s) in RCA: 166] [Impact Index Per Article: 15.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Genomes of eusocial insects code for dramatic examples of phenotypic plasticity and social organization. We compared the genomes of seven ants, the honeybee, and various solitary insects to examine whether eusocial lineages share distinct features of genomic organization. Each ant lineage contains ∼4000 novel genes, but only 64 of these genes are conserved among all seven ants. Many gene families have been expanded in ants, notably those involved in chemical communication (e.g., desaturases and odorant receptors). Alignment of the ant genomes revealed reduced purifying selection compared with Drosophila without significantly reduced synteny. Correspondingly, ant genomes exhibit dramatic divergence of noncoding regulatory elements; however, extant conserved regions are enriched for novel noncoding RNAs and transcription factor–binding sites. Comparison of orthologous gene promoters between eusocial and solitary species revealed significant regulatory evolution in both cis (e.g., Creb) and trans (e.g., fork head) for nearly 2000 genes, many of which exhibit phenotypic plasticity. Our results emphasize that genomic changes can occur remarkably fast in ants, because two recently diverged leaf-cutter ant species exhibit faster accumulation of species-specific genes and greater divergence in regulatory elements compared with other ants or Drosophila. Thus, while the “socio-genomes” of ants and the honeybee are broadly characterized by a pervasive pattern of divergence in gene composition and regulation, they preserve lineage-specific regulatory features linked to eusociality. We propose that changes in gene regulation played a key role in the origins of insect eusociality, whereas changes in gene composition were more relevant for lineage-specific eusocial adaptations.
Collapse
|