1
|
Rubinstein ND, Feldstein T, Shenkar N, Botero-Castro F, Griggio F, Mastrototaro F, Delsuc F, Douzery EJ, Gissi C, Huchon D. Deep sequencing of mixed total DNA without barcodes allows efficient assembly of highly plastic ascidian mitochondrial genomes. Genome Biol Evol 2013; 5:1185-99. [PMID: 23709623 PMCID: PMC3698926 DOI: 10.1093/gbe/evt081] [Citation(s) in RCA: 51] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Ascidians or sea squirts form a diverse group within chordates, which includes a few thousand members of marine sessile filter-feeding animals. Their mitochondrial genomes are characterized by particularly high evolutionary rates and rampant gene rearrangements. This extreme variability complicates standard polymerase chain reaction (PCR) based techniques for molecular characterization studies, and consequently only a few complete Ascidian mitochondrial genome sequences are available. Using the standard PCR and Sanger sequencing approach, we produced the mitochondrial genome of Ascidiella aspersa only after a great effort. In contrast, we produced five additional mitogenomes (Botrylloides aff. leachii, Halocynthia spinosa, Polycarpa mytiligera, Pyura gangelion, and Rhodosoma turcicum) with a novel strategy, consisting in sequencing the pooled total DNA samples of these five species using one Illumina HiSeq 2000 flow cell lane. Each mitogenome was efficiently assembled in a single contig using de novo transcriptome assembly, as de novo genome assembly generally performed poorly for this task. Each of the new six mitogenomes presents a different and novel gene order, showing that no syntenic block has been conserved at the ordinal level (in Stolidobranchia and in Phlebobranchia). Phylogenetic analyses support the paraphyly of both Ascidiacea and Phlebobranchia, with Thaliacea nested inside Phlebobranchia, although the deepest nodes of the Phlebobranchia-Thaliacea clade are not well resolved. The strategy described here thus provides a cost-effective approach to obtain complete mitogenomes characterized by a highly plastic gene order and a fast nucleotide/amino acid substitution rate.
Collapse
Affiliation(s)
- Nimrod D. Rubinstein
- Department of Cell Research and Immunology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv, Israel
- Present address: Department of Molecular and Cellular Biology, Harvard University
| | - Tamar Feldstein
- Department of Zoology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv, Israel
- The Steinhardt National Collections of Natural History Tel Aviv University, Ramat Aviv, Israel
| | - Noa Shenkar
- Department of Zoology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv, Israel
| | - Fidel Botero-Castro
- Institut des Sciences de l'Evolution de Montpellier (ISEM), UMR 5554 - CNRS, Université Montpellier II, Montpellier, France
| | | | | | - Frédéric Delsuc
- Institut des Sciences de l'Evolution de Montpellier (ISEM), UMR 5554 - CNRS, Université Montpellier II, Montpellier, France
| | - Emmanuel J.P. Douzery
- Institut des Sciences de l'Evolution de Montpellier (ISEM), UMR 5554 - CNRS, Université Montpellier II, Montpellier, France
| | - Carmela Gissi
- Dip. di Bioscienze, Università degli Studi di Milano, Milano, Italy
- *Corresponding authors: E-mail: ;
| | - Dorothée Huchon
- Department of Zoology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv, Israel
- *Corresponding authors: E-mail: ;
| |
Collapse
|
2
|
New insights into the evolution of Entomopoxvirinae from the complete genome sequences of four entomopoxviruses infecting Adoxophyes honmai, Choristoneura biennis, Choristoneura rosaceana, and Mythimna separata. J Virol 2013; 87:7992-8003. [PMID: 23678178 DOI: 10.1128/jvi.00453-13] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Poxviruses are nucleocytoplasmic large DNA viruses encompassing two subfamilies, the Chordopoxvirinae and the Entomopoxvirinae, infecting vertebrates and insects, respectively. While chordopoxvirus genomics have been widely studied, only two entomopoxvirus (EPV) genomes have been entirely sequenced. We report the genome sequences of four EPVs of the Betaentomopoxvirus genus infecting the Lepidoptera: Adoxophyes honmai EPV (AHEV), Choristoneura biennis EPV (CBEV), Choristoneura rosaceana EPV (CREV), and Mythimna separata EPV (MySEV). The genomes are 80% AT rich, are 228 to 307 kbp long, and contain 247 to 334 open reading frames (ORFs). Most genes are homologous to those of Amsacta moorei entomopoxvirus and encode several protein families repeated in tandem in terminal regions. Some genomes also encode proteins of unknown functions with similarity to those of other insect viruses. Comparative genomic analyses highlight a high colinearity among the lepidopteran EPV genomes and little gene order conservation with other poxvirus genomes. As with previously sequenced EPVs, the genomes include a relatively conserved central region flanked by inverted terminal repeats. Protein clustering identified 104 core EPV genes. Among betaentomopoxviruses, 148 core genes were found in relatively high synteny, pointing to low genomic diversity. Whole-genome and spheroidin gene phylogenetic analyses showed that the lepidopteran EPVs group closely in a monophyletic lineage, corroborating their affiliation with the Betaentomopoxvirus genus as well as a clear division of the EPVs according to the orders of insect hosts (Lepidoptera, Coleoptera, and Orthoptera). This suggests an ancient coevolution of EPVs with their insect hosts and the need to revise the current EPV taxonomy to separate orthopteran EPVs from the lepidopteran-specific betaentomopoxviruses so as to form a new genus.
Collapse
|
3
|
Proost S, Fostier J, De Witte D, Dhoedt B, Demeester P, Van de Peer Y, Vandepoele K. i-ADHoRe 3.0--fast and sensitive detection of genomic homology in extremely large data sets. Nucleic Acids Res 2011; 40:e11. [PMID: 22102584 PMCID: PMC3258164 DOI: 10.1093/nar/gkr955] [Citation(s) in RCA: 138] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Comparative genomics is a powerful means to gain insight into the evolutionary processes that shape the genomes of related species. As the number of sequenced genomes increases, the development of software to perform accurate cross-species analyses becomes indispensable. However, many implementations that have the ability to compare multiple genomes exhibit unfavorable computational and memory requirements, limiting the number of genomes that can be analyzed in one run. Here, we present a software package to unveil genomic homology based on the identification of conservation of gene content and gene order (collinearity), i-ADHoRe 3.0, and its application to eukaryotic genomes. The use of efficient algorithms and support for parallel computing enable the analysis of large-scale data sets. Unlike other tools, i-ADHoRe can process the Ensembl data set, containing 49 species, in 1 h. Furthermore, the profile search is more sensitive to detect degenerate genomic homology than chaining pairwise collinearity information based on transitive homology. From ultra-conserved collinear regions between mammals and birds, by integrating coexpression information and protein–protein interactions, we identified more than 400 regions in the human genome showing significant functional coherence. The different algorithmical improvements ensure that i-ADHoRe 3.0 will remain a powerful tool to study genome evolution.
Collapse
|
4
|
High-throughput sequencing of complete human mtDNA genomes from the Caucasus and West Asia: high diversity and demographic inferences. Eur J Hum Genet 2011; 19:988-94. [PMID: 21487439 DOI: 10.1038/ejhg.2011.62] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
Abstract
To investigate the demographic history of human populations from the Caucasus and surrounding regions, we used high-throughput sequencing to generate 147 complete mtDNA genome sequences from random samples of individuals from three groups from the Caucasus (Armenians, Azeri and Georgians), and one group each from Iran and Turkey. Overall diversity is very high, with 144 different sequences that fall into 97 different haplogroups found among the 147 individuals. Bayesian skyline plots (BSPs) of population size change through time show a population expansion around 40-50 kya, followed by a constant population size, and then another expansion around 15-18 kya for the groups from the Caucasus and Iran. The BSP for Turkey differs the most from the others, with an increase from 35 to 50 kya followed by a prolonged period of constant population size, and no indication of a second period of growth. An approximate Bayesian computation approach was used to estimate divergence times between each pair of populations; the oldest divergence times were between Turkey and the other four groups from the South Caucasus and Iran (~400-600 generations), while the divergence time of the three Caucasus groups from each other was comparable to their divergence time from Iran (average of ~360 generations). These results illustrate the value of random sampling of complete mtDNA genome sequences that can be obtained with high-throughput sequencing platforms.
Collapse
|
5
|
Iannelli F, Griggio F, Pesole G, Gissi C. The mitochondrial genome of Phallusia mammillata and Phallusia fumigata (Tunicata, Ascidiacea): high genome plasticity at intra-genus level. BMC Evol Biol 2007; 7:155. [PMID: 17764550 PMCID: PMC2220002 DOI: 10.1186/1471-2148-7-155] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2007] [Accepted: 08/31/2007] [Indexed: 11/24/2022] Open
Abstract
Background Within Chordata, the subphyla Vertebrata and Cephalochordata (lancelets) are characterized by a remarkable stability of the mitochondrial (mt) genome, with constancy of gene content and almost invariant gene order, whereas the limited mitochondrial data on the subphylum Tunicata suggest frequent and extensive gene rearrangements, observed also within ascidians of the same genus. Results To confirm this evolutionary trend and to better understand the evolutionary dynamics of the mitochondrial genome in Tunicata Ascidiacea, we have sequenced and characterized the complete mt genome of two congeneric ascidian species, Phallusia mammillata and Phallusia fumigata (Phlebobranchiata, Ascidiidae). The two mtDNAs are surprisingly rearranged, both with respect to one another and relative to those of other tunicates and chordates, with gene rearrangements affecting both protein-coding and tRNA genes. The new data highlight the extraordinary variability of ascidian mt genome in base composition, tRNA secondary structure, tRNA gene content, and non-coding regions (number, size, sequence and location). Indeed, both Phallusia genomes lack the trnD gene, show loss/acquisition of DHU-arm in two tRNAs, and have a G+C content two-fold higher than other ascidians. Moreover, the mt genome of P. fumigata presents two identical copies of trnI, an extra tRNA gene with uncertain amino acid specificity, and four almost identical sequence regions. In addition, a truncated cytochrome b, lacking a C-terminal tail that commonly protrudes into the mt matrix, has been identified as a new mt feature probably shared by all tunicates. Conclusion The frequent occurrence of major gene order rearrangements in ascidians both at high taxonomic level and within the same genus makes this taxon an excellent model to study the mechanisms of gene rearrangement, and renders the mt genome an invaluable phylogenetic marker to investigate molecular biodiversity and speciation events in this largely unexplored group of basal chordates.
Collapse
Affiliation(s)
- Fabio Iannelli
- Dipartimento di Scienze Biomolecolari e Biotecnologie, Università di Milano, Via Celoria 26, 20133 Milano, Italy
| | - Francesca Griggio
- Dipartimento di Scienze Biomolecolari e Biotecnologie, Università di Milano, Via Celoria 26, 20133 Milano, Italy
| | - Graziano Pesole
- Dipartimento di Scienze Biomolecolari e Biotecnologie, Università di Milano, Via Celoria 26, 20133 Milano, Italy
- Dipartimento di Biochimica e Biologia Molecolare "E. Quagliariello", Università di Bari, Via Orabona 4, 70126 Bari, Italy
| | - Carmela Gissi
- Dipartimento di Scienze Biomolecolari e Biotecnologie, Università di Milano, Via Celoria 26, 20133 Milano, Italy
| |
Collapse
|
6
|
Pavesi G, Pesole G. Finding conserved gene order across multiple genomes. Methods Mol Biol 2007; 396:111-119. [PMID: 18025689 DOI: 10.1007/978-1-59745-515-2_8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
The ever increasing amount of annotated genomic sequences permits now to shed light on the molecular dynamics at the basis of evolution. Genome evolution involves both changes at the sequence level, but also rearrangements on the gene organization along the genome. Comparative analysis of gene order in multiple genomes may be thus of help in the investigation of general or lineage-specific genome plasticity, as well as in the inference of phylogenetic relationships. In particular, conserved gene contiguity in a chromosome, even if interrupted by intervening genes, may suggest potential functional couplings. This unit explains the usage of GeneSyn, a software tool for the automatic identification of conserved gene order across multiple annotated genomes.
Collapse
|
7
|
Current Awareness on Comparative and Functional Genomics. Comp Funct Genomics 2005. [PMCID: PMC2448604 DOI: 10.1002/cfg.419] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
|
8
|
Pavesi G, Mereghetti P, Mauri G, Pesole G. Weeder Web: discovery of transcription factor binding sites in a set of sequences from co-regulated genes. Nucleic Acids Res 2004; 32:W199-203. [PMID: 15215380 PMCID: PMC441603 DOI: 10.1093/nar/gkh465] [Citation(s) in RCA: 309] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
One of the greatest challenges that modern molecular biology is facing is the understanding of the complex mechanisms regulating gene expression. A fundamental step in this process requires the characterization of regulatory motifs playing key roles in the regulation of gene expression at transcriptional and post-transcriptional levels. In particular, transcription is modulated by the interaction of transcription factors with their corresponding binding sites. Weeder Web is a web interface to Weeder, an algorithm for the automatic discovery of conserved motifs in a set of related regulatory DNA sequences. The motifs found are in turn likely to be instances of binding sites for some transcription factor. Other than providing access to the program, the interface has been designed so to make usage of the program itself as simple as possible, and to require very little prior knowledge about the length and the conservation of the motifs to be found. In fact, the interface automatically starts different runs of the program, each one with different parameters, and provides the user with an overall summary of the results as well as some 'advice' on which motifs look more interesting according to their statistical significance and some simple considerations. The web interface is available at the address www.pesolelab.it by following the 'Tools' link.
Collapse
|
9
|
Celamkoti S, Kundeti S, Purkayastha A, Mazumder R, Buck C, Seto D. GeneOrder3.0: software for comparing the order of genes in pairs of small bacterial genomes. BMC Bioinformatics 2004; 5:52. [PMID: 15128433 PMCID: PMC419981 DOI: 10.1186/1471-2105-5-52] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2004] [Accepted: 05/05/2004] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND An increasing number of whole viral and bacterial genomes are being sequenced and deposited in public databases. In parallel to the mounting interest in whole genomes, the number of whole genome analyses software tools is also increasing. GeneOrder was originally developed to provide an analysis of genes between two genomes, allowing visualization of gene order and synteny comparisons of any small genomes. It was originally developed for comparing virus, mitochondrion and chloroplast genomes. This is now extended to small bacterial genomes of sizes less than 2 Mb. RESULTS GeneOrder3.0 has been developed and validated successfully on several small bacterial genomes (ca. 580 kb to 1.83 Mb) archived in the NCBI GenBank database. It is an updated web-based "on-the-fly" computational tool allowing gene order and synteny comparisons of any two small bacterial genomes. Analyses of several bacterial genomes show that a large amount of gene and genome re-arrangement occurs, as seen with earlier DNA software tools. This can be displayed at the protein level using GeneOrder3.0. Whole genome alignments of genes are presented in both a table and a dot plot. This allows the detection of evolutionary more distant relationships since protein sequences are more conserved than DNA sequences. CONCLUSIONS GeneOrder3.0 allows researchers to perform comparative analysis of gene order and synteny in genomes of sizes up to 2 Mb "on-the-fly." AVAILABILITY http://binf.gmu.edu/genometools.html and http://pasteur.atcc.org:8050/GeneOrder3.0.
Collapse
Affiliation(s)
- Srikanth Celamkoti
- Bioinformatics and Computational Biology, School of Computational Sciences, George Mason University. 10900 University Boulevard, MSN 5B3, Manassas, VA 20110 USA
| | - Sashidhara Kundeti
- Bioinformatics and Computational Biology, School of Computational Sciences, George Mason University. 10900 University Boulevard, MSN 5B3, Manassas, VA 20110 USA
| | - Anjan Purkayastha
- Bioinformatics and Computational Biology, School of Computational Sciences, George Mason University. 10900 University Boulevard, MSN 5B3, Manassas, VA 20110 USA
| | - Raja Mazumder
- Biochemistry and Molecular Biology Department, Georgetown University School of Medicine. 4000 Reservoir Road, Washington, D.C. 20057 USA
| | - Charles Buck
- Virology Program, American Type Culture Collection (ATCC). 10801 University Boulevard, Manassas, VA 20110, USA
| | - Donald Seto
- Bioinformatics and Computational Biology, School of Computational Sciences, George Mason University. 10900 University Boulevard, MSN 5B3, Manassas, VA 20110 USA
| |
Collapse
|