1
|
Parvathaneni RK, DeLeo VL, Spiekerman JJ, Chakraborty D, Devos KM. Parallel loss of introns in the ABCB1 gene in angiosperms. BMC Evol Biol 2017; 17:238. [PMID: 29202710 PMCID: PMC5716013 DOI: 10.1186/s12862-017-1077-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2017] [Accepted: 11/16/2017] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The presence of non-coding introns is a characteristic feature of most eukaryotic genes. While the size of the introns, number of introns per gene and the number of intron-containing genes can vary greatly between sequenced eukaryotic genomes, the structure of a gene with reference to intron presence and positions is typically conserved in closely related species. Unexpectedly, the ABCB1 (ATP-Binding Cassette Subfamily B Member 1) gene which encodes a P-glycoprotein and underlies dwarfing traits in maize (br2), sorghum (dw3) and pearl millet (d2) displayed considerable variation in intron composition. RESULTS An analysis of the ABCB1 gene structure in 80 angiosperms revealed that the number of introns ranged from one to nine. All introns in ABCB1 underwent either a one-time loss (single loss in one lineage/species) or multiple independent losses (parallel loss in two or more lineages/species) with the majority of losses occurring within the grass family. In contrast, the structure of the closest homolog to ABCB1, ABCB19, remained constant in the majority of angiosperms analyzed. Using known phylogenetic relationships within the grasses, we determined the ancestral branch-points where the losses occurred. Intron 7, the longest intron, was lost in only a single species, Mimulus guttatus, following duplication of ABCB1. Semiquantitative PCR showed that the M. guttatus ABCB1 gene copy without intron 7 had significantly lower transcript levels than the gene copy with intron 7. We further demonstrated that intron 7 carried two motifs that were highly conserved across the monocot-dicot divide. CONCLUSIONS The ABCB1 gene structure is highly dynamic, while the structure of ABCB19 remained largely conserved through evolution. Precise removal of introns, preferential removal of smaller introns and presence of at least 2 bp of microhomology flanking most introns indicated that intron loss may have predominantly occurred through non-homologous end-joining (NHEJ) repair of double strand breaks. Lack of microhomology in the exon upstream of lost phase I introns was likely due to release of the selective constraint on the penultimate base (3rd base in codon) of the terminal codon by the splicing machinery. In addition to size, the presence of regulatory motifs will make introns recalcitrant to loss.
Collapse
Affiliation(s)
- Rajiv K Parvathaneni
- Institute of Plant Breeding, Genetics and Genomics, University of Georgia, 30602, Athens, Georgia, United States.,Current address: Donald Danforth Plant Science Center, St. Louis, MO, 63132, United States
| | - Victoria L DeLeo
- Department of Genetics, University of Georgia, 30602, Athens, GA, United States.,Current address: Department of Biology, Pennsylvania State University, University Park, PA, 16802, United States
| | - John J Spiekerman
- Department of Plant Biology, University of Georgia, 30602, Athens, GA, United States
| | - Debkanta Chakraborty
- Institute of Bioinformatics, University of Georgia, 30602, Athens, GA, United States
| | - Katrien M Devos
- Institute of Plant Breeding, Genetics and Genomics, University of Georgia, 30602, Athens, Georgia, United States. .,Department of Plant Biology, University of Georgia, 30602, Athens, GA, United States. .,Institute of Bioinformatics, University of Georgia, 30602, Athens, GA, United States.
| |
Collapse
|
2
|
Ma MY, Che XR, Porceddu A, Niu DK. Evaluation of the mechanisms of intron loss and gain in the social amoebae Dictyostelium. BMC Evol Biol 2015; 15:286. [PMID: 26678305 PMCID: PMC4683709 DOI: 10.1186/s12862-015-0567-y] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2015] [Accepted: 12/13/2015] [Indexed: 11/18/2022] Open
Abstract
Background Spliceosomal introns are a common feature of eukaryotic genomes. To approach a comprehensive understanding of intron evolution on Earth, studies should look beyond repeatedly studied groups such as animals, plants, and fungi. The slime mold Dictyostelium belongs to a supergroup of eukaryotes not covered in previous studies. Results We found 441 precise intron losses in Dictyostelium discoideum and 202 precise intron losses in Dictyostelium purpureum. Consistent with these observations, Dictyostelium discoideum was found to have significantly more copies of reverse transcriptase genes than Dictyostelium purpureum. We also found that the lost introns are significantly further from the 5′ end of genes than the conserved introns. Adjacent introns were prone to be lost simultaneously in Dictyostelium discoideum. In both Dictyostelium species, the exonic sequences flanking lost introns were found to have a significantly higher GC content than those flanking conserved introns. Together, these observations support a reverse-transcription model of intron loss in which intron losses were caused by gene conversion between genomic DNA and cDNA reverse transcribed from mature mRNA. We also identified two imprecise intron losses in Dictyostelium discoideum that may have resulted from genomic deletions. Ninety-eight putative intron gains were also observed. Consistent with previous studies of other lineages, the source sequences were found in only a small number of cases, with only two instances of intron gain identified in Dictyostelium discoideum. Conclusions Although they diverged very early from animals and fungi, Dictyostelium species have similar mechanisms of intron loss. Electronic supplementary material The online version of this article (doi:10.1186/s12862-015-0567-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Ming-Yue Ma
- MOE Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing, 100875, China. .,Beijing Key Laboratory of Gene Resource and Molecular Development, College of Life Sciences, Beijing Normal University, Beijing, 100875, China.
| | - Xun-Ru Che
- The High School Affiliated to Renmin University of China, Beijing, 100080, China.
| | - Andrea Porceddu
- Department of Agricultural Sciences, University of Sassari, Viale Italia, 39, 07100, Sassari, Italy.
| | - Deng-Ke Niu
- MOE Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing, 100875, China. .,Beijing Key Laboratory of Gene Resource and Molecular Development, College of Life Sciences, Beijing Normal University, Beijing, 100875, China.
| |
Collapse
|
3
|
Wang H, Devos KM, Bennetzen JL. Recurrent loss of specific introns during angiosperm evolution. PLoS Genet 2014; 10:e1004843. [PMID: 25474210 PMCID: PMC4256211 DOI: 10.1371/journal.pgen.1004843] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2013] [Accepted: 10/22/2014] [Indexed: 11/18/2022] Open
Abstract
Numerous instances of presence/absence variations for introns have been documented in eukaryotes, and some cases of recurrent loss of the same intron have been suggested. However, there has been no comprehensive or phylogenetically deep analysis of recurrent intron loss. Of 883 cases of intron presence/absence variation that we detected in five sequenced grass genomes, 93 were confirmed as recurrent losses and the rest could be explained by single losses (652) or single gains (118). No case of recurrent intron gain was observed. Deep phylogenetic analysis often indicated that apparent intron gains were actually numerous independent losses of the same intron. Recurrent loss exhibited extreme non-randomness, in that some introns were removed independently in many lineages. The two larger genomes, maize and sorghum, were found to have a higher rate of both recurrent loss and overall loss and/or gain than foxtail millet, rice or Brachypodium. Adjacent introns and small introns were found to be preferentially lost. Intron loss genes exhibited a high frequency of germ line or early embryogenesis expression. In addition, flanking exon A+T-richness and intron TG/CG ratios were higher in retained introns. This last result suggests that epigenetic status, as evidenced by a loss of methylated CG dinucleotides, may play a role in the process of intron loss. This study provides the first comprehensive analysis of recurrent intron loss, makes a series of novel findings on the patterns of recurrent intron loss during the evolution of the grass family, and provides insight into the molecular mechanism(s) underlying intron loss. The spliceosomal introns are nucleotide sequences that interrupt coding regions of eukaryotic genes and are removed by RNA splicing after transcription. Recent studies have reported several examples of possible recurrent intron loss or gain, i.e., introns that are independently removed from or inserted into the identical sites more than once in an investigated phylogeny. However, the frequency, evolutionary patterns or other characteristics of recurrent intron turnover remain unknown. We provide results for the first comprehensive analysis of recurrent intron turnover within a plant family and show that recurrent intron loss represents a considerable portion of all intron losses identified and intron loss events far outnumber intron gain events. We also demonstrate that recurrent intron loss is non-random, affecting only a small number of introns that are repeatedly lost, and that different lineages show significantly different rates of intron loss. Our results suggest a possible role of DNA methylation in the process of intron loss. Moreover, this study provides strong support for the model of intron loss by reverse transcriptase mediated conversion of genes by their processed mRNA transcripts.
Collapse
Affiliation(s)
- Hao Wang
- Department of Genetics, University of Georgia, Athens, Georgia, United States of America
| | - Katrien M. Devos
- Department of Crop and Soil Sciences, and Department of Plant Biology, University of Georgia, Athens, Georgia, United States of America
| | - Jeffrey L. Bennetzen
- Department of Genetics, University of Georgia, Athens, Georgia, United States of America
- Germplasm Bank of Wild Species in Southwestern China, Kunming Institute of Botany, Kunming, Yunnan, P.R. China
- * E-mail:
| |
Collapse
|
4
|
Moreno M, Bickersmith S, Harlow W, Hildebrandt J, McKeon SN, Silva-do-Nascimento TF, Loaiza JR, Ruiz F, Lourenço-de-Oliveira R, Sallum MAM, Bergo ES, Fritz GN, Wilkerson RC, Linton YM, Juri MJD, Rangel Y, Póvoa MM, Gutiérrez-Builes LA, Correa MM, Conn JE. Phylogeography of the neotropical Anopheles triannulatus complex (Diptera: Culicidae) supports deep structure and complex patterns. Parasit Vectors 2013; 6:47. [PMID: 23433428 PMCID: PMC3606328 DOI: 10.1186/1756-3305-6-47] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2012] [Accepted: 02/13/2013] [Indexed: 04/11/2024] Open
Abstract
BACKGROUND The molecular phylogenetic relationships and population structure of the species of the Anopheles triannulatus complex: Anopheles triannulatus s.s., Anopheles halophylus and the putative species Anopheles triannulatus C were investigated. METHODS The mitochondrial COI gene, the nuclear white gene and rDNA ITS2 of samples that include the known geographic distribution of these taxa were analyzed. Phylogenetic analyses were performed using Bayesian inference, Maximum parsimony and Maximum likelihood approaches. RESULTS Each data set analyzed septely yielded a different topology but none provided evidence for the seption of An. halophylus and An. triannulatus C, consistent with the hypothesis that the two are undergoing incipient speciation. The phylogenetic analyses of the white gene found three main clades, whereas the statistical parsimony network detected only a single metapopulation of Anopheles triannulatus s.l. Seven COI lineages were detected by phylogenetic and network analysis. In contrast, the network, but not the phylogenetic analyses, strongly supported three ITS2 groups. Combined data analyses provided the best resolution of the trees, with two major clades, Amazonian (clade I) and trans-Andean + Amazon Delta (clade II). Clade I consists of multiple subclades: An. halophylus + An. triannulatus C; trans-Andean Venezuela; central Amazonia + central Bolivia; Atlantic coastal lowland; and Amazon delta. Clade II includes three subclades: Panama; cis-Andean Colombia; and cis-Venezuela. The Amazon delta specimens are in both clades, likely indicating local sympatry. Spatial and molecular variance analyses detected nine groups, corroborating some of subclades obtained in the combined data analysis. CONCLUSION Combination of the three molecular markers provided the best resolution for differentiation within An. triannulatus s.s. and An. halophylus and C. The latest two species seem to be very closely related and the analyses performed were not conclusive regarding species differentiation. Further studies including new molecular markers would be desirable to solve this species status question. Besides, results of the study indicate a trans-Andean origin for An. triannulatus s.l. The potential implications for malaria epidemiology remain to be investigated.
Collapse
Affiliation(s)
- Marta Moreno
- New York State Department of Health, Wadsworth Center, Griffin Laboratory, Albany, NY, USA
- Present address: Division Infectious Diseases University of California San Diego, George Palade Labs, School of Medicine, 92093, 9500 Gilman Drive, MC 0741, La Jolla, CA, USA
| | - Sara Bickersmith
- New York State Department of Health, Wadsworth Center, Griffin Laboratory, Albany, NY, USA
| | - Wesley Harlow
- New York State Department of Health, Wadsworth Center, Griffin Laboratory, Albany, NY, USA
| | - Jessica Hildebrandt
- New York State Department of Health, Wadsworth Center, Griffin Laboratory, Albany, NY, USA
| | - Sascha N McKeon
- New York State Department of Health, Wadsworth Center, Griffin Laboratory, Albany, NY, USA
- Department of Biomedical Sciences, School of Public Health, State University of New York, Albany, NY, USA
| | | | - Jose R Loaiza
- Instituto de Investigaciones Científicas y Servicios de Alta Tecnología, Clayton, Panamá, República de Panamá
| | - Freddy Ruiz
- Division of Entomology, Walter Reed Army Institute of Research, Silver Spring, MD, USA
| | | | - Maria AM Sallum
- Departamento de Epidemiologia, Faculdade de Saúde Pública, Universidade de São Paulo, São Paulo, Brazil
| | - Eduardo S Bergo
- Superintendência de Controle de Endemias, SUCEN, São Paulo, Brazil
| | - Gary N Fritz
- Department of Biological Sciences, Eastern Illinois University, Charleston, IL, USA
| | - Richard C Wilkerson
- Division of Entomology, Walter Reed Army Institute of Research, Silver Spring, MD, USA
| | - Yvonne M Linton
- Division of Entomology, Walter Reed Army Institute of Research, Silver Spring, MD, USA
| | - Maria J Dantur Juri
- Instituto Superior de Entomología "Dr. Abraham Willink", Facultad de Ciencias, Naturales e Instituto Miguel Lillo, Universidad Nacional de Tucumán, Tucumán, Argentina
| | - Yadira Rangel
- Laboratorio de Biologia de Vectores, Instituto de Zoología y Ecología Tropical, Universidad Central de Venezuela, Caracas, Venezuela
| | | | - Lina A Gutiérrez-Builes
- Grupo de Microbiología Molecular, Escuela de Microbiología, Universidad de Antioquia, Medellín, Colombia
| | - Margarita M Correa
- Grupo de Microbiología Molecular, Escuela de Microbiología, Universidad de Antioquia, Medellín, Colombia
| | - Jan E Conn
- New York State Department of Health, Wadsworth Center, Griffin Laboratory, Albany, NY, USA
- Department of Biomedical Sciences, School of Public Health, State University of New York, Albany, NY, USA
| |
Collapse
|
5
|
Da Lage JL, Binder M, Hua-Van A, Janeček S, Casane D. Gene make-up: rapid and massive intron gains after horizontal transfer of a bacterial α-amylase gene to Basidiomycetes. BMC Evol Biol 2013; 13:40. [PMID: 23405862 PMCID: PMC3584928 DOI: 10.1186/1471-2148-13-40] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2012] [Accepted: 01/30/2013] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Increasing genome data show that introns, a hallmark of eukaryotes, already existed at a high density in the last common ancestor of extant eukaryotes. However, intron content is highly variable among species. The tempo of intron gains and losses has been irregular and several factors may explain why some genomes are intron-poor whereas other are intron-rich. RESULTS We studied the dynamics of intron gains and losses in an α-amylase gene, whose product breaks down starch and other polysaccharides. It was transferred from an Actinobacterium to an ancestor of Agaricomycotina. This gene underwent further duplications in several species. The results indicate a high rate of intron insertions soon after the gene settled in the fungal genome. A number of these oldest introns, regularly scattered along the gene, remained conserved. Subsequent gains and losses were lineage dependent, with a majority of losses. Moreover, a few species exhibited a high number of both specific intron gains and losses in recent periods. There was little sequence conservation around insertion sites, then probably little information for splicing, whereas splicing sites, inside introns, showed typical and conserved patterns. There was little variation of intron size. CONCLUSIONS Since most Basidiomycetes have intron-rich genomes and this richness was ancestral in Fungi, long before the transfer event, we suggest that the new gene was shaped to comply with requirements of the splicing machinery, such as short exon and intron sizes, in order to be correctly processed.
Collapse
Affiliation(s)
- Jean-Luc Da Lage
- Laboratoire Evolution, génomes et spéciation UPR 9034 CNRS, 91198 Gif-sur-Yvette, and Université Paris-Sud, Orsay, 91405, France.
| | | | | | | | | |
Collapse
|
6
|
Da Lage JL, Maczkowiak F, Cariou ML. Phylogenetic distribution of intron positions in alpha-amylase genes of bilateria suggests numerous gains and losses. PLoS One 2011; 6:e19673. [PMID: 21611157 PMCID: PMC3096672 DOI: 10.1371/journal.pone.0019673] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2010] [Accepted: 04/03/2011] [Indexed: 11/19/2022] Open
Abstract
Most eukaryotes have at least some genes interrupted by introns. While it is well accepted that introns were already present at moderate density in the last eukaryote common ancestor, the conspicuous diversity of intron density among genomes suggests a complex evolutionary history, with marked differences between phyla. The question of the rates of intron gains and loss in the course of evolution and factors influencing them remains controversial. We have investigated a single gene family, alpha-amylase, in 55 species covering a variety of animal phyla. Comparison of intron positions across phyla suggests a complex history, with a likely ancestral intronless gene undergoing frequent intron loss and gain, leading to extant intron/exon structures that are highly variable, even among species from the same phylum. Because introns are known to play no regulatory role in this gene and there is no alternative splicing, the structural differences may be interpreted more easily: intron positions, sizes, losses or gains may be more likely related to factors linked to splicing mechanisms and requirements, and to recognition of introns and exons, or to more extrinsic factors, such as life cycle and population size. We have shown that intron losses outnumbered gains in recent periods, but that "resets" of intron positions occurred at the origin of several phyla, including vertebrates. Rates of gain and loss appear to be positively correlated. No phase preference was found. We also found evidence for parallel gains and for intron sliding. Presence of introns at given positions was correlated to a strong protosplice consensus sequence AG/G, which was much weaker in the absence of intron. In contrast, recent intron insertions were not associated with a specific sequence. In animal Amy genes, population size and generation time seem to have played only minor roles in shaping gene structures.
Collapse
Affiliation(s)
- Jean-Luc Da Lage
- Laboratoire Evolution, génomes et spéciation, UPR 9034 CNRS, Gif sur Yvette, France.
| | | | | |
Collapse
|
7
|
Li W, Liu B, Yu L, Feng D, Wang H, Wang J. Phylogenetic analysis, structural evolution and functional divergence of the 12-oxo-phytodienoate acid reductase gene family in plants. BMC Evol Biol 2009; 9:90. [PMID: 19416520 PMCID: PMC2688005 DOI: 10.1186/1471-2148-9-90] [Citation(s) in RCA: 76] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2008] [Accepted: 05/05/2009] [Indexed: 11/30/2022] Open
Abstract
Background The 12-oxo-phytodienoic acid reductases (OPRs) are enzymes that catalyze the reduction of double-bonds in α, β-unsaturated aldehydes or ketones and are part of the octadecanoid pathway that converts linolenic acid to jasmonic acid. In plants, OPRs belong to the old yellow enzyme family and form multigene families. Although discoveries about this family in Arabidopsis and other species have been reported in some studies, the evolution and function of multiple OPRs in plants are not clearly understood. Results A comparative genomic analysis was performed to investigate the phylogenetic relationship, structural evolution and functional divergence among OPR paralogues in plants. In total, 74 OPR genes were identified from 11 species representing the 6 major green plant lineages: green algae, mosses, lycophytes, gymnosperms, monocots and dicots. Phylogenetic analysis showed that seven well-conserved subfamilies exist in plants. All OPR genes from green algae were clustered into a single subfamily, while those from land plants fell into six other subfamilies, suggesting that the events leading to the expansion of the OPR family occurred in land plants. Further analysis revealed that lineage-specific expansion, especially by tandem duplication, contributed to the current OPR subfamilies in land plants after divergence from aquatic plants. Interestingly, exon/intron structure analysis showed that the gene structures of OPR paralogues exhibits diversity in intron number and length, while the intron positions and phase were highly conserved across different lineage species. These observations together with the phylogenetic tree revealed that successive single intron loss, as well as indels within introns, occurred during the process of structural evolution of OPR paralogues. Functional divergence analysis revealed that altered functional constraints have occurred at specific amino acid positions after diversification of the paralogues. Most notably, significant functional divergence was also found in all pairs, except for the II/IV, II/V and V/VI pairs. Strikingly, analysis of the site-specific profiles established by posterior probability revealed that the positive-selection sites and/or critical amino acid residues for functional divergence are mainly distributed in α-helices and substrate binding loop (SBL), indicating the functional importance of these regions for this protein family. Conclusion This study highlights the molecular evolution of the OPR gene family in all plant lineages and indicates critical amino acid residues likely relevant for the distinct functional properties of the paralogues. Further experimental verification of these findings may provide valuable information on the OPRs' biochemical and physiological functions.
Collapse
Affiliation(s)
- Wenyan Li
- State Key Laboratory for Biocontrol and Key Laboratory of Gene Engineering of Ministry of Education, School of Life Sciences, Sun Yat-sen University, 510275, Guangzhou, PR China.
| | | | | | | | | | | |
Collapse
|
8
|
Mirabello L, Conn JE. Population analysis using the nuclear white gene detects Pliocene/Pleistocene lineage divergence within Anopheles nuneztovari in South America. MEDICAL AND VETERINARY ENTOMOLOGY 2008; 22:109-119. [PMID: 18498609 DOI: 10.1111/j.1365-2915.2008.00731.x] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
Anopheles (Nyssorhynchus) nuneztovari Gabaldón (Diptera: Culicidae), a locally important malaria vector in some regions of South America, has been hypothesized to consist of at least two cryptic incipient species. We investigated its phylogeographic structure in several South American localities to determine the number of lineages and levels of divergence using the nuclear white gene, a marker that detected two recently diverged genotypes in the primary Neotropical malaria vector Anopheles darlingi Root. In An. nuneztovari, five distinct lineages (1-5) were elucidated: (1) populations from northeastern and central Amazonia; (2) populations from Venezuela east and west of the Andes; (3) populations from Colombia and Venezuela west of the Andes; (4) southeastern and western Amazonian Brazil populations, and (5) southeastern and western Amazonian Brazil and Bolivian populations. There was a large amount of genetic differentiation among these lineages. The deepest and earliest divergence was found between lineage 3 and lineages 1, 2 and 4, which probably accounts for the detection of lineage 3 in some earlier studies. The multiple lineages within Amazonia are partially congruent with previous mtDNA and ITS2 data, but were undetected in many earlier studies, probably because of their recent (Pleistocene) divergence and the differential mutation rates of the markers. The estimates for the five lineages, interpreted as recently evolved or incipient species, date to the Pleistocene and Pliocene. We hypothesize that the diversification in An. nuneztovari is the result of an interaction between the Miocene/Pliocene marine incursion and Pleistocene climatic changes leading to refugial isolation. The identification of cryptic lineages in An. nuneztovari could have a significant impact on local vector control measures.
Collapse
Affiliation(s)
- L Mirabello
- Department of Biomedical Sciences, School of Public Health, University at Albany, State University of New York, Albany, New York, USA
| | | |
Collapse
|
9
|
Mirabello L, Vineis JH, Yanoviak SP, Scarpassa VM, Póvoa MM, Padilla N, Achee NL, Conn JE. Microsatellite data suggest significant population structure and differentiation within the malaria vector Anopheles darlingi in Central and South America. BMC Ecol 2008; 8:3. [PMID: 18366795 PMCID: PMC2292152 DOI: 10.1186/1472-6785-8-3] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2007] [Accepted: 03/26/2008] [Indexed: 12/05/2022] Open
Abstract
Background Anopheles darlingi is the most important malaria vector in the Neotropics. An understanding of A. darlingi's population structure and contemporary gene flow patterns is necessary if vector populations are to be successfully controlled. We assessed population genetic structure and levels of differentiation based on 1,376 samples from 31 localities throughout the Peruvian and Brazilian Amazon and Central America using 5–8 microsatellite loci. Results We found high levels of polymorphism for all of the Amazonian populations (mean RS = 7.62, mean HO = 0.742), and low levels for the Belize and Guatemalan populations (mean RS = 4.3, mean HO = 0.457). The Bayesian clustering analysis revealed five population clusters: northeastern Amazonian Brazil, southeastern and central Amazonian Brazil, western and central Amazonian Brazil, Peruvian Amazon, and the Central American populations. Within Central America there was low non-significant differentiation, except for between the populations separated by the Maya Mountains. Within Amazonia there was a moderate level of significant differentiation attributed to isolation by distance. Within Peru there was no significant population structure and low differentiation, and some evidence of a population expansion. The pairwise estimates of genetic differentiation between Central America and Amazonian populations were all very high and highly significant (FST = 0.1859 – 0.3901, P < 0.05). Both the DA and FST distance-based trees illustrated the main division to be between Central America and Amazonia. Conclusion We detected a large amount of population structure in Amazonia, with three population clusters within Brazil and one including the Peru populations. The considerable differences in Ne among the populations may have contributed to the observed genetic differentiation. All of the data suggest that the primary division within A. darlingi corresponds to two white gene genotypes between Amazonia (genotype 1) and Central America, parts of Colombia and Venezuela (genotype 2), and are in agreement with previously published mitochondrial COI gene sequences interpreted as incipient species. Overall, it appears that two main factors have contributed to the genetic differentiation between the population clusters: physical distance between the populations and the differences in effective population sizes among the subpopulations.
Collapse
Affiliation(s)
- Lisa Mirabello
- Department of Biomedical Sciences, School of Public Health, State University of New York at Albany, Albany, New York 12222, USA.
| | | | | | | | | | | | | | | |
Collapse
|
10
|
Krauss V, Thümmler C, Georgi F, Lehmann J, Stadler PF, Eisenhardt C. Near Intron Positions Are Reliable Phylogenetic Markers: An Application to Holometabolous Insects. Mol Biol Evol 2008; 25:821-30. [DOI: 10.1093/molbev/msn013] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
|
11
|
Irimia M, Roy SW. Spliceosomal introns as tools for genomic and evolutionary analysis. Nucleic Acids Res 2008; 36:1703-12. [PMID: 18263615 PMCID: PMC2275149 DOI: 10.1093/nar/gkn012] [Citation(s) in RCA: 74] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Over the past 5 years, the availability of dozens of whole genomic sequences from a wide variety of eukaryotic lineages has revealed a very large amount of information about the dynamics of intron loss and gain through eukaryotic history, as well as the evolution of intron sequences. Implicit in these advances is a great deal of information about the structure and evolution of surrounding sequences. Here, we review the wealth of ways in which structures of spliceosomal introns as well as their conservation and change through evolution may be harnessed for evolutionary and genomic analysis. First, we discuss uses of intron length distributions and positions in sequence assembly and annotation, and for improving alignment of homologous regions. Second, we review uses of introns in evolutionary studies, including the utility of introns as indicators of rates of sequence evolution, for inferences about molecular evolution, as signatures of orthology and paralogy, and for estimating rates of nucleotide substitution. We conclude with a discussion of phylogenetic methods utilizing intron sequences and positions.
Collapse
Affiliation(s)
- Manuel Irimia
- Departament de Genètica, Universitat de Barcelona, Barcelona, Spain
| | | |
Collapse
|
12
|
Sharpton TJ, Neafsey DE, Galagan JE, Taylor JW. Mechanisms of intron gain and loss in Cryptococcus. Genome Biol 2008; 9:R24. [PMID: 18234113 PMCID: PMC2395259 DOI: 10.1186/gb-2008-9-1-r24] [Citation(s) in RCA: 64] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2007] [Revised: 11/29/2007] [Accepted: 01/30/2008] [Indexed: 01/14/2023] Open
Abstract
BACKGROUND Genome comparisons across deep phylogenetic divergences have revealed that spliceosomal intron gain and loss are common evolutionary events. However, because of the deep divergences involved in these comparisons, little is understood about how these changes occur, particularly in the case of intron gain. To ascertain mechanisms of intron gain and loss, we compared five relatively closely related genomes from the yeast Cryptococcus. RESULTS We observe a predominance of intron loss over gain and identify a relatively slow intron loss rate in Cryptococcus. Some genes preferentially lose introns and a large proportion of intron losses occur in the middle of genes (so called internal intron loss). Finally, we identify a gene that displays a differential number of introns in a repetitive DNA region. CONCLUSION Based the observed patterns of intron loss and gain, population resequencing and population genetic analysis, it appears that recombination causes the widely observed but poorly understood phenomenon of internal intron loss and that DNA repeat expansion can create new introns in a population.
Collapse
Affiliation(s)
- Thomas J Sharpton
- Department of Plant and Microbial Biology, University of California at Berkeley, Berkeley, CA 94720, USA.
| | | | | | | |
Collapse
|
13
|
Stajich JE, Dietrich FS, Roy SW. Comparative genomic analysis of fungal genomes reveals intron-rich ancestors. Genome Biol 2007; 8:R223. [PMID: 17949488 PMCID: PMC2246297 DOI: 10.1186/gb-2007-8-10-r223] [Citation(s) in RCA: 105] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2006] [Revised: 10/12/2007] [Accepted: 10/19/2007] [Indexed: 11/17/2022] Open
Abstract
BACKGROUND Eukaryotic protein-coding genes are interrupted by spliceosomal introns, which are removed from transcripts before protein translation. Many facets of spliceosomal intron evolution, including age, mechanisms of origins, the role of natural selection, and the causes of the vast differences in intron number between eukaryotic species, remain debated. Genome sequencing and comparative analysis has made possible whole genome analysis of intron evolution to address these questions. RESULTS We analyzed intron positions in 1,161 sets of orthologous genes across 25 eukaryotic species. We find strong support for an intron-rich fungus-animal ancestor, with more than four introns per kilobase, comparable to the highest known modern intron densities. Indeed, the fungus-animal ancestor is estimated to have had more introns than any of the extant fungi in this study. Thus, subsequent fungal evolution has been characterized by widespread and recurrent intron loss occurring in all fungal clades. These results reconcile three previously proposed methods for estimation of ancestral intron number, which previously gave very different estimates of ancestral intron number for eight eukaryotic species, as well as a fourth more recent method. We do not find a clear inverse correspondence between rates of intron loss and gain, contrary to the predictions of selection-based proposals for interspecific differences in intron number. CONCLUSION Our results underscore the high intron density of eukaryotic ancestors and the widespread importance of intron loss through eukaryotic evolution.
Collapse
Affiliation(s)
- Jason E Stajich
- Department of Molecular Genetics and Microbiology, Center for Genome Technology, Institute for Genome Science and Policy, Duke University, Durham, NC 27710, USA
- Miller Institute for Basic Research and Department of Plant and Microbial Biology, 111 Koshland Hall #3102, University of California, Berkeley, CA 94720-3102, USA
| | - Fred S Dietrich
- Department of Molecular Genetics and Microbiology, Center for Genome Technology, Institute for Genome Science and Policy, Duke University, Durham, NC 27710, USA
| | - Scott W Roy
- Department of Molecular Genetics and Microbiology, Center for Genome Technology, Institute for Genome Science and Policy, Duke University, Durham, NC 27710, USA
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894, USA
| |
Collapse
|
14
|
Abstract
Phylogenetic analysis has changed greatly in the last decade, and the most important themes in that change are reviewed here. Sequence data have become the most common source of phylogenetic information. This means that explicit models for evolutionary processes have been developed in a likelihood context, which allow more realistic data analyses. These models are becoming increasingly complex, both for nucleotides and for amino acid sequences, and so all such models need to be quantitatively assessed for each data set, to find the most appropriate one for use in any particular tree-building analysis. Bayesian analysis has been developed for tree-building and is greatly increasing in popularity. This is because a good heuristic strategy exists, which allows large data sets to be analyzed with complex evolutionary models in a practical time. Perhaps the most disappointing aspect of tree interpretation is the ongoing confusion between rooted and unrooted trees, while the effect of taxon and character sampling is often overlooked when constructing a phylogeny (especially in parasitology). The review finishes with a detailed consideration of the analysis of a multi-gene data set for several dozen taxa of Cryptosporidium (Apicomplexa), illustrating many of the theoretical and practical points highlighted in the review.
Collapse
Affiliation(s)
- David A Morrison
- Department of Parasitology (SWEPAR), National Veterinary Institute and Swedish University of Agricultural Sciences, 751 89 Uppsala, Sweden
| |
Collapse
|
15
|
Abstract
Research into the origins of introns is at a critical juncture in the resolution of theories on the evolution of early life (which came first, RNA or DNA?), the identity of LUCA (the last universal common ancestor, was it prokaryotic- or eukaryotic-like?), and the significance of noncoding nucleotide variation. One early notion was that introns would have evolved as a component of an efficient mechanism for the origin of genes. But alternative theories emerged as well. From the debate between the "introns-early" and "introns-late" theories came the proposal that introns arose before the origin of genetically encoded proteins and DNA, and the more recent "introns-first" theory, which postulates the presence of introns at that early evolutionary stage from a reconstruction of the "RNA world." Here we review seminal and recent ideas about intron origins. Recent discoveries about the patterns and causes of intron evolution make this one of the most hotly debated and exciting topics in molecular evolutionary biology today.
Collapse
Affiliation(s)
- Francisco Rodríguez-Trelles
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697-2525, USA.
| | | | | |
Collapse
|
16
|
Abstract
The exon/intron structure of eukaryotic genes differs extensively across species, but the mechanisms and relative rates of intron loss and gain are still poorly understood. Here, we used whole-genome sequence alignments of human, mouse, rat, and dog to perform a genome-wide analysis of intron loss and gain events in >17,000 mammalian genes. We found no evidence for intron gain and 122 cases of intron loss, most of which occurred within the rodent lineage. The majority (68%) of the deleted introns were extremely small (<150 bp), significantly smaller than average. The intron losses occurred almost exclusively within highly expressed, housekeeping genes, supporting the hypothesis that intron loss is mediated via germline recombination of genomic DNA with intronless cDNA. This study constitutes the largest scale analysis for intron dynamics in vertebrates to date and allows us to confirm and extend several hypotheses previously based on much smaller samples. Our results in mammals show that intron gain has not been a factor in the evolution of gene structure during the past 95 Myr and has likely been restricted to more ancient history.
Collapse
Affiliation(s)
| | - Jacek Majewski
- Department of Human Genetics, McGill University, Montreal, Quebec H3A 1A4, Canada
- Corresponding author.E-mail ; fax (514) 398-1790
| |
Collapse
|
17
|
Knowles DG, McLysaght A. High Rate of Recent Intron Gain and Loss in Simultaneously Duplicated Arabidopsis Genes. Mol Biol Evol 2006; 23:1548-57. [PMID: 16720694 DOI: 10.1093/molbev/msl017] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
We examined the gene structure of a set of 2563 Arabidopsis thaliana paralogous pairs that were duplicated simultaneously 20-60 MYA by tetraploidy. Out of a total of 23,164 introns in these genes, we found that 10,004 pairs have been conserved and 578 introns have been inserted or deleted in the time since the duplication event. This intron insertion/deletion rate of 2.7 x 10(-3) to 9.1 x 10(-4) per site per million years is high in comparison to previous studies. At least 56 introns were gained and 39 lost based on parsimony analysis of the phylogenetic distribution of these introns. We found weak evidence that genes undergoing intron gain and loss are biased with respect to gene ontology terms. Gene pairs that experienced at least 2 intron insertions or deletions show evidence of enrichment for membrane location and transport and transporter activity function. We do not find any relationship of intron flux to expression level or G + C content of the gene. Detection of a bias in the location of intron gains and losses within a gene depends on the method of measurement: an intragene method indicates that events (specifically intron losses) are biased toward the 3' end of the gene. Despite the relatively recent acquisition of these introns, we found only one case where we could identify the mechanism of intron origin--the TOUCH3 gene has experienced 2 tandem, partial, internal gene duplications that duplicated a preexisting intron and also created a novel, alternatively spliced intron that makes use of a duplicated pair of cryptic splice sites.
Collapse
Affiliation(s)
- David G Knowles
- Smurfit Institute of Genetics, University of Dublin, Trinity College, Dublin, Ireland
| | | |
Collapse
|
18
|
Danforth BN, Fang J, Sipes S. Analysis of family-level relationships in bees (Hymenoptera: Apiformes) using 28S and two previously unexplored nuclear genes: CAD and RNA polymerase II. Mol Phylogenet Evol 2006; 39:358-72. [PMID: 16412668 DOI: 10.1016/j.ympev.2005.09.022] [Citation(s) in RCA: 96] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2005] [Revised: 09/13/2005] [Accepted: 09/20/2005] [Indexed: 11/27/2022]
Abstract
We analyzed a combined data set of two protein-coding nuclear genes (CAD and RNA polymerase II) and a nuclear ribosomal gene (28S D2-D4 region) for 68 bee species and 11 wasp outgroups. Our taxon sampling included all seven extant bee families, 17 of 20 subfamilies, and diverse tribes. Wasp outgroups included the two families most closely related to bees: Crabronidae and Sphecidae. We analyzed the combined and single gene data sets using parsimony and Bayesian methods, which yielded largely congruent results. Our results provide reasonably strong support for family and subfamily-level relationships among bees. Our data set strongly supports the sister-group relationship of the Colletidae and Stenotritidae, and places Halictidae as sister to this clade combined. Our analyses place the Melittidae and the long-tongued (LT) bee clade (Apidae+Megachilidae) near the base of the tree with Colletidae (and Stenotritidae) in a fairly highly derived position. This topology ("Melittidae-LT basal") was obtained in previous morphological studies under certain methods of character coding. A more widely accepted tree topology that places Colletidae (and/or Stenotritidae) as sister to all other bees ("Colletidae basal") is not supported by our data. The "Melittidae-LT basal" hypothesis may better explain patterns in the bee fossil record as well as historical biogeography of certain bee groups. Our results provide new insights into higher-level bee phylogeny and indicate that CAD, RNA polymerase II, and 28S are useful data sets for resolving Cretaceous-age divergences in bees and other Hymenoptera.
Collapse
Affiliation(s)
- Bryan N Danforth
- Department of Entomology, 3119 Comstock Hall, Cornell University, Ithaca, NY 14853, USA.
| | | | | |
Collapse
|
19
|
Jiménez-Delgado S, Crespo M, Permanyer J, Garcia-Fernàndez J, Manzanares M. Evolutionary genomics of the recently duplicated amphioxus Hairy genes. Int J Biol Sci 2006; 2:66-72. [PMID: 16733536 PMCID: PMC1458425 DOI: 10.7150/ijbs.2.66] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2006] [Accepted: 02/24/2006] [Indexed: 11/20/2022] Open
Abstract
Amphioxus Hairy genes have gone through a number of lineage-specific duplications, resulting in eight members, some of which are differentially expressed in the embryo. In order to gain insights into the evolution and function of this gene family we have compared their genomic structure and searched for conserved non-coding sequence elements. We have found that introns have been lost independently from these genes at least twice and after the duplication events. By carrying out phylogenetic footprinting between paralogues expressed in the embryo, we have found a differential distribution of conserved elements that could explain the limited overlap in expression patterns of Hairy genes in the amphioxus embryo. Furthermore, clustering of RBP-Jk binding sites in these conserved elements suggests that amphioxus Hairy genes are downstream targets of the Notch signaling pathway, as occurs in vertebrates. All of this evidence suggests that amphioxus Hairy genes have gone through a process of subfunctionalization shortly after their duplication, representing an extreme and rapid case of the duplication-degeneration-complementation model.
Collapse
Affiliation(s)
- Senda Jiménez-Delgado
- 1 Departament de Genètica, Facultat de Biologia, Universitat de Barcelona, Avda. Diagonal 645, 08028 Barcelona, Spain
| | - Miguel Crespo
- 2 Instituto de Investigaciones Biomédicas CSIC-UAM, Arturo Duperier 4, 28029 Madrid, Spain
| | - Jon Permanyer
- 1 Departament de Genètica, Facultat de Biologia, Universitat de Barcelona, Avda. Diagonal 645, 08028 Barcelona, Spain
| | - Jordi Garcia-Fernàndez
- 1 Departament de Genètica, Facultat de Biologia, Universitat de Barcelona, Avda. Diagonal 645, 08028 Barcelona, Spain
| | - Miguel Manzanares
- 2 Instituto de Investigaciones Biomédicas CSIC-UAM, Arturo Duperier 4, 28029 Madrid, Spain
| |
Collapse
|
20
|
Abstract
The origins and importance of spliceosomal introns comprise one of the longest-abiding mysteries of molecular evolution. Considerable debate remains over several aspects of the evolution of spliceosomal introns, including the timing of intron origin and proliferation, the mechanisms by which introns are lost and gained, and the forces that have shaped intron evolution. Recent important progress has been made in each of these areas. Patterns of intron-position correspondence between widely diverged eukaryotic species have provided insights into the origins of the vast differences in intron number between eukaryotic species, and studies of specific cases of intron loss and gain have led to progress in understanding the underlying molecular mechanisms and the forces that control intron evolution.
Collapse
Affiliation(s)
- Scott William Roy
- Allan Wilson Centre for Molecular Ecology and Evolution, Massey University, Palmerston North, New Zealand.
| | | |
Collapse
|
21
|
Rodríguez-Trelles F, Tarrío R, Ayala FJ. Models of spliceosomal intron proliferation in the face of widespread ectopic expression. Gene 2006; 366:201-8. [PMID: 16288838 DOI: 10.1016/j.gene.2005.09.004] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2005] [Revised: 08/04/2005] [Accepted: 09/02/2005] [Indexed: 11/27/2022]
Abstract
It is now certain that today living organisms can acquire new spliceosomal introns in their genes. The proposed sources of spliceosomal introns are exons, transposons, and other introns, including spliceosomal and group II self-splicing introns. Spliceosomal introns are thought to be the most likely source, because the inserted sequence would immediately be endowed with the essential set of intron recognition sequences, thereby preventing the deleterious effects associated with incorrect splicing. The most obvious spliceosomal intron duplication pathways involve an RNA transcript intermediate step. Therefore, for a spliceosomal intron to be originated by duplication, either the source gene from which the novel intron is derived, or that gene and the recipient gene, which contains the novel intron, would need to be expressed in the germ line. Intron proliferation surveys indicate that putative intron duplicate-containing genes do not always match detectable expression in the germ line, which casts doubt on the generality of the duplication model. However, judging mechanisms of intron gain (or loss) from present-day gene expression profiles could be erroneous, if expression patterns were different at the time the introns arose. In fact, this may likely be so in most cases. Ectopic expression, i.e., the expression of genes at times and locations where the target gene is not known to have a function, is a much more common phenomenon than previously realized. We conclude with a speculation on a possible interplay between spliceosomal introns and ectopic expression at the origin of multicellularity.
Collapse
Affiliation(s)
- Francisco Rodríguez-Trelles
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697-2525, USA.
| | | | | |
Collapse
|
22
|
Merritt TJS, Young CR, Vogt RG, Wilkerson RC, Quattro JM. Intron retention identifies a malaria vector within the Anopheles (Nyssorhynchus) albitaris complex (Diptera: Culicidae). Mol Phylogenet Evol 2005; 35:719-24. [PMID: 15878139 DOI: 10.1016/j.ympev.2005.03.009] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2004] [Revised: 03/09/2005] [Accepted: 03/10/2005] [Indexed: 11/18/2022]
Affiliation(s)
- T J S Merritt
- Department of Biological Sciences, Program in Marine Science, Baruch Institute and School of the Environment, University of South Carolina, Columbia, SC 29208, USA.
| | | | | | | | | |
Collapse
|
23
|
Delsuc F, Brinkmann H, Philippe H. Phylogenomics and the reconstruction of the tree of life. Nat Rev Genet 2005; 6:361-75. [PMID: 15861208 DOI: 10.1038/nrg1603] [Citation(s) in RCA: 758] [Impact Index Per Article: 37.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
As more complete genomes are sequenced, phylogenetic analysis is entering a new era - that of phylogenomics. One branch of this expanding field aims to reconstruct the evolutionary history of organisms on the basis of the analysis of their genomes. Recent studies have demonstrated the power of this approach, which has the potential to provide answers to several fundamental evolutionary questions. However, challenges for the future have also been revealed. The very nature of the evolutionary history of organisms and the limitations of current phylogenetic reconstruction methods mean that part of the tree of life might prove difficult, if not impossible, to resolve with confidence.
Collapse
Affiliation(s)
- Frédéric Delsuc
- Canadian Institute for Advanced Research, Département de Biochimie, Centre Robert-Cedergren, Université de Montréal, Succursale Centre-Ville, Montréal, Québec H3C3J7, Canada
| | | | | |
Collapse
|
24
|
Niu DK, Hou WR, Li SW. mRNA-mediated intron losses: evidence from extraordinarily large exons. Mol Biol Evol 2005; 22:1475-81. [PMID: 15788745 DOI: 10.1093/molbev/msi138] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
Multicellular eukaryotes that have high intron density have their introns almost evenly distributed within genes, but unicellular eukaryotes that are generally intron poor have their introns asymmetrically distributed toward the 5' ends of genes. This was explained by homologous recombination of genomic DNA with the cDNA reverse transcribed from the 3' polyadenylated tail of spliced mRNA. This paper is to study whether mRNA-mediated intron losses have ever occurred in multicellular eukaryotes. If intron losses were mRNA-mediated, adjacent introns should be commonly lost together. A direct result is fusion of several previously adjacent exons and producing a large exon. We found that extraordinarily large exons (ELEs) are common not only in unicellular eukaryotes but also in multicellular eukaryotes. The percentage of genes having ELEs is negatively correlated with intron abundance. In addition, the number of lost introns estimated from the relative lengths of ELEs is negatively correlated with the number of extant introns. These results support mRNA-mediated intron losses in all eukaryotes. Moreover, we found that the ELEs of intron-common eukaryotes (with more than 0.5 intron per gene on average) are not only located at 3' ends but also at 5' ends and the middle of genes. This is contrary to what would be expected if the involved cDNAs were reverse transcribed from the 3' polyadenosine ends. A remarkable difference in intron distribution was revealed between intron-rare eukaryotes and intron-common eukaryotes. The intron-rare eukaryotes show very strong 5'-biased intron distribution, whereas the intron-common eukaryotes display even intron distribution or only weak 5'-biased distribution. We suspected that intron losses from 3' end of genes may be limited in intron-rare eukaryotes. The intron losses from intron-common eukaryotes should have other priming mechanism, like self-primed reverse transcription.
Collapse
Affiliation(s)
- Deng-Ke Niu
- Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing, China.
| | | | | |
Collapse
|
25
|
Hu G, Leger RJS. A phylogenomic approach to reconstructing the diversification of serine proteases in fungi. J Evol Biol 2005; 17:1204-14. [PMID: 15525405 DOI: 10.1111/j.1420-9101.2004.00786.x] [Citation(s) in RCA: 62] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Using a phylogenomic approach with 10 fungi of very different virulence and habitat, we determined that there was substantial diversification of subtilase-type proteases early in ascomycete history (with subsequent loss in many lineages) but with no comparable diversification of trypsins. Patterns of intron loss and the degree of divergence between paralogues demonstrated that the proliferation of proteinase K subtilases and subtilisin type subtilases seen in pathogenic ascomycetes (Metarhizium anisopliae, Magnaporthe grisea, Fusarium graminearum) occurred after the basidiomycete/ascomycete split but predated radiation of ascomycete lineages. This suggests that the early ascomycetes had a lifestyle that selected for multiple proteases, whereas the current disparity in gene numbers between ascomycete lineages results from retention of genes in at least some pathogens that have been lost in other lineages (yeasts, Aspergillus nidulans, Neurospora crassa). A similar prevailing trend towards lineage specific gene loss of trypsins in saprophytes and some pathogens suggests that their phylogenetic breadth will have been much wider in early fungi than currently.
Collapse
Affiliation(s)
- G Hu
- Department of Entomology, University of Maryland, College Park, MD 20742-4454, USA.
| | | |
Collapse
|
26
|
Krauss V, Pecyna M, Kurz K, Sass H. Phylogenetic Mapping of Intron Positions: A Case Study of Translation Initiation Factor eIF2γ. Mol Biol Evol 2004; 22:74-84. [PMID: 15356279 DOI: 10.1093/molbev/msh255] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Eukaryotic translation initiation factor 2 (eIF2) is a G protein that delivers the methionyl initiator tRNA to the small ribosomal subunit and releases it upon GTP hydrolysis after the recognition of the initiation codon. eIF2 is composed of three subunits, alpha, beta, and gamma. Subunit gamma shows the strongest conservation, and it confers both tRNA and GTP/GDP binding. Using intron positioning and protein sequence alignment, here we show that eIF2gamma is a suitable phylogenetic marker for eukaryotes. We determined or completed the sequences of 13 arthropod eIF2gamma genes. Analyzing the phylogenetic distribution of 52 different intron positions in 55 distantly related eIF2gamma genes, we identified ancient ones and shared derived introns in our data set. Obviously, intron positioning in eIF2gamma is evolutionarily conserved. However, there were episodes of complete and partial intron losses followed by intron gains. We identified 17 clusters of intron positions based on their distribution. The evolution of these clusters appears to be connected with preferred exon length and can be used to estimate the relative timing of intron gain because nearby precursor introns had to be erased from the gene before the new introns could be inserted. Moreover, we identified a putative case of intron sliding that constitutes a synapomorphic character state supporting monophyly of Coleoptera, Lepidoptera, and Diptera excluding Hymenoptera. We also performed tree reconstructions using the eIF2gamma protein sequences and intron positioning as phylogenetic information. Our results support the monophyly of Viridoplantae, Ascomycota, Homobasidiomyceta, and Apicomplexa.
Collapse
Affiliation(s)
- Veiko Krauss
- Department of Genetics, University of Leipzig, Leipzig, Germany.
| | | | | | | |
Collapse
|
27
|
Cho S, Jin SW, Cohen A, Ellis RE. A phylogeny of caenorhabditis reveals frequent loss of introns during nematode evolution. Genome Res 2004; 14:1207-20. [PMID: 15231741 PMCID: PMC442136 DOI: 10.1101/gr.2639304] [Citation(s) in RCA: 142] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Since introns were discovered 26 years ago, people have wondered how changes in intron/exon structure occur, and what role these changes play in evolution. To answer these questions, we have begun studying gene structure in nematodes related to Caenorhabditis elegans. As a first step, we cloned a set of five genes from six different Caenorhabditis species, and used their amino acid sequences to construct the first detailed phylogeny of this genus. Our data indicate that nematode introns are lost at a very high rate during evolution, almost 400-fold higher than in mammals. These losses do not occur randomly, but instead, favor some introns and do not affect others. In contrast, intron gains are far less common than losses in these genes. On the basis of the sequences at each intron site, we suggest that several distinct mechanisms can cause introns to be lost. The small size of C. elegans introns should increase the rate at which each of these types of loss can occur, and might account for the dramatic difference in loss rate between nematodes and mammals.
Collapse
Affiliation(s)
- Soochin Cho
- Department of Molecular, Cellular and Developmental Biology, University of Michigan, Ann Arbor, Michigan 48864, USA
| | | | | | | |
Collapse
|
28
|
Brady SG, Danforth BN. Recent Intron Gain in Elongation Factor-1α of Colletid Bees (Hymenoptera: Colletidae). Mol Biol Evol 2004; 21:691-6. [PMID: 14739243 DOI: 10.1093/molbev/msh062] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open
Abstract
We discovered the presence of a unique spliceosomal intron in the F1 copy of elongation factor-1alpha (EF-1alpha) restricted to the bee family Colletidae (Hymenoptera: Apoidae). The intron ranges in size from 101 to 1044 bp and shows no positional sliding. Our data also demonstrate the complete absence of this intron from exemplars representing all other bee families, as well as from close hymenopteran relatives. A review of the literature finds that this intron is likewise absent from all other arthropods for which data are available. This provides unambiguous evidence for a relatively recent intron insertion event in the colletid common ancestor and, at least in this specific instance, lends support to the introns-late hypothesis. The comparative distribution of this novel intron also supports the monophyly of Colletidae and the exclusion of the Stenotritidae from this family, providing an example of the potential of some introns to act as robust markers of shared descent.
Collapse
Affiliation(s)
- Seán G Brady
- Department of Entomology, Cornell University, Ithaca, New York, USA.
| | | |
Collapse
|
29
|
Tarrio R, Rodríguez-Trelles F, Ayala FJ. A new Drosophila spliceosomal intron position is common in plants. Proc Natl Acad Sci U S A 2003; 100:6580-3. [PMID: 12750476 PMCID: PMC164489 DOI: 10.1073/pnas.0731952100] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The 25-year-old debate about the origin of introns between proponents of "introns early" and "introns late" has yielded significant advances, yet important questions remain to be ascertained. One question concerns the density of introns in the last common ancestor of the three multicellular kingdoms. Approaches to this issue thus far have relied on counts of the numbers of identical intron positions across present-day taxa on the assumption that the introns at those sites are orthologous. However, dismissing parallel intron gain for those sites may be unwarranted, because various factors can potentially constrain the site of intron insertion. Demonstrating parallel intron gain is severely handicapped, because intron sequences often evolve exceedingly fast and intron phylogenetic distributions are usually ambiguous, such that alternative loss and gain scenarios cannot be clearly distinguished. We have identified an intron position that was gained independently in animals and plants in the xanthine dehydrogenase gene. The extremely disjointed phylogenetic distribution of the intron argues strongly for separate gain rather than recurrent loss. If the observed phylogenetic pattern had resulted from recurrent loss, all observational support previously gathered for the introns-late theory of intron origins based on the phylogenetic distribution of introns would be invalidated.
Collapse
Affiliation(s)
- Rosa Tarrio
- Department of Ecology and Evolutionary Biology, University of California, Irvine 92697-2525, USA
| | | | | |
Collapse
|
30
|
Jordal BH. Elongation Factor 1 alpha resolves the monophyly of the haplodiploid ambrosia beetles Xyleborini (Coleoptera: Curculionidae). INSECT MOLECULAR BIOLOGY 2002; 11:453-465. [PMID: 12230544 DOI: 10.1046/j.1365-2583.2002.00354.x] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
Elongation Factor 1-alpha was used to test the monophyly of the wood boring beetle tribe Xyleborini, where all species are haplodiploid and perform regular inbreeding by brother-sister mating. Due to their feeding requirements, being highly dependent on ophiostomatoid fungi which they cultivate in wood tunnels, monophyly may be expected due to nutritional constraints. During the course of analyses, two copies of EF-1alpha were amplified in these beetles, differing in intron structure. The high similarity between paralogous amino acid sequences (93-94%) indicates a rather recent duplication in beetles, but phylogenetic analyses of different copies in insects rejected this hypothesis. Subsequent phylogenetic analyses of eighty orthologous sequences from Xyleborini and allied taxa, using the single-intron bearing copy, were greatly improved in resolution and node support by including the intron sequences (c. 60 bp). Most analyses resulted in a monophyletic Xyleborini, implying one origin of fungus feeding in this tribe. However, clear evidence for a polyphyletic Xyleborus and three more xyleborine genera calls for further revision of xyleborine classification.
Collapse
Affiliation(s)
- B H Jordal
- Department of Zoology, University of Bergen, Allegt 41, N-5007 Bergen, Norway.
| |
Collapse
|