1
|
Konrad A, Teufel AI, Grahnen JA, Liberles DA. Toward a general model for the evolutionary dynamics of gene duplicates. Genome Biol Evol 2011; 3:1197-209. [PMID: 21920903 PMCID: PMC3205605 DOI: 10.1093/gbe/evr093] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
Gene duplication is an important process in the functional divergence of genes and genomes. Several processes have been described that lead to duplicate gene retention over different timescales after both smaller-scale events and whole-genome duplication, including neofunctionalization, subfunctionalization, and dosage balance. Two common modes of duplicate gene loss include nonfunctionalization and loss due to population dynamics (failed fixation). Previous work has characterized expectations of duplicate gene retention under the neofunctionalization and subfunctionalization models. Here, that work is extended to dosage balance using simulations. A general model for duplicate gene loss/retention is then presented that is capable of fitting expectations under the different models, is defined at t = 0, and decays to an orthologous asymptotic rate rather than zero, based upon a modified Weibull hazard function. The model in a maximum likelihood framework shows the property of identifiability, recovering the evolutionary mechanism and parameters of simulation. This model is also capable of recovering the evolutionary mechanism of simulation from data generated using an unrelated network population genetic model. Lastly, the general model is applied as part of a mixture model to recent gene duplicates from the Oikopleura dioica genome, suggesting that neofunctionalization may be an important process leading to duplicate gene retention in that organism.
Collapse
Affiliation(s)
- Anke Konrad
- Department of Molecular Biology, University of Wyoming
| | | | | | | |
Collapse
|
2
|
Teixeira KN, Souza KN, Vidigal THDA, Brito CA, Santos AMC, Santoro MM. Size polymorphism in alleles of the myoglobin gene from biomphalaria mollusks. Genes (Basel) 2010; 1:357-370. [PMID: 24710092 PMCID: PMC3966218 DOI: 10.3390/genes1030357] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2010] [Revised: 10/07/2010] [Accepted: 10/14/2010] [Indexed: 11/17/2022] Open
Abstract
Introns are common among all eukaryotes, while only a limited number of introns are found in prokaryotes. Globin and globin-like proteins are widely distributed in nature, being found even in prokaryotes and a wide range of patterns of intron-exon have been reported in several eukaryotic globin genes. Globin genes in invertebrates show considerable variation in the positions of introns; globins can be found without introns, with only one intron or with three introns in different positions. In this work we analyzed the introns in the myoglobin gene from Biomphalaria glabrata, B. straminea and B. tenagophila. In the Biomphalaria genus, the myoglobin gene has three introns; these were amplified by PCR and analyzed by PCR-RFLP. Results showed that the size (number or nucleotides) and the nucleotide sequence of the coding gene of the myoglobin are variable in the three species. We observed the presence of size polymorphisms in intron 2 and 3; this characterizes a homozygous/heterozygous profile and it indicates the existence of two alleles which are different in size in each species of Biomphalaria. This polymorphism could be explored for specific identification of Biomphalaria individuals.
Collapse
Affiliation(s)
- Kádima N Teixeira
- Department of Biochemistry and Immunology, Federal University of Minas Gerais, Belo Horizonte/MG, 31270-901, Brazil.
| | - Karyne N Souza
- Department of Zoology, Federal University of Minas Gerais, Belo Horizonte/MG, 31270-901, Brazil.
| | - Teofânia H D A Vidigal
- Department of Zoology, Federal University of Minas Gerais, Belo Horizonte/MG, 31270-901, Brazil.
| | | | - Alexandre M C Santos
- Department of Biochemistry and Immunology, Federal University of Minas Gerais, Belo Horizonte/MG, 31270-901, Brazil.
| | - Marcelo M Santoro
- Department of Biochemistry and Immunology, Federal University of Minas Gerais, Belo Horizonte/MG, 31270-901, Brazil.
| |
Collapse
|
3
|
Romanel EAC, Schrago CG, Couñago RM, Russo CAM, Alves-Ferreira M. Evolution of the B3 DNA binding superfamily: new insights into REM family gene diversification. PLoS One 2009; 4:e5791. [PMID: 19503786 PMCID: PMC2688026 DOI: 10.1371/journal.pone.0005791] [Citation(s) in RCA: 106] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2008] [Accepted: 04/14/2009] [Indexed: 11/24/2022] Open
Abstract
Background The B3 DNA binding domain includes five families: auxin response factor (ARF), abscisic acid-insensitive3 (ABI3), high level expression of sugar inducible (HSI), related to ABI3/VP1 (RAV) and reproductive meristem (REM). The release of the complete genomes of the angiosperm eudicots Arabidopsis thaliana and Populus trichocarpa, the monocot Orysa sativa, the bryophyte Physcomitrella patens,the green algae Chlamydomonas reinhardtii and Volvox carteri and the red algae Cyanidioschyzon melorae provided an exceptional opportunity to study the evolution of this superfamily. Methodology In order to better understand the origin and the diversification of B3 domains in plants, we combined comparative phylogenetic analysis with exon/intron structure and duplication events. In addition, we investigated the conservation and divergence of the B3 domain during the origin and evolution of each family. Conclusions Our data indicate that showed that the B3 containing genes have undergone extensive duplication events, and that the REM family B3 domain has a highly diverged DNA binding. Our results also indicate that the founding member of the B3 gene family is likely to be similar to the ABI3/HSI genes found in C. reinhardtii and V. carteri. Among the B3 families, ABI3, HSI, RAV and ARF are most structurally conserved, whereas the REM family has experienced a rapid divergence. These results are discussed in light of their functional and evolutionary roles in plant development.
Collapse
Affiliation(s)
- Elisson A. C. Romanel
- Department of Genetics, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil
| | - Carlos G. Schrago
- Department of Genetics, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil
| | - Rafael M. Couñago
- Department of Biochemistry, University of Otago, Dunedin, New Zealand
| | - Claudia A. M. Russo
- Department of Genetics, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil
| | - Márcio Alves-Ferreira
- Department of Genetics, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil
- * E-mail:
| |
Collapse
|
4
|
Variability of esterase patterns in adult flies of the saltans species group of Drosophila (subgenus Sophophora). Genetica 2009; 137:111-24. [PMID: 19301127 DOI: 10.1007/s10709-009-9357-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2007] [Accepted: 02/16/2009] [Indexed: 10/21/2022]
Abstract
Esterases are known for their involvement in several physiological processes and high degree of polymorphism, in many organisms. Such polymorphism has been used to characterize species and species groups and to study genetic changes occurred in their evolutionary history. In the present study, the esterase patterns of 19 strains from 10 species representative of the five subgroups of the saltans species group were analyzed using polyacrylamide gel electrophoresis and alpha- and beta- naphthyl acetates as substrates. Fifty-one esterase bands were detected and classified as 31 alpha-esterases, 18 beta-esterases and two alpha/beta-esterases. On the basis of the inhibition patterns using Malathion and eserine sulfate, 34 bands were classified as carboxylesterases, 14 as acethylesterases and three as cholinesterases. Ten gene loci were tentatively established on the basis of data on band position in the gel, substrate preference and inhibition pattern. Twenty bands were species-specific, the remaining being shared by species from the same or different subgroups. Bands detected exclusively in males and bands with a different frequency or degree of expression between sexes were also detected. In the gels prepared for analysis of gene expression in the body parts (head, thorax and abdomen), the degree of expression of the beta-esterases was higher in the thorax, while the alpha-esterases were expressed predominantly in the abdomen and thorax. A global view of the data available at present on the esterases of the species from the saltans group and their degree of polymorphism are presented, as well as the possibility of using some beta-esterases, because of their characteristics in the gels, as markers for species identification.
Collapse
|
5
|
Abstract
Spliceosomal introns, a hallmark of eukaryotic gene organization, were an unexpected discovery. After three decades, crucial issues such as when and how introns first appeared in evolution remain unsettled. An issue yet to be answered is how intron positions arise de novo. Phylogenetic investigations concur that intron positions continue to emerge, at least in some lineages. Yet genomic scans for the sources of introns occupying new positions have been fruitless. Two alternative solutions to this paradox are: (i) formation of new intron positions halted before the recent past and (ii) it continues to occur, but through processes different from those generally assumed. One process generally dismissed is intron sliding--the relocation of a preexisting intron over short distances--because of supposed associated deleterious effects. The puzzle of intron gain arises owing to a pervasive operational definition of introns, which sees them as precisely demarcated segments of the genome separated from the neighboring nonintronic DNA by unmovable limits. Intron homology is defined as position homology. Recent studies of pre-mRNA processing indicate that this assumption needs to be revised. We incorporate recent advances on the evolutionarily frequent process of alternative splicing, by which exons of primary transcripts are spliced in different patterns, into a new model of intron sliding that accounts for the diversity of intron positions. We posit that intron positional diversity is driven by two overlapping processes: (i) background process of continuous relocation of preexisting introns by sliding and (ii) spurts of extensive gain/loss of new intron sequences.
Collapse
|
6
|
Abstract
Research into the origins of introns is at a critical juncture in the resolution of theories on the evolution of early life (which came first, RNA or DNA?), the identity of LUCA (the last universal common ancestor, was it prokaryotic- or eukaryotic-like?), and the significance of noncoding nucleotide variation. One early notion was that introns would have evolved as a component of an efficient mechanism for the origin of genes. But alternative theories emerged as well. From the debate between the "introns-early" and "introns-late" theories came the proposal that introns arose before the origin of genetically encoded proteins and DNA, and the more recent "introns-first" theory, which postulates the presence of introns at that early evolutionary stage from a reconstruction of the "RNA world." Here we review seminal and recent ideas about intron origins. Recent discoveries about the patterns and causes of intron evolution make this one of the most hotly debated and exciting topics in molecular evolutionary biology today.
Collapse
Affiliation(s)
- Francisco Rodríguez-Trelles
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697-2525, USA.
| | | | | |
Collapse
|
7
|
Fridmanis D, Fredriksson R, Kapa I, Schiöth HB, Klovins J. Formation of new genes explains lower intron density in mammalian Rhodopsin G protein-coupled receptors. Mol Phylogenet Evol 2006; 43:864-80. [PMID: 17188520 DOI: 10.1016/j.ympev.2006.11.007] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2006] [Revised: 10/06/2006] [Accepted: 11/02/2006] [Indexed: 10/23/2022]
Abstract
Mammalian G protein-coupled receptor (GPCR) genes are characterised by a large proportion of intronless genes or a lower density of introns when compared with GPCRs of invertebrates. It is unclear which mechanisms have influenced intron density in this protein family, which is one of the largest in the mammalian genomes. We used a combination of Hidden Markov Models (HMM) and BLAST searches to establish the comprehensive repertoire of Rhodopsin GPCRs from seven species and performed overall alignments and phylogenetic analysis using the maximum parsimony method for over 1400 receptors in 12 subgroups. We identified 14 different Ancestral Receptor Groups (ARGs) that have members in both vertebrate and invertebrate species. We found that there exists a remarkable difference in the intron density among ancestral and new Rhodopsin GPCRs. The intron density among ARGs members was more than 3.5-fold higher than that within non-ARG members and more than 2-fold higher when considering only the 7TM region. This suggests that the new GPCR genes have been predominantly formed intronless while the ancestral receptors likely accumulated introns during their evolution. Many of the intron positions found in mammalian ARG receptor sequences were found to be present in orthologue invertebrate receptors suggesting that these intron positions are ancient. This analysis also revealed that one intron position is much more frequent than any other position and it is common for a number of phylogenetically different Rhodopsin GPCR groups. This intron position lies within a functionally important, conserved, DRY motif which may form a proto-splice site that could contribute to positional intron insertion. Moreover, we have found that other receptor motifs, similar to DRY, also contain introns between the second and third nucleotide of the arginine codon which also forms a proto-splice site. Our analysis presents compelling evidence that there was not a major loss of introns in mammalian GPCRs and formation of new GPCRs among mammals explains why these have fewer introns compared to invertebrate GPCRs. We also discuss and speculate about the possible role of different RNA- and DNA-based mechanisms of intron insertion and loss.
Collapse
Affiliation(s)
- Davids Fridmanis
- Biomedical Research and Study Centre, University of Latvia, Ratsupites 1, Riga, Latvia
| | | | | | | | | |
Collapse
|
8
|
Singh ND, Arndt PF, Petrov DA. Minor shift in background substitutional patterns in the Drosophila saltans and willistoni lineages is insufficient to explain GC content of coding sequences. BMC Biol 2006; 4:37. [PMID: 17049096 PMCID: PMC1626080 DOI: 10.1186/1741-7007-4-37] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2006] [Accepted: 10/18/2006] [Indexed: 11/10/2022] Open
Abstract
Background Several lines of evidence suggest that codon usage in the Drosophila saltans and D. willistoni lineages has shifted towards a less frequent use of GC-ending codons. Introns in these lineages show a parallel shift toward a lower GC content. These patterns have been alternatively ascribed to either a shift in mutational patterns or changes in the definition of preferred and unpreferred codons in these lineages. Results and discussion To gain additional insight into this question, we quantified background substitutional patterns in the saltans/willistoni group using inactive copies of a novel, Q-like retrotransposable element. We demonstrate that the pattern of background substitutions in the saltans/willistoni lineage has shifted to a significant degree, primarily due to changes in mutational biases. These differences predict a lower equilibrium GC content in the genomes of the saltans/willistoni species compared with that in the D. melanogaster species group. The magnitude of the difference can readily account for changes in intronic GC content, but it appears insufficient to explain changes in codon usage within the saltans/willistoni lineage. Conclusion We suggest that the observed changes in codon usage in the saltans/willistoni clade reflects either lineage-specific changes in the definitions of preferred and unpreferred codons, or a weaker selective pressure on codon bias in this lineage.
Collapse
Affiliation(s)
- Nadia D Singh
- Department of Biological Sciences, Stanford University, 371 Serra Mall, Stanford, CA 94305, USA
| | - Peter F Arndt
- Max Planck for Molecular Genetics, 14195 Berlin, Germany
| | - Dmitri A Petrov
- Department of Biological Sciences, Stanford University, 371 Serra Mall, Stanford, CA 94305, USA
| |
Collapse
|
9
|
Abstract
There has been a lively debate over the evolution of eukaryote introns: at what point in the tree of life did they appear and from where, and what has been their subsequent pattern of loss and gain? A diverse range of recent research papers is relevant to this debate, and it is timely to bring them together. The absence of introns that are not self-splicing in prokaryotes and several other lines of evidence suggest an ancient eukaryotic origin for these introns, and the subsequent gain and loss of introns appears to be an ongoing process in many organisms. Some introns are now functionally important and there have been suggestions that invoke natural selection for the ancient and recent gain of introns, but it is also possible that fixation and loss of introns can occur in the absence of positive selection.
Collapse
Affiliation(s)
- R Belshaw
- Department of Zoology, University of Oxford, South Parks Road, Oxford OX1 3PS, UK.
| | | |
Collapse
|
10
|
Rodríguez-Trelles F, Tarrío R, Ayala FJ. Models of spliceosomal intron proliferation in the face of widespread ectopic expression. Gene 2006; 366:201-8. [PMID: 16288838 DOI: 10.1016/j.gene.2005.09.004] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2005] [Revised: 08/04/2005] [Accepted: 09/02/2005] [Indexed: 11/27/2022]
Abstract
It is now certain that today living organisms can acquire new spliceosomal introns in their genes. The proposed sources of spliceosomal introns are exons, transposons, and other introns, including spliceosomal and group II self-splicing introns. Spliceosomal introns are thought to be the most likely source, because the inserted sequence would immediately be endowed with the essential set of intron recognition sequences, thereby preventing the deleterious effects associated with incorrect splicing. The most obvious spliceosomal intron duplication pathways involve an RNA transcript intermediate step. Therefore, for a spliceosomal intron to be originated by duplication, either the source gene from which the novel intron is derived, or that gene and the recipient gene, which contains the novel intron, would need to be expressed in the germ line. Intron proliferation surveys indicate that putative intron duplicate-containing genes do not always match detectable expression in the germ line, which casts doubt on the generality of the duplication model. However, judging mechanisms of intron gain (or loss) from present-day gene expression profiles could be erroneous, if expression patterns were different at the time the introns arose. In fact, this may likely be so in most cases. Ectopic expression, i.e., the expression of genes at times and locations where the target gene is not known to have a function, is a much more common phenomenon than previously realized. We conclude with a speculation on a possible interplay between spliceosomal introns and ectopic expression at the origin of multicellularity.
Collapse
Affiliation(s)
- Francisco Rodríguez-Trelles
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697-2525, USA.
| | | | | |
Collapse
|
11
|
Morán T, Fontdevila A. Phylogeny and molecular evolution of the Drosophila hydei subgroup (Drosophila repleta group) inferred from the Xanthine dehydrogenase gene. Mol Phylogenet Evol 2005; 36:695-705. [PMID: 15935705 DOI: 10.1016/j.ympev.2005.04.009] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2004] [Revised: 03/18/2005] [Accepted: 04/05/2005] [Indexed: 11/30/2022]
Abstract
The hydei subgroup (Drosophila repleta group) consists of seven species divided into two complexes: bifurca and hydei, whose phylogenetic relationships are not well understood. To evaluate the molecular phylogeny of this subgroup, we analyzed 2085 bp of coding sequence of the Xanthine dehydrogenase gene in six available species of the hydei subgroup, with Drosophila buzzatii and Drosophila mulleri as an outgroup. For phylogenetic reconstruction we adopted a maximum-likelihood framework, based on the adjustment of descriptive models of nucleotide substitution to real data. We employed distance-based and weighted parsimony methods to construct candidate phylogenies. In all cases, we obtained only one completely resolved tree with strong statistical support for each node, that shows a phylogeny that is partially discordant with the proposed systematics of the subgroup. This tree suggests that the two species complexes are paraphyletic, as opposed to classic phylogenies using morphologic and cytologic traits. This discordance is discussed in relation to its implication for the evolutionary history of the hydei subgroup.
Collapse
Affiliation(s)
- Tomás Morán
- Grup de Biología Evolutiva, Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, 08193 Bellaterra (Barcelona), Spain
| | | |
Collapse
|
12
|
Binford GJ, Cordes MHJ, Wells MA. Sphingomyelinase D from venoms of Loxosceles spiders: evolutionary insights from cDNA sequences and gene structure. Toxicon 2005; 45:547-60. [PMID: 15777950 DOI: 10.1016/j.toxicon.2004.11.011] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2004] [Accepted: 11/01/2004] [Indexed: 10/25/2022]
Abstract
Loxosceles spider venoms cause dermonecrosis in mammalian tissues. The toxin sphingomyelinase D (SMaseD) is a sufficient causative agent in lesion formation and is only known in these spiders and a few pathogenic bacteria. Similarities between spider and bacterial SMaseD in molecular weights, pIs and N-terminal amino acid sequence suggest an evolutionary relationship between these molecules. We report three cDNA sequences from venom-expressed mRNAs, analyses of amino acid sequences, and partial characterization of gene structure of SMaseD homologs from Loxosceles arizonica with the goal of better understanding the evolution of this toxin. Sequence analyses indicate SMaseD is a single domain protein and a divergent member of the ubitiquous, broadly conserved glycerophosphoryl diester phosphodiesterase family (GDPD). Bacterial SMaseDs are not identifiable as homologs of spider SMaseD or GDPD family members. Amino acid sequence similarities do not afford clear distinction between independent origin of toxic SMaseD activity in spiders and bacteria and origin in one lineage by ancient horizontal transfer from the other. The SMaseD genes span at least 6500bp and contain at least 5 introns. Together, these data indicate L. arizonica SMaseD has been evolving within a eukaryotic genome for a long time ruling out origin by recent transfer from bacteria.
Collapse
Affiliation(s)
- Greta J Binford
- Department of Biochemistry and Molecular Biophysics, University of Arizona, Tucson, AZ 85719, USA.
| | | | | |
Collapse
|
13
|
Coghlan A, Wolfe KH. Origins of recently gained introns in Caenorhabditis. Proc Natl Acad Sci U S A 2004; 101:11362-7. [PMID: 15243155 PMCID: PMC509176 DOI: 10.1073/pnas.0308192101] [Citation(s) in RCA: 103] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2003] [Indexed: 11/18/2022] Open
Abstract
The genomes of the nematodes Caenorhabditis elegans and Caenorhabditis briggsae both contain approximately 100,000 introns, of which >6,000 are unique to one or the other species. To study the origins of new introns, we used a conservative method involving phylogenetic comparisons to animal orthologs and nematode paralogs to identify cases where an intron content difference between C. elegans and C. briggsae was caused by intron insertion rather than deletion. We identified 81 recently gained introns in C. elegans and 41 in C. briggsae. Novel introns have a stronger exon splice site consensus sequence than the general population of introns and show the same preference for phase 0 sites in codons over phases 1 and 2. More of the novel introns are inserted in genes that are expressed in the C. elegans germ line than expected by chance. Thirteen of the 122 gained introns are in genes whose protein products function in premRNA processing, including three gains in the gene for spliceosomal protein SF3B1 and two in the nonsense-mediated decay gene smg-2. Twenty-eight novel introns have significant DNA sequence identity to other introns, including three that are similar to other introns in the same gene. All of these similarities involve minisatellites or palindromes in the intron sequences. Our results suggest that at least some of the intron gains were caused by reverse splicing of a preexisting intron.
Collapse
Affiliation(s)
- Avril Coghlan
- Department of Genetics, Smurfit Institute, University of Dublin, Trinity College, Dublin 2, Ireland
| | | |
Collapse
|
14
|
Brady SG, Danforth BN. Recent Intron Gain in Elongation Factor-1α of Colletid Bees (Hymenoptera: Colletidae). Mol Biol Evol 2004; 21:691-6. [PMID: 14739243 DOI: 10.1093/molbev/msh062] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open
Abstract
We discovered the presence of a unique spliceosomal intron in the F1 copy of elongation factor-1alpha (EF-1alpha) restricted to the bee family Colletidae (Hymenoptera: Apoidae). The intron ranges in size from 101 to 1044 bp and shows no positional sliding. Our data also demonstrate the complete absence of this intron from exemplars representing all other bee families, as well as from close hymenopteran relatives. A review of the literature finds that this intron is likewise absent from all other arthropods for which data are available. This provides unambiguous evidence for a relatively recent intron insertion event in the colletid common ancestor and, at least in this specific instance, lends support to the introns-late hypothesis. The comparative distribution of this novel intron also supports the monophyly of Colletidae and the exclusion of the Stenotritidae from this family, providing an example of the potential of some introns to act as robust markers of shared descent.
Collapse
Affiliation(s)
- Seán G Brady
- Department of Entomology, Cornell University, Ithaca, New York, USA.
| | | |
Collapse
|
15
|
Abstract
For nearly 15 years, it has been widely believed that many introns were recently acquired by the genes of multicellular organisms. However, the mechanism of acquisition has yet to be described for a single animal intron. Here, we report a large-scale computational analysis of the human, Drosophila melanogaster, Caenorhabditis elegans, and Arabidopsis thaliana genomes. We divided 147,796 human intron sequences into batches of similar lengths and aligned them with each other. Different types of homologies between introns were found, but none showed evidence of simple intron transposition. Also, 106,902 plant, 39,624 Drosophila, and 6021 C. elegans introns were examined. No single case of homologous introns in nonhomologous genes was detected. Thus, we found no example of transposition of introns in the last 50 million years in humans, in 3 million years in Drosophila and C. elegans, or in 5 million years in Arabidopsis. Either new introns do not arise via transposition of other introns or intron transposition must have occurred so early in evolution that all traces of homology have been lost.
Collapse
Affiliation(s)
- Alexei Fedorov
- Department of Medicine, Medical College of Ohio, Toledo, Ohio 43614, USA.
| | | | | | | |
Collapse
|
16
|
Tarrio R, Rodríguez-Trelles F, Ayala FJ. A new Drosophila spliceosomal intron position is common in plants. Proc Natl Acad Sci U S A 2003; 100:6580-3. [PMID: 12750476 PMCID: PMC164489 DOI: 10.1073/pnas.0731952100] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The 25-year-old debate about the origin of introns between proponents of "introns early" and "introns late" has yielded significant advances, yet important questions remain to be ascertained. One question concerns the density of introns in the last common ancestor of the three multicellular kingdoms. Approaches to this issue thus far have relied on counts of the numbers of identical intron positions across present-day taxa on the assumption that the introns at those sites are orthologous. However, dismissing parallel intron gain for those sites may be unwarranted, because various factors can potentially constrain the site of intron insertion. Demonstrating parallel intron gain is severely handicapped, because intron sequences often evolve exceedingly fast and intron phylogenetic distributions are usually ambiguous, such that alternative loss and gain scenarios cannot be clearly distinguished. We have identified an intron position that was gained independently in animals and plants in the xanthine dehydrogenase gene. The extremely disjointed phylogenetic distribution of the intron argues strongly for separate gain rather than recurrent loss. If the observed phylogenetic pattern had resulted from recurrent loss, all observational support previously gathered for the introns-late theory of intron origins based on the phylogenetic distribution of introns would be invalidated.
Collapse
Affiliation(s)
- Rosa Tarrio
- Department of Ecology and Evolutionary Biology, University of California, Irvine 92697-2525, USA
| | | | | |
Collapse
|
17
|
Bird S, Zou J, Wang T, Munday B, Cunningham C, Secombes CJ. Evolution of interleukin-1beta. Cytokine Growth Factor Rev 2002; 13:483-502. [PMID: 12401481 DOI: 10.1016/s1359-6101(02)00028-x] [Citation(s) in RCA: 184] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
All jawed vertebrates possess a complex immune system, which is capable of anticipatory and innate immune responses. Jawless vertebrates possess an equally complex immune system but with no evidence of an anticipatory immune response. From these findings it has been speculated that the initiation and regulation of the immune system within vertebrates will be equally complex, although very little has been done to look at the evolution of cytokine genes, despite well-known biological activities within vertebrates. In recent years, cytokines, which have been well characterised within mammals, have begun to be cloned and sequenced within non-mammalian vertebrates, with the number of cytokine sequences available from primitive vertebrates growing rapidly. The identification of cytokines, which are mammalian homologues, will give a better insight into where immune system communicators arose and may also reveal molecules, which are unique to certain organisms. Work has focussed on interleukin-1 (IL-1), a major mediator of inflammation which initiates and/or increases a wide variety of non-structural, function associated genes that are characteristically expressed during inflammation. Other than mammalian IL-1beta sequences there are now full cDNA sequences and genomic organisations available from bird, amphibian, bony fish and cartilaginous fish, with many of these genes having been obtained using an homology cloning approach. This review considers how the IL-1beta gene has changed through vertebrate evolution and whether its role and regulation are conserved within selected non-mammalian vertebrates.
Collapse
Affiliation(s)
- Steve Bird
- Department of Zoology, University of Aberdeen, Tillydrone Avenue, Aberdeen AB24 2TZ, UK
| | | | | | | | | | | |
Collapse
|
18
|
Di Maro A, Pizzo E, Cubellis MV, D'Alessio G. An intron-less betagamma-crystallin-type gene from the sponge Geodia cydonium. Gene 2002; 299:79-82. [PMID: 12459254 DOI: 10.1016/s0378-1119(02)01014-4] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
We report the cloning of a gene encoding a betagamma-crystallin-type protein from a porifera, the Geodia cydonium sponge. The data provide direct, conclusive evidence of the existence of such a gene in the genome of an early diverged metazoan. The cloned gene is found to contain no introns, while proto-splice sites are identified in the nucleotide sequence at positions where introns are located in homologous, very recently diverged vertebrate genes. These findings are discussed in the light of the debate between the introns-late and introns-early theories.
Collapse
Affiliation(s)
- Antimo Di Maro
- Dipartimento di Scienze della Vita, Seconda Università di Napoli, Via Vivaldi 43, 81100 Caserta, Italy
| | | | | | | |
Collapse
|
19
|
Jordal BH. Elongation Factor 1 alpha resolves the monophyly of the haplodiploid ambrosia beetles Xyleborini (Coleoptera: Curculionidae). INSECT MOLECULAR BIOLOGY 2002; 11:453-465. [PMID: 12230544 DOI: 10.1046/j.1365-2583.2002.00354.x] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
Elongation Factor 1-alpha was used to test the monophyly of the wood boring beetle tribe Xyleborini, where all species are haplodiploid and perform regular inbreeding by brother-sister mating. Due to their feeding requirements, being highly dependent on ophiostomatoid fungi which they cultivate in wood tunnels, monophyly may be expected due to nutritional constraints. During the course of analyses, two copies of EF-1alpha were amplified in these beetles, differing in intron structure. The high similarity between paralogous amino acid sequences (93-94%) indicates a rather recent duplication in beetles, but phylogenetic analyses of different copies in insects rejected this hypothesis. Subsequent phylogenetic analyses of eighty orthologous sequences from Xyleborini and allied taxa, using the single-intron bearing copy, were greatly improved in resolution and node support by including the intron sequences (c. 60 bp). Most analyses resulted in a monophyletic Xyleborini, implying one origin of fungus feeding in this tribe. However, clear evidence for a polyphyletic Xyleborus and three more xyleborine genera calls for further revision of xyleborine classification.
Collapse
Affiliation(s)
- B H Jordal
- Department of Zoology, University of Bergen, Allegt 41, N-5007 Bergen, Norway.
| |
Collapse
|
20
|
Abstract
Debate over the mechanisms responsible for the phylogenetic and genomic distribution of introns has proceeded largely without consideration of the population-genetic forces influencing the establishment and retention of novel genetic elements. However, a simple model incorporating random genetic drift and weak mutation pressure against intron-containing alleles yields predictions consistent with a diversity of observations: (i) the rarity of introns in unicellular organisms with large population sizes, and their expansion after the origin of multicellular organisms with reduced population sizes; (ii) the relationship between intron abundance and the stringency of splice-site requirements; (iii) the tendency for introns to be more numerous and longer in regions of low recombination; and (iv) the bias toward phase-0 introns. This study provides a second example of a mechanism whereby genomic complexity originates passively as a "pathological" response to small population size, and raises difficulties for the idea that ancient introns played a major role in the origin of genes by exon shuffling.
Collapse
Affiliation(s)
- Michael Lynch
- Department of Biology, Indiana University, Bloomington, IN 47405, USA.
| |
Collapse
|
21
|
Nixon JEJ, Wang A, Morrison HG, McArthur AG, Sogin ML, Loftus BJ, Samuelson J. A spliceosomal intron in Giardia lamblia. Proc Natl Acad Sci U S A 2002; 99:3701-5. [PMID: 11854456 PMCID: PMC122587 DOI: 10.1073/pnas.042700299] [Citation(s) in RCA: 120] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2001] [Accepted: 12/26/2001] [Indexed: 11/18/2022] Open
Abstract
Short introns occur in numerous protist lineages, but there are no reports of intervening sequences in the protists Giardia lamblia and Trichomonas vaginalis, which may represent the deepest known branches in the eukaryotic line of descent. We have discovered a 35-bp spliceosomal intron in a gene encoding a putative [2Fe-2S] ferredoxin of G. lamblia. The Giardia intron contains a canonical splice site at its 3' end (AG), a noncanonical splice site at its 5' end (CT), and a branch point sequence that fits the yeast consensus sequence of TACTAAC except for the first nucleotide (AACTAAC). We have also identified several G. lamblia genes with spliceosomal peptides, including homologues of eukaryote-specific spliceosomal peptides (Prp8 and Prp11), several DExH-box RNA-helicases that have homologues in eubacteria, but serve essential functions in the splicing of introns in eukaryotes, and 11 predicted archaebacteria-like Sm and like-Sm core peptides, which coat small nuclear RNAs. Phylogenetic analyses show the Giardia Sm core peptides are the products of multiple, ancestral gene duplications followed by divergence, but they retain strong similarity to Sm and like-Sm peptides of other eukaryotes. Although we have documented only a single intron in Giardia, it likely has other introns and fully functional, spliceosomal machinery. If introns were added during eukaryotic evolution (the introns-late hypothesis), then these results push back the date of this event before the branching of G. lamblia.
Collapse
Affiliation(s)
- Julie E J Nixon
- Department of Immunology and Infectious Diseases, Harvard School of Public Health, 665 Huntington Avenue, Boston, MA 02115-6018, USA
| | | | | | | | | | | | | |
Collapse
|
22
|
Boudet N, Aubourg S, Toffano-Nioche C, Kreis M, Lecharny A. Evolution of intron/exon structure of DEAD helicase family genes in Arabidopsis, Caenorhabditis, and Drosophila. Genome Res 2001; 11:2101-14. [PMID: 11731501 PMCID: PMC311229 DOI: 10.1101/gr.200801] [Citation(s) in RCA: 108] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The DEAD box RNA helicase (RH) proteins are homologs involved in diverse cellular functions in all of the organisms from prokaryotes to eukaryotes. Nevertheless, there is a lack of conservation in the splicing pattern in the 53 Arabidopsis thaliana (AtRHs), the 32 Caenorhabditis elegans (CeRHs) and the 29 Drosophila melanogaster (DmRHs) genes. Of the 153 different observed intron positions, 4 are conserved between AtRHs, CeRHs, and DmRHs, and one position is also found in RHs from yeast and human. Of the 27 different AtRH structures with introns, 20 have at least one predicted ancient intron in the regions coding for the catalytic domain. In all of the organisms examined, we found at least one gene with most of its intron predicted to be ancient. In A. thaliana, the large diversity in RH structures suggests that duplications of the ancestral RH were followed by a high number of intron deletions and additions. The very high bias toward phase 0 introns is in favor of intron addition, preferentially in phase 0. Results from this comparative study of the same gene family in a plant and in two animals are discussed in terms of the general mechanisms of gene family evolution.
Collapse
Affiliation(s)
- N Boudet
- Institut de Biotechnologie des Plantes, Unité Mixte de Recherche-Centre National Recherche Scientifique 8618, Université de Paris-Sud, Bât. 630, F-91405 Orsay Cedex, France
| | | | | | | | | |
Collapse
|
23
|
Mattick JS. Non-coding RNAs: the architects of eukaryotic complexity. EMBO Rep 2001; 2:986-91. [PMID: 11713189 PMCID: PMC1084129 DOI: 10.1093/embo-reports/kve230] [Citation(s) in RCA: 558] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2001] [Revised: 09/10/2001] [Accepted: 09/11/2001] [Indexed: 11/14/2022] Open
Abstract
Around 98% of all transcriptional output in humans is non-coding RNA. RNA-mediated gene regulation is widespread in higher eukaryotes and complex genetic phenomena like RNA interference, co-suppression, transgene silencing, imprinting, methylation, and possibly position-effect variegation and transvection, all involve intersecting pathways based on or connected to RNA signaling. I suggest that the central dogma is incomplete, and that intronic and other non-coding RNAs have evolved to comprise a second tier of gene expression in eukaryotes, which enables the integration and networking of complex suites of gene activity. Although proteins are the fundamental effectors of cellular function, the basis of eukaryotic complexity and phenotypic variation may lie primarily in a control architecture composed of a highly parallel system of trans-acting RNAs that relay state information required for the coordination and modulation of gene expression, via chromatin remodeling, RNA-DNA, RNA-RNA and RNA-protein interactions. This system has interesting and perhaps informative analogies with small world networks and dataflow computing.
Collapse
Affiliation(s)
- J S Mattick
- ARC Special Research Centre for Functional and Applied Genomics, Institute for Molecular Bioscience, University of Queensland, Brisbane 4072, Australia.
| |
Collapse
|
24
|
Mattick JS, Gagen MJ. The evolution of controlled multitasked gene networks: the role of introns and other noncoding RNAs in the development of complex organisms. Mol Biol Evol 2001; 18:1611-30. [PMID: 11504843 DOI: 10.1093/oxfordjournals.molbev.a003951] [Citation(s) in RCA: 301] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Eukaryotic phenotypic diversity arises from multitasking of a core proteome of limited size. Multitasking is routine in computers, as well as in other sophisticated information systems, and requires multiple inputs and outputs to control and integrate network activity. Higher eukaryotes have a mosaic gene structure with a dual output, mRNA (protein-coding) sequences and introns, which are released from the pre-mRNA by posttranscriptional processing. Introns have been enormously successful as a class of sequences and comprise up to 95% of the primary transcripts of protein-coding genes in mammals. In addition, many other transcripts (perhaps more than half) do not encode proteins at all, but appear both to be developmentally regulated and to have genetic function. We suggest that these RNAs (eRNAs) have evolved to function as endogenous network control molecules which enable direct gene-gene communication and multitasking of eukaryotic genomes. Analysis of a range of complex genetic phenomena in which RNA is involved or implicated, including co-suppression, transgene silencing, RNA interference, imprinting, methylation, and transvection, suggests that a higher-order regulatory system based on RNA signals operates in the higher eukaryotes and involves chromatin remodeling as well as other RNA-DNA, RNA-RNA, and RNA-protein interactions. The evolution of densely connected gene networks would be expected to result in a relatively stable core proteome due to the multiple reuse of components, implying that cellular differentiation and phenotypic variation in the higher eukaryotes results primarily from variation in the control architecture. Thus, network integration and multitasking using trans-acting RNA molecules produced in parallel with protein-coding sequences may underpin both the evolution of developmentally sophisticated multicellular organisms and the rapid expansion of phenotypic complexity into uncontested environments such as those initiated in the Cambrian radiation and those seen after major extinction events.
Collapse
Affiliation(s)
- J S Mattick
- Centre for Functional and Applied Genomics, Institute for Molecular Bioscience, University of Queensland, Brisbane, Queensland, Australia.
| | | |
Collapse
|
25
|
Pitts RJ, Zwiebel LJ. Isolation and characterization of the Xanthine dehydrogenase gene of the Mediterranean fruit fly, Ceratitis capitata. Genetics 2001; 158:1645-55. [PMID: 11514452 PMCID: PMC1461762 DOI: 10.1093/genetics/158.4.1645] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Xanthine dehydrogenase (XDH) is a member of the molybdenum hydroxylase family of enzymes catalyzing the oxidation of hypoxanthine and xanthine to uric acid. The enzyme is also required for the production of one of the major Drosophila eye pigments, drosopterin. The XDH gene has been isolated in many species representing a broad cross section of the major groups of living organisms, including the cDNA encoding XDH from the Mediterranean fruit fly Ceratitis capitata (CcXDH) described here. CcXDH is closely related to other insect XDHs and is able to rescue the phenotype of the Drosophila melanogaster XDH mutant, rosy, in germline transformation experiments. A previously identified medfly mutant, termed rosy, whose phenotype is suggestive of a disruption in XDH function, has been examined for possible mutations in the XDH gene. However, we find no direct evidence that a mutation in the CcXDH gene or that a reduction in the CcXDH enzyme activity is present in rosy medflies. Conclusive studies of the nature of the medfly rosy mutant will require rescue by germline transformation of mutant medflies.
Collapse
Affiliation(s)
- R J Pitts
- Department of Biological Sciences, Vanderbilt University, Nashville, Tennessee 37235, USA
| | | |
Collapse
|
26
|
Tarrío R, Rodríguez-Trelles F, Ayala FJ. Shared nucleotide composition biases among species and their impact on phylogenetic reconstructions of the Drosophilidae. Mol Biol Evol 2001; 18:1464-73. [PMID: 11470837 DOI: 10.1093/oxfordjournals.molbev.a003932] [Citation(s) in RCA: 62] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Compositional changes are a major feature of genome evolution. Overlooking nucleotide composition differences among sequences can seriously mislead phylogenetic reconstructions. Large compositional variation exists among the members of the family Drosophilidae. Until now, however, base composition differences have been largely neglected in the formulations of the nucleotide substitution process used to reconstruct the phylogeny of this important group of species. The present study adopts a maximum-likelihood framework of phylogenetic inference in order to analyze five nuclear gene regions and shows that (1) the pattern of compositional variation in the Drosophilidae does not match the phylogeny of the species; (2) accounting for the heterogeneous GC content with Galtier and Gouy's nucleotide substitution model leads to a tree that differs in significant aspects from the tree inferred when the nucleotide composition differences are ignored, even though both phylogenetic hypotheses attain strong nodal support in the bootstrap analyses; and (3) the LogDet distance correction cannot completely overcome the distorting effects of the compositional variation that exists among the species of the Drosophilidae. Our analyses confidently place the Chymomyza genus as an outgroup closer than the genus Scaptodrosophila to the Drosophila genus and conclusively support the monophyly of the Sophophora subgenus.
Collapse
Affiliation(s)
- R Tarrío
- Department of Ecology and Evolutionary Biology, University of California at Irvine, 92697-2525, USA
| | | | | |
Collapse
|
27
|
Ragg H, Lokot T, Kamp PB, Atchley WR, Dress A. Vertebrate serpins: construction of a conflict-free phylogeny by combining exon-intron and diagnostic site analyses. Mol Biol Evol 2001; 18:577-84. [PMID: 11264410 DOI: 10.1093/oxfordjournals.molbev.a003838] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
A combination of three independent biological features, genomic organization, diagnostic amino acid sites, and rare indels, was used to elucidate the phylogeny of the vertebrate serpin (serine protease inhibitor) superfamily. A strong correlation between serpin gene families displaying (1) a conserved exon-intron pattern and (2) family-specific combinations of amino acid residues at specific sites suggests that present-day vertebrates encompass six serpin gene families which evolved from primordial genes by massive intron insertion before or during early vertebrate radiation. Introns placed at homologous positions in the gene sequences in combination with diagnostic sequence characters may also constitute a reliable kinship indicator for other protein superfamilies.
Collapse
Affiliation(s)
- H Ragg
- Faculty of Technology and Faculty of Mathematics, University of Bielefeld, Bielefeld, Germany.
| | | | | | | | | |
Collapse
|
28
|
Long M, Rosenberg C. Testing the "proto-splice sites" model of intron origin: evidence from analysis of intron phase correlations. Mol Biol Evol 2000; 17:1789-96. [PMID: 11110894 DOI: 10.1093/oxfordjournals.molbev.a026279] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
A few nucleotide sites of nuclear exons that flank introns are often conserved. A hypothesis has suggested that these sites, called "proto-splice sites," are remnants of recognition signals for the insertion of introns in the early evolution of eukaryotic genes. This notion of proto-splice sites has been an important basis for the insertional theory of introns. This hypothesis predicts that the distribution of proto-splice sites would determine the distribution of intron phases, because the positions of introns are just a subset of the proto-splice sites. We previously tested this prediction by examining the proportions of the phases of proto-splice sites, revealing nothing in these proportion distributions similar to observed proportions of intron phases. Here, we provide a second independent test of the proto-splice site hypothesis, with regard to its prediction that the proto-splice sites would mimic intron phase correlations, using a CDS database we created from GenBank. We tested four hypothetical proto-splice sites G / G, AG / G, AG / GT, and C/AAG / R. Interestingly, while G / G and AG / GT site phase distributions are not consistent with actual introns, we observed that AG / G and C/AAG / R sites have a symmetric phase excess. However, the patterns of the excess are quite different from the actual intron phase distribution. In addition, particular amino acid repeats in proteins were found to partially contribute to the excess of symmetry at these two types of sites. The phase associations of all four sites are significantly different from those of intron phases. Furthermore, a general model of intron insertion into proto-splice sites was simulated by Monte Carlo simulation to investigate the probability that the random insertion of introns into AG / G and C/AAG / R sites could generate the observed intron phase distribution. The simulation showed that (1) no observed correlation of intron phases was statistically consistent with the phase distribution of proto-splice sites in the simulated virtual genes; (2) most conservatively, no simulation in 10,000 Monte Carlo experiments gave a pattern with an excess of symmetric (1, 1) exons larger than those of (0, 0) and (2, 2), a major statistical feature of intron phase distribution that is consistent with the directly observed cases of exon shuffling. Thus, these results reject the null hypothesis that introns are randomly inserted into preexisting proto-splice sites, as suggested by the insertional theory of introns.
Collapse
Affiliation(s)
- M Long
- Department of Ecology and Evolution, University of Chicago, Illinois 60637, USa.
| | | |
Collapse
|
29
|
Abstract
Study of the nucleotide composition in Drosophila, focusing on the saltans and willistoni groups, has revealed unanticipated differences in nucleotide composition among lineages. Compositional differences are associated with an accelerated rate of nucleotide substitution in functionally less constrained regions. These observations have been set forth against the extended opinion that the pattern of point mutation has remained constant during the evolution of the genus. A crucial assumption has been that the most recent common ancestor of the subgenus Sophophora had an elevated GC content. Until now, this assumption has been supported by indirect arguments, consisting of extrapolations from closely related outgroups and limited by the robustness of mathematical descriptions concerning the extensive nucleotide composition differences among sequences. The present study seeks to test the assumption of a high ancestral GC content using realistic representations of the nucleotide substitution process to account for potential biases induced by the heterogeneous GC content of the taxa. The analysis of eight nuclear genes unambiguously corroborates that the common ancestor of Sophophora had an elevated GC content.
Collapse
Affiliation(s)
- F Rodríguez-Trelles
- Department of Ecology and Evolutionary Biology, University of California at Irvine, Irvine, CA 92697-2525, USA.
| | | | | |
Collapse
|
30
|
Tarrío R, Rodríguez-Trelles F, Ayala FJ. Tree rooting with outgroups when they differ in their nucleotide composition from the ingroup: the Drosophila saltans and willistoni groups, a case study. Mol Phylogenet Evol 2000; 16:344-9. [PMID: 10991788 DOI: 10.1006/mpev.2000.0813] [Citation(s) in RCA: 45] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Rooting is frequently the most precarious step in any phylogenetic analysis. Outgroups can become useless for rooting if they are too distantly related to the ingroup. Specifically, little attention has been paid to scenarios where outgroups have evolved different nucleotide frequencies from the ingroup. We investigate one empirical example that arose seeking to determine the phylogenetic relationship between the saltans and the willistoni groups of Drosophila (subgenus Sophophora). We have analyzed 2085 coding nucleotides from the xanthine dehydrogenase (Xdh) gene in 14 species, 6 from the saltans group and 8 from the willistoni group. We adopt a two-step strategy: (1) we investigate the phylogeny without outgroups, rooting the network by the midpoint method; (2) we reinvestigate the rooting of this phylogeny using predefined outgroups in both a parsimony- and a model-based maximum-likelihood framework. A satisfactory description of the substitution process along the Xdh region calls for six substitution types and substitution rate variation among codon positions. When the ingroup sequences are considered alone, the phylogeny obtained using this description corroborates the known relationships derived from anatomical criteria. Inclusion of the outgroups makes the root unstable, apparently because of differences between ingroups and outgroups in the substitution processes; these differences are better accounted for by a simplified model of evolution than by more complex, realistic descriptions of the substitution process.
Collapse
Affiliation(s)
- R Tarrío
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697-2525, USA
| | | | | |
Collapse
|
31
|
Reddy KL, Wohlwill A, Dzitoeva S, Lin MH, Holbrook S, Storti RV. The Drosophila PAR domain protein 1 (Pdp1) gene encodes multiple differentially expressed mRNAs and proteins through the use of multiple enhancers and promoters. Dev Biol 2000; 224:401-14. [PMID: 10926776 DOI: 10.1006/dbio.2000.9797] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
Transcription factors are often expressed at several times and in multiple tissues during development and regulate diverse sets of downstream target genes by varying their combinatorial interactions with other transcription factors. The Drosophila Tropomyosin I (TmI) gene is regulated by a complex of proteins within the enhancer that synergistically interacts with MEF2 to activate TmI transcription as muscle cells fuse and differentiate. One of the components of this complex is PDP1 (PAR domain protein 1), a basic leucine zipper transcription factor that is highly homologous to three vertebrate genes that are members of the PAR domain subfamily. We have isolated and describe here the structure of the Pdp1 gene. The Pdp1 gene is complex, containing at least four transcriptional start sites and producing at least six different mRNAs and PDP1 isoforms. Five of the PDP1 isoforms differ by the substitution or insertion of amino acids at or near the N-terminal of the protein. At least three of these alternately spliced transcripts are differentially expressed in different tissues of the developing embryo in which PDP1 expression is correlated with the differentiation of different cell types. A sixth isoform is produced by splicing out part of the PAR and basic DNA binding domains, and DNA binding and transient transfection experiments suggest that it functions as a dominant negative inhibitor of transcription. Furthermore, two enhancers have been identified within the gene that express in the somatic mesodermal precursors to body wall muscles and fat body and together direct expression in other tissues that closely mimics that of the endogenous gene. These results show that Pdp1 is widely expressed, including in muscle, fat, and gut precursors, and is likely involved in the transcriptional control of different developmental pathways through the use of differentially expressed PDP1 isoforms. Furthermore, the similarities between Pdp1 and the other PAR domain genes suggest that Pdp1 is the homologue of the vertebrate genes.
Collapse
Affiliation(s)
- K L Reddy
- Department of Biochemistry and Molecular Biology M/C536, University of Illinois College of Medicine, Chicago 60612, USA
| | | | | | | | | | | |
Collapse
|
32
|
Rodríguez-Trelles F, Alarcón L, Fontdevila A. Molecular evolution and phylogeny of the buzzatii complex (Drosophila repleta group): a maximum-likelihood approach. Mol Biol Evol 2000; 17:1112-22. [PMID: 10889224 DOI: 10.1093/oxfordjournals.molbev.a026392] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The buzzatii complex of the mulleri subgroup (Drosophila repleta group) consists of three clusters of species whose evolutionary relationships are poorly known. We analyzed 2,085 coding nucleotides from the xanthine dehydrogenase (XDH:) gene in the 10 available species of the complex and Drosophila mulleri and Drosophila hydei. We adopted a statistical model-fitting approach within the maximum-likelihood (ML) framework of phylogenetic inference. We first modeled the process of nucleotide substitution using a tree topology which was reasonably accurate. Then we used the most satisfactory description so attained to reconstruct the evolutionary relationships in the buzzatii complex. We found that a minimally realistic description of the substitution process of XDH: should allow six substitution types and different substitution rates for codon positions. Using this description we obtained a strongly supported, fully resolved tree which is congruent with the already-known (yet few) relationships. We also analyzed published data from three mitochondrial cytochrome oxidases (CO I, II, and III). In our analyses, these relatively short DNA sequences failed to discriminate statistically among alternative phylogenies. When the data of these three gene regions are combined with the XDH: sequences, the phylogenetic signal emerging from XDH: becomes reinforced. All four of the gene regions evolve faster in the buzzatii and martensis clusters than in the stalkeri cluster, paralleling the amount of chromosomal evolution.
Collapse
Affiliation(s)
- F Rodríguez-Trelles
- Grup de Biologia Evolutiva, Departament de Genètica i Microbiologia, Universitat Autònoma de Barcelona, Barcelona, Spain.
| | | | | |
Collapse
|
33
|
Abstract
Membrane heredity was central to the unique symbiogenetic origin from cyanobacteria of chloroplasts in the ancestor of Plantae (green plants, red algae, glaucophytes) and to subsequent lateral transfers of plastids to form even more complex photosynthetic chimeras. Each symbiogenesis integrated disparate genomes and several radically different genetic membranes into a more complex cell. The common ancestor of Plantae evolved transit machinery for plastid protein import. In later secondary symbiogeneses, signal sequences were added to target proteins across host perialgal membranes: independently into green algal plastids (euglenoids, chlorarachneans) and red algal plastids (alveolates, chromists). Conservatism and innovation during early plastid diversification are discussed.
Collapse
|
34
|
Caterino MS, Cho S, Sperling FA. The current state of insect molecular systematics: a thriving Tower of Babel. ANNUAL REVIEW OF ENTOMOLOGY 2000; 45:1-54. [PMID: 10761569 DOI: 10.1146/annurev.ento.45.1.1] [Citation(s) in RCA: 252] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
Insect molecular systematics has undergone remarkable recent growth. Advances in methods of data generation and analysis have led to the accumulation of large amounts of DNA sequence data from most major insect groups. In addition to reviewing theoretical and methodological advances, we have compiled information on the taxa and regions sequenced from all available phylogenetic studies of insects. It is evident that investigators have not usually coordinated their efforts. The genes and regions that have been sequenced differ substantially among studies and the whole of our efforts is thus little greater than the sum of its parts. The cytochrome oxidase I, 16S, 18S, and elongation factor-1 alpha genes have been widely used and are informative across a broad range of divergences in insects. We advocate their use as standards for insect phylogenetics. Insect molecular systematics has complemented and enhanced the value of morphological and ecological data, making substantial contributions to evolutionary biology in the process. A more coordinated approach focused on gathering homologous sequence data will greatly facilitate such efforts.
Collapse
Affiliation(s)
- M S Caterino
- Department of Environmental Science, Policy and Management, University of California, Berkeley 94720-3112, USA
| | | | | |
Collapse
|
35
|
Rokas A, Kathirithamby J, Holland PW. Intron insertion as a phylogenetic character: the engrailed homeobox of Strepsiptera does not indicate affinity with Diptera. INSECT MOLECULAR BIOLOGY 1999; 8:527-530. [PMID: 10620047 DOI: 10.1046/j.1365-2583.1999.00149.x] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
The phylogenetic relationships of the order Strepsiptera are unclear. Affiliation to Coleoptera has been proposed, however this implies that dipteran halteres and strep-sipteran haltere-like organs evolved convergently. An alternative is a sister group relationship with Diptera. In this case, halteres could be homologous but a radical homeotic mutation may have switched their position to the Strepsipteran mesothorax. Ribosomal DNA sequence analysis has been used to support Dipteran affiliation, although this is controversial. Here we investigate the potential of an intron insertion site as a phylogenetic character. We find that the en homeobox gene of the strepsipteran Stichotrema dallatorreanum lacks a derived intron insertion shared by representatives of Diptera and Lepidoptera. We argue against a close affiliation between Strepsiptera and Diptera.
Collapse
Affiliation(s)
- A Rokas
- School of Animal and Microbial Sciences, University of Reading, U.K
| | | | | |
Collapse
|
36
|
Rodríguez-Trelles F, Tarrío R, Ayala FJ. Molecular evolution and phylogeny of the Drosophila saltans species group inferred from the Xdh gene. Mol Phylogenet Evol 1999; 13:110-21. [PMID: 10508544 DOI: 10.1006/mpev.1999.0631] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The Drosophila saltans group of the subgenus Sophophora consists of five species subgroups whose phylogenetic relationships are poorly known. We have analyzed 2085 coding nucleotides from the xanthine dehydrogenase (Xdh) gene in six species, at least one from each subgroup. We follow a model-based maximum likelihood framework. We first model the substitution process using a tree topology that is approximately accurate. Then we evaluate several candidate tree topologies using a working model of nucleotide substitution. We found that a minimally realistic description of the substitution process along the Xdh region should allow two transition and four transversion rate parameters and different fixed rates for codon positions, which are distributed statistically according to different gamma distributions. The phylogeny obtained using this description differs in significant respects from a phylogeny based on anatomical criteria. We have also analyzed data from five additional (three nuclear and two mitochondrial) gene regions. In our analysis, these relatively short DNA sequences, either separately or jointly, fail to discriminate statistically among alternative phylogenies. When the data for these five gene regions are combined with the Xdh sequences, the strong phylogenetic signal emerging from Xdh becomes somewhat diluted rather than reinforced. The phylogeny of the species and biogeographical considerations suggest that the D. saltans group originated in the tropics of the New World, similarly as the closely related D. willistoni group.
Collapse
Affiliation(s)
- F Rodríguez-Trelles
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697-2525, USA
| | | | | |
Collapse
|
37
|
Rodríguez-Trelles F, Tarrío R, Ayala FJ. Switch in codon bias and increased rates of amino acid substitution in the Drosophila saltans species group. Genetics 1999; 153:339-50. [PMID: 10471717 PMCID: PMC1460741 DOI: 10.1093/genetics/153.1.339] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
We investigated the nucleotide composition of five genes, Xdh, Adh, Sod, Per, and 28SrRNA, in nine species of Drosophila (subgenus Sophophora) and one of Scaptodrosophila. The six species of the Drosophila saltans group markedly differ from the others in GC content and codon use bias. The GC content in the third codon position, and to a lesser extent in the first position and the introns, is higher in the D. melanogaster and D. obscura groups than in the D. saltans group (in Scaptodrosophila it is intermediate but closer to the melanogaster and obscura species). Differences are greater for Xdh than for Adh, Sod, Per, and 28SrRNA, which are functionally more constrained. We infer that rapid evolution of GC content in the saltans lineage is largely due to a shift in mutation pressure, which may have been associated with diminished natural selection due to smaller effective population numbers rather than reduced recombination rates. The rate of GC content evolution impacts the rate of protein evolution and may distort phylogenetic inferences. Previous observations suggesting that GC content evolution is very limited in Drosophila may have been distorted due to the restricted number of genes and species (mostly D. melanogaster) investigated.
Collapse
Affiliation(s)
- F Rodríguez-Trelles
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697-2525, USA.
| | | | | |
Collapse
|
38
|
Venkatesh B, Ning Y, Brenner S. Late changes in spliceosomal introns define clades in vertebrate evolution. Proc Natl Acad Sci U S A 1999; 96:10267-71. [PMID: 10468597 PMCID: PMC17877 DOI: 10.1073/pnas.96.18.10267] [Citation(s) in RCA: 175] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The evolutionary origin of spliceosomal introns has been the subject of much controversy. Introns are proposed to have been both lost and gained during evolution. If the gain or loss of introns are unique events in evolution, they can serve as markers for phylogenetic analysis. We have made an extensive survey of the phylogenetic distribution of seven spliceosomal introns that are present in Fugu genes, but not in their mammalian homologues; we show that these introns were acquired by actinopterygian (ray-finned) fishes at various stages of evolution. We have also investigated the intron pattern of the rhodopsin gene in fishes, and show that the four introns found in the ancestral chordate rhodopsin gene were simultaneously lost in a common ancestor of ray-finned fishes. These changes in introns serve as excellent markers for phylogenetic analysis because they reliably define clades. Our intron-based cladogram establishes the difficult-to-ascertain phylogenetic relationships of some ray-finned fishes. For example, it shows that bichirs (Polypterus) are the sister group of all other extant ray-finned fishes.
Collapse
Affiliation(s)
- B Venkatesh
- Marine Molecular Genetics Laboratory, Institute of Molecular and Cell Biology, 30 Medical Drive, Singapore 117609.
| | | | | |
Collapse
|
39
|
He M, Haymer DS. Genetic relationships of populations and the origins of new infestations of the Mediterranean fruit fly. Mol Ecol 1999; 8:1247-57. [PMID: 10447866 DOI: 10.1046/j.1365-294x.1999.00685.x] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
A polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) approach based on the variation in intron sequences of the glucose-6-phosphate dehydrogenase (Zw) gene was used to assess genetic variability in 26 populations and infestations of the Mediterranean fruit fly (medfly), Ceratitis capitata. Beginning with the exon-primed intron-crossing PCR (EPIC-PCR) method to amplify introns of this gene, five alleles were identified on the basis of DNA sequence variants. Several of these variants affect recognition sites for the restriction enzymes RsaI and TaqI. Using these enzymes in successive digestions of the EPIC-PCR products, each of these alleles can be identified directly from individuals. From this, surveys were conducted to document genotypes and allele frequencies in these samples. The relationships of existing populations and the invasion process represented by new infestations of the medfly were analysed using a principal coordinate analysis and the amova method to quantify the distribution of genetic diversity at different levels in a hierarchical manner. From these results, a framework of genetic relationships among the populations and infestations is presented. In addition, for at least some of the infestations, populations that are probably acting as sources of origin have been identified.
Collapse
Affiliation(s)
- M He
- Department of Genetics and Molecular Biology, University of Hawaii, Honolulu, Hawaii 96822, USA
| | | |
Collapse
|
40
|
Kômoto N, Yukuhiro K, Tamura T. Structure and expression of tandemly duplicated xanthine dehydrogenase genes of the silkworm (Bombyx mori). INSECT MOLECULAR BIOLOGY 1999; 8:73-83. [PMID: 9927176 DOI: 10.1046/j.1365-2583.1999.810073.x] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
Xanthine dehydrogenase (XDH) is a molybdoenzyme which catalyses oxidation of xanthine and hypoxanthine to uric acid. We isolated genomic clones of silkworm (Bombyx mori) XDH genes (BmXDH1 and BmXDH2). The BmXDH2 gene is located upstream from the BmXDH1 gene and they show a tandemly duplicated structure. Both BmXDH genes were expressed in the fat body and Malpighian tubules, whereas only the BmXDH1 gene was expressed in the midgut. Phylogenetic analysis indicates that BmXDH gene duplication occurred after the divergence of the silkworm and dipteran species. Intron insertion site comparison shows that some introns were lost during insect XDH gene evolution.
Collapse
Affiliation(s)
- N Kômoto
- Department of Insect Genetics and Breeding, National Institute of Sericultural and Entomological Science, Tsukuba, Ibaraki, Japan.
| | | | | |
Collapse
|
41
|
Abstract
Does the intron/exon structure of eukaryotic genes belie their ancient assembly by exon-shuffling or have introns been inserted into preformed genes during eukaryotic evolution? These are the central questions in the ongoing 'introns-early' versus 'introns-late' controversy. The phylogenetic distribution of spliceosomal introns continues to strongly favor the intronslate theory. The introns-early theory, however, has claimed support from intron phase and protein structure correlations.
Collapse
Affiliation(s)
- J M Logsdon
- Department of Biochemistry, Dalhousie University, Halifax, Nova Scotia,B3H 4H7, Canada.
| |
Collapse
|
42
|
Abstract
The 'introns-late' theory holds that spliceosomal introns have been added to genes during eukaryotic evolution. Few clear examples of recent intron gains have been well documented, but two such cases have now been reported, one with possible identification of the source of the intron.
Collapse
Affiliation(s)
- J M Logsdon
- Program in Evolutionary Biology Canadian Institute for Advanced Research Department of Biochemistry Dalhousie University Halifax, Nova Scotia, Canada B3H 4H7.
| | | | | |
Collapse
|