1
|
Liu K, Zhao S, Wang S, Wang H, Zhang Z. Identification and analysis of the FAD gene family in walnuts (Juglans regia L.) based on transcriptome data. BMC Genomics 2020; 21:299. [PMID: 32293267 PMCID: PMC7158092 DOI: 10.1186/s12864-020-6692-z] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2019] [Accepted: 03/24/2020] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Walnut kernels contain a large amount of unsaturated fatty acids, such as linoleic acid and linolenic acid, which are essential fatty acids for humans and have important effects on growth and health. The main function of fatty acid desaturase (FAD), which is widely distributed in organisms, is to remove hydrogen from carbon chains in the biosynthesis of unsaturated fatty acids to generate C=C bonds. RESULTS By performing a series of bioinformatics analysis, 24 members of the JrFAD gene family were identified from the genome database of walnut, and then compared with the homologous genes from Arabidopsis. Phylogenetic analysis showed that JrFADs were classified into four subfamilies: the SAD desaturase subfamily, Δ7/Δ9 desaturase subfamily, Δ12/ω-3 desaturase subfamily and "front-end" desaturase subfamily. Meanwhile, the expression of fatty acid synthesis genes in walnut kernels at different developmental stages was analysed by transcriptome sequencing, with expression of JrFAD3-1, which encodes an enzyme involved in linolenic acid synthesis, being particularly prominent. The relative expression level of JrFAD3-1 changed dramatically with the kernel development stages and exhibited a Bell-Shaped Curve. A significant positive correlation was observed between the expression of JrFAD3-1 during 70-100 DAF (Days after flowering) and the content of alpha-linolenic acid during 100-130 DAF, with a correlation coefficient of 0.991. Additionally, JrFAD3-1 was proved closely related to homologous genes in Betula pendula and Corylus heterophylla, indicating that the conserved structure of FADs is consistent with classical plant taxonomy. CONCLUSION Twenty-four members JrFADs in walnut were identified and classified into four subfamilies. JrFAD3-1 may play significant roles in the biosynthesis of polyunsaturated fatty acids in walnut.
Collapse
Affiliation(s)
- Kai Liu
- Mountainous Area Research Institute of Hebei Province, Hebei Agricultural University, Baoding, 071001, China
| | - Shugang Zhao
- College of Life Sciences, Hebei Agricultural University, Baoding, 071001, China.
| | - Shuang Wang
- Mountainous Area Research Institute of Hebei Province, Hebei Agricultural University, Baoding, 071001, China
| | - Hongxia Wang
- Mountainous Area Research Institute of Hebei Province, Hebei Agricultural University, Baoding, 071001, China. .,Research Center for Agricultural Engineering Technology of Mountain District of Hebei, Baoding, 071001, China. .,National Engineering Research Center for Agriculture in Northern Mountainous Areas, Baoding, 071001, China.
| | - Zhihua Zhang
- Mountainous Area Research Institute of Hebei Province, Hebei Agricultural University, Baoding, 071001, China. .,Research Center for Agricultural Engineering Technology of Mountain District of Hebei, Baoding, 071001, China. .,National Engineering Research Center for Agriculture in Northern Mountainous Areas, Baoding, 071001, China.
| |
Collapse
|
2
|
Wang Y, Tao XF, Su ZX, Liu AK, Liu TL, Sun L, Yao Q, Chen KP, Gu X. Current Bacterial Gene Encoding Capsule Biosynthesis Protein CapI Contains Nucleotides Derived from Exonization. Evol Bioinform Online 2016; 12:303-312. [PMID: 27980385 PMCID: PMC5154736 DOI: 10.4137/ebo.s40703] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2016] [Revised: 09/18/2016] [Accepted: 09/22/2016] [Indexed: 12/04/2022] Open
Abstract
Since the proposition of introns-early hypothesis, although many studies have shown that most eukaryotic ancestors possessed intron-rich genomes, evidence of intron existence in genomes of ancestral bacteria has still been absent. While not a single intron has been found in all protein-coding genes of current bacteria, analyses on bacterial genes horizontally transferred into eukaryotes at ancient time may provide evidence of intron existence in bacterial ancestors. In this study, a bacterial gene encoding capsule biosynthesis protein CapI was found in the genome of sea anemone, Nematostella vectensis. This horizontally transferred gene contains a phase 1 intron of 40 base pairs. The nucleotides of this intron have high sequence identity with those encoding amino acids in current bacterial CapI gene, indicating that the intron and the amino acid-coding nucleotides are originated from the same ancestor sequence. Moreover, 5′-splice site of this intron is located in a GT-poor region associated with a closely following AG-rich region, suggesting that deletion mutation at 5′-splice site has been employed to remove this intron and the intron-like amino acid-coding nucleotides in current bacterial CapI gene are derived from exonization. These data suggest that bacterial CapI gene contained intron(s) at ancient time. This is the first report providing the result of sequence analysis to suggest possible existence of spliceosomal introns in ancestral bacterial genes. The methodology employed in this study may be used to identify more such evidence that would aid in settlement of the dispute between introns-early and introns-late theories.
Collapse
Affiliation(s)
- Yong Wang
- School of Food and Biological Engineering, Jiangsu University, Zhenjiang, China
| | - Xia-Fang Tao
- Institute of Life Sciences, Jiangsu University, Zhenjiang, China
| | - Zhi-Xi Su
- School of Life Sciences, Fudan University, Shanghai, China
| | - A-Ke Liu
- School of Life Sciences, Fudan University, Shanghai, China
| | - Tian-Lei Liu
- School of Food and Biological Engineering, Jiangsu University, Zhenjiang, China
| | - Ling Sun
- School of Food and Biological Engineering, Jiangsu University, Zhenjiang, China
| | - Qin Yao
- Institute of Life Sciences, Jiangsu University, Zhenjiang, China
| | - Ke-Ping Chen
- Institute of Life Sciences, Jiangsu University, Zhenjiang, China
| | - Xun Gu
- School of Life Sciences, Fudan University, Shanghai, China.; Department of Genetics, Development, and Cell Biology, Iowa State University, Ames, IA, USA
| |
Collapse
|
3
|
Aldehyde dehydrogenase (ALDH) in Alzheimer’s and Parkinson’s disease. J Neural Transm (Vienna) 2014; 123:83-90. [DOI: 10.1007/s00702-014-1320-1] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2014] [Accepted: 09/26/2014] [Indexed: 10/24/2022]
|
4
|
Koonin EV, Csuros M, Rogozin IB. Whence genes in pieces: reconstruction of the exon-intron gene structures of the last eukaryotic common ancestor and other ancestral eukaryotes. WILEY INTERDISCIPLINARY REVIEWS-RNA 2012; 4:93-105. [PMID: 23139082 DOI: 10.1002/wrna.1143] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
In eukaryotes, protein-coding sequences are interrupted by non-coding sequences known as introns. During mRNA maturation, introns are excised by the spliceosome and the coding regions, exons, are spliced to form the mature coding region. The intron densities widely differ between eukaryotic lineages, from 6 to 7 introns per kb of coding sequence in vertebrates, some invertebrates and green plants, to only a few introns across the entire genome in many unicellular eukaryotes. Evolutionary reconstructions using maximum likelihood methods suggest intron-rich ancestors for each major group of eukaryotes. For the last common ancestor of animals, the highest intron density of all extant and extinct eukaryotes was inferred, at 120-130% of the human intron density. Furthermore, an intron density within 53-74% of the human values was inferred for the last eukaryotic common ancestor. Accordingly, evolution of eukaryotic genes in all lines of descent involved primarily intron loss, with substantial gain only at the bases of several branches including plants and animals. These conclusions have substantial biological implications indicating that the common ancestor of all modern eukaryotes was a complex organism with a gene architecture resembling those in multicellular organisms. Alternative splicing most likely initially appeared as an inevitable result of splicing errors and only later was employed to generate structural and functional diversification of proteins.
Collapse
Affiliation(s)
- Eugene V Koonin
- National Center for Biotechnology Information NLM/NIH, Bethesda, MD, USA.
| | | | | |
Collapse
|
5
|
Bae KY, Kim SW, Shin HY, Kim JM, Shin IS, Kim SJ, Kim JK, Yoon JS. The acute effects of ethanol and acetaldehyde on physiological responses after ethanol ingestion in young healthy men with different ALDH2 genotypes. Clin Toxicol (Phila) 2012; 50:242-9. [PMID: 22455355 DOI: 10.3109/15563650.2012.672743] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
BACKGROUND This study investigated the acute effects of ethanol and acetaldehyde on physiological responses in healthy men according to ALDH2 genotype by measuring them directly in human blood. METHODS Twenty-four men, 12 with the ALDH2*1/*1 genotype and 12 with the ALDH2*1/*2 genotype, were selected. In a double-blind placebo-controlled crossover design, each subject was administered one of three doses of ethanol (0.25, 0.5 or 0.75 g/kg) or a placebo on four separate occasions. The blood ethanol concentration (BEC), blood acetaldehyde concentration (BAAC) and physiological responses including facial redness, pulse rate and systolic and diastolic blood pressures were assessed. RESULTS Significant differences were observed in the physiological responses between the ALDH2*1/*2 and ALDH2*1/*1 groups. Significant gene effects were observed for pulse rate and facial flushing (F-values =62.344; p values <0.001 and F-values =7.062; p values =0.010, respectively, by repeated-measures analysis of variance), which were significantly greater in subjects with the ALDH2*1/*2 genotype. In a linear regression analysis, BAAC significantly predicted increased facial redness at 30 minutes (adjusted R( 2 ): 0.209; p values <0.001) and pulse rate at 30, 60, 90 and 120 minutes (adjusted R( 2 ): 0.454, 0.490, 0.428 and 0.193, respectively; all p values <0.001), whereas BEC was not associated with any physiological response measure at any time. CONCLUSIONS Facial redness and pulse rate after ethanol ingestion were significantly higher in the ALDH2*1/*2 genotype, and were significantly associated with blood acetaldehyde concentrations. The acute effects of ethanol on the physiological responses were mediated mainly by acetaldehyde, the active metabolite of ethanol, rather than by ethanol itself.
Collapse
Affiliation(s)
- Kyung-Yeol Bae
- Department of Psychiatry, Chonnam National University Medical School, Gwangju, Republic of Korea
| | | | | | | | | | | | | | | |
Collapse
|
6
|
Rogozin IB, Carmel L, Csuros M, Koonin EV. Origin and evolution of spliceosomal introns. Biol Direct 2012; 7:11. [PMID: 22507701 PMCID: PMC3488318 DOI: 10.1186/1745-6150-7-11] [Citation(s) in RCA: 217] [Impact Index Per Article: 18.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2011] [Accepted: 03/15/2012] [Indexed: 12/31/2022] Open
Abstract
Evolution of exon-intron structure of eukaryotic genes has been a matter of long-standing, intensive debate. The introns-early concept, later rebranded ‘introns first’ held that protein-coding genes were interrupted by numerous introns even at the earliest stages of life's evolution and that introns played a major role in the origin of proteins by facilitating recombination of sequences coding for small protein/peptide modules. The introns-late concept held that introns emerged only in eukaryotes and new introns have been accumulating continuously throughout eukaryotic evolution. Analysis of orthologous genes from completely sequenced eukaryotic genomes revealed numerous shared intron positions in orthologous genes from animals and plants and even between animals, plants and protists, suggesting that many ancestral introns have persisted since the last eukaryotic common ancestor (LECA). Reconstructions of intron gain and loss using the growing collection of genomes of diverse eukaryotes and increasingly advanced probabilistic models convincingly show that the LECA and the ancestors of each eukaryotic supergroup had intron-rich genes, with intron densities comparable to those in the most intron-rich modern genomes such as those of vertebrates. The subsequent evolution in most lineages of eukaryotes involved primarily loss of introns, with only a few episodes of substantial intron gain that might have accompanied major evolutionary innovations such as the origin of metazoa. The original invasion of self-splicing Group II introns, presumably originating from the mitochondrial endosymbiont, into the genome of the emerging eukaryote might have been a key factor of eukaryogenesis that in particular triggered the origin of endomembranes and the nucleus. Conversely, splicing errors gave rise to alternative splicing, a major contribution to the biological complexity of multicellular eukaryotes. There is no indication that any prokaryote has ever possessed a spliceosome or introns in protein-coding genes, other than relatively rare mobile self-splicing introns. Thus, the introns-first scenario is not supported by any evidence but exon-intron structure of protein-coding genes appears to have evolved concomitantly with the eukaryotic cell, and introns were a major factor of evolution throughout the history of eukaryotes. This article was reviewed by I. King Jordan, Manuel Irimia (nominated by Anthony Poole), Tobias Mourier (nominated by Anthony Poole), and Fyodor Kondrashov. For the complete reports, see the Reviewers’ Reports section.
Collapse
Affiliation(s)
- Igor B Rogozin
- National Center for Biotechnology Information NLM/NIH, 8600 Rockville Pike, Bldg, 38A, Bethesda, MD 20894, USA
| | | | | | | |
Collapse
|
7
|
Oda T, Ohniwa RL, Suzuki Y, Denawa M, Kumeta M, Okamura H, Takeyasu K. Evolutionary dynamics of spliceosomal intron revealed by in silico analyses of the P-Type ATPase superfamily genes. Mol Biol Rep 2010; 38:2285-93. [DOI: 10.1007/s11033-010-0360-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2010] [Accepted: 10/21/2010] [Indexed: 10/18/2022]
|
8
|
Teixeira KN, Souza KN, Vidigal THDA, Brito CA, Santos AMC, Santoro MM. Size polymorphism in alleles of the myoglobin gene from biomphalaria mollusks. Genes (Basel) 2010; 1:357-70. [PMID: 24710092 PMCID: PMC3966218 DOI: 10.3390/genes1030357] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2010] [Revised: 10/07/2010] [Accepted: 10/14/2010] [Indexed: 11/17/2022] Open
Abstract
Introns are common among all eukaryotes, while only a limited number of introns are found in prokaryotes. Globin and globin-like proteins are widely distributed in nature, being found even in prokaryotes and a wide range of patterns of intron-exon have been reported in several eukaryotic globin genes. Globin genes in invertebrates show considerable variation in the positions of introns; globins can be found without introns, with only one intron or with three introns in different positions. In this work we analyzed the introns in the myoglobin gene from Biomphalaria glabrata, B. straminea and B. tenagophila. In the Biomphalaria genus, the myoglobin gene has three introns; these were amplified by PCR and analyzed by PCR-RFLP. Results showed that the size (number or nucleotides) and the nucleotide sequence of the coding gene of the myoglobin are variable in the three species. We observed the presence of size polymorphisms in intron 2 and 3; this characterizes a homozygous/heterozygous profile and it indicates the existence of two alleles which are different in size in each species of Biomphalaria. This polymorphism could be explored for specific identification of Biomphalaria individuals.
Collapse
Affiliation(s)
- Kádima N Teixeira
- Department of Biochemistry and Immunology, Federal University of Minas Gerais, Belo Horizonte/MG, 31270-901, Brazil.
| | - Karyne N Souza
- Department of Zoology, Federal University of Minas Gerais, Belo Horizonte/MG, 31270-901, Brazil.
| | - Teofânia H D A Vidigal
- Department of Zoology, Federal University of Minas Gerais, Belo Horizonte/MG, 31270-901, Brazil.
| | | | - Alexandre M C Santos
- Department of Biochemistry and Immunology, Federal University of Minas Gerais, Belo Horizonte/MG, 31270-901, Brazil.
| | - Marcelo M Santoro
- Department of Biochemistry and Immunology, Federal University of Minas Gerais, Belo Horizonte/MG, 31270-901, Brazil.
| |
Collapse
|
9
|
Abstract
Spliceosomal introns, a hallmark of eukaryotic gene organization, were an unexpected discovery. After three decades, crucial issues such as when and how introns first appeared in evolution remain unsettled. An issue yet to be answered is how intron positions arise de novo. Phylogenetic investigations concur that intron positions continue to emerge, at least in some lineages. Yet genomic scans for the sources of introns occupying new positions have been fruitless. Two alternative solutions to this paradox are: (i) formation of new intron positions halted before the recent past and (ii) it continues to occur, but through processes different from those generally assumed. One process generally dismissed is intron sliding--the relocation of a preexisting intron over short distances--because of supposed associated deleterious effects. The puzzle of intron gain arises owing to a pervasive operational definition of introns, which sees them as precisely demarcated segments of the genome separated from the neighboring nonintronic DNA by unmovable limits. Intron homology is defined as position homology. Recent studies of pre-mRNA processing indicate that this assumption needs to be revised. We incorporate recent advances on the evolutionarily frequent process of alternative splicing, by which exons of primary transcripts are spliced in different patterns, into a new model of intron sliding that accounts for the diversity of intron positions. We posit that intron positional diversity is driven by two overlapping processes: (i) background process of continuous relocation of preexisting introns by sliding and (ii) spurts of extensive gain/loss of new intron sequences.
Collapse
|
10
|
Abstract
DNA and amino acid sequences contain information about both the phylogenetic relationships among species and the evolutionary processes that caused the sequences to divergence. Mathematical and statistical methods try to detect this information to determine how and why DNA and protein molecules work the way they do. This chapter describes some of the models of evolution of biological sequences most widely used. It first focuses on single nucleotide/amino acid replacement rate models. Then it discusses the modelling of evolution at gene and protein module levels. The chapter concludes with speculations about the future use of molecular evolution studies using genomic and proteomic data.
Collapse
Affiliation(s)
- Pietro Liò
- Computer Laboratory, University of Cambridge, Cambridge, UK
| | | |
Collapse
|
11
|
Fry JD, Donlon K, Saweikis M. A worldwide polymorphism in aldehyde dehydrogenase in Drosophila melanogaster: evidence for selection mediated by dietary ethanol. Evolution 2007; 62:66-75. [PMID: 18070084 DOI: 10.1111/j.1558-5646.2007.00288.x] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Clinally varying traits in Drosophila melanogaster provide good opportunities for elucidating the genetic basis of adaptation. Resistance to ethanol, a natural component of D. melanogaster's breeding sites, increases with latitude on multiple continents, indicating that the trait is under selection. Although the well-studied Alcohol dehydrogenase (Adh) polymorphism makes a contribution to the clines, it accounts for only a small proportion of the phenotypic variation. We describe an amino acid replacement polymorphism in Aldehyde dehydrogenase (Aldh), the gene encoding the second enzyme in the ethanol degradation pathway, that shows hallmarks of also contributing to the clines. The derived Aldh allele, like the Adh-Fast allele, increases in frequency in laboratory populations selected for ethanol resistance, and increases in frequency with latitude in wild populations. Moreover, strains with the derived allele have significantly higher ALDH enzyme activity with acetaldehyde (the breakdown product of ethanol) as a substrate than strains with the ancestral allele. As is the case with the Adh-Fast allele, chromosomes with the derived Aldh allele show markedly reduced molecular variation in the vicinity of the replacement polymorphism compared to those with the ancestral allele, suggesting a single, relatively recent origin. Nonetheless, the Aldh polymorphism differs from the Adh polymorphism in that the ethanol-associated allele remains in relatively low frequency in most populations. We present evidence that this is likely to be the result of a trade-off in catalytic activity, with the advantage of the derived allele in acetaldehyde detoxification being offset by a disadvantage in detoxification of other aldehydes.
Collapse
Affiliation(s)
- James D Fry
- Department of Biology, University of Rochester, Rochester, NY 14627, USA.
| | | | | |
Collapse
|
12
|
Tanaka-Kunishima M, Ishida Y, Takahashi K, Honda M, Oonuma T. Ancient intron insertion sites and palindromic genomic duplication evolutionally shapes an elementally functioning membrane protein family. BMC Evol Biol 2007; 7:143. [PMID: 17708769 PMCID: PMC1999503 DOI: 10.1186/1471-2148-7-143] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2006] [Accepted: 08/20/2007] [Indexed: 12/28/2022] Open
Abstract
Background In spite of the recent accumulation of genomic data, the evolutionary pathway in the individual genes of present-day living taxa is still elusive for most genes. Among ion channels, inward K+ rectifier (IRK) channels are the fundamental and well-defined protein group. We analyzed the genomic structures of this group and compared them among a phylogenetically wide range with our sequenced Halocynthia roretzi, a tunicate, IRK genomic genes. Results A total of 131 IRK genomic genes were analyzed. The phylogenic trees of amino acid sequences revealed a clear diversification of deuterostomic IRKs from protostomic IRKs and suggested that the tunicate IRKs are possibly representatives of the descendants of ancestor forms of three major groups of IRKs in the vertebrate. However, the exon-intron structures of the tunicate IRK genomes showed considerable similarities to those of Caenorhabditis. In the vertebrate clade, the members in each major group increased at least four times those in the tunicate by various types of global gene duplication. The generation of some major groups was inferred to be due to anti-tandem (palindromic) duplication in early history. The intron insertion points greatly decreased during the evolution of the vertebrates, remaining as a unique conservation of an intron insertion site in the portion of protein-protein interaction within the coding regions of all vertebrate G-protein-activated IRK genes. Conclusion From the genomic survey of a family of IRK genes, it was suggested that the ancient intron insertion sites and the unique palindromic genomic duplication evolutionally shaped this membrane protein family.
Collapse
Affiliation(s)
- Motoko Tanaka-Kunishima
- Department of Medical Physiology, Meiji Pharmaceutical University, Noshio 2-522-1, Kiyose, Tokyo, MZC204-8588, Japan
| | - Yoshihiro Ishida
- Department of Medical Physiology, Meiji Pharmaceutical University, Noshio 2-522-1, Kiyose, Tokyo, MZC204-8588, Japan
| | - Kunitaro Takahashi
- Department of Medical Physiology, Meiji Pharmaceutical University, Noshio 2-522-1, Kiyose, Tokyo, MZC204-8588, Japan
| | - Motoo Honda
- Department of Medical Physiology, Meiji Pharmaceutical University, Noshio 2-522-1, Kiyose, Tokyo, MZC204-8588, Japan
| | - Takashi Oonuma
- Department of Medical Physiology, Meiji Pharmaceutical University, Noshio 2-522-1, Kiyose, Tokyo, MZC204-8588, Japan
| |
Collapse
|
13
|
Artamonova II, Gelfand MS. Comparative Genomics and Evolution of Alternative Splicing: The Pessimists' Science. Chem Rev 2007; 107:3407-30. [PMID: 17645315 DOI: 10.1021/cr068304c] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Irena I Artamonova
- Group of Bioinformatics, Vavilov Institute of General Genetics, RAS, Gubkina 3, Moscow 119991, Russia
| | | |
Collapse
|
14
|
De Kee DW, Gopalan V, Stoltzfus A. A Sequence-Based Model Accounts Largely for the Relationship of Intron Positions to Protein Structural Features. Mol Biol Evol 2007; 24:2158-68. [PMID: 17646255 DOI: 10.1093/molbev/msm151] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Claims of intron-structure correlations have played a major role in debates surrounding split gene origins. In the formative (as opposed to disruptive or "insertional") model of split gene origins, introns represent the scars of chimaeric gene assembly. When analyzed retrospectively, formative introns should tend to fall between modular units, if such units exist, or at least to exhibit a preference for sites favorable to chimaera formation. However, there is another possible source of preferences: under a disruptive model of split gene origins, fortuitous intron-structure correlations may arise because the gain of introns is biased with respect to flanking nucleotide sequences. To investigate the extent to which a sequence-biased intron gain model may account for the present-day distribution of introns, data on over 10,000 introns in eukaryotic protein-coding genes were integrated with structural data from a set of 1,851 nonredundant protein chains. The positions of introns with respect to secondary structures, solvent accessibility, and so-called "modules" were evaluated relative to the expectations of a null model, a disruptive model based on amino acid frequencies at splice junctions, and a formative model defined relative to these. The null model can be excluded for most structural features and is highly improbable when intron sites are grouped by reading frame phase. Phase-dependent correlations with secondary structure and side-chain surface accessibility are particularly strong. However, these phase-dependent correlations are explained largely by the sequence-based disruptive model.
Collapse
Affiliation(s)
- Danny W De Kee
- Center for Advanced Research in Biotechnology, Rockville, MD, USA
| | | | | |
Collapse
|
15
|
Abstract
Research into the origins of introns is at a critical juncture in the resolution of theories on the evolution of early life (which came first, RNA or DNA?), the identity of LUCA (the last universal common ancestor, was it prokaryotic- or eukaryotic-like?), and the significance of noncoding nucleotide variation. One early notion was that introns would have evolved as a component of an efficient mechanism for the origin of genes. But alternative theories emerged as well. From the debate between the "introns-early" and "introns-late" theories came the proposal that introns arose before the origin of genetically encoded proteins and DNA, and the more recent "introns-first" theory, which postulates the presence of introns at that early evolutionary stage from a reconstruction of the "RNA world." Here we review seminal and recent ideas about intron origins. Recent discoveries about the patterns and causes of intron evolution make this one of the most hotly debated and exciting topics in molecular evolutionary biology today.
Collapse
Affiliation(s)
- Francisco Rodríguez-Trelles
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697-2525, USA.
| | | | | |
Collapse
|
16
|
Cañestro C, Postlethwait JH, Gonzàlez-Duarte R, Albalat R. Is retinoic acid genetic machinery a chordate innovation? Evol Dev 2006; 8:394-406. [PMID: 16925675 DOI: 10.1111/j.1525-142x.2006.00113.x] [Citation(s) in RCA: 67] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Development of many chordate features depends on retinoic acid (RA). Because the action of RA during development seems to be restricted to chordates, it had been previously proposed that the "invention" of RA genetic machinery, including RA-binding nuclear hormone receptors (Rars), and the RA-synthesizing and RA-degrading enzymes Aldh1a (Raldh) and Cyp26, respectively, was an important step for the origin of developmental mechanisms leading to the chordate body plan. We tested this hypothesis by conducting an exhaustive survey of the RA machinery in genomic databases for twelve deuterostomes. We reconstructed the evolution of these genes in deuterostomes and showed for the first time that RA genetic machinery--that is Aldh1a, Cyp26, and Rar orthologs--is present in nonchordate deuterostomes. This finding implies that RA genetic machinery was already present during early deuterostome evolution, and therefore, is not a chordate innovation. This new evolutionary viewpoint argues against the hypothesis that the acquisition of gene families underlying RA metabolism and signaling was a key event for the origin of chordates. We propose a new hypothesis in which lineage-specific duplication and loss of RA machinery genes could be related to the morphological radiation of deuterostomes.
Collapse
Affiliation(s)
- Cristian Cañestro
- Institute of Neuroscience, University of Oregon, Eugene, OR 97403, USA
| | | | | | | |
Collapse
|
17
|
Koonin EV. The origin of introns and their role in eukaryogenesis: a compromise solution to the introns-early versus introns-late debate? Biol Direct 2006; 1:22. [PMID: 16907971 PMCID: PMC1570339 DOI: 10.1186/1745-6150-1-22] [Citation(s) in RCA: 187] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2006] [Accepted: 08/14/2006] [Indexed: 12/19/2022] Open
Abstract
BACKGROUND Ever since the discovery of 'genes in pieces' and mRNA splicing in eukaryotes, origin and evolution of spliceosomal introns have been considered within the conceptual framework of the 'introns early' versus 'introns late' debate. The 'introns early' hypothesis, which is closely linked to the so-called exon theory of gene evolution, posits that protein-coding genes were interrupted by numerous introns even at the earliest stages of life's evolution and that introns played a major role in the origin of proteins by facilitating recombination of sequences coding for small protein/peptide modules. Under this scenario, the absence of spliceosomal introns in prokaryotes is considered to be a result of "genome streamlining". The 'introns late' hypothesis counters that spliceosomal introns emerged only in eukaryotes, and moreover, have been inserted into protein-coding genes continuously throughout the evolution of eukaryotes. Beyond the formal dilemma, the more substantial side of this debate has to do with possible roles of introns in the evolution of eukaryotes. RESULTS I argue that several lines of evidence now suggest a coherent solution to the introns-early versus introns-late debate, and the emerging picture of intron evolution integrates aspects of both views although, formally, there seems to be no support for the original version of introns-early. Firstly, there is growing evidence that spliceosomal introns evolved from group II self-splicing introns which are present, usually, in small numbers, in many bacteria, and probably, moved into the evolving eukaryotic genome from the alpha-proteobacterial progenitor of the mitochondria. Secondly, the concept of a primordial pool of 'virus-like' genetic elements implies that self-splicing introns are among the most ancient genetic entities. Thirdly, reconstructions of the ancestral state of eukaryotic genes suggest that the last common ancestor of extant eukaryotes had an intron-rich genome. Thus, it appears that ancestors of spliceosomal introns, indeed, have existed since the earliest stages of life's evolution, in a formal agreement with the introns-early scenario. However, there is no evidence that these ancient introns ever became widespread before the emergence of eukaryotes, hence, the central tenet of introns-early, the role of introns in early evolution of proteins, has no support. However, the demonstration that numerous introns invaded eukaryotic genes at the outset of eukaryotic evolution and that subsequent intron gain has been limited in many eukaryotic lineages implicates introns as an ancestral feature of eukaryotic genomes and refutes radical versions of introns-late. Perhaps, most importantly, I argue that the intron invasion triggered other pivotal events of eukaryogenesis, including the emergence of the spliceosome, the nucleus, the linear chromosomes, the telomerase, and the ubiquitin signaling system. This concept of eukaryogenesis, in a sense, revives some tenets of the exon hypothesis, by assigning to introns crucial roles in eukaryotic evolutionary innovation. CONCLUSION The scenario of the origin and evolution of introns that is best compatible with the results of comparative genomics and theoretical considerations goes as follows: self-splicing introns since the earliest stages of life's evolution--numerous spliceosomal introns invading genes of the emerging eukaryote during eukaryogenesis--subsequent lineage-specific loss and gain of introns. The intron invasion, probably, spawned by the mitochondrial endosymbiont, might have critically contributed to the emergence of the principal features of the eukaryotic cell. This scenario combines aspects of the introns-early and introns-late views. REVIEWERS this article was reviewed by W. Ford Doolittle, James Darnell (nominated by W. Ford Doolittle), William Martin, and Anthony Poole.
Collapse
Affiliation(s)
- Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
| |
Collapse
|
18
|
Naito H, Yamanoshita O, Kamijima M, Katoh T, Matsunaga T, Lee CH, Kim H, Aoyama T, Gonzalez FJ, Nakajima T. Association of V227A PPARα polymorphism with altered serum biochemistry and alcohol drinking in Japanese men. Pharmacogenet Genomics 2006; 16:569-77. [PMID: 16847426 DOI: 10.1097/01.fpc.0000220565.90466.79] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
OBJECTIVES Peroxisome proliferator-activated receptor (PPAR) alpha plays a major role in alcoholic liver disease in rodents. The two-fold objective of our study was to determine the presence of PPARalpha polymorphisms and their frequencies in Japanese populations and then to evaluate the effects of any alleles on metabolic parameters and alcohol drinking. METHODS Analysis of coding SNP in PPARalpha was performed in 706 Japanese men; from these subjects 655 men were further studied after exclusion criteria were applied. RESULTS PPARalpha-V227A, which has not been reported in Europe and North America as a major polymorphism, was discovered with the frequency of 0.05. PPARalpha-L162V was found in European and North American populations, but not in Japanese, thus confirming the ethnic differences in PPARalpha allele frequencies. The A227 allele was associated with increased serum concentrations of gamma glutamyltranspeptidase. In non-drinkers, the total cholesterol (TC) levels were significantly lower in those having the PPARalpha-V227A polymorphism. In drinkers, however, it was comparable among V227A polymorphisms, and conversely higher in those having both A227 and aldehyde dehydrogenase 2 (ALDH2) variants when further divided according to the ALDH2 polymorphism. Significant interactions between PPARalpha-V227A polymorphism and drinking were also found for TC, triglyceride levels and AST/ALT ratios. These results suggest that the activity of the A227 allele without drinking may be higher than in wild-type allele, but its activity may become lower during drinking habits. CONCLUSION PPARalpha-V227A is a major polymorphism in the Japanese population, and its activity may be greater compared to wild-type, but decreased by alcohol drinking.
Collapse
Affiliation(s)
- Hisao Naito
- Department of Occupational and Environmental Health, Nagoya University Graduate School of Medicine, Nagoya, Japan
| | | | | | | | | | | | | | | | | | | |
Collapse
|
19
|
Abstract
The origins and importance of spliceosomal introns comprise one of the longest-abiding mysteries of molecular evolution. Considerable debate remains over several aspects of the evolution of spliceosomal introns, including the timing of intron origin and proliferation, the mechanisms by which introns are lost and gained, and the forces that have shaped intron evolution. Recent important progress has been made in each of these areas. Patterns of intron-position correspondence between widely diverged eukaryotic species have provided insights into the origins of the vast differences in intron number between eukaryotic species, and studies of specific cases of intron loss and gain have led to progress in understanding the underlying molecular mechanisms and the forces that control intron evolution.
Collapse
Affiliation(s)
- Scott William Roy
- Allan Wilson Centre for Molecular Ecology and Evolution, Massey University, Palmerston North, New Zealand.
| | | |
Collapse
|
20
|
Kim JS, Kim YJ, Kim TY, Song JY, Cho YH, Park YC, Chung HW. Association of ALDH2 polymorphism with sensitivity to acetaldehyde-induced micronuclei and facial flushing after alcohol intake. Toxicology 2005; 210:169-74. [PMID: 15840430 DOI: 10.1016/j.tox.2005.01.016] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2004] [Revised: 01/22/2005] [Accepted: 01/30/2005] [Indexed: 01/08/2023]
Abstract
To investigate whether sensitivity to the induction of micronuclei by acetaldehyde is associated with genetic polymorphisms of the aldehyde dehydrogenase-2 (ALDH2) gene, cytokinesis-block micronucleus (CBMN) assays were performed on peripheral lymphocytes from 47 healthy human subjects exposed to acetaldehyde in vitro. Facial flushing following alcohol intake was analyzed to determine if it was correlated with ALDH2 gene polymorphisms. The frequencies of the ALDH2 genotypes ALDH2(1)/ALDH2(1), ALDH2(1)/ALDH2(2), and ALDH2(2)/ALDH2(2) were 66.0, 27.7, and 6.4%, respectively, in the 47 subjects. Therefore, 34% of the studied subjects carried the mutant allele ALDH2(2), which is associated with the lack of enzyme activity. The frequency of micronuclei induced by acetaldehyde increased in a dose-dependent manner with the largest increase seen in subjects that were homozygous for the ALDH2(2) allele. A significant association was observed between the ALDH2 genotype and alcohol-induced facial flushing. Average alcohol consumption of the study subjects was also associated with the ALDH2 genotype. The frequency of heavy drinking was significantly higher among subjects with the ALDH2(1)/ALDH2(1) genotype than among subjects with the ALDH2(2) allele (ALDH2(1)/ALDH2(2) and ALDH2(2)/ALDH2(2) genotypes). Alcohol-induced facial flushing was also associated with an increased frequency of micronuclei in lymphocytes treated with acetaldehyde. The results suggest that the ALDH2 genotype is significantly associated with acetaldehyde-induced micronuclei and alcohol-induced facial flushing.
Collapse
Affiliation(s)
- Jin Sik Kim
- School of Public Health and Institute of Health and Environment, Seoul National University, 28 Yunkeun-dong Chongno-ku, Seoul 110-460, Korea
| | | | | | | | | | | | | |
Collapse
|
21
|
Wang C, Typas MA, Butt TM. Phylogenetic and exon-intron structure analysis of fungal subtilisins: support for a mixed model of intron evolution. J Mol Evol 2005; 60:238-46. [PMID: 15785852 DOI: 10.1007/s00239-004-0147-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2004] [Accepted: 07/18/2004] [Indexed: 10/25/2022]
Abstract
Phylogenetic and exon-intron structure analyses of intra- and interspecific fungal subtilisins in this study provided support for a mixed model of intron evolution: a synthetic theory of introns-early and introns-late speculations. Intraspecifically, there were three phase zero introns in Pr1A and its introns 1 and 2 located at the highly conserved positions were phylogentically congruent with coding region, which is in favor of the view of introns-early speculation, while intron 3 had two different sizes and was evolutionarily incongruent with coding region, the evidence for introns-late speculation. Noticeably, the subtilisin Pr1J gene from different strains of M. ansiopliae contained different number of introns, the strong evidence in support of introns-late theory. Interspecifically, phylogenetic analysis of 60 retrievable fungal subtilisins provided a clear relationship between amino acid sequence and gene exon-intron structure that the homogeneous sequences usually have a similar exon-infron structure. There were 10 intron positions inserted by highly biased phase zero introns across examined fungal subtilisin genes, half of these positions were highly conserved, while the others were species-specific, appearing to be of recent origins due to intron insertion, in favor of the introns-late theory. High conservations of positions 1 and 2 inserted by the high percentage of phase zero introns as well as the evidence of phylogenetic congruence between the evolutionary histories of intron sequences and coding region suggested that the introns at these two positions were primordial.
Collapse
Affiliation(s)
- Chengshu Wang
- School of Biological Sciences, University of Wales Swansea, Swansea SA2 8PP, UK.
| | | | | |
Collapse
|
22
|
|
23
|
Sok AJ, Czajewska K, Ozyhar A, Kochman M. The structure of the juvenile hormone binding protein gene from Galleria mellonella. Biol Chem 2005; 386:1-10. [PMID: 15843141 DOI: 10.1515/bc.2005.001] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
AbstractJuvenile hormone (JH) and ecdysone are the key hormones controlling insect growth and development. The juvenile hormone binding protein (JHBP) is the first member in the array of proteins participating in JH signal transmission. In the present report a wholejhbpgene sequence (9790 bp) is described. Thejhbpgene contains four introns (A–D). All the introns have common flanking sequences: GT at the 5′ and AG at the 3′ end. The first intron is in phase 1, the second in phase 2, and the third and fourth in phase 1. An analysis of these sequences suggests that U2-class spliceosomes are involved in intron excision from pre-mRNA. Several horizontally transmitted elements from other genes were found in the introns. Alljhbpexons are positioned in local AT-reach regions of the gene. A search for core promoter regulatory elements revealed that the TATA box starts 29 bp preceding the start of transcription; the sequence TCAGTA representing a putative initiator sequence (Inr) starts at position +14. Eight characteristic sequences for bindingBroad-Complexgene products, which coordinate the ecdysone temporal response, are present in the non-coding sequence of thejhbpgene. An analysis of exon locations and intron phases indicates thatjhbpgene organization is related to theretinol binding proteingene, a member of the lipocalin family.
Collapse
Affiliation(s)
- Agnieszka J Sok
- Division of Biochemistry, Institute of Organic Chemistry, Biochemistry and Biotechnology, Wrocław University of Technology, Wybrzeze Wyspiańskiego 27, 50-370 Wrocław, Poland
| | | | | | | |
Collapse
|
24
|
Krauss V, Pecyna M, Kurz K, Sass H. Phylogenetic Mapping of Intron Positions: A Case Study of Translation Initiation Factor eIF2γ. Mol Biol Evol 2004; 22:74-84. [PMID: 15356279 DOI: 10.1093/molbev/msh255] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Eukaryotic translation initiation factor 2 (eIF2) is a G protein that delivers the methionyl initiator tRNA to the small ribosomal subunit and releases it upon GTP hydrolysis after the recognition of the initiation codon. eIF2 is composed of three subunits, alpha, beta, and gamma. Subunit gamma shows the strongest conservation, and it confers both tRNA and GTP/GDP binding. Using intron positioning and protein sequence alignment, here we show that eIF2gamma is a suitable phylogenetic marker for eukaryotes. We determined or completed the sequences of 13 arthropod eIF2gamma genes. Analyzing the phylogenetic distribution of 52 different intron positions in 55 distantly related eIF2gamma genes, we identified ancient ones and shared derived introns in our data set. Obviously, intron positioning in eIF2gamma is evolutionarily conserved. However, there were episodes of complete and partial intron losses followed by intron gains. We identified 17 clusters of intron positions based on their distribution. The evolution of these clusters appears to be connected with preferred exon length and can be used to estimate the relative timing of intron gain because nearby precursor introns had to be erased from the gene before the new introns could be inserted. Moreover, we identified a putative case of intron sliding that constitutes a synapomorphic character state supporting monophyly of Coleoptera, Lepidoptera, and Diptera excluding Hymenoptera. We also performed tree reconstructions using the eIF2gamma protein sequences and intron positioning as phylogenetic information. Our results support the monophyly of Viridoplantae, Ascomycota, Homobasidiomyceta, and Apicomplexa.
Collapse
Affiliation(s)
- Veiko Krauss
- Department of Genetics, University of Leipzig, Leipzig, Germany.
| | | | | | | |
Collapse
|
25
|
Babenko VN, Rogozin IB, Mekhedov SL, Koonin EV. Prevalence of intron gain over intron loss in the evolution of paralogous gene families. Nucleic Acids Res 2004; 32:3724-33. [PMID: 15254274 PMCID: PMC484173 DOI: 10.1093/nar/gkh686] [Citation(s) in RCA: 105] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The mechanisms and evolutionary dynamics of intron insertion and loss in eukaryotic genes remain poorly understood. Reconstruction of parsimonious scenarios of gene structure evolution in paralogous gene families in animals and plants revealed numerous gains and losses of introns. In all analyzed lineages, the number of acquired new introns was substantially greater than the number of lost ancestral introns. This trend held even for lineages in which vertical evolution of genes involved more intron losses than gains, suggesting that gene duplication boosts intron insertion. However, dating gene duplications and the associated intron gains and losses based on the molecular clock assumption showed that very few, if any, introns were gained during the last approximately 100 million years of animal and plant evolution, in agreement with previous conclusions reached through analysis of orthologous gene sets. These results are generally compatible with the emerging notion of intensive insertion and loss of introns during transitional epochs in contrast to the relative quiet of the intervening evolutionary spans.
Collapse
Affiliation(s)
- Vladimir N Babenko
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bldg 38A, Bethesda, MD 20894, USA
| | | | | | | |
Collapse
|
26
|
Kanzok SM, Hoa NT, Bonizzoni M, Luna C, Huang Y, Malacrida AR, Zheng L. Origin of Toll-like receptor-mediated innate immunity. J Mol Evol 2004; 58:442-8. [PMID: 15114422 DOI: 10.1007/s00239-003-2565-8] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2003] [Accepted: 10/29/2003] [Indexed: 10/26/2022]
Abstract
Toll-related receptors (TLR) have been found in four animal phyla: Nematoda, Arthropoda, Echinodermata, and Chordata. No TLR has been identified thus far in acoelomates. TLR genes play a pivotal role in the innate immunity in both fruit fly and mammals. The prevailing view is that TLR-mediated immunity is ancient. The two pseudocoelomate TLRs, one each from Caenorhabditis elegans and Strongyloides stercoralis, were distinct from the coelomate ones. Further, the only TLR gene (Tol-1) in Ca. elegans did not appear to play a role in innate immunity. We argue that TLR-mediated innate immunity developed only in the coelomates, after they split from pseudocoelomates and acoelomates. We hypothesize that the function of TLR-mediated immunity is to prevent microbial infection in the body cavity present only in the coelomates. Phylogenetic analysis showed that almost all arthropod TLRs form a separate cluster from the mammalian counterparts. We further hypothesize that TLR-mediated immunity developed independently in the protostomia and deuterostomia coelomates.
Collapse
Affiliation(s)
- Stefan M Kanzok
- Yale University School of Medicine, Epidemiology and Public Health, 60 College Street, New Haven, CT 06520, USA
| | | | | | | | | | | | | |
Collapse
|
27
|
Abstract
For nearly 15 years, it has been widely believed that many introns were recently acquired by the genes of multicellular organisms. However, the mechanism of acquisition has yet to be described for a single animal intron. Here, we report a large-scale computational analysis of the human, Drosophila melanogaster, Caenorhabditis elegans, and Arabidopsis thaliana genomes. We divided 147,796 human intron sequences into batches of similar lengths and aligned them with each other. Different types of homologies between introns were found, but none showed evidence of simple intron transposition. Also, 106,902 plant, 39,624 Drosophila, and 6021 C. elegans introns were examined. No single case of homologous introns in nonhomologous genes was detected. Thus, we found no example of transposition of introns in the last 50 million years in humans, in 3 million years in Drosophila and C. elegans, or in 5 million years in Arabidopsis. Either new introns do not arise via transposition of other introns or intron transposition must have occurred so early in evolution that all traces of homology have been lost.
Collapse
Affiliation(s)
- Alexei Fedorov
- Department of Medicine, Medical College of Ohio, Toledo, Ohio 43614, USA.
| | | | | | | |
Collapse
|
28
|
Rogozin IB, Wolf YI, Sorokin AV, Mirkin BG, Koonin EV. Remarkable interkingdom conservation of intron positions and massive, lineage-specific intron loss and gain in eukaryotic evolution. Curr Biol 2003; 13:1512-7. [PMID: 12956953 DOI: 10.1016/s0960-9822(03)00558-x] [Citation(s) in RCA: 301] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Sequencing of eukaryotic genomes allows one to address major evolutionary problems, such as the evolution of gene structure. We compared the intron positions in 684 orthologous gene sets from 8 complete genomes of animals, plants, fungi, and protists and constructed parsimonious scenarios of evolution of the exon-intron structure for the respective genes. Approximately one-third of the introns in the malaria parasite Plasmodium falciparum are shared with at least one crown group eukaryote; this number indicates that these introns have been conserved through >1.5 billion years of evolution that separate Plasmodium from the crown group. Paradoxically, humans share many more introns with the plant Arabidopsis thaliana than with the fly or nematode. The inferred evolutionary scenario holds that the common ancestor of Plasmodium and the crown group and, especially, the common ancestor of animals, plants, and fungi had numerous introns. Most of these ancestral introns, which are retained in the genomes of vertebrates and plants, have been lost in fungi, nematodes, arthropods, and probably Plasmodium. In addition, numerous introns have been inserted into vertebrate and plant genes, whereas, in other lineages, intron gain was much less prominent.
Collapse
Affiliation(s)
- Igor B Rogozin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | | | | | | | |
Collapse
|
29
|
Tarrio R, Rodríguez-Trelles F, Ayala FJ. A new Drosophila spliceosomal intron position is common in plants. Proc Natl Acad Sci U S A 2003; 100:6580-3. [PMID: 12750476 PMCID: PMC164489 DOI: 10.1073/pnas.0731952100] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The 25-year-old debate about the origin of introns between proponents of "introns early" and "introns late" has yielded significant advances, yet important questions remain to be ascertained. One question concerns the density of introns in the last common ancestor of the three multicellular kingdoms. Approaches to this issue thus far have relied on counts of the numbers of identical intron positions across present-day taxa on the assumption that the introns at those sites are orthologous. However, dismissing parallel intron gain for those sites may be unwarranted, because various factors can potentially constrain the site of intron insertion. Demonstrating parallel intron gain is severely handicapped, because intron sequences often evolve exceedingly fast and intron phylogenetic distributions are usually ambiguous, such that alternative loss and gain scenarios cannot be clearly distinguished. We have identified an intron position that was gained independently in animals and plants in the xanthine dehydrogenase gene. The extremely disjointed phylogenetic distribution of the intron argues strongly for separate gain rather than recurrent loss. If the observed phylogenetic pattern had resulted from recurrent loss, all observational support previously gathered for the introns-late theory of intron origins based on the phylogenetic distribution of introns would be invalidated.
Collapse
Affiliation(s)
- Rosa Tarrio
- Department of Ecology and Evolutionary Biology, University of California, Irvine 92697-2525, USA
| | | | | |
Collapse
|
30
|
Bellingham J, Wells DJ, Foster RG. In silico characterisation and chromosomal localisation of human RRH (peropsin)--implications for opsin evolution. BMC Genomics 2003; 4:3. [PMID: 12542842 PMCID: PMC149353 DOI: 10.1186/1471-2164-4-3] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2002] [Accepted: 01/24/2003] [Indexed: 11/29/2022] Open
Abstract
BACKGROUND The vertebrate opsins are proteins which utilise a retinaldehyde chromophore in their photosensory or photoisomerase roles in the visual/irradiance detection cycle. The majority of the opsins, such as rod and cone opsins, have a very highly conserved gene structure suggesting a common lineage. Exceptions to this are RGR-opsin and melanopsin, whose genes have very different intron insertion positions. The gene structure of another opsin, peropsin (retinal pigment epithelium-derived rhodopsin homologue, RRH) is unknown. RESULTS By in silico analysis of the GenBank database we have determined that the human RRH comprises 7 exons spanning approximately 16.5 kb and is localised to chromosome 4q25 in the following gene sequence: cen-EGF-RRH-IF-qter - a position that excludes this gene as a candidate for the RP29 autosomal recessive retinitis pigmentosa locus. A comparison of opsin gene structures reveals that RRH and RGR share two common intron (introns 1 and 4) insertion positions which may reflect a shared ancestral gene. CONCLUSION The opsins comprise a diverse group of genes which appear to have arisen from three different lineages. These lineages comprise the "classical opsin superfamily" which includes the rod and cone opsins, pinopsin, VA-opsin, parapinopsin and encephalopsin; the RRH and RGR group; and the melanopsin line. A common lineage for RRH and RGR, together with their sites of expression in the RPE, indicates that peropsin may act as a retinal isomerase.
Collapse
Affiliation(s)
- James Bellingham
- Gene Targeting Unit, Department of Neuromuscular Diseases, Division of Neuroscience and Psychological Medicine, Faculty of Medicine, Imperial College London, Charing Cross Hospital, St. Dunstan's Road, London, W6 8RP, UK
| | - Dominic J Wells
- Gene Targeting Unit, Department of Neuromuscular Diseases, Division of Neuroscience and Psychological Medicine, Faculty of Medicine, Imperial College London, Charing Cross Hospital, St. Dunstan's Road, London, W6 8RP, UK
| | - Russell G Foster
- Department of Integrative and Molecular Neuroscience, Division of Neuroscience and Psychological Medicine, Faculty of Medicine, Imperial College London, Charing Cross Hospital, St. Dunstan's Road, London, W6 8RP, UK
| |
Collapse
|
31
|
Lecharny A, Boudet N, Gy I, Aubourg S, Kreis M. Introns in, introns out in plant gene families: a genomic approach of the dynamics of gene structure. JOURNAL OF STRUCTURAL AND FUNCTIONAL GENOMICS 2003. [PMID: 12836690 DOI: 10.1023/a:1022614001371] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]
Abstract
Gene duplication is considered to be a source of genetic information for the creation of new functions. The Arabidopsis thaliana genome sequence revealed that a majority of plant genes belong to gene families. Regarding the problem of genes involved in the genesis of novel organs or functions during evolution, the reconstitution of the evolutionary history of gene families is of critical importance. A comparison of the intron/exon gene structure may provide clues for the understanding of the evolutionary mechanisms underlying the genesis of gene families. An extensive study of A. thaliana genome showed that families of duplicated genes may be organized according to the number and/or density of intron and the diversity in gene structure. In this paper, we propose a genomic classification of several A. thaliana gene families based on introns in an evolutionary perspective.
Collapse
Affiliation(s)
- Alain Lecharny
- Institut de Biotechnologie des Plantes, Unité Mixte de Recherche-Centre National de la Recherche Scientifique 8618, Université de Paris-Sud, Bât. 630, F-91405 Orsay Cedex, France.
| | | | | | | | | |
Collapse
|
32
|
Rogozin IB, Babenko VN, Fedorova ND, Jackson JD, Jacobs AR, Krylov DM, Makarova KS, Mazumder R, Mekhedov SL, Mirkin BG, Nikolskaya AN, Rao BS, Smirnov S, Sorokin AV, Sverdlov AV, Vasudevan S, Wolf YI, Yin JJ, Natale DA, Koonin EV. Evolution of eukaryotic gene repertoire and gene structure: discovering the unexpected dynamics of genome evolution. COLD SPRING HARBOR SYMPOSIA ON QUANTITATIVE BIOLOGY 2003; 68:293-301. [PMID: 15338629 DOI: 10.1101/sqb.2003.68.293] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/30/2023]
Affiliation(s)
- I B Rogozin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
33
|
Abstract
Debate over the mechanisms responsible for the phylogenetic and genomic distribution of introns has proceeded largely without consideration of the population-genetic forces influencing the establishment and retention of novel genetic elements. However, a simple model incorporating random genetic drift and weak mutation pressure against intron-containing alleles yields predictions consistent with a diversity of observations: (i) the rarity of introns in unicellular organisms with large population sizes, and their expansion after the origin of multicellular organisms with reduced population sizes; (ii) the relationship between intron abundance and the stringency of splice-site requirements; (iii) the tendency for introns to be more numerous and longer in regions of low recombination; and (iv) the bias toward phase-0 introns. This study provides a second example of a mechanism whereby genomic complexity originates passively as a "pathological" response to small population size, and raises difficulties for the idea that ancient introns played a major role in the origin of genes by exon shuffling.
Collapse
Affiliation(s)
- Michael Lynch
- Department of Biology, Indiana University, Bloomington, IN 47405, USA.
| |
Collapse
|
34
|
Boudet N, Aubourg S, Toffano-Nioche C, Kreis M, Lecharny A. Evolution of intron/exon structure of DEAD helicase family genes in Arabidopsis, Caenorhabditis, and Drosophila. Genome Res 2001; 11:2101-14. [PMID: 11731501 PMCID: PMC311229 DOI: 10.1101/gr.200801] [Citation(s) in RCA: 110] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The DEAD box RNA helicase (RH) proteins are homologs involved in diverse cellular functions in all of the organisms from prokaryotes to eukaryotes. Nevertheless, there is a lack of conservation in the splicing pattern in the 53 Arabidopsis thaliana (AtRHs), the 32 Caenorhabditis elegans (CeRHs) and the 29 Drosophila melanogaster (DmRHs) genes. Of the 153 different observed intron positions, 4 are conserved between AtRHs, CeRHs, and DmRHs, and one position is also found in RHs from yeast and human. Of the 27 different AtRH structures with introns, 20 have at least one predicted ancient intron in the regions coding for the catalytic domain. In all of the organisms examined, we found at least one gene with most of its intron predicted to be ancient. In A. thaliana, the large diversity in RH structures suggests that duplications of the ancestral RH were followed by a high number of intron deletions and additions. The very high bias toward phase 0 introns is in favor of intron addition, preferentially in phase 0. Results from this comparative study of the same gene family in a plant and in two animals are discussed in terms of the general mechanisms of gene family evolution.
Collapse
Affiliation(s)
- N Boudet
- Institut de Biotechnologie des Plantes, Unité Mixte de Recherche-Centre National Recherche Scientifique 8618, Université de Paris-Sud, Bât. 630, F-91405 Orsay Cedex, France
| | | | | | | | | |
Collapse
|
35
|
Fedorov A, Cao X, Saxonov S, de Souza SJ, Roy SW, Gilbert W. Intron distribution difference for 276 ancient and 131 modern genes suggests the existence of ancient introns. Proc Natl Acad Sci U S A 2001; 98:13177-82. [PMID: 11687643 PMCID: PMC60844 DOI: 10.1073/pnas.231491498] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
o introns delineate elements of protein tertiary structure? This issue is crucial to the debate about the role and origin of introns. We present an analysis of the full set of proteins with known three-dimensional structures that have homologs with intron positions recorded in GenBank. A computer program was generated that maps on a reference sequence the positions of all introns in homologous genes. We have applied this program to a set of 665 nonredundant protein sequences with defined three-dimensional structures in the Protein Data Bank (PDB), which yielded 8,217 introns in 407 proteins. For the subset of proteins corresponding to ancient conserved regions (ACR), we find that there is a correlation of phase-zero introns with the boundary regions of modules and no correlation for the phase-one and phase-two positions. However, for a subset of proteins without prokaryotic counterparts (131 non-ACR proteins), a set of presumably modern proteins (or proteins that have diverged extremely far from any ancestral form), we do not find any correlation of phase-zero intron positions with three-dimensional structure. Furthermore, we find an anticorrelation of phase-one intron positions with module boundaries: they actually have a preference for the interior of modules. This finding is explicable as a preference for phase-one introns to lie in glycines, between G/G sequences, the preference for glycines being anticorrelated with the three-dimensional modules. We interpret this anticorrelation as a sign that a number of phase-one introns, and hence many modern introns, have been inserted into G/G "protosplice" sequences.
Collapse
Affiliation(s)
- A Fedorov
- Department of Molecular and Cellular Biology, Harvard University, 16 Divinity Avenue, Cambridge, MA 02138, USA
| | | | | | | | | | | |
Collapse
|
36
|
Pineda AO, Ellington WR. Organization of the gene for an invertebrate mitochondrial creatine kinase: comparisons with genes of higher forms and correlation of exon boundaries with functional domains. Gene 2001; 265:115-21. [PMID: 11255014 DOI: 10.1016/s0378-1119(01)00352-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Abstract
Two major gene duplication events are thought to have taken place in the evolution of creatine kinases (CK) in the vertebrates - (1) the formation of distinct mitochondrial (MiCK) and cytoplasmic forms from the primordial gene and (2) subsequent formation of the sarcomeric (sar-) and ubiquitous (ubi-) isoforms of octameric MiCK and muscle (M) and brain (B) isoforms of dimeric, cytoplasmic CK. The genes of these two CK clades reflect a distant divergence as sar- and ubiMiCK genes consistently have nine protein-coding exons while M- and B-CK genes have seven protein-coding exons; these genes share only one common exon. CKs are also widely distributed in the invertebrates and it has recently been shown that MiCKs evolved well before the divergence of the major metazoan groups. In the present communication, we report the structure and topology of the gene for MiCK from the protostome marine worm Chaetopterus variopedatus. The protein-coding region of the gene for this primitive MiCK spans over 10 kb and consists of eight exons, the last five (E4-E8) have identical boundaries to the corresponding exons of sar- and ubiMiCK genes. Exon-3 of the C. variopedatus MiCK gene consists of the corresponding E3 and E4 of the vertebrate MiCKs with no intervening intron. E1 is longer and E2 is shorter in the polychaete MiCK gene than the counterpart sarcomeric and ubiquitous genes. The insertion of the intron in C. variopedatus E3 creating the two exons as well as the rearrangement of the intron between E1 and E2 must have occurred prior to or coincident with the duplication event creating the two vertebrate mitochondrial isoforms. Sarcomeric and ubiMiCKs display substantial differences from their invertebrate MiCK counterparts in properties relating to octamer stability and membrane binding. The evolutionary changes in gene topology may be a component of this functional progression.
Collapse
Affiliation(s)
- A O Pineda
- Department of Biological Science and Institute of Molecular Biophysics, Florida State University, 32306-4370, Tallahassee, FL, USA
| | | |
Collapse
|
37
|
Abstract
The introns-early view has been challenged for several genes; prominent instances are triose phosphate isomerase (TPI), aldolase, pyruvate kinase (PK), alcohol dehydrogenase (ADH), glyceraldehyde-3-phosphate dehydrogenase (GAPDH) and myosin heavy chain. While some of their introns appear to be phylogenetically ancient and/or to delineate exons corresponding to protein modules, a considerable number seemingly do not. But it is argued here that many of these anomalous introns are periodic, that is, relics of internal sequence repetitions within the ancestral gene. Some of these periodic-intron patterns are shared between related genes, as in the alphabeta -barrels of TPI, aldolase and PK, or the Rossmann nucleotide-binding domain common to PK, ADH and GAPDH. This is further evidence for the ancestral status of these introns. The myosin heavy chain C-terminal rod region is paradoxical in that its sequence is clearly periodic but its intron placements are not; however, they exhibit a remarkable coherence of intron translational phases, suggesting that these introns may also have originally had a periodic arrangement now obscured by intron slipping.
Collapse
Affiliation(s)
- D Elder
- School of Pharmacy, University of South Australia, North Terrace, Adelaide, SA, 5000, Australia
| |
Collapse
|
38
|
Bhattacharya D, Lutzoni F, Reeb V, Simon D, Nason J, Fernandez F. Widespread occurrence of spliceosomal introns in the rDNA genes of ascomycetes. Mol Biol Evol 2000; 17:1971-84. [PMID: 11110913 DOI: 10.1093/oxfordjournals.molbev.a026298] [Citation(s) in RCA: 85] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Spliceosomal (pre-mRNA) introns have previously been found in eukaryotic protein-coding genes, in the small nuclear RNAs of some fungi, and in the small- and large-subunit ribosomal DNA genes of a limited number of ascomycetes. How the majority of these introns originate remains an open question because few proven cases of recent and pervasive intron origin have been documented. We report here the widespread occurrence of spliceosomal introns (69 introns at 27 different sites) in the small- and large-subunit nuclear-encoded rDNA of lichen-forming and free-living members of the Ascomycota. Our analyses suggest that these spliceosomal introns are of relatively recent origin, i.e., within the Euascomycetes, and have arisen through aberrant reverse-splicing (in trans) of free pre-mRNA introns into rRNAs. The spliceosome itself, and not an external agent (e.g., transposable elements, group II introns), may have given rise to these introns. A nonrandom sequence pattern was found at sites flanking the rRNA spliceosomal introns. This pattern (AG-intron-G) closely resembles the proto-splice site (MAG-intron-R) postulated for intron insertions in pre-mRNA genes. The clustered positions of spliceosomal introns on secondary structures suggest that particular rRNA regions are preferred sites for insertion through reverse-splicing.
Collapse
Affiliation(s)
- D Bhattacharya
- Department of Biological Sciences, University of Iowa, Iowa City, 52242-1324, USA.
| | | | | | | | | | | |
Collapse
|
39
|
Affiliation(s)
- I B Rogozin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894, USA.
| | | | | |
Collapse
|
40
|
Saxonov S, Daizadeh I, Fedorov A, Gilbert W. EID: the Exon-Intron Database-an exhaustive database of protein-coding intron-containing genes. Nucleic Acids Res 2000; 28:185-90. [PMID: 10592221 PMCID: PMC102483 DOI: 10.1093/nar/28.1.185] [Citation(s) in RCA: 100] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/1999] [Revised: 10/25/1999] [Accepted: 10/25/1999] [Indexed: 11/13/2022] Open
Abstract
To aid studies of molecular evolution and to assist in gene prediction research, we have constructed an Exon-Intron Database (EID) in FASTA format. Currently, the database is derived from GenBank release 112, and it contains 51 289 protein-coding genes (287 209 exons) that harbor introns, along with extensive descriptions of each gene and its DNA and protein sequences, as well as splice motif information. There is 17% redundancy inherited from GenBank-a purge at the 99% identity level reduced the database to 42 460 genes (243 589 exons). We have created subdatabases of genes whose intron positions have been experimentally determined. One such database, constructed by comparing genomic and mRNA sequences, contains 11 242 genes (62 474 exons). A larger database of 22 196 genes (105 595 exons) was constructed by selecting on keywords to eliminate computer-predicted genes. By examining the two nucleotides adjacent to the intron boundary, we infer that there is a 2% rate of errors or other deviations from the standard GTellipsisAG motif in nuclear genes. This criterion can be used to eliminate 4921 genes from the overall database. Various tools are provided to enable generation of user-specific subsets of the EID. The EID distribution can be obtained from http://mcb.harvard.edu/gilbert/EID
Collapse
Affiliation(s)
- S Saxonov
- Department of Molecular Biology, Harvard University, Cambridge, MA 02138, USA
| | | | | | | |
Collapse
|
41
|
Samuelson J. Why metronidazole is active against both bacteria and parasites. Antimicrob Agents Chemother 1999; 43:1533-41. [PMID: 10390199 PMCID: PMC89320 DOI: 10.1128/aac.43.7.1533] [Citation(s) in RCA: 152] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Affiliation(s)
- J Samuelson
- Department of Immunology and Infectious Diseases, Harvard School of Public Health, Boston, Massachusetts 02115, USA.
| |
Collapse
|
42
|
Navarro-Aviño JP, Prasad R, Miralles VJ, Benito RM, Serrano R. A proposal for nomenclature of aldehyde dehydrogenases inSaccharomyces cerevisiae and characterization of the stress-inducibleALD2 andALD3 genes. Yeast 1999. [DOI: 10.1002/(sici)1097-0061(199907)15:10a%3c829::aid-yea423%3e3.0.co;2-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open
|
43
|
Navarro-Aviño JP, Prasad R, Miralles VJ, Benito RM, Serrano R. A proposal for nomenclature of aldehyde dehydrogenases in Saccharomyces cerevisiae and characterization of the stress-inducible ALD2 and ALD3 genes. Yeast 1999; 15:829-42. [PMID: 10407263 DOI: 10.1002/(sici)1097-0061(199907)15:10a<829::aid-yea423>3.0.co;2-9] [Citation(s) in RCA: 100] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The complete sequencing of the genome of Saccharomyces cerevisiae indicated that this organism contains five genes encoding aldehyde dehydrogenases. YOR374w and YER073w correspond to the mitochondrial isoforms and we propose as gene names ALD4 and ALD5, respectively. YPL061w has been described as the cytoplasmic constitutive isoform and named ALD6. We characterize here the tandem-repeated ORFs YMR170c and YMR169c as the cytoplasmic stress-inducible isoforms, with gene names ALD2 and ALD3, respectively. The expression of ALD2 and ALD3 is dependent on the general-stress transcription factors Msn2,4 but independent of the HOG MAP kinase pathway. ALD3 is induced by a variety of stresses, including osmotic shock, heat shock, glucose exhaustion, oxidative stress and drugs. ALD2 is only induced by osmotic stress and glucose exhaustion. A double null mutant, ald2 ald3, exhibited unchanged sensitivity to any of the above stresses. The only phenotype detected in this mutant was a reduced growth rate in ethanol medium as compared to the wild type.
Collapse
Affiliation(s)
- J P Navarro-Aviño
- Instituto de Biologia Molecular y Celular de Plantas, Universidad Politecnica de Valencia-C.S.I.C., Camino de Vera, 46022 Valencia, Spain
| | | | | | | | | |
Collapse
|
44
|
Zhang Y, Gorry MC, Post JC, Ehrlich GD. Genomic organization of the human fibroblast growth factor receptor 2 (FGFR2) gene and comparative analysis of the human FGFR gene family. Gene 1999; 230:69-79. [PMID: 10196476 DOI: 10.1016/s0378-1119(99)00047-5] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The human fibroblast growth factor receptor (FGFR) genes play important roles in normal vertebrate development. Mutations in the human FGFR2 gene have been associated with many craniosynostotic syndromes and malformations, including Crouzon, Pfeiffer, Apert, Jackson-Weiss, Beare-Stevenson cutis gyrata, and Antley-Bixler syndromes, and Kleeblaatschadel (cloverleaf skull) deformity. The mutations identified to date are concentrated in the previously characterized region of FGFR2 that codes for the extracellular IgIII domain of the receptor protein. The search for mutations in other regions of the gene, however, has been hindered by lack of knowledge of the genomic structure. Using a combination of genomic library screening, long-range PCR, and genomic walking, we have characterized the genomic structure of nearly the entire human FGFR2 gene, including a delineation of the organization and size of all introns and exons and determination of the DNA sequences at the intron/exon boundaries. Comparative analysis of the human FGFR gene family reveals that the genomic organization of the FGFRs is relatively conserved. Moreover, alignment of the amino acid sequences shows that the four corresponding proteins share 46% identity overall, with up to 70% identity between individual pairs of FGFR proteins. However, the FGFR2 gene contains an additional exon not found in other members of the family, and it also has much larger intronic sequences throughout the gene. Remarkable similarities in genomic organization, intron/exon boundaries, and intron sizes are found between the human and mouse FGFR2 genes. Knowledge gained from this study of the human FGFR2 gene structure may prove useful in future screening studies designed to find additional mutations associated with craniosynostotic syndromes, and in understanding the molecular and cell biology of this receptor family.
Collapse
Affiliation(s)
- Y Zhang
- Center for Genomic Sciences, Allegheny University of the Health Sciences, 320 East North Avenue, 10th Floor, Pittsburgh, PA 15212, USA
| | | | | | | |
Collapse
|
45
|
Abstract
Does the intron/exon structure of eukaryotic genes belie their ancient assembly by exon-shuffling or have introns been inserted into preformed genes during eukaryotic evolution? These are the central questions in the ongoing 'introns-early' versus 'introns-late' controversy. The phylogenetic distribution of spliceosomal introns continues to strongly favor the intronslate theory. The introns-early theory, however, has claimed support from intron phase and protein structure correlations.
Collapse
Affiliation(s)
- J M Logsdon
- Department of Biochemistry, Dalhousie University, Halifax, Nova Scotia,B3H 4H7, Canada.
| |
Collapse
|
46
|
Abstract
The 'introns-late' theory holds that spliceosomal introns have been added to genes during eukaryotic evolution. Few clear examples of recent intron gains have been well documented, but two such cases have now been reported, one with possible identification of the source of the intron.
Collapse
Affiliation(s)
- J M Logsdon
- Program in Evolutionary Biology Canadian Institute for Advanced Research Department of Biochemistry Dalhousie University Halifax, Nova Scotia, Canada B3H 4H7.
| | | | | |
Collapse
|
47
|
Terao M, Kurosaki M, Demontis S, Zanotta S, Garattini E. Isolation and characterization of the human aldehyde oxidase gene: conservation of intron/exon boundaries with the xanthine oxidoreductase gene indicates a common origin. Biochem J 1998; 332 ( Pt 2):383-93. [PMID: 9601067 PMCID: PMC1219493 DOI: 10.1042/bj3320383] [Citation(s) in RCA: 45] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Aldehyde oxidase (AO) is a molybdo-flavo enzyme involved in the metabolism of various endogenous and exogenous N-heterocyclic compounds of pharmacological and toxicological importance. The enzyme is the product of a gene which is implicated in the aetio-pathogenesis of familial recessive amyotrophic lateral sclerosis. Here, we report the cloning and structural characterization of the human AO gene. AO is a single copy gene approximately 85 kb long with 35 transcribed exons. The transcription-initiation site and the sequence of the 5'-flanking region, containing several putative regulatory elements, were determined. The 5'-flanking region contains a functional promoter, as assessed by appropriate reporter constructs in transient transfection experiments. Comparison of the AO gene structure shows conservation of the position and type of exon/intron junctions relative to those observed in the gene coding for another molybdo-flavoprotein, i.e. xanthine oxidoreductase (XOR). As the two genes code for proteins with a high level of amino acid identity, our results strongly suggest that the AO and XOR genetic loci arose as the consequence of a duplication event. Southern blot analysis conducted on genomic DNA from various animal species with specific cDNA probes indicates that the AO gene is less conserved than the XOR gene during evolution.
Collapse
Affiliation(s)
- M Terao
- Laboratory of Molecular Biology, Centro Catullo e Daniela Borgomainerio, Istituto di Ricerche Farmacologiche 'Mario Negri', via Eritrea, 62, 20157 Italy
| | | | | | | | | |
Collapse
|
48
|
de Souza SJ, Long M, Klein RJ, Roy S, Lin S, Gilbert W. Toward a resolution of the introns early/late debate: only phase zero introns are correlated with the structure of ancient proteins. Proc Natl Acad Sci U S A 1998; 95:5094-9. [PMID: 9560234 PMCID: PMC20219 DOI: 10.1073/pnas.95.9.5094] [Citation(s) in RCA: 119] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
We present evidence that a well defined subset of intron positions shows a non-random distribution in ancient genes. We analyze a database of ancient conserved regions drawn from GenBank 101 to retest two predictions of the theory that the first genes were constructed by exon shuffling. These predictions are that there should be an excess of symmetric exons (and sets of exons) flanked by introns of the same phase (positions within the codon) and that intron positions in ancient proteins should correlate with the boundaries of compact protein modules. Both these predictions are supported by the data, with considerable statistical force (P values < 0.0001). Intron positions correlate to modules of diameters around 21, 27, and 33 A, and this correlation is due to phase zero introns. We suggest that 30-40% of present day intron positions in ancient genes correspond to phase zero introns originally present in the progenote, while almost all of the remaining intron positions correspond to introns added, or moved, appearing equally in all three intron phases. This proposal provides a resolution for many of the arguments of the introns-early/introns-late debate.
Collapse
Affiliation(s)
- S J de Souza
- Department of Molecular and Cellular Biology, The Biological Laboratories, Harvard University, Cambridge, MA 02138, USA
| | | | | | | | | | | |
Collapse
|
49
|
Gärtner J, Jimenez-Sanchez G, Roerig P, Valle D. Genomic organization of the 70-kDa peroxisomal membrane protein gene (PXMP1). Genomics 1998; 48:203-8. [PMID: 9521874 DOI: 10.1006/geno.1997.5177] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The 70-kDa peroxisomal membrane protein (PMP70) is a member of a family of half-ATP-binding cassette (ABC) transporter proteins located in the human peroxisomal membrane. Other members include the PMP70-related peroxisomal membrane protein, the adrenoleukodystrophy protein (ALDP), and the adrenoleukodystrophy-related protein. The functions of ABC transporters in the peroxisomal membrane are poorly understood. Evidence from yeast and human mutants suggests that they are involved in the peroxisomal import of fatty acids and/or fatty acyl-CoAs into the organelle. We report the cloning and characterization of the human PMP70 structural gene (gene symbol: PXMP1) localized on human chromosome 1p21-p22. PXMP1 is approximately 65 kb in length, contains 23 exons, and is quite different in structure from the gene (ALD) that encodes the related protein, ALDP. We also analyzed the 5' flanking region of the human PXMP1 gene and the corresponding region of murine Pxmp-1. Both promoters have features of housekeeping genes, including a high GC content and multiple consensus Sp1 binding sequences. In more than 3 kb of Pxmp-1 5' flanking sequence we did not identify a consensus peroxisomal proliferator responsive element.
Collapse
Affiliation(s)
- J Gärtner
- Department of Pediatrics, Heinrich Heine University Düsseldorf, Germany.
| | | | | | | |
Collapse
|
50
|
Tarrío R, Rodríguez-Trelles F, Ayala FJ. New Drosophila introns originate by duplication. Proc Natl Acad Sci U S A 1998; 95:1658-62. [PMID: 9465072 PMCID: PMC19138 DOI: 10.1073/pnas.95.4.1658] [Citation(s) in RCA: 43] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
We have analyzed the phylogenetic distribution of introns in the gene coding for xanthine dehydrogenase in 37 species, including 31 dipterans sequenced by us. We have discovered three narrowly distributed novel introns, one in the medfly Ceratitis capitata, the second in the willistoni and saltans groups of Drosophila, and the third in two sibling species of the willistoni group. The phylogenetic distribution of these introns favors the "introns-late" theory of the origin of genes. Analysis of the nucleotide sequences indicates that all three introns have arisen by duplication of a preexisting intron, which is pervasive in Drosophila and other dipterans (and has a homologous position as an intron found in humans and other diverse organisms).
Collapse
Affiliation(s)
- R Tarrío
- Department of Ecology and Evolutionary Biology, University of California, Irvine, CA 92697-2525, USA
| | | | | |
Collapse
|