1
|
Ramallo E, Kalendar R, Schulman AH, Martínez-Izquierdo JA. Reme1, a Copia retrotransposon in melon, is transcriptionally induced by UV light. PLANT MOLECULAR BIOLOGY 2008; 66:137-50. [PMID: 18034313 DOI: 10.1007/s11103-007-9258-4] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/09/2007] [Accepted: 11/05/2007] [Indexed: 05/08/2023]
Abstract
For the first time, numerous sequences of Copia and Gypsy retrotransposons from the Cucumis melo genome have been obtained and analyzed. Phylogenetic analyses of sequences of both types of long terminal repeat (LTR) retrotransposons were carried out. The melon genome contains approximately 20,000 Gypsy and 6,800 Copia elements, comprising about 26% of its total size. Starting from a retrotransposon fragment, we have cloned and characterized an entire melon retrotransposon, named Reme1, which is 5,149 bp long. Reme1 belongs to the Superfamily Copia retrotransposons by its protein domain order and sequence similarity to other Copia elements of dicotyledons. The haploid genome of melon (var. "Piel de Sapo") contains about 120 copies of Reme1. Several copies of Reme1 are transcriptionally active, although at low levels, in melon leaves as analyzed by reverse-transcription PCR (RT-PCR) and sequencing. However, the transcript pool is considerably increased when melon leaves are treated with UV light, as has been seen for various retroelements in many organisms. The cDNAs of Reme1 transcripts showed less diversity than do Reme1 genomic sequences, suggesting that a subfamily of these elements is differentially responsive to UV.
Collapse
Affiliation(s)
- Elisenda Ramallo
- Department of Molecular Genetics, Consorci CSIC-IRTA, Jordi Girona 18-26, 08034 Barcelona, Spain.
| | | | | | | |
Collapse
|
2
|
Zhang H, Sreenivasulu N, Weschke W, Stein N, Rudd S, Radchuk V, Potokina E, Scholz U, Schweizer P, Zierold U, Langridge P, Varshney RK, Wobus U, Graner A. Large-scale analysis of the barley transcriptome based on expressed sequence tags. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2004; 40:276-90. [PMID: 15447653 DOI: 10.1111/j.1365-313x.2004.02209.x] [Citation(s) in RCA: 84] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
To provide resources for barley genomics, 110,981 expressed sequence tags (ESTs) were generated from 22 cDNA libraries representing tissues at various developmental stages. This EST collection corresponds to approximately one-third of the 380,000 publicly available barley ESTs. Clustering and assembly resulted in 14,151 tentative consensi (TCs) and 11 073 singletons, altogether representing 25 224 putatively unique sequences. Of these, 17.5% showed no significant similarity to other barley ESTs present in dbEST. More than 41% of all barley genes are supposed to belong to multigene families and approximately 4% of the barley genes undergo alternative splicing. Based on the functional annotation of the set of unique sequences, the functional category 'Energy' was further analysed to reveal tissue- and stage-specific differences in gene expression. Hierarchical clustering of 362 differentially expressed TCs resulted in the identification of seven major clusters. The clusters reflect biochemical pathways predominantly activated in specific tissues and at various developmental stages. During seed germination glycolysis could be identified as the most predominant biochemical pathway. Germination-specific glycolysis is characterized by the coordinated expression of phosphoenolpyruvate carboxylase and phosphoenolpyruvate carboxykinase, whose antagonistic actions possibly regulate the flux of amino acids into protein biosynthesis and gluconeogenesis respectively. The expression of defence-related and antioxidant genes during germination might be controlled by the ethylene-signalling pathway as concluded from the coordinated expression of those genes and the transcription factors (TF) EIN3 and EREBPG. Moreover, because of their predominant expression in germinating seeds, TF of the AP2 and MYB type are presumably major regulators of germination.
Collapse
Affiliation(s)
- Hangning Zhang
- Institut für Pflanzengenetik und Kulturpflanzenforschung (IPK), Correnstrasse 3, D-06466 Gatersleben, Germany
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
3
|
Kato A, Suzuki M, Kuwahara A, Ooe H, Higano-Inaba K, Komeda Y. Isolation and analysis of cDNA within a 300 kb Arabidopsis thaliana genomic region located around the 100 map unit of chromosome 1. Gene 1999; 239:309-16. [PMID: 10548732 DOI: 10.1016/s0378-1119(99)00403-5] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
In order to analyze the organization of genes located at the 100 map unit of chromosome 1, we screened cDNAs hybridized with approximately 300kb contiguous DNA using four P1 clones and one YAC clone. A total of 40 kinds of cDNA were isolated, and their entire sequences were determined. A comparison with the GenBank/EMBL database indicated that three of the cDNAs have been found in Arabidopsis, and that similar sequences to 18 of the cDNAs had been detected in Arabidopsis or other organisms. cDNAs were aligned on a physical map of the contiguous DNA, and the transcriptional direction of each cDNA was determined. This contiguous DNA contains a large direct repeat, which contains five genes. In addition, identical or very similar sequences to two cDNAs are located in a narrow region. Thus, a total of 50 genes were identified, and the gene density was revealed to be approximately one gene every 6kb. In addition, cDNA sequencing revealed the existence of unusual transcripts. A sequence of seven cDNAs seemed to have no significant open reading frames. Furthermore, the existence of antisense RNA and the possibility of alternative splicing were also revealed.
Collapse
Affiliation(s)
- A Kato
- Division of Biological Sciences, Graduate School of Science, Hokkaido University, Sapporo, Japan.
| | | | | | | | | | | |
Collapse
|
4
|
Mathé C, Peresetsky A, Déhais P, Van Montagu M, Rouzé P. Classification of Arabidopsis thaliana gene sequences: clustering of coding sequences into two groups according to codon usage improves gene prediction. J Mol Biol 1999; 285:1977-91. [PMID: 9925779 DOI: 10.1006/jmbi.1998.2451] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
While genomic sequences are accumulating, finding the location of the genes remains a major issue that can be solved only for about a half of them by homology searches. Prediction methods are thus required, but unfortunately are not fully satisfying. Most prediction methods implicitly assume a unique model for genes. This is an oversimplification as demonstrated by the possibility to group coding sequences into several classes in Escherichia coli and other genomes. As no classification existed for Arabidopsis thaliana, we classified genes according to the statistical features of their coding sequences. A clustering algorithm using a codon usage model was developed and applied to coding sequences from A. thaliana, E. coli, and a mixture of both. By using it, Arabidopsis sequences were clustered into two classes. The CU1 and CU2 classes differed essentially by the choice of pyrimidine bases at the codon silent sites: CU2 genes often use C whereas CU1 genes prefer T. This classification discriminated the Arabidopsis genes according to their expressiveness, highly expressed genes being clustered in CU2 and genes expected to have a lower expression, such as the regulatory genes, in CU1. The algorithm separated the sequences of the Escherichia-Arabidopsis mixed data set into five classes according to the species, except for one class. This mixed class contained 89 % Arabidopsis genes from CU1 and 11 % E. coli genes, mostly horizontally transferred. Interestingly, most genes encoding organelle-targeted proteins, except the photosynthetic and photoassimilatory ones, were clustered in CU1. By tailoring the GeneMark CDS prediction algorithm to the observed coding sequence classes, its quality of prediction was greatly improved. Similar improvement can be expected with other prediction systems.
Collapse
Affiliation(s)
- C Mathé
- Laboratorium voor Genetica Department of Genetics, Flanders Interuniversity Institute for Biotechnology (VIB), Universiteit Gent, Gent, B-9000, Belgium
| | | | | | | | | |
Collapse
|
5
|
Terryn N, Gielen J, De Keyser A, Van Den Daele H, Ardiles W, Neyt P, De Clercq R, Coppieters J, Déhais P, Villarroel R, Rouzé P, Van Montagu M. Sequence analysis of a 40-kb Arabidopsis thaliana genomic region located at the top of chromosome 1. Gene 1998; 215:11-7. [PMID: 9666060 DOI: 10.1016/s0378-1119(98)00286-8] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
As a contribution to the European Scientists Sequencing Arabidopsis (BIOTECH ESSA) project, a contig of almost 40kb has been sequenced at the extreme top of chromosome 1, around the Arabidopsis thaliana gene coding for a member of the 1-aminocyclopropane-1-carboxylate synthesis gene family. The region contains, besides the ACS1 gene itself, 10 putative genes, all new for Arabidopsis. Among these are three genes encoding kinases, a late embryogenesis-abundant protein, a MADS box-containing protein, a dehydrogenase, and a Myb-related transcription factor. In addition, six cDNAs have been sequenced that correspond to this region.
Collapse
Affiliation(s)
- N Terryn
- Laboratorium voor Genetica, Departement Genetica, Vlaams Interuniversitair Instituut voor Biotechnologie, Universiteit Gent, K.L. Ledeganckstraat 35, B-9000, Gent, Belgium
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
6
|
McKinney EC, Meagher RB. Members of the Arabidopsis actin gene family are widely dispersed in the genome. Genetics 1998; 149:663-75. [PMID: 9611182 PMCID: PMC1460172 DOI: 10.1093/genetics/149.2.663] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Plant genomes are subjected to a variety of DNA turnover mechanisms that are thought to result in rapid expansion and presumable contraction of gene copy number. The evolutionary history of the 10 actin genes in Arabidopsis thaliana is well characterized and can be traced to the origin of vascular plant genomes. Knowledge about the genomic position of each actin gene may be the key to tracing landmark genomic duplication events that define plant families or genera and facilitate further mutant isolation. All 10 actin genes were mapped by following the segregation of cleaved amplified polymorphisms between two ecotypes and identifying actin gene locations among yeast artificial chromosomes. The Arabidopsis actin genes are widely dispersed on four different chromosomes (1, 2, 3, and 5). Even the members of three closely related and recently duplicated pairs of actin genes are unlinked. Several other cytoskeletal genes (profilins, tubulins) that might have evolved in concert with actins were also mapped, but showed few patterns consistent with that evolutionary history. Thus, the events that gave rise to the actin gene family have been obscured either by the duplication of very small genic fragments or by extensive rearrangement of the genome.
Collapse
Affiliation(s)
- E C McKinney
- Department of Genetics, University of Georgia, Athens, Georgia 30602, USA
| | | |
Collapse
|
7
|
Rounsley S, Lin X, Ketchum KA. Large-scale sequencing of plant genomes. CURRENT OPINION IN PLANT BIOLOGY 1998; 1:136-141. [PMID: 10066574 DOI: 10.1016/s1369-5266(98)80015-0] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
The large number of ESTs generated for Arabidopsis and rice in recent years now act as an important complement to whole genome sequencing projects. The Arabidopsis Genome Initiative has begun a coordinated effort to sequence the entire genome and, as a result, increasing numbers of large sequence entries can be found in the public databases. In addition, the mitochondrial genome of Arabidopsis has been completely sequenced. Genome sequencing studies and the public sequence databases have begun to influence the direction of diverse areas of research from physiology to evolution.
Collapse
Affiliation(s)
- S Rounsley
- The Institute for Genomic Research, 9712 Medical Center Drive, Rockville, MD 20850, USA.
| | | | | |
Collapse
|
8
|
Gy I, Aubourg S, Sherson S, Cobbett CS, Cheron A, Kreis M, Lecharny A. Analysis of a 14-kb fragment containing a putative cell wall gene and a candidate for the ARA1, arabinose kinase, gene from chromosome IV of Arabidopsis thaliana. Gene 1998; 209:201-10. [PMID: 9524266 DOI: 10.1016/s0378-1119(98)00049-3] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
An Arabidopsis thaliana genomic DNA fragment of 14kb has been characterized in the framework of the E.S.S.A. programme. Computational and molecular approaches identified three novel gene sequences coding, respectively, for a protein of unknown function, a putative membrane-anchored cell wall protein and an arabinose kinase gene corresponding to the locus ARA1. The latter two genes named AtSEB1 and AtISA1 have been characterized in detail. They are very different in their organization, codon usage and level of expression. Homologues of AtSEB1 and AtISA1 have been identified. Sequence comparisons showed that the former genes contained a long 5' extension coding for an N-terminal domain probably specifying subcellular localization. Cloning and sequencing of the cognate cDNA for the AtISA1 homologue in A. thaliana, named GAL1, indicate that it encodes for a galactokinase-like protein. Our results highlight the integrative outcome of a systematic sequencing project in which links between biochemically and genetically characterized mutants, ESTs and genomic sequence data are generated.
Collapse
Affiliation(s)
- I Gy
- Institut de Biotechnologie des Plantes, Laboratoire de Biologie du Développement des Plantes, Bâtiment 630, Université de Paris-Sud, CNRS-ERS 569, F-91405, Orsay, Cedex, France
| | | | | | | | | | | | | |
Collapse
|
9
|
Panstruga R, Büschges R, Piffanelli P, Schulze-Lefert P. A contiguous 60 kb genomic stretch from barley reveals molecular evidence for gene islands in a monocot genome. Nucleic Acids Res 1998; 26:1056-62. [PMID: 9461468 PMCID: PMC147355 DOI: 10.1093/nar/26.4.1056] [Citation(s) in RCA: 111] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
The contiguous DNA sequence of a 60 kb genomic interval of barley chromosome 4HL has been assembled. The region harbours a single and novel gypsy -like retrotransposon, designated BAGY-1. Only three genes appear to reside in the genomic stretch. One predicts a plant homologue of ribophorin I, a subunit of the oligosaccharyltransferase-protein complex located in the rough endoplasmatic reticulum. The second is similar to the Drosophila g1 gene encoding a ring finger protein involved in developmental processes. The observed gene density is approximately 5-fold lower than in the best characterized dicot genome of Arabidopsis but 6- to 10-fold higher than expected from an equidistant gene distribution in the complex barley genome. Our data suggest that the 60 kb genomic interval represents part of a gene island, a seemingly distinctive feature of grass genomes.
Collapse
Affiliation(s)
- R Panstruga
- The Sainsbury Laboratory, John Innes Centre, Norwich Research Park, Colney Lane, Norwich NR4 7UH, UK
| | | | | | | |
Collapse
|
10
|
Terryn N, Neyt P, De Clercq R, De Keyser A, Van Den Daele H, Ardiles W, Déhais P, Rouzé P, Gielen J, Villarroel R, Van Montagu M. Sequence analysis of a 24-kb contiguous genomic region at the Arabidopsis thaliana PFL locus on chromosome 1. FEBS Lett 1997; 416:156-60. [PMID: 9369203 DOI: 10.1016/s0014-5793(97)01190-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
As part of the European Union program of European Scientist Sequencing Arabidopsis (ESSA), the DNA sequence of a 24.053-bp insert of cosmid clone CC17J13 was determined. The cosmid is located on chromosome 1 at the PFL locus (position 30 cM). Analysis of the sequence and comparison to public databases predicts seven genes in this area, thus approximately one gene every 3.3 kb. Three cDNAs corresponding to genes in this region were also sequenced. The homologies and/or possible functions of the (putative) genes are discussed. Proteins encoded by genes in this region include a polyadenylate-binding protein (PAB-3) and a GTP-binding protein (Rab7) as well as a novel protein, possibly involved in double-stranded RNA unwinding and apoptosis. Intriguingly, the gene encoding the PAB-3 protein, which is very specifically expressed, is flanked by putative matrix attachment regions.
Collapse
Affiliation(s)
- N Terryn
- Department of Genetics, Flanders Interuniversity Institute for Biotechnology (VIB), Universiteit Gent, Belgium
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|