1
|
Guseva S, Szekely O, Geng A, Smith K, Pratihar S, Lee Y, Li L, Korn SM, Gu S, Al-Hashimi HM. An A-T Hoogsteen Base Pair in a Naked DNA Hairpin Motif: A Protein-Recognized Conformation. Angew Chem Int Ed Engl 2025:e202425067. [PMID: 40215227 DOI: 10.1002/anie.202425067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2024] [Revised: 03/31/2025] [Accepted: 04/08/2025] [Indexed: 04/25/2025]
Abstract
In duplex DNA, A-T and G-C form Watson-Crick base pairs, and Hoogsteen pairing only dominates upon protein binding or DNA damage. Using NMR, we show that an A-T Hoogsteen base pair previously observed in crystal structures of transposon DNA hairpins bound to TnpA protein forms in solution even in the absence of TnpA. This Hoogsteen base pair, located adjacent to a dinucleotide apical loop, exists in dynamic equilibrium with a minor Watson-Crick conformation (population ∼11% and lifetime ∼55 µs). Extending the apical loop to three residues inverted the equilibrium, making Watson-Crick the dominant state and the Hoogsteen conformation recognized by TnpA a minor state (population ∼14% and lifetime ∼28 µs). The propensity for Hoogsteen pairing depended on apical loop residues, which form contacts directly or indirectly stabilizing the Hoogsteen conformation. A structure survey did not reveal Hoogsteen pairing near RNA apical loops making them unique to DNA. Our results demonstrate that Hoogsteen can be the dominant state even in naked unmodified duplex DNA and identify 5'-CTT(T/C)AG-3' as a DNA-specific apical loop motif stabilized by Hoogsteen pairing. Hoogsteen base pairs may be prevalent in DNA hairpins forming during replication and transcription, with broad implications for the genomic landscape.
Collapse
Affiliation(s)
- Serafima Guseva
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, 10032, USA
| | - Or Szekely
- Department of Biology, Duke University, Durham, NC, 27710, USA
- Howard Hughes Medical Institute, Duke University, Durham, NC, 27710, USA
| | - Ainan Geng
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, 10032, USA
- Department of Biochemistry, Duke University School of Medicine, Durham, NC, 27710, USA
| | - Kai Smith
- Department of Biomedical Engineering, University of Arizona, Tuscon, AZ, 85721, USA
| | - Supriya Pratihar
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, 10032, USA
| | - Yeongjoon Lee
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, 10032, USA
| | - Linshu Li
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, 10032, USA
| | - Sophie M Korn
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, 10032, USA
| | - Stephanie Gu
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, 10032, USA
- Department of Biochemistry, Duke University School of Medicine, Durham, NC, 27710, USA
- Program in Cellular and Molecular Medicine, Boston Children's Hospital, Harvard Medical School, Boston, MA, 02115, USA
| | - Hashim M Al-Hashimi
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, 10032, USA
| |
Collapse
|
2
|
Guseva S, Szekely O, Geng A, Smith K, Pratihar S, Gu S, Al-Hashimi HM. An A-T Hoogsteen base pair in a naked DNA hairpin motif: A Protein-Recognized Conformation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.12.22.630000. [PMID: 39763874 PMCID: PMC11703281 DOI: 10.1101/2024.12.22.630000] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/25/2025]
Abstract
In duplex DNA, A-T and G-C form Watson-Crick base pairs, and Hoogsteen pairing only dominates upon protein binding or DNA damage. Using NMR, we show that an A-T Hoogsteen base pair previously observed in crystal structures of transposon DNA hairpins bound to TnpA protein forms in solution even in the absence of TnpA. This Hoogsteen base pair, located adjacent to a dinucleotide apical loop, exists in dynamic equilibrium with a minor Watson-Crick conformation (population ∼11% and lifetime ∼55 µs). Extending the apical loop to three residues inverted the equilibrium, making Watson-Crick the dominant state and the Hoogsteen conformation recognized by TnpA a minor state (population ∼14% and lifetime ∼28 µs). The propensity for Hoogsteen pairing depended on apical loop residues, which form contacts directly or indirectly stabilizing the Hoogsteen conformation. A structure survey did not reveal Hoogsteen pairing near RNA apical loops making them unique to DNA. Our results demonstrate that Hoogsteen can be the dominant state even in naked unmodified duplex DNA and identify 5'-CTT(T/C)AG-3' as a DNA-specific apical loop motif stabilized by Hoogsteen pairing. Hoogsteen base pairs may be prevalent in DNA hairpins forming during replication and transcription, with broad implications for the genomic landscape.
Collapse
|
3
|
Yu Y, Wang X, Fox J, Li Q, Yu Y, Hastings PJ, Chen K, Ira G. RPA and Rad27 limit templated and inverted insertions at DNA breaks. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.07.583931. [PMID: 38496432 PMCID: PMC10942419 DOI: 10.1101/2024.03.07.583931] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/19/2024]
Abstract
Formation of templated insertions at DNA double-strand breaks (DSBs) is very common in cancer cells. The mechanisms and enzymes regulating these events are largely unknown. Here, we investigated templated insertions in yeast at DSBs using amplicon sequencing across a repaired locus. We document very short (most ∼5-34 bp), templated inverted duplications at DSBs. They are generated through a foldback mechanism that utilizes microhomologies adjacent to the DSB. Enzymatic requirements suggest a hybrid mechanism wherein one end requires Polδ-mediated synthesis while the other end is captured by nonhomologous end joining (NHEJ). This process is exacerbated in mutants with low levels or mutated RPA ( rtt105 Δ; rfa1 -t33) or extensive resection mutant ( sgs1 Δ exo1 Δ). Templated insertions from various distant genomic locations also increase in these mutants as well as in rad27 Δ and originate from fragile regions of the genome. Among complex insertions, common events are insertions of two sequences, originating from the same locus and with inverted orientation. We propose that these inversions are also formed by microhomology-mediated template switching. Taken together, we propose that a shortage of RPA typical in cancer cells is one possible factor stimulating the formation of templated insertions.
Collapse
|
4
|
Verma A, Lin M, Smith D, Walker JC, Hewezi T, Davis EL, Hussey RS, Baum TJ, Mitchum MG. A novel sugar beet cyst nematode effector 2D01 targets the Arabidopsis HAESA receptor-like kinase. MOLECULAR PLANT PATHOLOGY 2022; 23:1765-1782. [PMID: 36069343 PMCID: PMC9644282 DOI: 10.1111/mpp.13263] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Revised: 08/10/2022] [Accepted: 08/11/2022] [Indexed: 06/15/2023]
Abstract
Plant-parasitic cyst nematodes use a stylet to deliver effector proteins produced in oesophageal gland cells into root cells to cause disease in plants. These effectors are deployed to modulate plant defence responses and developmental programmes for the formation of a specialized feeding site called a syncytium. The Hg2D01 effector gene, coding for a novel 185-amino-acid secreted protein, was previously shown to be up-regulated in the dorsal gland of parasitic juveniles of the soybean cyst nematode Heterodera glycines, but its function has remained unknown. Genome analyses revealed that Hg2D01 belongs to a highly diversified effector gene family in the genomes of H. glycines and the sugar beet cyst nematode Heterodera schachtii. For functional studies using the model Arabidopsis thaliana-H. schachtii pathosystem, we cloned the orthologous Hs2D01 sequence from H. schachtii. We demonstrate that Hs2D01 is a cytoplasmic effector that interacts with the intracellular kinase domain of HAESA (HAE), a cell surface-associated leucine-rich repeat (LRR) receptor-like kinase (RLK) involved in signalling the activation of cell wall-remodelling enzymes important for cell separation during abscission and lateral root emergence. Furthermore, we show that AtHAE is expressed in the syncytium and, therefore, could serve as a viable host target for Hs2D01. Infective juveniles effectively penetrated the roots of HAE and HAESA-LIKE2 (HSL2) double mutant plants; however, fewer nematodes developed on the roots, consistent with a role for this receptor family in nematode infection. Taken together, our results suggest that the Hs2D01-AtHAE interaction may play an important role in sugar beet cyst nematode parasitism.
Collapse
Affiliation(s)
- Anju Verma
- Department of Plant Pathology and Institute of Plant Breeding, Genetics, and GenomicsUniversity of GeorgiaAthensGeorgiaUSA
- Division of Plant Sciences and Bond Life Sciences CenterUniversity of MissouriColumbiaMissouriUSA
| | - Marriam Lin
- Division of Plant Sciences and Bond Life Sciences CenterUniversity of MissouriColumbiaMissouriUSA
- Boyle Frederickson Intellectual Property LawMilwaukeeWisconsinUSA
| | - Dante Smith
- Division of Plant Sciences and Bond Life Sciences CenterUniversity of MissouriColumbiaMissouriUSA
- Conagra Brands, Inc., Corporate Microbiology, Research and DevelopmentOmahaNebraskaUSA
| | - John C. Walker
- Division of Biological SciencesUniversity of MissouriColumbiaMissouriUSA
| | - Tarek Hewezi
- Department of Plant SciencesUniversity of TennesseeKnoxvilleTennesseeUSA
| | - Eric L. Davis
- Department of Entomology and Plant PathologyNorth Carolina State UniversityRaleighNorth CarolinaUSA
| | - Richard S. Hussey
- Department of Plant Pathology and Institute of Plant Breeding, Genetics, and GenomicsUniversity of GeorgiaAthensGeorgiaUSA
| | - Thomas J. Baum
- Department of Plant Pathology and MicrobiologyIowa State UniversityAmesIowaUSA
| | - Melissa G. Mitchum
- Department of Plant Pathology and Institute of Plant Breeding, Genetics, and GenomicsUniversity of GeorgiaAthensGeorgiaUSA
- Division of Plant Sciences and Bond Life Sciences CenterUniversity of MissouriColumbiaMissouriUSA
| |
Collapse
|
5
|
Fajkus P, Peška V, Fajkus J, Sýkorová E. Origin and Fates of TERT Gene Copies in Polyploid Plants. Int J Mol Sci 2021; 22:1783. [PMID: 33670111 PMCID: PMC7916837 DOI: 10.3390/ijms22041783] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Revised: 02/03/2021] [Accepted: 02/05/2021] [Indexed: 12/14/2022] Open
Abstract
The gene coding for the telomerase reverse transcriptase (TERT) is essential for the maintenance of telomeres. Previously we described the presence of three TERT paralogs in the allotetraploid plant Nicotiana tabacum, while a single TERT copy was identified in the paleopolyploid model plant Arabidopsis thaliana. Here we examine the presence, origin and functional status of TERT variants in allotetraploid Nicotiana species of diverse evolutionary ages and their parental genome donors, as well as in other diploid and polyploid plant species. A combination of experimental and in silico bottom-up analyses of TERT gene copies in Nicotiana polyploids revealed various patterns of retention or loss of parental TERT variants and divergence in their functions. RT-qPCR results confirmed the expression of all the identified TERT variants. In representative plant and green algal genomes, our synteny analyses show that their TERT genes were located in a conserved locus that became advantageous after the divergence of eudicots, and the gene was later translocated in several plant groups. In various diploid and polyploid species, translocation of TERT became fixed in target loci that show ancient synapomorphy.
Collapse
Affiliation(s)
- Petr Fajkus
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, CZ-61265 Brno, Czech Republic; (P.F.); (V.P.)
| | - Vratislav Peška
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, CZ-61265 Brno, Czech Republic; (P.F.); (V.P.)
| | - Jiří Fajkus
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, CZ-61265 Brno, Czech Republic; (P.F.); (V.P.)
- Laboratory of Functional Genomics and Proteomics, NCBR, Faculty of Science, Masaryk University, Kotlářská 2, CZ-61137 Brno, Czech Republic
- Mendel Centre for Plant Genomics and Proteomics, CEITEC, Masaryk University, Kamenice 5, CZ-62500 Brno, Czech Republic
| | - Eva Sýkorová
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, CZ-61265 Brno, Czech Republic; (P.F.); (V.P.)
| |
Collapse
|
6
|
Fonseca PLC, Badotti F, De-Paula RB, Araújo DS, Bortolini DE, Del-Bem LE, Azevedo VA, Brenig B, Aguiar ERGR, Góes-Neto A. Exploring the Relationship Among Divergence Time and Coding and Non-coding Elements in the Shaping of Fungal Mitochondrial Genomes. Front Microbiol 2020; 11:765. [PMID: 32411111 PMCID: PMC7202290 DOI: 10.3389/fmicb.2020.00765] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Accepted: 03/30/2020] [Indexed: 12/24/2022] Open
Abstract
The order Hypocreales (Ascomycota) is composed of ubiquitous and ecologically diverse fungi such as saprobes, biotrophs, and pathogens. Despite their phylogenetic relationship, these species exhibit high variability in biomolecules production, lifestyle, and fitness. The mitochondria play an important role in the fungal biology, providing energy to the cells and regulating diverse processes, such as immune response. In spite of its importance, the mechanisms that shape fungal mitogenomes are still poorly understood. Herein, we investigated the variability and evolution of mitogenomes and its relationship with the divergence time using the order Hypocreales as a study model. We sequenced and annotated for the first time Trichoderma harzianum mitochondrial genome (mtDNA), which was compared to other 34 mtDNAs species that were publicly available. Comparative analysis revealed a substantial structural and size variation on non-coding mtDNA regions, despite the conservation of copy number, length, and structure of protein-coding elements. Interestingly, we observed a highly significant correlation between mitogenome length, and the number and size of non-coding sequences in mitochondrial genome. Among the non-coding elements, group I and II introns and homing endonucleases genes (HEGs) were the main contributors to discrepancies in mitogenomes structure and length. Several intronic sequences displayed sequence similarity among species, and some of them are conserved even at gene position, and were present in the majority of mitogenomes, indicating its origin in a common ancestor. On the other hand, we also identified species-specific introns that advocate for the origin by different mechanisms. Investigation of mitochondrial gene transfer to the nuclear genome revealed that nuclear copies of the nad5 are the most frequent while atp8, atp9, and cox3 could not be identified in any of the nuclear genomes analyzed. Moreover, we also estimated the divergence time of each species and investigated its relationship with coding and non-coding elements as well as with the length of mitogenomes. Altogether, our results demonstrated that introns and HEGs are key elements on mitogenome shaping and its presence on fast-evolving mtDNAs could be mostly explained by its divergence time, although the intron sharing profile suggests the involvement of other mechanisms on the mitochondrial genome evolution, such as horizontal transference.
Collapse
Affiliation(s)
- Paula L. C. Fonseca
- Molecular and Computational Biology of Fungi Laboratory, Department of Microbiology, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - Fernanda Badotti
- Department of Chemistry, Centro Federal de Educação Tecnológica de Minas Gerais, Belo Horizonte, Brazil
| | - Ruth B. De-Paula
- Department of Molecular and Cellular Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX, United States
| | - Daniel S. Araújo
- Molecular and Computational Biology of Fungi Laboratory, Department of Microbiology, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - Dener E. Bortolini
- Program of Bioinformatics, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - Luiz-Eduardo Del-Bem
- Program of Bioinformatics, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
- Department of Botany, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - Vasco A. Azevedo
- Program of Bioinformatics, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - Bertram Brenig
- Institute of Veterinary Medicine, Burckhardtweg, University of Göttingen, Göttingen, Germany
| | - Eric R. G. R. Aguiar
- Program of Bioinformatics, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - Aristóteles Góes-Neto
- Molecular and Computational Biology of Fungi Laboratory, Department of Microbiology, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
- Program of Bioinformatics, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| |
Collapse
|
7
|
Structural variation and its potential impact on genome instability: Novel discoveries in the EGFR landscape by long-read sequencing. PLoS One 2020; 15:e0226340. [PMID: 31940362 PMCID: PMC6961855 DOI: 10.1371/journal.pone.0226340] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2019] [Accepted: 11/25/2019] [Indexed: 12/29/2022] Open
Abstract
Structural variation (SV) is typically defined as variation within the human genome that exceeds 50 base pairs (bp). SV may be copy number neutral or it may involve duplications, deletions, and complex rearrangements. Recent studies have shown SV to be associated with many human diseases. However, studies of SV have been challenging due to technological constraints. With the advent of third generation (long-read) sequencing technology, exploration of longer stretches of DNA not easily examined previously has been made possible. In the present study, we utilized third generation (long-read) sequencing techniques to examine SV in the EGFR landscape of four haplotypes derived from two human samples. We analyzed the EGFR gene and its landscape (+/- 500,000 base pairs) using this approach and were able to identify a region of non-coding DNA with over 90% similarity to the most common activating EGFR mutation in non-small cell lung cancer. Based on previously published Alu-element genome instability algorithms, we propose a molecular mechanism to explain how this non-coding region of DNA may be interacting with and impacting the stability of the EGFR gene and potentially generating this cancer-driver gene. By these techniques, we were also able to identify previously hidden structural variation in the four haplotypes and in the human reference genome (hg38). We applied previously published algorithms to compare the relative stabilities of these five different EGFR gene landscape haplotypes to estimate their relative potentials to generate the EGFR exon 19, 15 bp canonical deletion. To our knowledge, the present study is the first to use the differences in genomic architecture between targeted cancer-linked phased haplotypes to estimate their relative potentials to form a common cancer-linked driver mutation.
Collapse
|
8
|
Shi J, Liang C. Generic Repeat Finder: A High-Sensitivity Tool for Genome-Wide De Novo Repeat Detection. PLANT PHYSIOLOGY 2019; 180:1803-1815. [PMID: 31152127 PMCID: PMC6670090 DOI: 10.1104/pp.19.00386] [Citation(s) in RCA: 61] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/28/2019] [Accepted: 05/17/2019] [Indexed: 05/25/2023]
Abstract
Comprehensive and accurate annotation of the repeatome, including transposons, is critical for deepening our understanding of repeat origins, biogenesis, regulatory mechanisms, and roles. Here, we developed Generic Repeat Finder (GRF), a tool for genome-wide repeat detection based on fast, exhaustive numerical calculation algorithms integrated with optimized dynamic programming strategies. GRF sensitively identifies terminal inverted repeats (TIRs), terminal direct repeats (TDRs), and interspersed repeats that bear both inverted and direct repeats. GRF also detects DNA or RNA transposable elements characterized by these repeats in plant and animal genomes. For TIRs and TDRs, GRF identifies spacers in the middle and mismatches/insertions or deletions in terminal repeats, showing their alignment or base-pairing information. GRF helps improve the annotation for various DNA transposons and retrotransposons, such as miniature inverted-repeat transposable elements (MITEs), long terminal repeat (LTR) retrotransposons, and non-LTR retrotransposons, including long interspersed nuclear elements and short interspersed nuclear elements in plants. We used GRF to perform TIR/TDR, interspersed-repeat, and MITE detection in several species, including Arabidopsis (Arabidopsis thaliana), rice (Oryza sativa), and mouse (Mus musculus). As a generic bioinformatics tool in repeat finding implemented as a parallelized C++ program, GRF was faster and more sensitive than the existing inverted repeat/MITE detection tools based on numerical approaches (i.e. detectIR and detectMITE) in Arabidopsis and mouse. GRF is more sensitive than Inverted Repeat Finder in TIR detection, LTR_FINDER in short TDR detection (≤1,000 nt), and phRAIDER in interspersed repeat detection in Arabidopsis and rice. GRF is an open source available from Github.
Collapse
Affiliation(s)
- Jieming Shi
- Department of Biology, Miami University, Oxford, Ohio 45056
| | - Chun Liang
- Department of Biology, Miami University, Oxford, Ohio 45056
- Department of Computer Science and Software Engineering, Miami University, Oxford, Ohio 45056
| |
Collapse
|
9
|
Natale F, Scholl A, Rapp A, Yu W, Rausch C, Cardoso MC. DNA replication and repair kinetics of Alu, LINE-1 and satellite III genomic repetitive elements. Epigenetics Chromatin 2018; 11:61. [PMID: 30352618 PMCID: PMC6198450 DOI: 10.1186/s13072-018-0226-9] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2018] [Accepted: 09/25/2018] [Indexed: 12/04/2022] Open
Abstract
Background Preservation of genome integrity by complete, error-free DNA duplication prior to cell division and by correct DNA damage repair is paramount for the development and maintenance of an organism. This holds true not only for protein-encoding genes, but also it applies to repetitive DNA elements, which make up more than half of the human genome. Here, we focused on the replication and repair kinetics of interspersed and tandem repetitive DNA elements. Results We integrated genomic population level data with a single cell immunofluorescence in situ hybridization approach to simultaneously label replication/repair and repetitive DNA elements. We found that: (1) the euchromatic Alu element was replicated during early S-phase; (2) LINE-1, which is associated with AT-rich genomic regions, was replicated throughout S-phase, with the majority being replicated according to their particular histone marks; (3) satellite III, which constitutes pericentromeric heterochromatin, was replicated exclusively during the mid-to-late S-phase. As for the DNA double-strand break repair process, we observed that Alu elements followed the global genome repair kinetics, while LINE-1 elements repaired at a slower rate. Finally, satellite III repeats were repaired at later time points. Conclusions We conclude that the histone modifications in the specific repeat element predominantly determine its replication and repair timing. Thus, Alu elements, which are characterized by euchromatic chromatin features, are repaired and replicated the earliest, followed by LINE-1 elements, including more variegated eu/heterochromatic features and, lastly, satellite tandem repeats, which are homogeneously characterized by heterochromatic features and extend over megabase-long genomic regions. Altogether, this work reemphasizes the need for complementary approaches to achieve an integrated and comprehensive investigation of genomic processes. Electronic supplementary material The online version of this article (10.1186/s13072-018-0226-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Francesco Natale
- Department of Biology, Technische Universität Darmstadt, 64287, Darmstadt, Germany.,Biology Unit, IRBM Science Park S. p. A., 80131, Naples, Italy
| | - Annina Scholl
- Department of Biology, Technische Universität Darmstadt, 64287, Darmstadt, Germany
| | - Alexander Rapp
- Department of Biology, Technische Universität Darmstadt, 64287, Darmstadt, Germany
| | - Wei Yu
- Department of Biology, Technische Universität Darmstadt, 64287, Darmstadt, Germany.,G5 Lymphocyte Development and Oncogenesis, Immunology Department, Pasteur Institute, 75724, Paris Cedex 15, France
| | - Cathia Rausch
- Department of Biology, Technische Universität Darmstadt, 64287, Darmstadt, Germany
| | - M Cristina Cardoso
- Department of Biology, Technische Universität Darmstadt, 64287, Darmstadt, Germany.
| |
Collapse
|
10
|
Raza MA, Yu N, Wang D, Cao L, Gan S, Chen L. Differential DNA methylation and gene expression in reciprocal hybrids between Solanum lycopersicum and S. pimpinellifolium. DNA Res 2018; 24:597-607. [PMID: 28679169 PMCID: PMC5726463 DOI: 10.1093/dnares/dsx028] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2017] [Accepted: 05/27/2017] [Indexed: 12/22/2022] Open
Abstract
Wide hybridization is a common and efficient breeding strategy for enhancing crop yield and quality. An interesting phenomenon is that the reciprocal hybrids usually show different phenotypes, and its underlying mechanism is not well understood. Here, we reported our comparative analysis of the DNA methylation patterns in Solanum lycopersicum, Solanum pimpinellifolium and their reciprocal hybrids by methylated DNA immunoprecipitation sequencing. The reciprocal hybrids had lower levels of DNA methylation in CpG islands and LTR retroelements when compared with those of their parents. Importantly, remarkable differences in DNA methylation patterns, mainly in introns and CDS regions, were revealed between the reciprocal hybrids. These different methylated regions were mapped to 79 genes, 14 of which were selected for analysis of gene expression levels. While there was an inverse correlation between DNA methylation and gene expression in promoter regions, the relationship was complicated in gene body regions. Further association analysis revealed that there were 15 differentially methylated genes associated with siRNAs, and that the methylation levels of these genes were inversely correlated with respective siRNAs. All these data raised the possibility that the direction of hybridization induced the divergent epigenomes leading to changes in the transcription levels of reciprocal hybrids.
Collapse
Affiliation(s)
- Muhammad Ammar Raza
- Department of Horticulture, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, P. R. China
| | - Ningning Yu
- Department of Horticulture, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, P. R. China
| | - Dan Wang
- Department of Horticulture, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, P. R. China
| | - Liwen Cao
- Department of Horticulture, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, P. R. China
| | - Susheng Gan
- Department of Horticulture, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, P. R. China.,Plant Biology Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, USA
| | - Liping Chen
- Department of Horticulture, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, P. R. China
| |
Collapse
|
11
|
Complex Analyses of Short Inverted Repeats in All Sequenced Chloroplast DNAs. BIOMED RESEARCH INTERNATIONAL 2018; 2018:1097018. [PMID: 30140690 PMCID: PMC6081594 DOI: 10.1155/2018/1097018] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/15/2018] [Revised: 06/19/2018] [Accepted: 07/12/2018] [Indexed: 01/14/2023]
Abstract
Chloroplasts are key organelles in the management of oxygen in algae and plants and are therefore crucial for all living beings that consume oxygen. Chloroplasts typically contain a circular DNA molecule with nucleus-independent replication and heredity. Using "palindrome analyser" we performed complete analyses of short inverted repeats (S-IRs) in all chloroplast DNAs (cpDNAs) available from the NCBI genome database. Our results provide basic parameters of cpDNAs including comparative information on localization, frequency, and differences in S-IR presence. In a total of 2,565 cpDNA sequences available, the average frequency of S-IRs in cpDNA genomes is 45 S-IRs/per kbp, significantly higher than that found in mitochondrial DNA sequences. The frequency of S-IRs in cpDNAs generally decreased with S-IR length, but not for S-IRs 15, 22, 24, or 27 bp long, which are significantly more abundant than S-IRs with other lengths. These results point to the importance of specific S-IRs in cpDNA genomes. Moreover, comparison by Levenshtein distance of S-IR similarities showed that a limited number of S-IR sequences are shared in the majority of cpDNAs. S-IRs are not located randomly in cpDNAs, but are length-dependently enriched in specific locations, including the repeat region, stem, introns, and tRNA regions. The highest enrichment was found for 12 bp and longer S-IRs in the stem-loop region followed by 12 bp and longer S-IRs located before the repeat region. On the other hand, S-IRs are relatively rare in rRNA sequences and around introns. These data show nonrandom and conserved arrangements of S-IRs in chloroplast genomes.
Collapse
|
12
|
Lavi B, Levy Karin E, Pupko T, Hazkani-Covo E. The Prevalence and Evolutionary Conservation of Inverted Repeats in Proteobacteria. Genome Biol Evol 2018; 10:918-927. [PMID: 29608719 PMCID: PMC5941160 DOI: 10.1093/gbe/evy044] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/21/2018] [Indexed: 12/11/2022] Open
Abstract
Perfect short inverted repeats (IRs) are known to be enriched in a variety of bacterial and eukaryotic genomes. Currently, it is unclear whether perfect IRs are conserved over evolutionary time scales. In this study, we aimed to characterize the prevalence and evolutionary conservation of IRs across 20 proteobacterial strains. We first identified IRs in Escherichia coli K-12 substr MG1655 and showed that they are overabundant. We next aimed to test whether this overabundance is reflected in the conservation of IRs over evolutionary time scales. To this end, for each perfect IR identified in E. coli MG1655, we collected orthologous sequences from related proteobacterial genomes. We next quantified the evolutionary conservation of these IRs, that is, the presence of the exact same IR across orthologous regions. We observed high conservation of perfect IRs: out of the 234 examined orthologous regions, 145 were more conserved than expected, which is statistically significant even after correcting for multiple testing. Our results together with previous experimental findings support a model in which imperfect IRs are corrected to perfect IRs in a preferential manner via a template switching mechanism.
Collapse
Affiliation(s)
- Bar Lavi
- Department of Cell Research and Immunology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Israel
- Department of Natural and Life Sciences, The Open University of Israel, Ra'anana, Israel
| | - Eli Levy Karin
- Department of Cell Research and Immunology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Israel
- Department of Molecular Biology & Ecology of Plants, George S. Wise Faculty of Life Sciences, Tel Aviv University, Israel
| | - Tal Pupko
- Department of Cell Research and Immunology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Israel
| | - Einat Hazkani-Covo
- Department of Natural and Life Sciences, The Open University of Israel, Ra'anana, Israel
| |
Collapse
|
13
|
Näsvall J. Direct and Inverted Repeat stimulated excision (DIRex): Simple, single-step, and scar-free mutagenesis of bacterial genes. PLoS One 2017; 12:e0184126. [PMID: 28854250 PMCID: PMC5576700 DOI: 10.1371/journal.pone.0184126] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2017] [Accepted: 08/18/2017] [Indexed: 11/29/2022] Open
Abstract
The need for generating precisely designed mutations is common in genetics, biochemistry, and molecular biology. Here, I describe a new λ Red recombineering method (Direct and Inverted Repeat stimulated excision; DIRex) for fast and easy generation of single point mutations, small insertions or replacements as well as deletions of any size, in bacterial genes. The method does not leave any resistance marker or scar sequence and requires only one transformation to generate a semi-stable intermediate insertion mutant. Spontaneous excision of the intermediate efficiently and accurately generates the final mutant. In addition, the intermediate is transferable between strains by generalized transductions, enabling transfer of the mutation into multiple strains without repeating the recombineering step. Existing methods that can be used to accomplish similar results are either (i) more complicated to design, (ii) more limited in what mutation types can be made, or (iii) require expression of extrinsic factors in addition to λ Red. I demonstrate the utility of the method by generating several deletions, small insertions/replacements, and single nucleotide exchanges in Escherichia coli and Salmonella enterica. Furthermore, the design parameters that influence the excision frequency and the success rate of generating desired point mutations have been examined to determine design guidelines for optimal efficiency.
Collapse
Affiliation(s)
- Joakim Näsvall
- Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
- * E-mail:
| |
Collapse
|
14
|
Putman AI, Tredway LP, Carbone I. Characterization and distribution of mating-type genes of the turfgrass pathogen Sclerotinia homoeocarpa on a global scale. Fungal Genet Biol 2015; 81:25-40. [PMID: 26049125 DOI: 10.1016/j.fgb.2015.05.012] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2014] [Revised: 05/27/2015] [Accepted: 05/28/2015] [Indexed: 01/23/2023]
Abstract
Sclerotinia homoeocarpa F.T. Bennett is a filamentous member of Ascomycota that causes dollar spot, the most economically important disease of turfgrass worldwide. We sequenced and characterized the mating-type (MAT) locus of four recently-collected contemporary strains causing dollar spot, four historical type strains used to describe the fungus, and three species of Rutstroemiaceae. Moreover, we developed a multiplex PCR assay to screen 1019 contemporary isolates for mating-type. The organization of the MAT loci of all strains examined could be classified into one of four categories: (1) putatively heterothallic, as exemplified by all contemporary strains and three of four historical type strains; (2) putatively heterothallic with a deleted putative gene in the MAT1-2 idiomorph, as detected in strains from two recently-collected populations in the United Kingdom that show more similarity to historical strains; (3) putatively homothallic with close physical linkage between MAT1-1-1 and MAT1-2-1, as found in one historical type strain of S. homoeocarpa and two strains of Rutstroemia cuniculi; and (4) an unresolved but apparently homothallic organization in which strains contained both MAT1-1-1 and MAT1-2-1 but linkage between these genes and between the two flanking genes could not be confirmed, as identified in R. paludosa and Poculum henningsianum. In contemporary S. homoeocarpa populations there was no significant difference in the frequency of the two mating types in clone-corrected samples when analyzed on regional and local scales, suggesting sex may be possible in this pathogen. However, two isolates from Italy and twenty from California were heterokaryotic for both complete heterothallic MAT idiomorphs. Results from this study contribute to knowledge about mating systems in filamentous fungi and enhance our understanding of the evolution and biology of an important plant pathogen.
Collapse
Affiliation(s)
- Alexander I Putman
- Department of Plant Pathology, North Carolina State University, Raleigh, NC 27695-7616, United States.
| | - Lane P Tredway
- Department of Plant Pathology, North Carolina State University, Raleigh, NC 27695-7616, United States
| | - Ignazio Carbone
- Center for Integrated Fungal Research, Department of Plant Pathology, North Carolina State University, Raleigh, NC 27695-7244, United States
| |
Collapse
|
15
|
Lu S, Wang G, Bacolla A, Zhao J, Spitser S, Vasquez KM. Short Inverted Repeats Are Hotspots for Genetic Instability: Relevance to Cancer Genomes. Cell Rep 2015; 10:1674-1680. [PMID: 25772355 DOI: 10.1016/j.celrep.2015.02.039] [Citation(s) in RCA: 88] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2014] [Revised: 01/26/2015] [Accepted: 02/16/2015] [Indexed: 12/25/2022] Open
Abstract
Analyses of chromosomal aberrations in human genetic disorders have revealed that inverted repeat sequences (IRs) often co-localize with endogenous chromosomal instability and breakage hotspots. Approximately 80% of all IRs in the human genome are short (<100 bp), yet the mutagenic potential of such short cruciform-forming sequences has not been characterized. Here, we find that short IRs are enriched at translocation breakpoints in human cancer and stimulate the formation of DNA double-strand breaks (DSBs) and deletions in mammalian and yeast cells. We provide evidence for replication-related mechanisms of IR-induced genetic instability and a novel XPF cleavage-based mechanism independent of DNA replication. These discoveries implicate short IRs as endogenous sources of DNA breakage involved in disease etiology and suggest that these repeats represent a feature of genome plasticity that may contribute to the evolution of the human genome by providing a means for diversity within the population.
Collapse
Affiliation(s)
- Steve Lu
- Division of Pharmacology and Toxicology, College of Pharmacy, The University of Texas at Austin - Dell Pediatric Research Institute, 1400 Barbara Jordan Boulevard R1800, Austin, TX 78723, USA
| | - Guliang Wang
- Division of Pharmacology and Toxicology, College of Pharmacy, The University of Texas at Austin - Dell Pediatric Research Institute, 1400 Barbara Jordan Boulevard R1800, Austin, TX 78723, USA
| | - Albino Bacolla
- Division of Pharmacology and Toxicology, College of Pharmacy, The University of Texas at Austin - Dell Pediatric Research Institute, 1400 Barbara Jordan Boulevard R1800, Austin, TX 78723, USA
| | - Junhua Zhao
- Division of Pharmacology and Toxicology, College of Pharmacy, The University of Texas at Austin - Dell Pediatric Research Institute, 1400 Barbara Jordan Boulevard R1800, Austin, TX 78723, USA
| | - Scott Spitser
- Division of Pharmacology and Toxicology, College of Pharmacy, The University of Texas at Austin - Dell Pediatric Research Institute, 1400 Barbara Jordan Boulevard R1800, Austin, TX 78723, USA
| | - Karen M Vasquez
- Division of Pharmacology and Toxicology, College of Pharmacy, The University of Texas at Austin - Dell Pediatric Research Institute, 1400 Barbara Jordan Boulevard R1800, Austin, TX 78723, USA.
| |
Collapse
|
16
|
Aygun N. Correlations between long inverted repeat (LIR) features, deletion size and distance from breakpoint in human gross gene deletions. Sci Rep 2015; 5:8300. [PMID: 25657065 PMCID: PMC4319165 DOI: 10.1038/srep08300] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2014] [Accepted: 01/14/2015] [Indexed: 11/09/2022] Open
Abstract
Long inverted repeats (LIRs) have been shown to induce genomic deletions in yeast. In this study, LIRs were investigated within ±10 kb spanning each breakpoint from 109 human gross deletions, using Inverted Repeat Finder (IRF) software. LIR number was significantly higher at the breakpoint regions, than in control segments (P < 0.001). In addition, it was found that strong correlation between 5' and 3' LIR numbers, suggesting contribution to DNA sequence evolution (r = 0.85, P < 0.001). 138 LIR features at ±3 kb breakpoints in 89 (81%) of 109 gross deletions were evaluated. Significant correlations were found between distance from breakpoint and loop length (r = -0.18, P < 0.05) and stem length (r = -0.18, P < 0.05), suggesting DNA strands are potentially broken in locations closer to bigger LIRs. In addition, bigger loops cause larger deletions (r = 0.19, P < 0.05). Moreover, loop length (r = 0.29, P < 0.02) and identity between stem copies (r = 0.30, P < 0.05) of 3' LIRs were more important in larger deletions. Consequently, DNA breaks may form via LIR-induced cruciform structure during replication. DNA ends may be later repaired by non-homologous end-joining (NHEJ), with following deletion.
Collapse
Affiliation(s)
- Nevim Aygun
- Department of Medical Biology, Faculty of Medicine, Dokuz Eylul University, Inciralti, Izmir, Turkey
| |
Collapse
|
17
|
Aguileta G, de Vienne DM, Ross ON, Hood ME, Giraud T, Petit E, Gabaldón T. High variability of mitochondrial gene order among fungi. Genome Biol Evol 2015; 6:451-65. [PMID: 24504088 PMCID: PMC3942027 DOI: 10.1093/gbe/evu028] [Citation(s) in RCA: 162] [Impact Index Per Article: 16.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
From their origin as an early alpha proteobacterial endosymbiont to their current state as cellular organelles, large-scale genomic reorganization has taken place in the mitochondria of all main eukaryotic lineages. So far, most studies have focused on plant and animal mitochondrial (mt) genomes (mtDNA), but fungi provide new opportunities to study highly differentiated mtDNAs. Here, we analyzed 38 complete fungal mt genomes to investigate the evolution of mtDNA gene order among fungi. In particular, we looked for evidence of nonhomologous intrachromosomal recombination and investigated the dynamics of gene rearrangements. We investigated the effect that introns, intronic open reading frames (ORFs), and repeats may have on gene order. Additionally, we asked whether the distribution of transfer RNAs (tRNAs) evolves independently to that of mt protein-coding genes. We found that fungal mt genomes display remarkable variation between and within the major fungal phyla in terms of gene order, genome size, composition of intergenic regions, and presence of repeats, introns, and associated ORFs. Our results support previous evidence for the presence of mt recombination in all fungal phyla, a process conspicuously lacking in most Metazoa. Overall, the patterns of rearrangements may be explained by the combined influences of recombination (i.e., most likely nonhomologous and intrachromosomal), accumulated repeats, especially at intergenic regions, and to a lesser extent, mobile element dynamics.
Collapse
Affiliation(s)
- Gabriela Aguileta
- Bioinformatics and Genomics Programme, Centre for Genomic Regulation (CRG), Barcelona, Spain
| | | | | | | | | | | | | |
Collapse
|
18
|
Adeno-associated virus inverted terminal repeats stimulate gene editing. Gene Ther 2014; 22:190-5. [PMID: 25503695 DOI: 10.1038/gt.2014.109] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2014] [Revised: 10/09/2014] [Accepted: 11/03/2014] [Indexed: 12/13/2022]
Abstract
Advancements in genome editing have relied on technologies to specifically damage DNA which, in turn, stimulates DNA repair including homologous recombination (HR). As off-target concerns complicate the therapeutic translation of site-specific DNA endonucleases, an alternative strategy to stimulate gene editing based on fragile DNA was investigated. To do this, an episomal gene-editing reporter was generated by a disruptive insertion of the adeno-associated virus (AAV) inverted terminal repeat (ITR) into the egfp gene. Compared with a non-structured DNA control sequence, the ITR induced DNA damage as evidenced by increased gamma-H2AX and Mre11 foci formation. As local DNA damage stimulates HR, ITR-mediated gene editing was investigated using DNA oligonucleotides as repair substrates. The AAV ITR stimulated gene editing >1000-fold in a replication-independent manner and was not biased by the polarity of the repair oligonucleotide. Analysis of additional human DNA sequences demonstrated stimulation of gene editing to varying degrees. In particular, inverted yet not direct, Alu repeats induced gene editing, suggesting a role for DNA structure in the repair event. Collectively, the results demonstrate that inverted DNA repeats stimulate gene editing via double-strand break repair in an episomal context and allude to efficient gene editing of the human chromosome using fragile DNA sequences.
Collapse
|
19
|
Ye C, Ji G, Li L, Liang C. detectIR: a novel program for detecting perfect and imperfect inverted repeats using complex numbers and vector calculation. PLoS One 2014; 9:e113349. [PMID: 25409465 PMCID: PMC4237412 DOI: 10.1371/journal.pone.0113349] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2014] [Accepted: 10/22/2014] [Indexed: 11/19/2022] Open
Abstract
Inverted repeats are present in abundance in both prokaryotic and eukaryotic genomes and can form DNA secondary structures--hairpins and cruciforms that are involved in many important biological processes. Bioinformatics tools for efficient and accurate detection of inverted repeats are desirable, because existing tools are often less accurate and time consuming, sometimes incapable of dealing with genome-scale input data. Here, we present a MATLAB-based program called detectIR for the perfect and imperfect inverted repeat detection that utilizes complex numbers and vector calculation and allows genome-scale data inputs. A novel algorithm is adopted in detectIR to convert the conventional sequence string comparison in inverted repeat detection into vector calculation of complex numbers, allowing non-complementary pairs (mismatches) in the pairing stem and a non-palindromic spacer (loop or gaps) in the middle of inverted repeats. Compared with existing popular tools, our program performs with significantly higher accuracy and efficiency. Using genome sequence data from HIV-1, Arabidopsis thaliana, Homo sapiens and Zea mays for comparison, detectIR can find lots of inverted repeats missed by existing tools whose outputs often contain many invalid cases. detectIR is open source and its source code is freely available at: https://sourceforge.net/projects/detectir.
Collapse
Affiliation(s)
- Congting Ye
- Department of Automation, Xiamen University, Xiamen, Fujian 361005, China; Department of Biology, Miami University, Oxford, Ohio 45056, United States of America
| | - Guoli Ji
- Department of Automation, Xiamen University, Xiamen, Fujian 361005, China; Innovation Center for Cell Biology, Xiamen University, Xiamen, Fujian 361005, China
| | - Lei Li
- Department of Automation, Xiamen University, Xiamen, Fujian 361005, China; Department of Biology, Miami University, Oxford, Ohio 45056, United States of America
| | - Chun Liang
- Department of Biology, Miami University, Oxford, Ohio 45056, United States of America; State Key Laboratory for Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| |
Collapse
|
20
|
Palindromic GOLGA8 core duplicons promote chromosome 15q13.3 microdeletion and evolutionary instability. Nat Genet 2014; 46:1293-302. [PMID: 25326701 PMCID: PMC4244265 DOI: 10.1038/ng.3120] [Citation(s) in RCA: 72] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2014] [Accepted: 09/25/2014] [Indexed: 12/14/2022]
Abstract
Recurrent deletions of chromosome 15q13.3 associate with intellectual disability, schizophrenia, autism and epilepsy. To gain insight into its instability, we sequenced the region in patients, normal individuals and nonhuman primates. We discovered five structural configurations of the human chromosome 15q13.3 region ranging in size from 2 to 3 Mbp. These configurations arose recently (~0.5–0.9 million years ago) as a result of human-specific expansions of segmental duplications and two independent inversion events. All inversion breakpoints map near GOLGA8 core duplicons—a ~14 kbp primate-specific chromosome 15 repeat that became organized into larger palindromic structures. GOLGA8-flanked palindromes also demarcate the breakpoints of recurrent 15q13.3 microdeletions, the expansion of chromosome 15 segmental duplications in the human lineage, and independent structural changes in apes. The significant clustering (p=0.002) of breakpoints provides mechanistic evidence for the role of this core duplicon and its palindromic architecture in promoting evolutionary and disease-related instability of chromosome 15.
Collapse
|
21
|
Mehrotra S, Goyal V. Repetitive sequences in plant nuclear DNA: types, distribution, evolution and function. GENOMICS, PROTEOMICS & BIOINFORMATICS 2014; 12:164-71. [PMID: 25132181 PMCID: PMC4411372 DOI: 10.1016/j.gpb.2014.07.003] [Citation(s) in RCA: 156] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/12/2014] [Revised: 06/29/2014] [Accepted: 07/03/2014] [Indexed: 12/27/2022]
Abstract
Repetitive DNA sequences are a major component of eukaryotic genomes and may account for up to 90% of the genome size. They can be divided into minisatellite, microsatellite and satellite sequences. Satellite DNA sequences are considered to be a fast-evolving component of eukaryotic genomes, comprising tandemly-arrayed, highly-repetitive and highly-conserved monomer sequences. The monomer unit of satellite DNA is 150-400 base pairs (bp) in length. Repetitive sequences may be species- or genus-specific, and may be centromeric or subtelomeric in nature. They exhibit cohesive and concerted evolution caused by molecular drive, leading to high sequence homogeneity. Repetitive sequences accumulate variations in sequence and copy number during evolution, hence they are important tools for taxonomic and phylogenetic studies, and are known as "tuning knobs" in the evolution. Therefore, knowledge of repetitive sequences assists our understanding of the organization, evolution and behavior of eukaryotic genomes. Repetitive sequences have cytoplasmic, cellular and developmental effects and play a role in chromosomal recombination. In the post-genomics era, with the introduction of next-generation sequencing technology, it is possible to evaluate complex genomes for analyzing repetitive sequences and deciphering the yet unknown functional potential of repetitive sequences.
Collapse
Affiliation(s)
- Shweta Mehrotra
- Department of Botany, University of Delhi, Delhi 110007, India.
| | - Vinod Goyal
- Department of Botany, University of Delhi, Delhi 110007, India
| |
Collapse
|
22
|
Lee JR, Hong CP, Moon JW, Jung YD, Kim DS, Kim TH, Gim JA, Bae JH, Choi Y, Eo J, Kwon YJ, Song S, Ko J, Yang YM, Lee HK, Park KD, Ahn K, Do KT, Ha HS, Han K, Yi JM, Cha HJ, Cho BW, Bhak J, Kim HS. Genome-wide analysis of DNA methylation patterns in horse. BMC Genomics 2014; 15:598. [PMID: 25027854 PMCID: PMC4117963 DOI: 10.1186/1471-2164-15-598] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2013] [Accepted: 07/07/2014] [Indexed: 12/23/2022] Open
Abstract
Background DNA methylation is an epigenetic regulatory mechanism that plays an essential role in mediating biological processes and determining phenotypic plasticity in organisms. Although the horse reference genome and whole transcriptome data are publically available the global DNA methylation data are yet to be known. Results We report the first genome-wide DNA methylation characteristics data from skeletal muscle, heart, lung, and cerebrum tissues of thoroughbred (TH) and Jeju (JH) horses, an indigenous Korea breed, respectively by methyl-DNA immunoprecipitation sequencing. The analysis of the DNA methylation patterns indicated that the average methylation density was the lowest in the promoter region, while the density in the coding DNA sequence region was the highest. Among repeat elements, a relatively high density of methylation was observed in long interspersed nuclear elements compared to short interspersed nuclear elements or long terminal repeat elements. We also successfully identified differential methylated regions through a comparative analysis of corresponding tissues from TH and JH, indicating that the gene body regions showed a high methylation density. Conclusions We provide report the first DNA methylation landscape and differentially methylated genomic regions (DMRs) of thoroughbred and Jeju horses, providing comprehensive DMRs maps of the DNA methylome. These data are invaluable resource to better understanding of epigenetics in the horse providing information for the further biological function analyses. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-598) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - Jong Bhak
- Department of Biological Sciences, College of Natural Sciences, Pusan National University, Busan 609-735, Republic of Korea.
| | | |
Collapse
|
23
|
Kato T, Franconi CP, Sheridan MB, Hacker AM, Inagakai H, Glover TW, Arlt MF, Drabkin HA, Gemmill RM, Kurahashi H, Emanuel BS. Analysis of the t(3;8) of hereditary renal cell carcinoma: a palindrome-mediated translocation. Cancer Genet 2014; 207:133-40. [PMID: 24813807 DOI: 10.1016/j.cancergen.2014.03.004] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2013] [Revised: 02/07/2014] [Accepted: 03/10/2014] [Indexed: 12/01/2022]
Abstract
It has emerged that palindrome-mediated genomic instability generates DNA-based rearrangements. The presence of palindromic AT-rich repeats (PATRRs) at the translocation breakpoints suggested a palindrome-mediated mechanism in the generation of several recurrent constitutional rearrangements: the t(11;22), t(17;22), and t(8;22). To date, all reported PATRR-mediated translocations include the PATRR on chromosome 22 (PATRR22) as a translocation partner. Here, the constitutional rearrangement, t(3;8)(p14.2;q24.1), segregating with renal cell carcinoma in two families, is examined. The chromosome 8 breakpoint lies in PATRR8 in the first intron of the RNF139 (TRC8) gene, whereas the chromosome 3 breakpoint is located in an AT-rich palindromic sequence in intron 3 of the FHIT gene (PATRR3). Thus, the t(3;8) is the first PATRR-mediated, recurrent, constitutional translocation that does not involve PATRR22. Furthermore, we detect de novo translocations similar to the t(11;22) and t(8;22), involving PATRR3 in normal sperm. The breakpoint on chromosome 3 is in proximity to FRA3B, the most common fragile site in the human genome and a site of frequent deletions in tumor cells. However, the lack of involvement of PATRR3 sequence in numerous FRA3B-related deletions suggests that there are several different DNA sequence-based etiologies responsible for chromosome 3p14.2 genomic rearrangements.
Collapse
Affiliation(s)
- Takema Kato
- Division of Human Genetics, The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Colleen P Franconi
- Division of Human Genetics, The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Molly B Sheridan
- Division of Human Genetics, The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - April M Hacker
- Division of Human Genetics, The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Hidehito Inagakai
- Division of Molecular Genetics, Institute for Comprehensive Medical Science, Fujita Health University, Aichi, Japan
| | - Thomas W Glover
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
| | - Martin F Arlt
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
| | - Harry A Drabkin
- Division of Hematology-Oncology, Medical University of South Carolina, Charleston, SC, USA
| | - Robert M Gemmill
- Division of Hematology-Oncology, Medical University of South Carolina, Charleston, SC, USA
| | - Hiroki Kurahashi
- Division of Molecular Genetics, Institute for Comprehensive Medical Science, Fujita Health University, Aichi, Japan
| | - Beverly S Emanuel
- Division of Human Genetics, The Children's Hospital of Philadelphia, Philadelphia, PA, USA; Department of Pediatrics, The Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA, USA.
| |
Collapse
|
24
|
Zhang Y, Saini N, Sheng Z, Lobachev KS. Genome-wide screen reveals replication pathway for quasi-palindrome fragility dependent on homologous recombination. PLoS Genet 2013; 9:e1003979. [PMID: 24339793 PMCID: PMC3855049 DOI: 10.1371/journal.pgen.1003979] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2013] [Accepted: 10/12/2013] [Indexed: 02/07/2023] Open
Abstract
Inverted repeats capable of forming hairpin and cruciform structures present a threat to chromosomal integrity. They induce double strand breaks, which lead to gross chromosomal rearrangements, the hallmarks of cancers and hereditary diseases. Secondary structure formation at this motif has been proposed to be the driving force for the instability, albeit the mechanisms leading to the fragility are not well-understood. We carried out a genome-wide screen to uncover the genetic players that govern fragility of homologous and homeologous Alu quasi-palindromes in the yeast Saccharomyces cerevisiae. We found that depletion or lack of components of the DNA replication machinery, proteins involved in Fe-S cluster biogenesis, the replication-pausing checkpoint pathway, the telomere maintenance complex or the Sgs1-Top3-Rmi1 dissolvasome augment fragility at Alu-IRs. Rad51, a component of the homologous recombination pathway, was found to be required for replication arrest and breakage at the repeats specifically in replication-deficient strains. These data demonstrate that Rad51 is required for the formation of breakage-prone secondary structures in situations when replication is compromised while another mechanism operates in DSB formation in replication-proficient strains. Inverted repeats are found in many eukaryotic genomes including humans. They have a potential to cause chromosomal breakage and rearrangements that contribute to genome polymorphism and the development of diseases. Instability of inverted repeats is accounted for by their propensity to adopt DNA secondary structures that is negatively affected by the distance between the repeats and level of sequence divergence. However, the genetic factors that promote the abnormal structure formation or affect the ability of the repeats to break are largely unknown. Here, using a genome-wide screen we identified 38 mutants that destabilize imperfect human inverted Alu repeats and predispose them to breakage. The proteins that are required to maintain repeat stability belong to the core of the DNA replication machinery and to the accessory proteins that help replication fork to move through the difficult templates. Remarkably, when replication machinery is compromised, the proteins involved in homologous recombination promote the formation of secondary structures and replication block thereby triggering breakage at the inverted repeats. These results reveal a powerful pathway for the destabilization of chromosomes containing inverted repeats that requires the activity of homologous recombination.
Collapse
Affiliation(s)
- Yu Zhang
- School of Biology and Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, Georgia, United States of America
| | - Natalie Saini
- School of Biology and Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, Georgia, United States of America
| | - Ziwei Sheng
- School of Biology and Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, Georgia, United States of America
| | - Kirill S. Lobachev
- School of Biology and Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, Georgia, United States of America
- * E-mail:
| |
Collapse
|
25
|
Senda M, Nishimura S, Kasai A, Yumoto S, Takada Y, Tanaka Y, Ohnishi S, Kuroda T. Comparative analysis of the inverted repeat of a chalcone synthase pseudogene between yellow soybean and seed coat pigmented mutants. BREEDING SCIENCE 2013; 63:384-92. [PMID: 24399910 PMCID: PMC3859349 DOI: 10.1270/jsbbs.63.384] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/31/2013] [Accepted: 08/17/2013] [Indexed: 05/13/2023]
Abstract
In soybean, the I gene inhibits pigmentation over the entire seed coat, resulting in yellow seeds. It is thought that this suppression of seed coat pigmentation is due to naturally occurring RNA silencing of chalcone synthase genes (CHS silencing). Fully pigmented seeds can be found among harvested yellow seeds at a very low percentage. These seed coat pigmented (scp) mutants are generated from yellow soybeans by spontaneous recessive mutation of the I gene. A candidate for the I gene, GmIRCHS, contains a perfect inverted repeat (IR) of a CHS pseudogene (pseudoCHS3) and transcripts of GmIRCHS form a double-stranded CHS RNA that potentially triggers CHS silencing. One CHS gene, ICHS1, is located 680 bp downstream of GmIRCHS. Here, the GmIRCHS-ICHS1 cluster was compared in scp mutants of various origins. In these mutants, sequence divergence in the cluster resulted in complete or partial loss of GmIRCHS in at least the pseudoCHS3 region. This result is consistent with the notion that the IR of pseudoCHS3 is sufficient to induce CHS silencing, and further supports that GmIRCHS is the I gene.
Collapse
Affiliation(s)
- Mineo Senda
- Faculty of Agriculture and Life Sciences, Hirosaki University,
3 Bunkyo-cho, Hirosaki, Aomori 036-8561,
Japan
- Corresponding author (e-mail: )
| | - Satsuki Nishimura
- Faculty of Agriculture and Life Sciences, Hirosaki University,
3 Bunkyo-cho, Hirosaki, Aomori 036-8561,
Japan
| | - Atsushi Kasai
- Faculty of Agriculture and Life Sciences, Hirosaki University,
3 Bunkyo-cho, Hirosaki, Aomori 036-8561,
Japan
| | - Setsuzo Yumoto
- Research Support Center, National Agricultural Research Center for Tohoku Region,
Yotsuya, Daisen, Akita 014-0102,
Japan
| | - Yoshitake Takada
- National Agricultural Research Organization (NARO) Western Region Agricultural Research Center,
1-3-1 Senyu, Zentsuji, Kagawa 765-8508,
Japan
| | - Yoshinori Tanaka
- Hokkaido Research Organization Tokachi Agricultural Experiment Station,
S9-2 Shinsei, Memuro, Kasai, Hokkaido 082-0081,
Japan
| | - Shizen Ohnishi
- Hokkaido Research Organization Kitami Agricultural Experiment Station,
52 Yayoi, Kunneppu, Tokoro, Hokkaido 099-1406,
Japan
| | - Tomohisa Kuroda
- Niigata Agricultural Research Institute,
857 Nagakura-machi, Nagaoka, Niigata 940-0826,
Japan
| |
Collapse
|
26
|
Lim C, Luhe AL, JingYing CT, Balagurunathan B, Wu J, Zhao H. Size of gene specific inverted repeat--dependent gene deletion In Saccharomyces cerevisiae. PLoS One 2013; 8:e72137. [PMID: 23977230 PMCID: PMC3748122 DOI: 10.1371/journal.pone.0072137] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2013] [Accepted: 07/08/2013] [Indexed: 01/15/2023] Open
Abstract
We describe here an approach for rapidly producing scar-free and precise gene deletions in S. cerevisiae with high efficiency. Preparation of the disruption gene cassette in this approach was simply performed by overlap extension-PCR of an invert repeat of a partial or complete sequence of the targeted gene with URA3. Integration of the prepared disruption gene cassette to the designated position of a target gene leads to the formation of a mutagenesis cassette within the yeast genome, which consists of a URA3 gene flanked by the targeted gene and its inverted repeat between two short identical direct repeats. The inherent instability of the inverted sequences in close proximity facilitates the self-excision of the entire mutagenesis cassette deposited in the genome and promotes homologous recombination resulting in a seamless deletion via a single transformation. This rapid assembly circumvents the difficulty during preparation of disruption gene cassettes composed of two inverted repeats of the URA3, which requires the engineering of unique restriction sites for subsequent digestion and T4 DNA ligation in vitro. We further identified that the excision of the entire mutagenesis cassette flanked by two DRs in the transformed S. cerevisiae is dependent on the length of the inverted repeat of which a minimum of 800 bp is required for effective gene deletion. The deletion efficiency improves with the increase of the inverted repeat till 1.2 kb. Finally, the use of gene-specific inverted repeats of target genes enables simultaneous gene deletions. The procedure has the potential for application on other yeast strains to achieve precise and efficient removal of gene sequences.
Collapse
Affiliation(s)
- Chanyuen Lim
- Industrial Biotechnology Division, Institute of Chemical and Engineering Sciences, Agency for Science, Technology and Research (A*STAR), Jurong Island, Singapore
| | - Annette Lin Luhe
- Industrial Biotechnology Division, Institute of Chemical and Engineering Sciences, Agency for Science, Technology and Research (A*STAR), Jurong Island, Singapore
| | - Crystal Tear JingYing
- Industrial Biotechnology Division, Institute of Chemical and Engineering Sciences, Agency for Science, Technology and Research (A*STAR), Jurong Island, Singapore
| | - Balaji Balagurunathan
- Process Sciences and Modeling Division, Institute of Chemical and Engineering Sciences, Agency for Science, Technology and Research (A*STAR), Jurong Island, Singapore
| | - Jinchuan Wu
- Industrial Biotechnology Division, Institute of Chemical and Engineering Sciences, Agency for Science, Technology and Research (A*STAR), Jurong Island, Singapore
| | - Hua Zhao
- Industrial Biotechnology Division, Institute of Chemical and Engineering Sciences, Agency for Science, Technology and Research (A*STAR), Jurong Island, Singapore
- * E-mail:
| |
Collapse
|
27
|
St. Charles J, Petes TD. High-resolution mapping of spontaneous mitotic recombination hotspots on the 1.1 Mb arm of yeast chromosome IV. PLoS Genet 2013; 9:e1003434. [PMID: 23593029 PMCID: PMC3616911 DOI: 10.1371/journal.pgen.1003434] [Citation(s) in RCA: 69] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2012] [Accepted: 02/20/2013] [Indexed: 11/18/2022] Open
Abstract
Although homologous recombination is an important pathway for the repair of double-stranded DNA breaks in mitotically dividing eukaryotic cells, these events can also have negative consequences, such as loss of heterozygosity (LOH) of deleterious mutations. We mapped about 140 spontaneous reciprocal crossovers on the right arm of the yeast chromosome IV using single-nucleotide-polymorphism (SNP) microarrays. Our mapping and subsequent experiments demonstrate that inverted repeats of Ty retrotransposable elements are mitotic recombination hotspots. We found that the mitotic recombination maps on the two homologs were substantially different and were unrelated to meiotic recombination maps. Additionally, about 70% of the DNA lesions that result in LOH are likely generated during G1 of the cell cycle and repaired during S or G2. We also show that different genetic elements are associated with reciprocal crossover conversion tracts depending on the cell cycle timing of the initiating DSB. Double-strand breaks (DSBs) are DNA lesions that can be fatal to a cell if left unrepaired. They can be caused by exogenous sources, such as gamma radiation, or endogenous stresses, such as high levels of transcription. Yeast cells primarily repair DSBs that are initiated outside of meiosis by mitotic recombination, which can result in physical exchanges between chromosomes, known as crossovers. We created a mitotic recombination map of one chromosome arm, representing 10% of the genome. This recombination map allows us to determine which regions of the chromosome arm are more susceptible to DNA damage than other regions. We were able to determine that most DSBs that result in detectable genomic changes were initiated prior to DNA replication and that some secondary DNA structures can be recombination hotspots. Recombination can also occur during meiosis, as a method of ensuring proper chromosome segregation. However, previously reported meiotic recombination maps have no correlation with our mitotic recombination map.
Collapse
Affiliation(s)
- Jordan St. Charles
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, North Carolina, United States of America
- Department of Pharmacology and Cancer Biology, Duke University Medical Center, Durham, North Carolina, United States of America
| | - Thomas D. Petes
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, North Carolina, United States of America
- * E-mail:
| |
Collapse
|
28
|
Chen X, Shen Y, Zhang F, Chiang C, Pillalamarri V, Blumenthal I, Talkowski M, Wu BL, Gusella J. Molecular analysis of a deletion hotspot in the NRXN1 region reveals the involvement of short inverted repeats in deletion CNVs. Am J Hum Genet 2013; 92:375-86. [PMID: 23472757 PMCID: PMC3591860 DOI: 10.1016/j.ajhg.2013.02.006] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2012] [Revised: 12/04/2012] [Accepted: 02/12/2013] [Indexed: 01/07/2023] Open
Abstract
NRXN1 microdeletions occur at a relatively high frequency and confer increased risk for neurodevelopmental and neurobehavioral abnormalities. The mechanism that makes NRXN1 a deletion hotspot is unknown. Here, we identified deletions of the NRXN1 region in affected cohorts, confirming a strong association with the autism spectrum and other neurodevelopmental disorders. Interestingly, deletions in both affected and control individuals were clustered in the 5' portion of NRXN1 and its immediate upstream region. To explore the mechanism of deletion, we mapped and analyzed the breakpoints of 32 deletions. At the deletion breakpoints, frequent microhomology (68.8%, 2-19 bp) suggested predominant mechanisms of DNA replication error and/or microhomology-mediated end-joining. Long terminal repeat (LTR) elements, unique non-B-DNA structures, and MEME-defined sequence motifs were significantly enriched, but Alu and LINE sequences were not. Importantly, small-size inverted repeats (minus self chains, minus sequence motifs, and partial complementary sequences) were significantly overrepresented in the vicinity of NRXN1 region deletion breakpoints, suggesting that, although they are not interrupted by the deletion process, such inverted repeats can predispose a region to genomic instability by mediating single-strand DNA looping via the annealing of partially reverse complementary strands and the promoting of DNA replication fork stalling and DNA replication error. Our observations highlight the potential importance of inverted repeats of variable sizes in generating a rearrangement hotspot in which individual breakpoints are not recurrent. Mechanisms that involve short inverted repeats in initiating deletion may also apply to other deletion hotspots in the human genome.
Collapse
Affiliation(s)
- Xiaoli Chen
- Capital Institute of Pediatrics, Beijing 100020, China
- Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA 02114, USA
- Department of Laboratory Medicine, Children’s Hospital Boston, Boston, MA 02115, USA
| | - Yiping Shen
- Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA 02114, USA
- Department of Laboratory Medicine, Children’s Hospital Boston, Boston, MA 02115, USA
- Shanghai Children’s Medical Center, Shanghai Jiaotong University School of Medicine, Shanghai 200127, China
- Department of Pathology, Harvard Medical School, Boston, MA 02115, USA
| | - Feng Zhang
- State Key Laboratory of Genetic Engineering and Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai 200433, China
| | - Colby Chiang
- Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Vamsee Pillalamarri
- Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Ian Blumenthal
- Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Michael Talkowski
- Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA 02114, USA
- Department of Neurology, Harvard Medical School, Boston, MA 02115, USA
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Bai-Lin Wu
- Department of Laboratory Medicine, Children’s Hospital Boston, Boston, MA 02115, USA
- Department of Pathology, Harvard Medical School, Boston, MA 02115, USA
- Children’s Hospital and Institutes of Biomedical Science, Fudan University, Shanghai 200032, China
| | - James F. Gusella
- Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA 02114, USA
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Department of Genetics, Harvard Medical School, Boston, MA 02115, USA
| |
Collapse
|
29
|
Vasquez KM, Wang G. The yin and yang of repair mechanisms in DNA structure-induced genetic instability. Mutat Res 2013; 743-744:118-131. [PMID: 23219604 PMCID: PMC3661696 DOI: 10.1016/j.mrfmmm.2012.11.005] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2012] [Revised: 11/21/2012] [Accepted: 11/24/2012] [Indexed: 01/14/2023]
Abstract
DNA can adopt a variety of secondary structures that deviate from the canonical Watson-Crick B-DNA form. More than 10 types of non-canonical or non-B DNA secondary structures have been characterized, and the sequences that have the capacity to adopt such structures are very abundant in the human genome. Non-B DNA structures have been implicated in many important biological processes and can serve as sources of genetic instability, implicating them in disease and evolution. Non-B DNA conformations interact with a wide variety of proteins involved in replication, transcription, DNA repair, and chromatin architectural regulation. In this review, we will focus on the interactions of DNA repair proteins with non-B DNA and their roles in genetic instability, as the proteins and DNA involved in such interactions may represent plausible targets for selective therapeutic intervention.
Collapse
Affiliation(s)
- Karen M Vasquez
- Division of Pharmacology and Toxicology, College of Pharmacy, The University of Texas at Austin, Dell Pediatric Research Institute, 1400 Barbara Jordan Blvd. R1800, Austin, TX 78723, United States.
| | - Guliang Wang
- Division of Pharmacology and Toxicology, College of Pharmacy, The University of Texas at Austin, Dell Pediatric Research Institute, 1400 Barbara Jordan Blvd. R1800, Austin, TX 78723, United States
| |
Collapse
|
30
|
Wojcik EA, Brzostek A, Bacolla A, Mackiewicz P, Vasquez KM, Korycka-Machala M, Jaworski A, Dziadek J. Direct and inverted repeats elicit genetic instability by both exploiting and eluding DNA double-strand break repair systems in mycobacteria. PLoS One 2012; 7:e51064. [PMID: 23251422 PMCID: PMC3519483 DOI: 10.1371/journal.pone.0051064] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2012] [Accepted: 10/29/2012] [Indexed: 12/02/2022] Open
Abstract
Repetitive DNA sequences with the potential to form alternative DNA conformations, such as slipped structures and cruciforms, can induce genetic instability by promoting replication errors and by serving as a substrate for DNA repair proteins, which may lead to DNA double-strand breaks (DSBs). However, the contribution of each of the DSB repair pathways, homologous recombination (HR), non-homologous end-joining (NHEJ) and single-strand annealing (SSA), to this sort of genetic instability is not fully understood. Herein, we assessed the genome-wide distribution of repetitive DNA sequences in the Mycobacterium smegmatis, Mycobacterium tuberculosis and Escherichia coli genomes, and determined the types and frequencies of genetic instability induced by direct and inverted repeats, both in the presence and in the absence of HR, NHEJ, and SSA. All three genomes are strongly enriched in direct repeats and modestly enriched in inverted repeats. When using chromosomally integrated constructs in M. smegmatis, direct repeats induced the perfect deletion of their intervening sequences ~1,000-fold above background. Absence of HR further enhanced these perfect deletions, whereas absence of NHEJ or SSA had no influence, suggesting compromised replication fidelity. In contrast, inverted repeats induced perfect deletions only in the absence of SSA. Both direct and inverted repeats stimulated excision of the constructs from the attB integration sites independently of HR, NHEJ, or SSA. With episomal constructs, direct and inverted repeats triggered DNA instability by activating nucleolytic activity, and absence of the DSB repair pathways (in the order NHEJ>HR>SSA) exacerbated this instability. Thus, direct and inverted repeats may elicit genetic instability in mycobacteria by 1) directly interfering with replication fidelity, 2) stimulating the three main DSB repair pathways, and 3) enticing L5 site-specific recombination.
Collapse
Affiliation(s)
- Ewelina A. Wojcik
- Institute of Medical Biology, Polish Academy of Sciences, Lodz, Poland
- Department of Genetics of Microorganisms, Institute of Microbiology and Immunology, University of Lodz, Lodz, Poland
| | - Anna Brzostek
- Institute of Medical Biology, Polish Academy of Sciences, Lodz, Poland
| | - Albino Bacolla
- The University of Texas at Austin, Division of Pharmacology and Toxicology, Dell Pediatric Research Institute, Austin, Texas, United States of America
| | - Pawel Mackiewicz
- Department of Genomics, Faculty of Biotechnology, University of Wroclaw, Wroclaw, Poland
| | - Karen M. Vasquez
- The University of Texas at Austin, Division of Pharmacology and Toxicology, Dell Pediatric Research Institute, Austin, Texas, United States of America
| | | | - Adam Jaworski
- Department of Genetics of Microorganisms, Institute of Microbiology and Immunology, University of Lodz, Lodz, Poland
| | - Jaroslaw Dziadek
- Institute of Medical Biology, Polish Academy of Sciences, Lodz, Poland
| |
Collapse
|
31
|
Tschudi C, Shi H, Franklin JB, Ullu E. Small interfering RNA-producing loci in the ancient parasitic eukaryote Trypanosoma brucei. BMC Genomics 2012; 13:427. [PMID: 22925482 PMCID: PMC3447711 DOI: 10.1186/1471-2164-13-427] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2012] [Accepted: 08/24/2012] [Indexed: 01/24/2023] Open
Abstract
Background At the core of the RNA interference (RNAi) pathway in Trypanosoma brucei is a single Argonaute protein, TbAGO1, with an established role in controlling retroposon and repeat transcripts. Recent evidence from higher eukaryotes suggests that a variety of genomic sequences with the potential to produce double-stranded RNA are sources for small interfering RNAs (siRNAs). Results To test whether such endogenous siRNAs are present in T. brucei and to probe the individual role of the two Dicer-like enzymes, we affinity purified TbAGO1 from wild-type procyclic trypanosomes, as well as from cells deficient in the cytoplasmic (TbDCL1) or nuclear (TbDCL2) Dicer, and subjected the bound RNAs to Illumina high-throughput sequencing. In wild-type cells the majority of reads originated from two classes of retroposons. We also considerably expanded the repertoire of trypanosome siRNAs to encompass a family of 147-bp satellite-like repeats, many of the regions where RNA polymerase II transcription converges, large inverted repeats and two pseudogenes. Production of these newly described siRNAs is strictly dependent on the nuclear DCL2. Notably, our data indicate that putative centromeric regions, excluding the CIR147 repeats, are not a significant source for endogenous siRNAs. Conclusions Our data suggest that endogenous RNAi targets may be as evolutionarily old as the mechanism itself.
Collapse
Affiliation(s)
- Christian Tschudi
- Department of Internal Medicine, School of Medicine, Yale University, New Haven, CT 06536, USA
| | | | | | | |
Collapse
|
32
|
Kato T, Kurahashi H, Emanuel BS. Chromosomal translocations and palindromic AT-rich repeats. Curr Opin Genet Dev 2012; 22:221-8. [PMID: 22402448 DOI: 10.1016/j.gde.2012.02.004] [Citation(s) in RCA: 59] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2011] [Revised: 02/03/2012] [Accepted: 02/06/2012] [Indexed: 10/28/2022]
Abstract
Repetitive DNA sequences constitute 30% of the human genome, and are often sites of genomic rearrangement. Recently, it has been found that several constitutional translocations, especially those that involve chromosome 22, take place utilizing palindromic sequences on 22q11 and on the partner chromosome. Analysis of translocation junction fragments shows that the breakpoints of such palindrome-mediated translocations are localized at the center of palindromic AT-rich repeats (PATRRs). The presence of PATRRs at the breakpoints indicates a palindrome-mediated mechanism involved in the generation of these constitutional translocations. Identification of these PATRR-mediated translocations suggests a universal pathway for gross chromosomal rearrangement in the human genome. De novo occurrences of PATRR-mediated translocations can be detected by PCR in normal sperm samples but not somatic cells. Polymorphisms of various PATRRs influence their propensity for adopting a secondary structure, which in turn affects de novo translocation frequency. We propose that the PATRRs form an unstable secondary structure, which leads to double-strand breaks at the center of the PATRR. The double-strand breaks appear to be followed by a non-homologous end-joining repair pathway, ultimately leading to the translocations. This review considers recent findings concerning the mechanism of meiosis-specific, PATRR-mediated translocations.
Collapse
Affiliation(s)
- Takema Kato
- Division of Human Genetics, The Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | | | | |
Collapse
|
33
|
Sati S, Tanwar VS, Kumar KA, Patowary A, Jain V, Ghosh S, Ahmad S, Singh M, Reddy SU, Chandak GR, Raghunath M, Sivasubbu S, Chakraborty K, Scaria V, Sengupta S. High resolution methylome map of rat indicates role of intragenic DNA methylation in identification of coding region. PLoS One 2012; 7:e31621. [PMID: 22355382 PMCID: PMC3280313 DOI: 10.1371/journal.pone.0031621] [Citation(s) in RCA: 70] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2011] [Accepted: 01/10/2012] [Indexed: 11/18/2022] Open
Abstract
DNA methylation is crucial for gene regulation and maintenance of genomic stability. Rat has been a key model system in understanding mammalian systemic physiology, however detailed rat methylome remains uncharacterized till date. Here, we present the first high resolution methylome of rat liver generated using Methylated DNA immunoprecipitation and high throughput sequencing (MeDIP-Seq) approach. We observed that within the DNA/RNA repeat elements, simple repeats harbor the highest degree of methylation. Promoter hypomethylation and exon hypermethylation were common features in both RefSeq genes and expressed genes (as evaluated by proteomic approach). We also found that although CpG islands were generally hypomethylated, about 6% of them were methylated and a large proportion (37%) of methylated islands fell within the exons. Notably, we obeserved significant differences in methylation of terminal exons (UTRs); methylation being more pronounced in coding/partially coding exons compared to the non-coding exons. Further, events like alternate exon splicing (cassette exon) and intron retentions were marked by DNA methylation and these regions are retained in the final transcript. Thus, we suggest that DNA methylation could play a crucial role in marking coding regions thereby regulating alternative splicing. Apart from generating the first high resolution methylome map of rat liver tissue, the present study provides several critical insights into methylome organization and extends our understanding of interplay between epigenome, gene expression and genome stability.
Collapse
Affiliation(s)
- Satish Sati
- CSIR-Institute of Genomics and Integrative Biology, Delhi, India
| | | | | | - Ashok Patowary
- CSIR-Institute of Genomics and Integrative Biology, Delhi, India
| | - Vaibhav Jain
- CSIR-Institute of Genomics and Integrative Biology, Delhi, India
| | - Sourav Ghosh
- CSIR-Institute of Genomics and Integrative Biology, Delhi, India
| | - Shadab Ahmad
- CSIR-Institute of Genomics and Integrative Biology, Delhi, India
| | - Meghna Singh
- CSIR-Institute of Genomics and Integrative Biology, Delhi, India
| | - S. Umakar Reddy
- CSIR-Centre for Cellular and Molecular Biology, Hyderabad, India
| | | | | | | | | | - Vinod Scaria
- CSIR-Institute of Genomics and Integrative Biology, Delhi, India
- * E-mail: (VS); (S. Sengupta)
| | - Shantanu Sengupta
- CSIR-Institute of Genomics and Integrative Biology, Delhi, India
- * E-mail: (VS); (S. Sengupta)
| |
Collapse
|
34
|
Zhu J, Nguyen MT, Nakamura E, Yang J, Mackem S. Cre-mediated recombination can induce apoptosis in vivo by activating the p53 DNA damage-induced pathway. Genesis 2012; 50:102-11. [PMID: 21913308 PMCID: PMC3273649 DOI: 10.1002/dvg.20799] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2010] [Revised: 08/30/2011] [Accepted: 09/01/2011] [Indexed: 01/06/2023]
Abstract
Cre-mediated apoptosis has been observed in many contexts in mice expressing Cre-recombinase and can confound the analysis of genetically engineered conditional mutant or transgenic alleles. Several mechanisms have been proposed to explain this phenomenon. We find that the degree of apoptosis induced correlates roughly with the copy number of loxP sites present in the genome and that some level of increased apoptosis accompanies the presence of even only a few loxP sites, as occurs in conditional floxed alleles. Cre-induced apoptosis in this context is completely p53-dependent, suggesting that the apoptosis is stimulated by p53 activation in response to DNA damage incurred during the process of Cre-mediated recombination.
Collapse
Affiliation(s)
| | | | | | - Junming Yang
- Cancer and Developmental Biology Laboratory, Center for Cancer Research, NCI-Frederick, Frederick, MD
| | - Susan Mackem
- author for correspondence: Susan Mackem, Center for Cancer Research, NCI-Frederick, Cancer & Developmental Biology Laboratory, 1050 Boyles St., Bldg 539, Rm 121A, Frederick, MD 21702
| |
Collapse
|
35
|
Sharma S. Non-B DNA Secondary Structures and Their Resolution by RecQ Helicases. J Nucleic Acids 2011; 2011:724215. [PMID: 21977309 PMCID: PMC3185257 DOI: 10.4061/2011/724215] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2011] [Accepted: 07/25/2011] [Indexed: 01/14/2023] Open
Abstract
In addition to the canonical B-form structure first described by Watson and Crick, DNA can adopt a number of alternative structures. These non-B-form DNA secondary structures form spontaneously on tracts of repeat sequences that are abundant in genomes. In addition, structured forms of DNA with intrastrand pairing may arise on single-stranded DNA produced transiently during various cellular processes. Such secondary structures have a range of biological functions but also induce genetic instability. Increasing evidence suggests that genomic instabilities induced by non-B DNA secondary structures result in predisposition to diseases. Secondary DNA structures also represent a new class of molecular targets for DNA-interactive compounds that might be useful for targeting telomeres and transcriptional control. The equilibrium between the duplex DNA and formation of multistranded non-B-form structures is partly dependent upon the helicases that unwind (resolve) these alternate DNA structures. With special focus on tetraplex, triplex, and cruciform, this paper summarizes the incidence of non-B DNA structures and their association with genomic instability and emphasizes the roles of RecQ-like DNA helicases in genome maintenance by resolution of DNA secondary structures. In future, RecQ helicases are anticipated to be additional molecular targets for cancer chemotherapeutics.
Collapse
Affiliation(s)
- Sudha Sharma
- Department of Biochemistry and Molecular Biology, College of Medicine, Howard University, 520 W Street, NW, Suite 3424A, Washington, DC 20059, USA
| |
Collapse
|
36
|
Mannaert A, Amemiya CT, Bossuyt F. Comparative analyses of vertebrate posterior HoxD clusters reveal atypical cluster architecture in the caecilian Typhlonectes natans. BMC Genomics 2010; 11:658. [PMID: 21106068 PMCID: PMC3091776 DOI: 10.1186/1471-2164-11-658] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2010] [Accepted: 11/24/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The posterior genes of the HoxD cluster play a crucial role in the patterning of the tetrapod limb. This region is under the control of a global, long-range enhancer that is present in all vertebrates. Variation in limb types, as is the case in amphibians, can probably not only be attributed to variation in Hox genes, but is likely to be the product of differences in gene regulation. With a collection of vertebrate genome sequences available today, we used a comparative genomics approach to study the posterior HoxD cluster of amphibians. A frog and a caecilian were included in the study to compare coding sequences as well as to determine the gain and loss of putative regulatory sequences. RESULTS We sequenced the posterior end of the HoxD cluster of a caecilian and performed comparative analyses of this region using HoxD clusters of other vertebrates. We determined the presence of conserved non-coding sequences and traced gains and losses of these footprints during vertebrate evolution, with particular focus on amphibians. We found that the caecilian HoxD cluster is almost three times larger than its mammalian counterpart. This enlargement is accompanied with the loss of one gene and the accumulation of repeats in that area. A similar phenomenon was observed in the coelacanth, where a different gene was lost and expansion of the area where the gene was lost has occurred. At least one phylogenetic footprint present in all vertebrates was lost in amphibians. This conserved region is a known regulatory element and functions as a boundary element in neural tissue to prevent expression of Hoxd genes. CONCLUSION The posterior part of the HoxD cluster of Typhlonectes natans is among the largest known today. The loss of Hoxd-12 and the expansion of the intergenic region may exert an influence on the limb enhancer, by having to bypass a distance seven times that of regular HoxD clusters. Whether or not there is a correlation with the loss of limbs remains to be investigated. These results, together with data on other vertebrates show that the tetrapod Hox clusters are more variable than previously thought.
Collapse
Affiliation(s)
- An Mannaert
- Biology Department, ECOL, Amphibian Evolution Lab, Vrije Universiteit Brussel, Brussels, Belgium
| | - Chris T Amemiya
- Benaroya Research Institute at Virginia Mason and University of Washington, Seattle, USA
| | - Franky Bossuyt
- Biology Department, ECOL, Amphibian Evolution Lab, Vrije Universiteit Brussel, Brussels, Belgium
| |
Collapse
|
37
|
Cer RZ, Bruce KH, Mudunuri US, Yi M, Volfovsky N, Luke BT, Bacolla A, Collins JR, Stephens RM. Non-B DB: a database of predicted non-B DNA-forming motifs in mammalian genomes. Nucleic Acids Res 2010; 39:D383-91. [PMID: 21097885 PMCID: PMC3013731 DOI: 10.1093/nar/gkq1170] [Citation(s) in RCA: 72] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Although the capability of DNA to form a variety of non-canonical (non-B) structures has long been recognized, the overall significance of these alternate conformations in biology has only recently become accepted en masse. In order to provide access to genome-wide locations of these classes of predicted structures, we have developed non-B DB, a database integrating annotations and analysis of non-B DNA-forming sequence motifs. The database provides the most complete list of alternative DNA structure predictions available, including Z-DNA motifs, quadruplex-forming motifs, inverted repeats, mirror repeats and direct repeats and their associated subsets of cruciforms, triplex and slipped structures, respectively. The database also contains motifs predicted to form static DNA bends, short tandem repeats and homo(purine•pyrimidine) tracts that have been associated with disease. The database has been built using the latest releases of the human, chimp, dog, macaque and mouse genomes, so that the results can be compared directly with other data sources. In order to make the data interpretable in a genomic context, features such as genes, single-nucleotide polymorphisms and repetitive elements (SINE, LINE, etc.) have also been incorporated. The database is accessed through query pages that produce results with links to the UCSC browser and a GBrowse-based genomic viewer. It is freely accessible at http://nonb.abcc.ncifcrf.gov.
Collapse
Affiliation(s)
- Regina Z Cer
- Advanced Biomedical Computing Center, Information Systems Program, SAIC-Frederick, Inc, NCI-Frederick, Frederick, MD 21702, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
38
|
Kurahashi H, Inagaki H, Ohye T, Kogo H, Tsutsumi M, Kato T, Tong M, Emanuel BS. The constitutional t(11;22): implications for a novel mechanism responsible for gross chromosomal rearrangements. Clin Genet 2010; 78:299-309. [PMID: 20507342 PMCID: PMC3336963 DOI: 10.1111/j.1399-0004.2010.01445.x] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
The constitutional t(11;22)(q23;q11) is the most common recurrent non-Robertsonian translocation in humans. The breakpoint sequences of both chromosomes are characterized by several hundred base pairs of palindromic AT-rich repeats (PATRRs). Similar PATRRs have also been identified at the breakpoints of other nonrecurrent translocations, suggesting that PATRR-mediated chromosomal translocation represents one of the universal pathways for gross chromosomal rearrangement in the human genome. We propose that PATRRs have the potential to form cruciform structures through intrastrand-base pairing in single-stranded DNA, creating a source of genomic instability and leading to translocations. Indeed, de novo examples of the t(11;22) are detected at a high frequency in sperm from normal healthy males. This review synthesizes recent data illustrating a novel paradigm for an apparent spermatogenesis-specific translocation mechanism. This observation has important implications pertaining to the predominantly paternal origin of de novo gross chromosomal rearrangements in humans.
Collapse
Affiliation(s)
- H Kurahashi
- Division of Molecular Genetics, Institute for Comprehensive Medical Science, Fujita Health University, Toyoake, Aichi, Japan.
| | | | | | | | | | | | | | | |
Collapse
|
39
|
Strawbridge EM, Benson G, Gelfand Y, Benham CJ. The distribution of inverted repeat sequences in the Saccharomyces cerevisiae genome. Curr Genet 2010; 56:321-40. [PMID: 20446088 PMCID: PMC2908449 DOI: 10.1007/s00294-010-0302-6] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2010] [Revised: 04/05/2010] [Accepted: 04/08/2010] [Indexed: 02/06/2023]
Abstract
Although a variety of possible functions have been proposed for inverted repeat sequences (IRs), it is not known which of them might occur in vivo. We investigate this question by assessing the distributions and properties of IRs in the Saccharomyces cerevisiae (SC) genome. Using the IRFinder algorithm we detect 100,514 IRs having copy length greater than 6 bp and spacer length less than 77 bp. To assess statistical significance we also determine the IR distributions in two types of randomization of the S. cerevisiae genome. We find that the S. cerevisiae genome is significantly enriched in IRs relative to random. The S. cerevisiae IRs are significantly longer and contain fewer imperfections than those from the randomized genomes, suggesting that processes to lengthen and/or correct errors in IRs may be operative in vivo. The S. cerevisiae IRs are highly clustered in intergenic regions, while their occurrence in coding sequences is consistent with random. Clustering is stronger in the 3' flanks of genes than in their 5' flanks. However, the S. cerevisiae genome is not enriched in those IRs that would extrude cruciforms, suggesting that this is not a common event. Various explanations for these results are considered.
Collapse
Affiliation(s)
| | - Gary Benson
- Laboratory for Biocomputing and Informatics, Boston University, Boston, MA USA
| | - Yevgeniy Gelfand
- Laboratory for Biocomputing and Informatics, Boston University, Boston, MA USA
| | - Craig J. Benham
- Department of Mathematics, University of California, Davis, CA 95616 USA
| |
Collapse
|
40
|
Kurahashi H, Inagaki H, Kato T, Hosoba E, Kogo H, Ohye T, Tsutsumi M, Bolor H, Tong M, Emanuel BS. Impaired DNA replication prompts deletions within palindromic sequences, but does not induce translocations in human cells. Hum Mol Genet 2009; 18:3397-406. [PMID: 19520744 PMCID: PMC2729664 DOI: 10.1093/hmg/ddp279] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2009] [Accepted: 06/09/2009] [Indexed: 11/12/2022] Open
Abstract
Palindromic regions are unstable and susceptible to deletion in prokaryotes and eukaryotes possibly due to stalled or slow replication. In the human genome, they also appear to become partially or completely deleted, while two palindromic AT-rich repeats (PATRR) contribute to known recurrent constitutional translocations. To explore the mechanism that causes the development of palindrome instabilities in humans, we compared the incidence of de novo translocations and deletions at PATRRs in human cells. Using a highly sensitive PCR assay that can detect single molecules, de novo deletions were detected neither in human somatic cells nor in sperm. However, deletions were detected at low frequency in cultured cell lines. Inhibition of DNA replication by administration of siRNA against the DNA polymerase alpha 1 (POLA1) gene or introduction of POLA inhibitors increased the frequency. This is in contrast to PATRR-mediated translocations that were never detected in similar conditions but were observed frequently in human sperm samples. Further deletions were found to take place during both leading- and lagging-strand synthesis. Our data suggest that stalled or slow replication induces deletions within PATRRs, but that other mechanisms might contribute to PATRR-mediated recurrent translocations in humans.
Collapse
Affiliation(s)
- Hiroki Kurahashi
- Division of Molecular Genetics, Fujita Health University, Toyoake, Aichi, Japan.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
41
|
Wang G, Zhao J, Vasquez KM. Methods to determine DNA structural alterations and genetic instability. Methods 2009; 48:54-62. [PMID: 19245837 PMCID: PMC2693251 DOI: 10.1016/j.ymeth.2009.02.012] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2008] [Accepted: 02/15/2009] [Indexed: 11/16/2022] Open
Abstract
Chromosomal DNA is a dynamic structure that can adopt a variety of non-canonical (i.e., non-B) conformations. In this regard, at least 10 different forms of non-B DNA conformations have been identified; many of them have been found to be mutagenic, and associated with human disease development. Despite the importance of non-B DNA structures in genetic instability and DNA metabolic processes, mechanisms by which instability occurs remain largely undefined. The purpose of this review is to summarize current methodologies that are used to address questions in the field of non-B DNA structure-induced genetic instability. Advantages and disadvantages of each method will be discussed. A focused effort to further elucidate the mechanisms of non-B DNA-induced genetic instability will lead to a better understanding of how these structure-forming sequences contribute to the development of human disease.
Collapse
Affiliation(s)
- Guliang Wang
- Department of Carcinogenesis, University of Texas M.D. Anderson Cancer Center, Science Park-Research Division, 1808 Park Road 1-C, Smithville, TX 78957
| | - Junhua Zhao
- Department of Carcinogenesis, University of Texas M.D. Anderson Cancer Center, Science Park-Research Division, 1808 Park Road 1-C, Smithville, TX 78957
| | - Karen M. Vasquez
- Department of Carcinogenesis, University of Texas M.D. Anderson Cancer Center, Science Park-Research Division, 1808 Park Road 1-C, Smithville, TX 78957
| |
Collapse
|
42
|
Waller RF, Jackson CJ. Dinoflagellate mitochondrial genomes: stretching the rules of molecular biology. Bioessays 2009; 31:237-45. [PMID: 19204978 DOI: 10.1002/bies.200800164] [Citation(s) in RCA: 88] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Mitochondrial genomes represent relict bacterial genomes derived from a progenitor alpha-proteobacterium that gave rise to all mitochondria through an ancient endosymbiosis. Evolution has massively reduced these genomes, yet despite relative simplicity their organization and expression has developed considerable novelty throughout eukaryotic evolution. Few organisms have reengineered their mitochondrial genomes as thoroughly as the protist lineage of dinoflagellates. Recent work reveals dinoflagellate mitochondrial genomes as likely the most gene-impoverished of any free-living eukaryote, encoding only two to three proteins. The organization and expression of these genomes, however, is far from the simplicity their gene content would suggest. Gene duplication, fragmentation, and scrambling have resulted in an inflated and complex genome organization. Extensive RNA editing then recodes gene transcripts, and trans-splicing is required to assemble full-length transcripts for at least one fragmented gene. Even after these processes, messenger RNAs (mRNAs) lack canonical start codons and most transcripts have abandoned stop codons altogether.
Collapse
Affiliation(s)
- Ross F Waller
- School of Botany, University of Melbourne, Melbourne, Victoria, Australia.
| | | |
Collapse
|
43
|
Wang Y, Leung FCC. A study on genomic distribution and sequence features of human long inverted repeats reveals species-specific intronic inverted repeats. FEBS J 2009; 276:1986-98. [PMID: 19243432 DOI: 10.1111/j.1742-4658.2009.06930.x] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
The inverted repeats present in a genome play dual roles. They can induce genomic instability and, on the other hand, regulate gene expression. In the present study, we report the distribution and sequence features of recombinogenic long inverted repeats (LIRs) that are capable of forming stable stem-loops or palindromes within the human genome. A total of 2551 LIRs were identified, and 37% of them were located in long introns (largely > 10 kb) of genes. Their distribution appears to be random in introns and is not restrictive, even for regions near intron-exon boundaries. Almost half of them comprise TG/CA-rich repeats, inversely arranged Alu repeats and MADE1 mariners. The remaining LIRs are mostly unique in their sequence features. Comparative studies of human, chimpanzee, rhesus monkey and mouse orthologous genes reveal that human genes have more recombinogenic LIRs than other orthologs, and over 80% are human-specific. The human genes associated with the human-specific LIRs are involved in the pathways of cell communication, development and the nervous system, as based on significantly over-represented Gene Ontology terms. The functional pathways related to the development and functions of the nervous system are not enriched in chimpanzee and mouse orthologs. The findings of the present study provide insight into the role of intronic LIRs in gene regulation and primate speciation.
Collapse
Affiliation(s)
- Yong Wang
- School of Biological Sciences, The University of Hong Kong, Hong Kong, China.
| | | |
Collapse
|
44
|
Lee J, Han K, Meyer TJ, Kim HS, Batzer MA. Chromosomal inversions between human and chimpanzee lineages caused by retrotransposons. PLoS One 2008; 3:e4047. [PMID: 19112500 PMCID: PMC2603318 DOI: 10.1371/journal.pone.0004047] [Citation(s) in RCA: 67] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2008] [Accepted: 11/22/2008] [Indexed: 02/02/2023] Open
Abstract
The long interspersed element-1 (LINE-1 or L1) and Alu elements are the most abundant mobile elements comprising 21% and 11% of the human genome, respectively. Since the divergence of human and chimpanzee lineages, these elements have vigorously created chromosomal rearrangements causing genomic difference between humans and chimpanzees by either increasing or decreasing the size of genome. Here, we report an exotic mechanism, retrotransposon recombination-mediated inversion (RRMI), that usually does not alter the amount of genomic material present. Through the comparison of the human and chimpanzee draft genome sequences, we identified 252 inversions whose respective inversion junctions can clearly be characterized. Our results suggest that L1 and Alu elements cause chromosomal inversions by either forming a secondary structure or providing a fragile site for double-strand breaks. The detailed analysis of the inversion breakpoints showed that L1 and Alu elements are responsible for at least 44% of the 252 inversion loci between human and chimpanzee lineages, including 49 RRMI loci. Among them, three RRMI loci inverted exonic regions in known genes, which implicates this mechanism in generating the genomic and phenotypic differences between human and chimpanzee lineages. This study is the first comprehensive analysis of mobile element bases inversion breakpoints between human and chimpanzee lineages, and highlights their role in primate genome evolution.
Collapse
Affiliation(s)
- Jungnam Lee
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, United States of America
- Biological Computation and Visualization Center, Louisiana State University, Baton Rouge, Louisiana, United States of America
| | - Kyudong Han
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, United States of America
- Biological Computation and Visualization Center, Louisiana State University, Baton Rouge, Louisiana, United States of America
| | - Thomas J. Meyer
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, United States of America
- Biological Computation and Visualization Center, Louisiana State University, Baton Rouge, Louisiana, United States of America
| | - Heui-Soo Kim
- PBBRC, Interdisciplinary Research Program of Bioinformatics, College of Natural Sciences, Pusan National University, Busan, Korea
- Division of Biological Sciences, College of Natural Sciences, Pusan National University, Busan, Korea
| | - Mark A. Batzer
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, United States of America
- Biological Computation and Visualization Center, Louisiana State University, Baton Rouge, Louisiana, United States of America
- * E-mail:
| |
Collapse
|
45
|
Abstract
Here we describe a one-step method to create precise modifications in the genome of Saccharomyces cerevisiae as a tool for synthetic biology, metabolic engineering, systems biology and genetic studies. Through homologous recombination, a mutagenesis cassette containing an inverted repeat of selection marker(s) is integrated into the genome. Due to its inherent instability in genomic DNA, the inverted repeat catalyzes spontaneous self-excision, resulting in precise genome modification. Since this excision occurs at very high frequencies, selection for the integration event can be followed immediately by counterselection, without the need for growth in permissive conditions. This is the first time a truly one-step method has been described for genome modification in any organism.
Collapse
Affiliation(s)
- Nikhil U Nair
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | | |
Collapse
|
46
|
Dishaw LJ, Mueller MG, Gwatney N, Cannon JP, Haire RN, Litman RT, Amemiya CT, Ota T, Rowen L, Glusman G, Litman GW. Genomic complexity of the variable region-containing chitin-binding proteins in amphioxus. BMC Genet 2008; 9:78. [PMID: 19046437 PMCID: PMC2632668 DOI: 10.1186/1471-2156-9-78] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2008] [Accepted: 12/01/2008] [Indexed: 01/11/2023] Open
Abstract
BACKGROUND The variable region-containing chitin-binding proteins (VCBPs) are found in protochordates and consist of two tandem immunoglobulin variable (V)-type domains and a chitin-binding domain. We previously have shown that these polymorphic genes, which primarily are expressed in the gut, exhibit characteristics of immune genes. In this report, we describe VCBP genomic organization and characterize adjacent and intervening genetic features which may influence both their polymorphism and complex transcriptional repertoire. RESULTS VCBP genes 1, 2, 4, and 5 are encoded in a single contiguous gene-rich chromosomal region and VCBP3 is encoded in a separate locus. The VCBPs exhibit extensive haplotype variation, including copy number variation (CNV), indel polymorphism and a markedly elevated variation in repeat type and density. In at least one haplotype, inverted repeats occur more frequently than elsewhere in the genome. Multi-animal cDNA screening, as well as transcriptional profilingusing a novel transfection system, suggests that haplotype-specific transcriptional variants may contribute to VCBP genetic diversity. CONCLUSION The availability of the Branchiostoma floridae genome (Joint Genome Institute, Brafl1), along with BAC and PAC screening and sequencing described here, reveal that the relatively limited number of VCBP genes present in the amphioxus genome exhibit exceptionally high haplotype variation. These VCBP haplotypes contribute a diverse pool of allelic variants, which includes gene copy number variation, pseudogenes, and other polymorphisms, while contributing secondary effects on gene transcription as well.
Collapse
Affiliation(s)
- Larry J Dishaw
- All Children's Hospital, Department of Molecular Genetics, 801 Sixth Street South, St. Petersburg, FL 33701, USA
- H. Lee Moffitt Cancer Center and Research Institute, 12902 Magnolia Avenue, Tampa, FL 33612, USA
| | - M Gail Mueller
- All Children's Hospital, Department of Molecular Genetics, 801 Sixth Street South, St. Petersburg, FL 33701, USA
| | - Natasha Gwatney
- Department of Pediatrics, University of South Florida College of Medicine, USF/ACH Children's Research Institute, 830 First Street South, St. Petersburg, FL 33701, USA
| | - John P Cannon
- H. Lee Moffitt Cancer Center and Research Institute, 12902 Magnolia Avenue, Tampa, FL 33612, USA
- Department of Pediatrics, University of South Florida College of Medicine, USF/ACH Children's Research Institute, 830 First Street South, St. Petersburg, FL 33701, USA
| | - Robert N Haire
- Department of Pediatrics, University of South Florida College of Medicine, USF/ACH Children's Research Institute, 830 First Street South, St. Petersburg, FL 33701, USA
| | - Ronda T Litman
- Department of Pediatrics, University of South Florida College of Medicine, USF/ACH Children's Research Institute, 830 First Street South, St. Petersburg, FL 33701, USA
| | - Chris T Amemiya
- Benaroya Research Institute, 1201 Ninth Avenue, Seattle, WA 98101, USA
| | - Tatsuya Ota
- Department of Evolutionary Studies of Biosystems, The Graduate University for Advanced Studies, Kamiyamaguchi 1560-35, Hayama 240-0193 Japan
| | - Lee Rowen
- Institute for Systems Biology, 1441 N. 34th St, Seattle, WA, 98103, USA
| | - Gustavo Glusman
- Institute for Systems Biology, 1441 N. 34th St, Seattle, WA, 98103, USA
| | - Gary W Litman
- All Children's Hospital, Department of Molecular Genetics, 801 Sixth Street South, St. Petersburg, FL 33701, USA
- H. Lee Moffitt Cancer Center and Research Institute, 12902 Magnolia Avenue, Tampa, FL 33612, USA
- Department of Pediatrics, University of South Florida College of Medicine, USF/ACH Children's Research Institute, 830 First Street South, St. Petersburg, FL 33701, USA
| |
Collapse
|
47
|
Intron splicing-mediated expression of AAV Rep and Cap genes and production of AAV vectors in insect cells. Mol Ther 2008; 16:924-30. [PMID: 18388928 DOI: 10.1038/mt.2008.35] [Citation(s) in RCA: 64] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
An artificial intron containing the insect cell polyhedrin (polh) promoter was designed, constructed, and inserted into the adeno-associated virus (AAV) Rep and Cap coding sequences to express the Rep and Cap proteins, respectively. The artificial intron was spliced out and full-length Rep78 or VP1 proteins were expressed from the insect promoters located upstream of their respective AUG start codons. The polh promoter located inside the artificial intron was functional, expressed the Rep52 or VP2/VP3 proteins located downstream of the artificial intron, and overlapped with the Rep78 or VP1 proteins. This is the first report that an artificial intron containing an insect cell promoter can be inserted into a coding sequence to express genes with overlapping open-reading frames (ORFs). A method was also established for AAV vector production in insect cells with these intron-containing Rep and Cap coding sequences, and the vectors produced thereby were infectious. These intron-containing AAV Rep and Cap coding sequences were very stable in recombinant baculoviruses and showed no apparent loss of protein expression even after five consecutive amplifications of the plaque-purified recombinant baculoviruses. This newly established AAV production method should prove to be a useful tool for large-scale AAV vector production.
Collapse
|
48
|
Zhao G, Chang KY, Varley K, Stormo GD. Evidence for active maintenance of inverted repeat structures identified by a comparative genomic approach. PLoS One 2007; 2:e262. [PMID: 17327921 PMCID: PMC1803023 DOI: 10.1371/journal.pone.0000262] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2006] [Accepted: 02/08/2007] [Indexed: 11/19/2022] Open
Abstract
Inverted repeats have been found to occur in both prokaryotic and eukaryotic genomes. Usually they are short and some have important functions in various biological processes. However, long inverted repeats are rare and can cause genome instability. Analyses of C. elegans genome identified long, nearly-perfect inverted repeat sequences involving both divergently and convergently oriented homologous gene pairs and complete intergenic sequences. Comparisons with the orthologous regions from the genomes of C. briggsae and C. remanei show that the inverted repeat structures are often far more conserved than the sequences. This observation implies that there is an active mechanism for maintaining the inverted repeat nature of the sequences.
Collapse
Affiliation(s)
- Guoyan Zhao
- Department of Genetics, Washington University School of Medicine, St. Louis, Missouri, United States of America
| | - Kuan Y. Chang
- Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, Missouri, United States of America
| | - Katherine Varley
- Department of Genetics, Washington University School of Medicine, St. Louis, Missouri, United States of America
| | - Gary D. Stormo
- Department of Genetics, Washington University School of Medicine, St. Louis, Missouri, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
49
|
Girirajan S, Mendoza-Londono R, Vlangos CN, Dupuis L, Nowak NJ, Bunyan DJ, Hatchwell E, Elsea SH. Smith–Magenis syndrome and moyamoya disease in a patient with del(17)(p11.2p13.1). Am J Med Genet A 2007; 143A:999-1008. [PMID: 17431895 DOI: 10.1002/ajmg.a.31689] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Chromosomal rearrangements causing microdeletions and microduplications are a major cause of congenital malformation and mental retardation. Because they are not visible by routine chromosome analysis, high resolution whole-genome technologies are required for the detection and diagnosis of small chromosomal abnormalities. Recently, array-comparative genomic hybridization (aCGH) and multiplex ligation-dependent probe amplification (MLPA) have been useful tools for the identification and mapping of deletions and duplications at higher resolution and throughput. Smith-Magenis syndrome (SMS) is a multiple congenital anomalies/mental retardation syndrome caused by deletion or mutation of the retinoic acid induced 1 (RAI1) gene and is often associated with a chromosome 17p11.2 deletion. We report here on the clinical and molecular analysis of a 10-year-old girl with SMS and moyamoya disease (occlusion of the circle of Willis). We have employed a combination of aCGH, FISH, and MLPA to characterize an approximately 6.3 Mb deletion spanning chromosome region 17p11.2-p13.1 in this patient, with the proximal breakpoint within the RAI1 gene. Further, investigation of the genomic architecture at the breakpoint intervals of this large deletion documented the presence of palindromic repeat elements that could potentially form recombination substrates leading to unequal crossover.
Collapse
Affiliation(s)
- Santhosh Girirajan
- Department of Human Genetics, Medical College of Virginia Campus, Virginia Commonwealth University, Richmond, VA 23298, USA
| | | | | | | | | | | | | | | |
Collapse
|
50
|
Arlt MF, Durkin SG, Ragland RL, Glover TW. Common fragile sites as targets for chromosome rearrangements. DNA Repair (Amst) 2006; 5:1126-35. [PMID: 16807141 DOI: 10.1016/j.dnarep.2006.05.010] [Citation(s) in RCA: 151] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Common fragile sites are large chromosomal regions that preferentially exhibit gaps or breaks after DNA synthesis is partially perturbed. Fragile site instability in cultured cells is well documented and includes gaps and breaks on metaphase chromosomes, translocation and deletions breakpoints, and sister chromosome exchanges. In recent years, much has been learned about the genomic structure at fragile sites and the cellular mechanisms that monitor their stability. The study of fragile sites has merged with that of cell cycle checkpoints and DNA repair, with multiple proteins from these pathways implicated in fragile site stability, including ATR, BRCA1, CHK1, and RAD51. Since their discovery, fragile sites have been implicated in constitutional and cancer chromosome rearrangements in vivo and recent studies suggest that common fragile sites may serve as markers of chromosome damage caused by replication stress during early tumorigenesis. Here we review the relationship of fragile sites to chromosome rearrangements, particularly in tumor cells, and discuss the mechanisms that may be involved.
Collapse
Affiliation(s)
- Martin F Arlt
- Department of Human, Genetics University of Michigan, 4909 Buhl Box 0618, 1241 E. Catherine Street, Ann Arbor, MI 48109, USA
| | | | | | | |
Collapse
|