1
|
Brewer BJ, Dunham MJ, Raghuraman MK. A unifying model that explains the origins of human inverted copy number variants. PLoS Genet 2024; 20:e1011091. [PMID: 38175827 PMCID: PMC10766186 DOI: 10.1371/journal.pgen.1011091] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2024] Open
Abstract
With the release of the telomere-to-telomere human genome sequence and the availability of both long-read sequencing and optical genome mapping techniques, the identification of copy number variants (CNVs) and other structural variants is providing new insights into human genetic disease. Different mechanisms have been proposed to account for the novel junctions in these complex architectures, including aberrant forms of DNA replication, non-allelic homologous recombination, and various pathways that repair DNA breaks. Here, we have focused on a set of structural variants that include an inverted segment and propose that they share a common initiating event: an inverted triplication with long, unstable palindromic junctions. The secondary rearrangement of these palindromes gives rise to the various forms of inverted structural variants. We postulate that this same mechanism (ODIRA: origin-dependent inverted-repeat amplification) that creates the inverted CNVs in inherited syndromes also generates the palindromes found in cancers.
Collapse
Affiliation(s)
- Bonita J. Brewer
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
| | - Maitreya J. Dunham
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
| | - M. K. Raghuraman
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
| |
Collapse
|
2
|
Ait Saada A, Guo W, Costa AB, Yang J, Wang J, Lobachev K. Widely spaced and divergent inverted repeats become a potent source of chromosomal rearrangements in long single-stranded DNA regions. Nucleic Acids Res 2023; 51:3722-3734. [PMID: 36919609 PMCID: PMC10164571 DOI: 10.1093/nar/gkad153] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2022] [Revised: 02/16/2023] [Accepted: 02/20/2023] [Indexed: 03/16/2023] Open
Abstract
DNA inverted repeats (IRs) are widespread across many eukaryotic genomes. Their ability to form stable hairpin/cruciform secondary structures is causative in triggering chromosome instability leading to several human diseases. Distance and sequence divergence between IRs are inversely correlated with their ability to induce gross chromosomal rearrangements (GCRs) because of a lesser probability of secondary structure formation and chromosomal breakage. In this study, we demonstrate that structural parameters that normally constrain the instability of IRs are overcome when the repeats interact in single-stranded DNA (ssDNA). We established a system in budding yeast whereby >73 kb of ssDNA can be formed in cdc13-707fs mutants. We found that in ssDNA, 12 bp or 30 kb spaced Alu-IRs show similarly high levels of GCRs, while heterology only beyond 25% suppresses IR-induced instability. Mechanistically, rearrangements arise after cis-interaction of IRs leading to a DNA fold-back and the formation of a dicentric chromosome, which requires Rad52/Rad59 for IR annealing as well as Rad1-Rad10, Slx4, Msh2/Msh3 and Saw1 proteins for nonhomologous tail removal. Importantly, using structural characteristics rendering IRs permissive to DNA fold-back in yeast, we found that ssDNA regions mapped in cancer genomes contain a substantial number of potentially interacting and unstable IRs.
Collapse
Affiliation(s)
- Anissia Ait Saada
- School of Biological Sciences and Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, GA 30332, USA
| | - Wenying Guo
- School of Biological Sciences and Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, GA 30332, USA
| | - Alex B Costa
- School of Biological Sciences and Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, GA 30332, USA
| | - Jiaxin Yang
- Department of Computational Mathematics, Science and Engineering, Michigan State University, East Lansing, MI 48824, USA
| | - Jianrong Wang
- Department of Computational Mathematics, Science and Engineering, Michigan State University, East Lansing, MI 48824, USA
| | - Kirill S Lobachev
- School of Biological Sciences and Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, GA 30332, USA
| |
Collapse
|
3
|
Ait Saada A, Costa AB, Sheng Z, Guo W, Haber JE, Lobachev K. Structural parameters of palindromic repeats determine the specificity of nuclease attack of secondary structures. Nucleic Acids Res 2021; 49:3932-3947. [PMID: 33772579 PMCID: PMC8053094 DOI: 10.1093/nar/gkab168] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2021] [Revised: 02/24/2021] [Accepted: 03/02/2021] [Indexed: 12/15/2022] Open
Abstract
Palindromic sequences are a potent source of chromosomal instability in many organisms and are implicated in the pathogenesis of human diseases. In this study, we investigate which nucleases are responsible for cleavage of the hairpin and cruciform structures and generation of double-strand breaks at inverted repeats in Saccharomyces cerevisiae. We demonstrate that the involvement of structure-specific nucleases in palindrome fragility depends on the distance between inverted repeats and their transcriptional status. The attack by the Mre11 complex is constrained to hairpins with loops <9 nucleotides. This restriction is alleviated upon RPA depletion, indicating that RPA controls the stability and/or formation of secondary structures otherwise responsible for replication fork stalling and DSB formation. Mus81-Mms4 cleavage of cruciforms occurs at divergently but not convergently transcribed or nontranscribed repeats. Our study also reveals the third pathway for fragility at perfect and quasi-palindromes, which involves cruciform resolution during the G2 phase of the cell cycle.
Collapse
Affiliation(s)
- Anissia Ait Saada
- School of Biological Sciences and Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, GE 30332, USA
| | - Alex B Costa
- School of Biological Sciences and Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, GE 30332, USA
| | - Ziwei Sheng
- School of Biological Sciences and Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, GE 30332, USA
| | - Wenying Guo
- School of Biological Sciences and Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, GE 30332, USA
| | - James E Haber
- Department of Biology and Rosenstiel Basic Medical Sciences Research Center, Waltham, MA 02454-9110, USA
| | - Kirill S Lobachev
- School of Biological Sciences and Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, GE 30332, USA
| |
Collapse
|
4
|
Svetec Miklenić M, Svetec IK. Palindromes in DNA-A Risk for Genome Stability and Implications in Cancer. Int J Mol Sci 2021; 22:2840. [PMID: 33799581 PMCID: PMC7999016 DOI: 10.3390/ijms22062840] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2021] [Revised: 03/04/2021] [Accepted: 03/08/2021] [Indexed: 02/07/2023] Open
Abstract
A palindrome in DNA consists of two closely spaced or adjacent inverted repeats. Certain palindromes have important biological functions as parts of various cis-acting elements and protein binding sites. However, many palindromes are known as fragile sites in the genome, sites prone to chromosome breakage which can lead to various genetic rearrangements or even cell death. The ability of certain palindromes to initiate genetic recombination lies in their ability to form secondary structures in DNA which can cause replication stalling and double-strand breaks. Given their recombinogenic nature, it is not surprising that palindromes in the human genome are involved in genetic rearrangements in cancer cells as well as other known recurrent translocations and deletions associated with certain syndromes in humans. Here, we bring an overview of current understanding and knowledge on molecular mechanisms of palindrome recombinogenicity and discuss possible implications of DNA palindromes in carcinogenesis. Furthermore, we overview the data on known palindromic sequences in the human genome and efforts to estimate their number and distribution, as well as underlying mechanisms of genetic rearrangements specific palindromic sequences cause.
Collapse
Affiliation(s)
| | - Ivan Krešimir Svetec
- Faculty of Food Technology and Biotechnology, University of Zagreb, Pierottijeva 6, 10000 Zagreb, Croatia;
| |
Collapse
|
5
|
A reference catalog of DNA palindromes in the human genome and their variations in 1000 Genomes. Hum Genome Var 2020; 7:40. [PMID: 33298903 PMCID: PMC7680136 DOI: 10.1038/s41439-020-00127-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2020] [Revised: 05/24/2020] [Accepted: 06/08/2020] [Indexed: 02/07/2023] Open
Abstract
A palindrome in DNA is like a palindrome in language, but when read backwards, it is a complement of the forward sequence; effectively, the two halves of a sequence complement each other from its midpoint like in a double strand of DNA. Palindromes are distributed throughout the human genome and play significant roles in gene expression and regulation. Palindromic mutations are linked to many human diseases, such as neuronal disorders, mental retardation, and various cancers. In this work, we computed and analyzed the palindromic sequences in the human genome and studied their conservation in personal genomes using 1000 Genomes data. We found that ~30% of the palindromes exhibit variation, some of which are caused by rare variants. The analysis of disease/trait-associated single-nucleotide polymorphisms in palindromic regions showed that disease-associated risk variants are 14 times more likely to be present in palindromic regions than in other regions. The catalog of palindromes in the reference genome and 1000 Genomes is being made available here with details on their variations in each individual genome to serve as a resource for future and retrospective whole-genome studies identifying statistically significant palindrome variations associated with diseases or traits and their roles in disease mechanisms.
Collapse
|
6
|
Wang Y, Huang JM. Lirex: A Package for Identification of Long Inverted Repeats in Genomes. GENOMICS PROTEOMICS & BIOINFORMATICS 2017; 15:141-146. [PMID: 28392477 PMCID: PMC5414712 DOI: 10.1016/j.gpb.2017.01.005] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/14/2016] [Revised: 01/04/2017] [Accepted: 01/22/2017] [Indexed: 11/30/2022]
Abstract
Long inverted repeats (LIRs) are evolutionarily and functionally important structures in genomes because of their involvement in RNA interference, DNA recombination, and gene duplication. Identification of LIRs is highly complicated when mismatches and indels between the repeats are permitted. Long inverted repeat explorer (Lirex) was developed and introduced in this report. Written in Java, Lirex provides a user-friendly interface and allows users to specify LIR searching criteria, such as length of the region, as well as pattern and size of the repeats. Recombinogenic LIRs can be selected on the basis of mismatch rate and internal spacer size from identified LIRs. Lirex, as a cross-platform tool to identify LIRs in a genome, may assist in designing following experiments to explore the function of LIRs. Our tool can identify more LIRs than other LIR searching tools. Lirex is publicly available at http://124.16.219.129/Lirex.
Collapse
Affiliation(s)
- Yong Wang
- Institute of Deep-Sea Science and Engineering, Chinese Academy of Sciences, Sanya 572000, China.
| | - Jiao-Mei Huang
- Institute of Deep-Sea Science and Engineering, Chinese Academy of Sciences, Sanya 572000, China
| |
Collapse
|
7
|
Subramanian S, Chaparala S, Avali V, Ganapathiraju MK. A pilot study on the prevalence of DNA palindromes in breast cancer genomes. BMC Med Genomics 2016; 9:73. [PMID: 28117658 PMCID: PMC5260791 DOI: 10.1186/s12920-016-0232-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Background DNA palindromes are a unique pattern of repeat sequences that are present in the human genome. It consists of a sequence of nucleotides in which the second half is the complement of the first half but appearing in reverse order. These palindromic sequences may have a significant role in DNA replication, transcription and gene regulation processes. They occur frequently in human cancers by clustering at specific locations of the genome that undergo gene amplification and tumorigenesis. Moreover, some studies showed that palindromes are clustered in amplified regions of breast cancer genomes especially in chromosomes (chr) 8 and 11. With the large number of personal genomes and cancer genomes becoming available, it is now possible to study their association to diseases using computational methods. Here, we conducted a pilot study on chromosomes 8 and 11 of cancer genomes to identify computationally the differentially occurring palindromes. Methods We processed 69 breast cancer genomes from The Cancer Genome Atlas including serum-normal and tumor genomes, and 1000 Genomes to serve as control group. The Biological Language Modelling Toolkit (BLMT) computes palindromes in whole genomes. We developed a computational pipeline integrating BLMT to compute and compare prevalence of palindromes in personal genomes. Results We carried out a pilot study on chr 8 and chr 11 taking into account single nucleotide polymorphisms, insertions and deletions. Of all the palindromes that showed any variation in cancer genomes, 38% of what were near breast cancer genes happened to be the most differentiated palindromes in tumor (i.e. they ranked among the top 25% by our heuristic measure). Conclusions These results will shed light on the prevalence of palindromes in oncogenes and the mutations that are present in the palindromic regions that could contribute to genomic rearrangements, and breast cancer progression.
Collapse
Affiliation(s)
- Sandeep Subramanian
- Language Technologies Institute, Carnegie Mellon University, Pittsburgh, PA, 15213, USA
| | - Srilakshmi Chaparala
- Department of Biomedical Informatics, University of Pittsburgh, 5607 Baum Blvd, Suite 522, Pittsburgh, PA, 15206, USA
| | - Viji Avali
- Department of Biomedical Informatics, University of Pittsburgh, 5607 Baum Blvd, Suite 522, Pittsburgh, PA, 15206, USA
| | - Madhavi K Ganapathiraju
- Department of Biomedical Informatics, University of Pittsburgh, 5607 Baum Blvd, Suite 522, Pittsburgh, PA, 15206, USA. .,Language Technologies Institute, Carnegie Mellon University, Pittsburgh, PA, 15213, USA.
| |
Collapse
|
8
|
Lu S, Wang G, Bacolla A, Zhao J, Spitser S, Vasquez KM. Short Inverted Repeats Are Hotspots for Genetic Instability: Relevance to Cancer Genomes. Cell Rep 2015; 10:1674-1680. [PMID: 25772355 DOI: 10.1016/j.celrep.2015.02.039] [Citation(s) in RCA: 65] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2014] [Revised: 01/26/2015] [Accepted: 02/16/2015] [Indexed: 12/25/2022] Open
Abstract
Analyses of chromosomal aberrations in human genetic disorders have revealed that inverted repeat sequences (IRs) often co-localize with endogenous chromosomal instability and breakage hotspots. Approximately 80% of all IRs in the human genome are short (<100 bp), yet the mutagenic potential of such short cruciform-forming sequences has not been characterized. Here, we find that short IRs are enriched at translocation breakpoints in human cancer and stimulate the formation of DNA double-strand breaks (DSBs) and deletions in mammalian and yeast cells. We provide evidence for replication-related mechanisms of IR-induced genetic instability and a novel XPF cleavage-based mechanism independent of DNA replication. These discoveries implicate short IRs as endogenous sources of DNA breakage involved in disease etiology and suggest that these repeats represent a feature of genome plasticity that may contribute to the evolution of the human genome by providing a means for diversity within the population.
Collapse
Affiliation(s)
- Steve Lu
- Division of Pharmacology and Toxicology, College of Pharmacy, The University of Texas at Austin - Dell Pediatric Research Institute, 1400 Barbara Jordan Boulevard R1800, Austin, TX 78723, USA
| | - Guliang Wang
- Division of Pharmacology and Toxicology, College of Pharmacy, The University of Texas at Austin - Dell Pediatric Research Institute, 1400 Barbara Jordan Boulevard R1800, Austin, TX 78723, USA
| | - Albino Bacolla
- Division of Pharmacology and Toxicology, College of Pharmacy, The University of Texas at Austin - Dell Pediatric Research Institute, 1400 Barbara Jordan Boulevard R1800, Austin, TX 78723, USA
| | - Junhua Zhao
- Division of Pharmacology and Toxicology, College of Pharmacy, The University of Texas at Austin - Dell Pediatric Research Institute, 1400 Barbara Jordan Boulevard R1800, Austin, TX 78723, USA
| | - Scott Spitser
- Division of Pharmacology and Toxicology, College of Pharmacy, The University of Texas at Austin - Dell Pediatric Research Institute, 1400 Barbara Jordan Boulevard R1800, Austin, TX 78723, USA
| | - Karen M Vasquez
- Division of Pharmacology and Toxicology, College of Pharmacy, The University of Texas at Austin - Dell Pediatric Research Institute, 1400 Barbara Jordan Boulevard R1800, Austin, TX 78723, USA.
| |
Collapse
|
9
|
Yang H, Volfovsky N, Rattray A, Chen X, Tanaka H, Strathern J. GAP-Seq: a method for identification of DNA palindromes. BMC Genomics 2014; 15:394. [PMID: 24885769 PMCID: PMC4057610 DOI: 10.1186/1471-2164-15-394] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2013] [Accepted: 04/26/2014] [Indexed: 12/13/2022] Open
Abstract
Background Closely spaced long inverted repeats, also known as DNA palindromes, can undergo intrastrand annealing to form DNA hairpins. The ability to form these hairpins results in genome instability, difficulties in maintaining clones in Escherichia coli and major problems for most DNA sequencing approaches. Because of their role in genomic instability and gene amplification in some human cancers, it is important to develop systematic approaches to detect and characterize DNA palindromes. Results We developed a new protocol to identify palindromes that couples the S1 nuclease treated Cot0 DNA (GAPF) with high-throughput sequencing (GAP-Seq). Unlike earlier protocols, it does not involve restriction enzymatic digestion prior to DNA snap-back thereby preserving longer DNA sequences. It also indicates the location of the novel junction, which can then be recovered. Using MCF-7 breast cancer cell line as the proof-of-principle analysis, we have identified 35 palindrome candidates and physically characterized the top 5 candidates and their junctions. Because this protocol eliminates many of the false positives that plague earlier techniques, we have improved palindrome identification. Conclusions The GAP-Seq approach underscores the importance of developing new tools for identifying and characterizing palindromes, and provides a new strategy to systematically assess palindromes in genomes. It will be useful for studying human cancers and other diseases associated with palindromes. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-394) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | | | | | | | - Jeffrey Strathern
- Gene Regulation and Chromosome Biology Laboratory, Frederick National Laboratory for Cancer Research, Cancer Research and Development Center, Frederick, MD 21702, USA.
| |
Collapse
|
10
|
Smith DR, Hua J, Archibald JM, Lee RW. Palindromic genes in the linear mitochondrial genome of the nonphotosynthetic green alga Polytomella magna. Genome Biol Evol 2014; 5:1661-7. [PMID: 23940100 PMCID: PMC3787674 DOI: 10.1093/gbe/evt122] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Organelle DNA is no stranger to palindromic repeats. But never has a mitochondrial or plastid genome been described in which every coding region is part of a distinct palindromic unit. While sequencing the mitochondrial DNA of the nonphotosynthetic green alga Polytomella magna, we uncovered precisely this type of genic arrangement. The P. magna mitochondrial genome is linear and made up entirely of palindromes, each containing 1–7 unique coding regions. Consequently, every gene in the genome is duplicated and in an inverted orientation relative to its partner. And when these palindromic genes are folded into putative stem-loops, their predicted translational start sites are often positioned in the apex of the loop. Gel electrophoresis results support the linear, 28-kb monomeric conformation of the P. magna mitochondrial genome. Analyses of other Polytomella taxa suggest that palindromic mitochondrial genes were present in the ancestor of the Polytomella lineage and lost or retained to various degrees in extant species. The possible origins and consequences of this bizarre genomic architecture are discussed.
Collapse
Affiliation(s)
- David Roy Smith
- Department of Biology, University of Western Ontario, London, Ontario, Canada
| | | | | | | |
Collapse
|
11
|
Zhang Y, Saini N, Sheng Z, Lobachev KS. Genome-wide screen reveals replication pathway for quasi-palindrome fragility dependent on homologous recombination. PLoS Genet 2013; 9:e1003979. [PMID: 24339793 PMCID: PMC3855049 DOI: 10.1371/journal.pgen.1003979] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2013] [Accepted: 10/12/2013] [Indexed: 02/07/2023] Open
Abstract
Inverted repeats capable of forming hairpin and cruciform structures present a threat to chromosomal integrity. They induce double strand breaks, which lead to gross chromosomal rearrangements, the hallmarks of cancers and hereditary diseases. Secondary structure formation at this motif has been proposed to be the driving force for the instability, albeit the mechanisms leading to the fragility are not well-understood. We carried out a genome-wide screen to uncover the genetic players that govern fragility of homologous and homeologous Alu quasi-palindromes in the yeast Saccharomyces cerevisiae. We found that depletion or lack of components of the DNA replication machinery, proteins involved in Fe-S cluster biogenesis, the replication-pausing checkpoint pathway, the telomere maintenance complex or the Sgs1-Top3-Rmi1 dissolvasome augment fragility at Alu-IRs. Rad51, a component of the homologous recombination pathway, was found to be required for replication arrest and breakage at the repeats specifically in replication-deficient strains. These data demonstrate that Rad51 is required for the formation of breakage-prone secondary structures in situations when replication is compromised while another mechanism operates in DSB formation in replication-proficient strains. Inverted repeats are found in many eukaryotic genomes including humans. They have a potential to cause chromosomal breakage and rearrangements that contribute to genome polymorphism and the development of diseases. Instability of inverted repeats is accounted for by their propensity to adopt DNA secondary structures that is negatively affected by the distance between the repeats and level of sequence divergence. However, the genetic factors that promote the abnormal structure formation or affect the ability of the repeats to break are largely unknown. Here, using a genome-wide screen we identified 38 mutants that destabilize imperfect human inverted Alu repeats and predispose them to breakage. The proteins that are required to maintain repeat stability belong to the core of the DNA replication machinery and to the accessory proteins that help replication fork to move through the difficult templates. Remarkably, when replication machinery is compromised, the proteins involved in homologous recombination promote the formation of secondary structures and replication block thereby triggering breakage at the inverted repeats. These results reveal a powerful pathway for the destabilization of chromosomes containing inverted repeats that requires the activity of homologous recombination.
Collapse
Affiliation(s)
- Yu Zhang
- School of Biology and Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, Georgia, United States of America
| | - Natalie Saini
- School of Biology and Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, Georgia, United States of America
| | - Ziwei Sheng
- School of Biology and Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, Georgia, United States of America
| | - Kirill S. Lobachev
- School of Biology and Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, Georgia, United States of America
- * E-mail:
| |
Collapse
|
12
|
Nichols M, Steinman RA. A recombinase-based palindrome generator capable of producing randomized shRNA libraries. J Biotechnol 2009; 143:79-84. [PMID: 19539675 DOI: 10.1016/j.jbiotec.2009.06.010] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2009] [Revised: 06/04/2009] [Accepted: 06/09/2009] [Indexed: 01/05/2023]
|
13
|
Hastings PJ, Lupski JR, Rosenberg SM, Ira G. Mechanisms of change in gene copy number. Nat Rev Genet 2009; 10:551-64. [PMID: 19597530 DOI: 10.1038/nrg2593] [Citation(s) in RCA: 836] [Impact Index Per Article: 55.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
Deletions and duplications of chromosomal segments (copy number variants, CNVs) are a major source of variation between individual humans and are an underlying factor in human evolution and in many diseases, including mental illness, developmental disorders and cancer. CNVs form at a faster rate than other types of mutation, and seem to do so by similar mechanisms in bacteria, yeast and humans. Here we review current models of the mechanisms that cause copy number variation. Non-homologous end-joining mechanisms are well known, but recent models focus on perturbation of DNA replication and replication of non-contiguous DNA segments. For example, cellular stress might induce repair of broken replication forks to switch from high-fidelity homologous recombination to non-homologous repair, thus promoting copy number change.
Collapse
Affiliation(s)
- P J Hastings
- Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA.
| | | | | | | |
Collapse
|
14
|
Dishaw LJ, Mueller MG, Gwatney N, Cannon JP, Haire RN, Litman RT, Amemiya CT, Ota T, Rowen L, Glusman G, Litman GW. Genomic complexity of the variable region-containing chitin-binding proteins in amphioxus. BMC Genet 2008; 9:78. [PMID: 19046437 PMCID: PMC2632668 DOI: 10.1186/1471-2156-9-78] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2008] [Accepted: 12/01/2008] [Indexed: 01/11/2023] Open
Abstract
BACKGROUND The variable region-containing chitin-binding proteins (VCBPs) are found in protochordates and consist of two tandem immunoglobulin variable (V)-type domains and a chitin-binding domain. We previously have shown that these polymorphic genes, which primarily are expressed in the gut, exhibit characteristics of immune genes. In this report, we describe VCBP genomic organization and characterize adjacent and intervening genetic features which may influence both their polymorphism and complex transcriptional repertoire. RESULTS VCBP genes 1, 2, 4, and 5 are encoded in a single contiguous gene-rich chromosomal region and VCBP3 is encoded in a separate locus. The VCBPs exhibit extensive haplotype variation, including copy number variation (CNV), indel polymorphism and a markedly elevated variation in repeat type and density. In at least one haplotype, inverted repeats occur more frequently than elsewhere in the genome. Multi-animal cDNA screening, as well as transcriptional profilingusing a novel transfection system, suggests that haplotype-specific transcriptional variants may contribute to VCBP genetic diversity. CONCLUSION The availability of the Branchiostoma floridae genome (Joint Genome Institute, Brafl1), along with BAC and PAC screening and sequencing described here, reveal that the relatively limited number of VCBP genes present in the amphioxus genome exhibit exceptionally high haplotype variation. These VCBP haplotypes contribute a diverse pool of allelic variants, which includes gene copy number variation, pseudogenes, and other polymorphisms, while contributing secondary effects on gene transcription as well.
Collapse
Affiliation(s)
- Larry J Dishaw
- All Children's Hospital, Department of Molecular Genetics, 801 Sixth Street South, St. Petersburg, FL 33701, USA
- H. Lee Moffitt Cancer Center and Research Institute, 12902 Magnolia Avenue, Tampa, FL 33612, USA
| | - M Gail Mueller
- All Children's Hospital, Department of Molecular Genetics, 801 Sixth Street South, St. Petersburg, FL 33701, USA
| | - Natasha Gwatney
- Department of Pediatrics, University of South Florida College of Medicine, USF/ACH Children's Research Institute, 830 First Street South, St. Petersburg, FL 33701, USA
| | - John P Cannon
- H. Lee Moffitt Cancer Center and Research Institute, 12902 Magnolia Avenue, Tampa, FL 33612, USA
- Department of Pediatrics, University of South Florida College of Medicine, USF/ACH Children's Research Institute, 830 First Street South, St. Petersburg, FL 33701, USA
| | - Robert N Haire
- Department of Pediatrics, University of South Florida College of Medicine, USF/ACH Children's Research Institute, 830 First Street South, St. Petersburg, FL 33701, USA
| | - Ronda T Litman
- Department of Pediatrics, University of South Florida College of Medicine, USF/ACH Children's Research Institute, 830 First Street South, St. Petersburg, FL 33701, USA
| | - Chris T Amemiya
- Benaroya Research Institute, 1201 Ninth Avenue, Seattle, WA 98101, USA
| | - Tatsuya Ota
- Department of Evolutionary Studies of Biosystems, The Graduate University for Advanced Studies, Kamiyamaguchi 1560-35, Hayama 240-0193 Japan
| | - Lee Rowen
- Institute for Systems Biology, 1441 N. 34th St, Seattle, WA, 98103, USA
| | - Gustavo Glusman
- Institute for Systems Biology, 1441 N. 34th St, Seattle, WA, 98103, USA
| | - Gary W Litman
- All Children's Hospital, Department of Molecular Genetics, 801 Sixth Street South, St. Petersburg, FL 33701, USA
- H. Lee Moffitt Cancer Center and Research Institute, 12902 Magnolia Avenue, Tampa, FL 33612, USA
- Department of Pediatrics, University of South Florida College of Medicine, USF/ACH Children's Research Institute, 830 First Street South, St. Petersburg, FL 33701, USA
| |
Collapse
|
15
|
Lee JY, Mogen JL, Chavez A, Johnson FB. Sgs1 RecQ helicase inhibits survival of Saccharomyces cerevisiae cells lacking telomerase and homologous recombination. J Biol Chem 2008; 283:29847-58. [PMID: 18757364 DOI: 10.1074/jbc.m804760200] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
In yeast telomerase mutants, the Sgs1 RecQ helicase slows the rate of senescence and also facilitates the appearance of certain types of survivors of critical telomere shortening via mechanisms dependent on Rad52-dependent homologous recombination (HR). Here we describe a third function for Sgs1 in telomerase-deficient cells, inhibition of survivors that grow independent of Rad52. Unlike tlc1 rad52 double mutants, which do not form survivors of telomere dysfunction, tlc1 rad52 sgs1 triple mutants readily generated survivors. After emerging from growth crisis, the triple mutants progressively lost telomeric and subtelomeric sequences, yet grew for more than 1 year. Analysis of cloned chromosome termini and of copy number changes of loci genome-wide using tiling arrays revealed terminal deletions extending up to 57 kb, as well as changes in Ty retrotransposon copy numbers. Amplification of the remaining terminal sequences generated large palindromes at some chromosome termini. Sgs1 helicase activity but not checkpoint function was essential for inhibiting the appearance of the survivors, and the continued absence of Sgs1 was required for the growth of the established survivors. Thus, in addition to facilitating the maintenance of telomere repeat sequences via HR-dependent mechanisms, a RecQ helicase can prevent the adoption of HR-independent mechanisms that stabilize chromosome termini without the use of natural telomere sequences. This provides a novel mechanism by which RecQ helicases may help maintain genome integrity and thus prevent age-related diseases and cancer.
Collapse
Affiliation(s)
- Julia Y Lee
- Department of Pathology and Laboratory Medicine, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania 19104, USA
| | | | | | | |
Collapse
|
16
|
Neiman PE, Elsaesser K, Loring G, Kimmel R. Myc oncogene-induced genomic instability: DNA palindromes in bursal lymphomagenesis. PLoS Genet 2008; 4:e1000132. [PMID: 18636108 PMCID: PMC2444050 DOI: 10.1371/journal.pgen.1000132] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2008] [Accepted: 06/18/2008] [Indexed: 12/16/2022] Open
Abstract
Genetic instability plays a key role in the formation of naturally occurring cancer. The formation of long DNA palindromes is a rate-limiting step in gene amplification, a common form of tumor-associated genetic instability. Genome-wide analysis of palindrome formation (GAPF) has detected both extensive palindrome formation and gene amplification, beginning early in tumorigenesis, in an experimental Myc-induced model tumor system in the chicken bursa of Fabricius. We determined that GAPF-detected palindromes are abundant and distributed nonrandomly throughout the genome of bursal lymphoma cells, frequently at preexisting short inverted repeats. By combining GAPF with chromatin immunoprecipitation (ChIP), we found a significant association between occupancy of gene-proximal Myc binding sites and the formation of palindromes. Numbers of palindromic loci correlate with increases in both levels of Myc over-expression and ChIP-detected occupancy of Myc binding sites in bursal cells. However, clonal analysis of chick DF-1 fibroblasts suggests that palindrome formation is a stochastic process occurring in individual cells at a small number of loci relative to much larger numbers of susceptible loci in the cell population and that the induction of palindromes is not involved in Myc-induced acute fibroblast transformation. GAPF-detected palindromes at the highly oncogenic bic/miR-155 locus in all of our preneoplastic and neoplastic bursal samples, but not in DNA from normal and other transformed cell types. This finding indicates very strong selection during bursal lymphomagenesis. Therefore, in addition to providing a platform for gene copy number change, palindromes may alter microRNA genes in a fashion that can contribute to cancer development.
Collapse
MESH Headings
- Animals
- Bursa of Fabricius/pathology
- Cell Line, Transformed
- Cell Line, Tumor
- Cell Transformation, Neoplastic
- Cell Transformation, Viral
- Chickens
- Chromatin Immunoprecipitation
- DNA, Complementary
- DNA, Neoplasm/genetics
- Gene Amplification
- Genes, myc
- Genetic Vectors
- Genomic Instability
- Lymphoma, B-Cell/etiology
- Lymphoma, B-Cell/genetics
- Nucleic Acid Hybridization
- Oligonucleotide Array Sequence Analysis
- Precancerous Conditions/genetics
- Repetitive Sequences, Nucleic Acid
- Retroviridae/genetics
- Stochastic Processes
Collapse
Affiliation(s)
- Paul E Neiman
- Division of Basic Science, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America.
| | | | | | | |
Collapse
|
17
|
Inagaki K, Lewis SM, Wu X, Ma C, Munroe DJ, Fuess S, Storm TA, Kay MA, Nakai H. DNA palindromes with a modest arm length of greater, similar 20 base pairs are a significant target for recombinant adeno-associated virus vector integration in the liver, muscles, and heart in mice. J Virol 2007; 81:11290-303. [PMID: 17686840 PMCID: PMC2045527 DOI: 10.1128/jvi.00963-07] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Our previous study has shown that recombinant adeno-associated virus (rAAV) vector integrates preferentially in genes, near transcription start sites and CpG islands in mouse liver (H. Nakai, X. Wu, S. Fuess, T. A. Storm, D. Munroe, E. Montini, S. M. Burgess, M. Grompe, and M. A. Kay, J. Virol. 79:3606-3614, 2005). However, the previous method relied on in vivo selection of rAAV integrants and could be employed for the liver but not for other tissues. Here, we describe a novel method for high-throughput rAAV integration site analysis that does not rely on marker gene expression, selection, or cell division, and therefore it can identify rAAV integration sites in nondividing cells without cell manipulations. Using this new method, we identified and characterized a total of 997 rAAV integration sites in mouse liver, skeletal muscle, and heart, transduced with rAAV2 or rAAV8 vector. The results support our previous observations, but notably they have revealed that DNA palindromes with an arm length of greater, similar 20 bp (total length, greater, similar 40 bp) are a significant target for rAAV integration. Up to approximately 30% of total integration events occurred in the vicinity of DNA palindromes with an arm length of greater, similar 20 bp. Considering that DNA palindromes may constitute fragile genomic sites, our results support the notion that rAAV integrates at chromosomal sites susceptible to breakage or preexisting breakage sites. The use of rAAV to label fragile genomic sites may provide an important new tool for probing the intrinsic source of ongoing genomic instability in various tissues in animals, studying DNA palindrome metabolism in vivo, and understanding their possible contributions to carcinogenesis and aging.
Collapse
Affiliation(s)
- Katsuya Inagaki
- Department of Molecular Genetics & Biochemistry, University of Pittsburgh School of Medicine, W1244 BSTWR, 200 Lothrop St., Pittsburgh, PA 15261, USA
| | | | | | | | | | | | | | | | | |
Collapse
|