1
|
He C, Zhu H. Evolutionary Nonindependence Between Human piRNAs and Their Potential Target Sites in Protein-Coding Genes. J Mol Evol 2025; 93:83-99. [PMID: 39621077 DOI: 10.1007/s00239-024-10220-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Accepted: 11/08/2024] [Indexed: 02/26/2025]
Abstract
PIWI-interacting RNAs (piRNAs) are the most diverse small RNAs in animals. These small RNAs have been known to play an important role in the suppression of transposable elements (TEs). Protein-coding genes (PCGs) are the most well-recognized functional genes in genomes. In the present study, we designed and performed a set of statistics-based evolutionary analyses to reveal nonrandom phenomena in the evolution of human piRNA-PCG targeting relationships. Through analyzing the occurrence of single nucleotide variants (SNVs) in potential piRNA target sites in human PCGs, we provide evidence that there exists a mutational force biased to strengthen piRNA-PCG targeting relationships. Through analyzing the allele frequencies of SNVs in potential piRNA target sites in human PCGs, we provide evidence that there exists a piRNA-dependent selective force acting on potential piRNA target sites in human PCGs. Because of these nonrandom evolutionary forces, human piRNAs and their potential target sites in PCGs are not independent in evolution. Additionally, we found evidence that potential piRNA target sites in human PCGs are particularly likely to be present in regions derived from Alu elements. This finding suggests that the aforementioned evolutionary forces acting on piRNA-PCG targeting relationships could be particularly prone to affect Alu-derived regions in human PCGs. Collectively, our findings provide new insights into the evolutionary interplay between piRNAs, PCGs, and Alu elements in the evolution of the human genome.
Collapse
Affiliation(s)
- Chong He
- Bioinformatics Section, School of Basic Medical Sciences, Southern Medical University, Guangzhou, 510515, China.
| | - Hao Zhu
- Bioinformatics Section, School of Basic Medical Sciences, Southern Medical University, Guangzhou, 510515, China
- Guangdong-Hong Kong-Macao Greater Bay Area Center for Brain Science and Brain-Inspired Intelligence, Southern Medical University, Guangzhou, 510515, China
- Guangdong Provincial Key Lab of Single Cell Technology and Application, Southern Medical University, Guangzhou, 510515, China
| |
Collapse
|
2
|
Balachandran P, Walawalkar IA, Flores JI, Dayton JN, Audano PA, Beck CR. Transposable element-mediated rearrangements are prevalent in human genomes. Nat Commun 2022; 13:7115. [PMID: 36402840 PMCID: PMC9675761 DOI: 10.1038/s41467-022-34810-8] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Accepted: 11/08/2022] [Indexed: 11/21/2022] Open
Abstract
Transposable elements constitute about half of human genomes, and their role in generating human variation through retrotransposition is broadly studied and appreciated. Structural variants mediated by transposons, which we call transposable element-mediated rearrangements (TEMRs), are less well studied, and the mechanisms leading to their formation as well as their broader impact on human diversity are poorly understood. Here, we identify 493 unique TEMRs across the genomes of three individuals. While homology directed repair is the dominant driver of TEMRs, our sequence-resolved TEMR resource allows us to identify complex inversion breakpoints, triplications or other high copy number polymorphisms, and additional complexities. TEMRs are enriched in genic loci and can create potentially important risk alleles such as a deletion in TRIM65, a known cancer biomarker and therapeutic target. These findings expand our understanding of this important class of structural variation, the mechanisms responsible for their formation, and establish them as an important driver of human diversity.
Collapse
Affiliation(s)
| | | | - Jacob I Flores
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Jacob N Dayton
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Peter A Audano
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Christine R Beck
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA.
- Department of Genetics and Genome Sciences, University of Connecticut Health Center, Farmington, CT, USA.
- Institute for Systems Genomics, University of Connecticut, Storrs, CT, USA.
| |
Collapse
|
3
|
Jiang Y. Contribution of Microhomology to Genome Instability: Connection between DNA Repair and Replication Stress. Int J Mol Sci 2022; 23:12937. [PMID: 36361724 PMCID: PMC9657218 DOI: 10.3390/ijms232112937] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Revised: 10/23/2022] [Accepted: 10/23/2022] [Indexed: 11/23/2023] Open
Abstract
Microhomology-mediated end joining (MMEJ) is a highly mutagenic pathway to repair double-strand breaks (DSBs). MMEJ was thought to be a backup pathway of homologous recombination (HR) and canonical nonhomologous end joining (C-NHEJ). However, it attracts more attention in cancer research due to its special function of microhomology in many different aspects of cancer. In particular, it is initiated with DNA end resection and upregulated in homologous recombination-deficient cancers. In this review, I summarize the following: (1) the recent findings and contributions of MMEJ to genome instability, including phenotypes relevant to MMEJ; (2) the interaction between MMEJ and other DNA repair pathways; (3) the proposed mechanistic model of MMEJ in DNA DSB repair and a new connection with microhomology-mediated break-induced replication (MMBIR); and (4) the potential clinical application by targeting MMEJ based on synthetic lethality for cancer therapy.
Collapse
Affiliation(s)
- Yuning Jiang
- Department of Radiation Oncology, University of Virginia, Charlottesville, VA 22903, USA
| |
Collapse
|
4
|
Cosenza MR, Rodriguez-Martin B, Korbel JO. Structural Variation in Cancer: Role, Prevalence, and Mechanisms. Annu Rev Genomics Hum Genet 2022; 23:123-152. [DOI: 10.1146/annurev-genom-120121-101149] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Somatic rearrangements resulting in genomic structural variation drive malignant phenotypes by altering the expression or function of cancer genes. Pan-cancer studies have revealed that structural variants (SVs) are the predominant class of driver mutation in most cancer types, but because they are difficult to discover, they remain understudied when compared with point mutations. This review provides an overview of the current knowledge of somatic SVs, discussing their primary roles, prevalence in different contexts, and mutational mechanisms. SVs arise throughout the life history of cancer, and 55% of driver mutations uncovered by the Pan-Cancer Analysis of Whole Genomes project represent SVs. Leveraging the convergence of cell biology and genomics, we propose a mechanistic classification of somatic SVs, from simple to highly complex DNA rearrangement classes. The actions of DNA repair and DNA replication processes together with mitotic errors result in a rich spectrum of SV formation processes, with cascading effects mediating extensive structural diversity after an initiating DNA lesion has formed. Thanks to new sequencing technologies, including the sequencing of single-cell genomes, open questions about the molecular triggers and the biomolecules involved in SV formation as well as their mutational rates can now be addressed. Expected final online publication date for the Annual Review of Genomics and Human Genetics, Volume 23 is October 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
Collapse
Affiliation(s)
| | | | - Jan O. Korbel
- Genome Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
- German Cancer Research Center (DKFZ), Heidelberg, Germany
| |
Collapse
|
5
|
Altamura R, Doshi J, Benenson Y. Rational design and construction of multi-copy biomanufacturing islands in mammalian cells. Nucleic Acids Res 2022; 50:561-578. [PMID: 34893882 PMCID: PMC8754653 DOI: 10.1093/nar/gkab1214] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Revised: 11/21/2021] [Accepted: 11/26/2021] [Indexed: 11/14/2022] Open
Abstract
Cell line development is a critical step in the establishment of a biopharmaceutical manufacturing process. Current protocols rely on random transgene integration and amplification. Due to considerable variability in transgene integration profiles, this workflow results in laborious screening campaigns before stable producers can be identified. Alternative approaches for transgene dosage increase and integration are therefore highly desirable. In this study, we present a novel strategy for the rapid design, construction, and genomic integration of engineered multiple-copy gene constructs consisting of up to 10 gene expression cassettes. Key to this strategy is the diversification, at the sequence level, of the individual gene cassettes without altering their protein products. We show a computational workflow for coding and regulatory sequence diversification and optimization followed by experimental assembly of up to nine gene copies and a sentinel reporter on a contiguous scaffold. Transient transfections in CHO cells indicates that protein expression increases with the gene copy number on the scaffold. Further, we stably integrate these cassettes into a pre-validated genomic locus. Altogether, our findings point to the feasibility of engineering a fully mapped multi-copy recombinant protein 'production island' in a mammalian cell line with greatly reduced screening effort, improved stability, and predictable product titers.
Collapse
Affiliation(s)
- Raffaele Altamura
- Department of Biosystems Science and Engineering, ETH Zurich, Mattenstrasse 26, Basel, 4058, Switzerland
| | - Jiten Doshi
- Department of Biosystems Science and Engineering, ETH Zurich, Mattenstrasse 26, Basel, 4058, Switzerland
| | - Yaakov Benenson
- Department of Biosystems Science and Engineering, ETH Zurich, Mattenstrasse 26, Basel, 4058, Switzerland
| |
Collapse
|
6
|
Zhang X, Li T, Ou J, Huang J, Liang P. Homology-based repair induced by CRISPR-Cas nucleases in mammalian embryo genome editing. Protein Cell 2021; 13:316-335. [PMID: 33945139 PMCID: PMC9008090 DOI: 10.1007/s13238-021-00838-7] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2020] [Accepted: 03/16/2021] [Indexed: 12/26/2022] Open
Abstract
Recent advances in genome editing, especially CRISPR-Cas nucleases, have revolutionized both laboratory research and clinical therapeutics. CRISPR-Cas nucleases, together with the DNA damage repair pathway in cells, enable both genetic diversification by classical non-homologous end joining (c-NHEJ) and precise genome modification by homology-based repair (HBR). Genome editing in zygotes is a convenient way to edit the germline, paving the way for animal disease model generation, as well as human embryo genome editing therapy for some life-threatening and incurable diseases. HBR efficiency is highly dependent on the DNA donor that is utilized as a repair template. Here, we review recent progress in improving CRISPR-Cas nuclease-induced HBR in mammalian embryos by designing a suitable DNA donor. Moreover, we want to provide a guide for producing animal disease models and correcting genetic mutations through CRISPR-Cas nuclease-induced HBR in mammalian embryos. Finally, we discuss recent developments in precise genome-modification technology based on the CRISPR-Cas system.
Collapse
Affiliation(s)
- Xiya Zhang
- Center for Reproductive Medicine, the Third Affiliated Hospital of Sun Yat-sen University, Sun Yat-sen University, Guangzhou, 510630, China
| | - Tao Li
- Center for Reproductive Medicine, the Third Affiliated Hospital of Sun Yat-sen University, Sun Yat-sen University, Guangzhou, 510630, China
| | - Jianping Ou
- Center for Reproductive Medicine, the Third Affiliated Hospital of Sun Yat-sen University, Sun Yat-sen University, Guangzhou, 510630, China.
| | - Junjiu Huang
- MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou, 510275, China. .,Key Laboratory of Reproductive Medicine of Guangdong Province, the First Affiliated Hospital and School of Life Sciences, Sun Yat-sen University, Guangzhou, 510275, China.
| | - Puping Liang
- MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou, 510275, China. .,Key Laboratory of Reproductive Medicine of Guangdong Province, the First Affiliated Hospital and School of Life Sciences, Sun Yat-sen University, Guangzhou, 510275, China.
| |
Collapse
|
7
|
Abelleyro MM, Radic CP, Marchione VD, Waisman K, Tetzlaff T, Neme D, Rossetti LC, De Brasi CD. Molecular insights into the mechanism of nonrecurrent F8 structural variants: Full breakpoint characterization and bioinformatics of DNA elements implicated in the upmost severe phenotype in hemophilia A. Hum Mutat 2020; 41:825-836. [PMID: 31898853 DOI: 10.1002/humu.23977] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2019] [Revised: 12/09/2019] [Accepted: 12/26/2019] [Indexed: 11/08/2022]
Abstract
Hemophilia A (HA) provides excellent models to analyze genotype-phenotype relationships and mutational mechanisms. NhF8ld's breakpoints were characterized using case-specific DNA-tags, direct- or inverse-polymerase chain reaction amplification, and Sanger sequencing. DNA-break's stimulators (n = 46), interspersed repeats, non-B-DNA, and secondary structures were analyzed around breakpoints versus null hypotheses (E-values) based on computer simulations and base-frequency probabilities. Nine of 18 (50%) severe-HA patients with nhF8lds developed inhibitors, 1/8 affecting one exon and 8/10 (80%) affecting multi-exons. NhF8lds range: 2-165 kb. Five (45%) nhF8lds involve F8-extragenic regions including three affecting vicinal genes (SMIM9 and BRCC3) but none shows an extra-phenotype not related to severe-HA. The contingency analysis of recombinogenic motifs at nhF8ld breakpoints indicated a significant involvement of several DNA-break stimulator elements. Most nhF8ld's breakpoint junctions showed microhomologies (1-7 bp). Three (27%) nhF8lds show complexities at the breakpoints: an 8-bp inverted-insertion, and the remnant two, inverted- and direct-insertions (46-68 bp) supporting replicative models microhomology-mediated break-induced replication/Fork Stalling and Template Switching. The remnant eight (73%) nhF8lds may support nonhomologous end joining/microhomology-mediated end joining models. Our study suggests the involvement of the retroposition machinery (e.g., Jurka-targets, Alu-elements, long interspersed nuclear elements, long terminal repeats), microhomologies, and secondary structures at breakpoints playing significant roles in the origin of the upmost severe phenotype in HA.
Collapse
Affiliation(s)
- Miguel Martín Abelleyro
- Instituto de Medicina Experimental (IMEX, CONICET-Academia Nacional de Medicina), Buenos Aires, Argentina
| | - Claudia Pamela Radic
- Instituto de Medicina Experimental (IMEX, CONICET-Academia Nacional de Medicina), Buenos Aires, Argentina
| | - Vanina Daniela Marchione
- Instituto de Medicina Experimental (IMEX, CONICET-Academia Nacional de Medicina), Buenos Aires, Argentina
| | - Karen Waisman
- Instituto de Medicina Experimental (IMEX, CONICET-Academia Nacional de Medicina), Buenos Aires, Argentina
| | - Tomas Tetzlaff
- Universidad Nacional de General Sarmiento, Los Polvorines, Argentina
| | - Daniela Neme
- Fundación de la Hemofilia Alfredo Pavlovsky, Buenos Aires, Argentina
| | - Liliana Carmen Rossetti
- Instituto de Medicina Experimental (IMEX, CONICET-Academia Nacional de Medicina), Buenos Aires, Argentina
| | - Carlos Daniel De Brasi
- Instituto de Medicina Experimental (IMEX, CONICET-Academia Nacional de Medicina), Buenos Aires, Argentina.,Instituto de Investigaciones Hematológicas Mariano R Castex (Academia Nacional de Medicina), Buenos Aires, Argentina
| |
Collapse
|
8
|
Advances in genome editing through control of DNA repair pathways. Nat Cell Biol 2019; 21:1468-1478. [PMID: 31792376 DOI: 10.1038/s41556-019-0425-z] [Citation(s) in RCA: 253] [Impact Index Per Article: 42.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2018] [Accepted: 10/25/2019] [Indexed: 02/06/2023]
Abstract
Eukaryotic cells deploy overlapping repair pathways to resolve DNA damage. Advancements in genome editing take advantage of these pathways to produce permanent genetic changes. Despite recent improvements, genome editing can produce diverse outcomes that can introduce risks in clinical applications. Although homology-directed repair is attractive for its ability to encode precise edits, it is particularly difficult in human cells. Here we discuss the DNA repair pathways that underlie genome editing and strategies to favour various outcomes.
Collapse
|
9
|
Beal MA, Meier MJ, Williams A, Rowan-Carroll A, Gagné R, Lindsay SJ, Fitzgerald T, Hurles ME, Marchetti F, Yauk CL. Paternal exposure to benzo(a)pyrene induces genome-wide mutations in mouse offspring. Commun Biol 2019; 2:228. [PMID: 31240266 PMCID: PMC6586636 DOI: 10.1038/s42003-019-0476-5] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2019] [Accepted: 05/23/2019] [Indexed: 01/14/2023] Open
Abstract
Understanding the effects of environmental exposures on germline mutation rates has been a decades-long pursuit in genetics. We used next-generation sequencing and comparative genomic hybridization arrays to investigate genome-wide mutations in the offspring of male mice exposed to benzo(a)pyrene (BaP), a common environmental pollutant. We demonstrate that offspring developing from sperm exposed during the mitotic or post-mitotic phases of spermatogenesis have significantly more de novo single nucleotide variants (1.8-fold; P < 0.01) than controls. Both phases of spermatogenesis are susceptible to the induction of heritable mutations, although mutations arising from post-fertilization events are more common after post-mitotic exposure. In addition, the mutation spectra in sperm and offspring of BaP-exposed males are consistent. Finally, we report a significant increase in transmitted copy number duplications (P = 0.001) in BaP-exposed sires. Our study demonstrates that germ cell mutagen exposures induce genome-wide mutations in the offspring that may be associated with adverse health outcomes.
Collapse
Affiliation(s)
- Marc A. Beal
- Carleton University, Ottawa, Ontario K1S 5B6 Canada
- Environmental Health Science and Research Bureau, Healthy Environments and Consumer Safety Branch, Health Canada, Ottawa, Ontario K1A 0K9 Canada
| | - Matthew J. Meier
- Environmental Health Science and Research Bureau, Healthy Environments and Consumer Safety Branch, Health Canada, Ottawa, Ontario K1A 0K9 Canada
| | - Andrew Williams
- Environmental Health Science and Research Bureau, Healthy Environments and Consumer Safety Branch, Health Canada, Ottawa, Ontario K1A 0K9 Canada
| | - Andrea Rowan-Carroll
- Environmental Health Science and Research Bureau, Healthy Environments and Consumer Safety Branch, Health Canada, Ottawa, Ontario K1A 0K9 Canada
| | - Rémi Gagné
- Environmental Health Science and Research Bureau, Healthy Environments and Consumer Safety Branch, Health Canada, Ottawa, Ontario K1A 0K9 Canada
| | - Sarah J. Lindsay
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA UK
| | - Tomas Fitzgerald
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA UK
| | - Matthew E. Hurles
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA UK
| | - Francesco Marchetti
- Environmental Health Science and Research Bureau, Healthy Environments and Consumer Safety Branch, Health Canada, Ottawa, Ontario K1A 0K9 Canada
| | - Carole L. Yauk
- Environmental Health Science and Research Bureau, Healthy Environments and Consumer Safety Branch, Health Canada, Ottawa, Ontario K1A 0K9 Canada
| |
Collapse
|
10
|
Pang D, Shang X, Cai D, Zhu F, Cheng Y, Zhong J, Yi S, Zhang Q, Xu X. Thalassaemia intermedia caused by coinheritance of a β‐thalassaemia mutation and a
de novo
duplication of α‐globin genes in the paternal allele. Br J Haematol 2019; 186:620-624. [DOI: 10.1111/bjh.15958] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2019] [Accepted: 03/05/2019] [Indexed: 12/11/2022]
Affiliation(s)
- Dejian Pang
- Department of Medical Genetics School of Basic Medical Sciences Southern Medical UniversityGuangzhou Guangdong China
| | - Xuan Shang
- Department of Medical Genetics School of Basic Medical Sciences Southern Medical UniversityGuangzhou Guangdong China
- Guangdong Genetics Testing Engineering Research CentreGuangzhou Guangdong China
- Guangdong Provincial Key Laboratory of Single Cell Technology and Application Guangzhou GuangdongChina
| | - Decheng Cai
- Department of Medical Genetics School of Basic Medical Sciences Southern Medical UniversityGuangzhou Guangdong China
| | - Fei Zhu
- Department of Medical Genetics School of Basic Medical Sciences Southern Medical UniversityGuangzhou Guangdong China
| | - Yi Cheng
- Department of Medical Genetics School of Basic Medical Sciences Southern Medical UniversityGuangzhou Guangdong China
| | - Jianmei Zhong
- Department of Medical Genetics School of Basic Medical Sciences Southern Medical UniversityGuangzhou Guangdong China
| | - Sheng Yi
- Prenatal Diagnostic Centre Guangxi Zhuang Autonomous Region Women and Children Care Hospital Nanning Guangxi China
| | - Qianqian Zhang
- Department of Medical Genetics School of Basic Medical Sciences Southern Medical UniversityGuangzhou Guangdong China
| | - Xiangmin Xu
- Department of Medical Genetics School of Basic Medical Sciences Southern Medical UniversityGuangzhou Guangdong China
- Guangdong Genetics Testing Engineering Research CentreGuangzhou Guangdong China
- Guangdong Provincial Key Laboratory of Single Cell Technology and Application Guangzhou GuangdongChina
| |
Collapse
|
11
|
Korunes KL, Noor MAF. Pervasive gene conversion in chromosomal inversion heterozygotes. Mol Ecol 2018; 28:1302-1315. [PMID: 30387889 DOI: 10.1111/mec.14921] [Citation(s) in RCA: 44] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2018] [Revised: 09/27/2018] [Accepted: 10/22/2018] [Indexed: 12/30/2022]
Abstract
Chromosomal inversions shape recombination landscapes, and species differing by inversions may exhibit reduced gene flow in these regions of the genome. Though single crossovers within inversions are not usually recovered from inversion heterozygotes, the recombination barrier imposed by inversions is nuanced by noncrossover gene conversion. Here, we provide a genomewide empirical analysis of gene conversion rates both within species and in species hybrids. We estimate that gene conversion occurs at a rate of 1 × 10-5 to 2.5 × 10-5 converted sites per bp per generation in experimental crosses within Drosophila pseudoobscura and between D. pseudoobscura and its naturally hybridizing sister species D. persimilis. This analysis is the first direct empirical assessment of gene conversion rates within inversions of a species hybrid. Our data show that gene conversion rates in interspecies hybrids are at least as high as within-species estimates of gene conversion rates, and gene conversion occurs regularly within and around inverted regions of species hybrids, even near inversion breakpoints. We also found that several gene conversion events appeared to be mitotic rather than meiotic in origin. Finally, we observed that gene conversion rates are higher in regions of lower local sequence divergence, yet our observed gene conversion rates in more divergent inverted regions were at least as high as in less divergent collinear regions. Given our observed high rates of gene conversion despite the sequence differentiation between species, especially in inverted regions, gene conversion has the potential to reduce the efficacy of inversions as barriers to recombination over evolutionary time.
Collapse
|
12
|
Amico G, Grossi S, Vijzelaar R, Lanza F, Mazzotti R, Corsolini F, Ketema M, Filocamo M. MLPA-based approach for initial and simultaneous detection of GBA deletions and recombinant alleles in patients affected by Gaucher Disease. Mol Genet Metab 2016; 119:329-337. [PMID: 27802905 DOI: 10.1016/j.ymgme.2016.10.008] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/16/2016] [Revised: 10/24/2016] [Accepted: 10/24/2016] [Indexed: 11/23/2022]
Abstract
The chromosomal region, in which the GBA gene is located, is structurally subject to misalignments, reciprocal and nonreciprocal homologous recombination events, leading to structural defects such as deletions, duplications and gene-pseudogene complex rearrangements causing Gaucher Disease (GD). Interestingly deletions and duplications, belonging to the heterogeneous group of structural defects collectively termed Copy Number Variations (CNVs), together with gene-pseudogene complex rearrangements represent the main cause of pitfalls in GD mutational analysis. In the present study, we set up and validate a Multiplex Ligation-dependent Probe Amplification (MLPA)-based approach to simultaneously investigate the potential occurrence of CNVs and complex rearrangements in 8 unrelated GD patients who had still not-well-characterized or uncharacterized alleles. The findings allowed us to complete the mutational analysis in 4 patients, identifying a rare deletion (g.-3100_+834del3934) and 2 novel recombinant alleles (g.4356_7031conJ03060.1:g.2544_4568; g.1942_7319conJ03060.1:g.1092_4856). These results demonstrate the diagnostic usefulness of MLPA in the detection of GBA deletions and recombinations. In addition, MLPA findings have also served as a basis for developing molecular approaches to precisely pinpoint the breakpoints and characterize the underlying mechanism of copy number variations.
Collapse
Affiliation(s)
- Giulia Amico
- Centro di Diagnostica Genetica e Biochimica delle Malattie Metaboliche, Istituto G. Gaslini, Via Gaslini 5, 16147 Genova, Italy.
| | - Serena Grossi
- Centro di Diagnostica Genetica e Biochimica delle Malattie Metaboliche, Istituto G. Gaslini, Via Gaslini 5, 16147 Genova, Italy.
| | - Raymon Vijzelaar
- MRC-Holland, Willem Schoutenstraat 1, 1057 DL Amsterdam, The Netherlands.
| | - Federica Lanza
- Centro di Diagnostica Genetica e Biochimica delle Malattie Metaboliche, Istituto G. Gaslini, Via Gaslini 5, 16147 Genova, Italy.
| | - Raffaella Mazzotti
- Centro di Diagnostica Genetica e Biochimica delle Malattie Metaboliche, Istituto G. Gaslini, Via Gaslini 5, 16147 Genova, Italy.
| | - Fabio Corsolini
- Centro di Diagnostica Genetica e Biochimica delle Malattie Metaboliche, Istituto G. Gaslini, Via Gaslini 5, 16147 Genova, Italy.
| | - Mirjam Ketema
- MRC-Holland, Willem Schoutenstraat 1, 1057 DL Amsterdam, The Netherlands.
| | - Mirella Filocamo
- Centro di Diagnostica Genetica e Biochimica delle Malattie Metaboliche, Istituto G. Gaslini, Via Gaslini 5, 16147 Genova, Italy.
| |
Collapse
|
13
|
Duplication of chicken defensin7 gene generated by gene conversion and homologous recombination. Proc Natl Acad Sci U S A 2016; 113:13815-13820. [PMID: 27849592 DOI: 10.1073/pnas.1616948113] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
Defensins constitute an evolutionary conserved family of cationic antimicrobial peptides that play a key role in host innate immune responses to infection. Defensin genes generally reside in complex genomic regions that are prone to structural variation, and defensin genes exhibit extensive copy number variation in humans and in other species. Copy number variation of defensin genes was examined in inbred lines of Leghorn and Fayoumi chickens, and a duplication of defensin7 was discovered in the Fayoumi breed. Analysis of junction sequences confirmed the occurrence of a simple tandem duplication of defensin7 with sequence identity at the junction, suggesting nonallelic homologous recombination between defensin7 and defensin6 The duplication event generated two chimeric promoters that are best explained by gene conversion followed by homologous recombination. Expression of defensin7 was not elevated in animals with two genes despite both genes being transcribed in the tissues examined. Computational prediction of promoter regions revealed the presence of several putative transcription factor binding sites generated by the duplication event. These data provide insight into the evolution and possible function of large gene families and specifically, the defensins.
Collapse
|
14
|
Weatherly DB, Peng D, Tarleton RL. Recombination-driven generation of the largest pathogen repository of antigen variants in the protozoan Trypanosoma cruzi. BMC Genomics 2016; 17:729. [PMID: 27619017 PMCID: PMC5020489 DOI: 10.1186/s12864-016-3037-z] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2016] [Accepted: 08/24/2016] [Indexed: 12/15/2022] Open
Abstract
Background The protozoan parasite Trypanosoma cruzi, causative agent of Chagas disease, depends upon a cell surface-expressed trans-sialidase (ts) to avoid activation of complement-mediated lysis and to enhance intracellular invasion. However these functions alone fail to account for the size of this gene family in T. cruzi, especially considering that most of these genes encode proteins lacking ts enzyme activity. Previous whole genome sequencing of the CL Brener clone of T. cruzi identified ~1400 ts variants, but left many partially assembled sequences unannotated. Results In the current study we reevaluated the trans-sialidase-like sequences in this reference strain, identifying an additional 1779 full-length and partial ts genes with their important features annotated, and confirming the expression of previously annotated “pseudogenes” and newly annotated ts family members. Multiple EM for Motif Elicitation (MEME) analysis allowed us to generate a model T. cruzi ts (TcTS) based upon the most conserved motif patterns and demonstrated that a common motif order is highly conserved among ts family members. Using a newly developed pipeline for the analysis of recombination within large gene families, we further demonstrate that TcTS family members are undergoing frequent recombination, generating new variants from the thousands of functional and non-functional ts gene segments but retaining the overall structure of the core TcTS family members. Conclusions The number and variety as well as high recombination frequency of TcTS family members supports strong evolutionary pressure, probably exerted by immune selection, for continued variation in ts sequences in T. cruzi, and thus for a unique immune evasion mechanism for the large ts gene family. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-3037-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- D Brent Weatherly
- Center for Tropical and Emerging Global Diseases, Institute of Bioinformatics and Department of Cellular Biology, University of Georgia, Athens, GA, 30602, USA.,Center for Complex Carbohydrate Research, University of Georgia, Athens, GA, 30602, USA
| | - Duo Peng
- Center for Tropical and Emerging Global Diseases, Institute of Bioinformatics and Department of Cellular Biology, University of Georgia, Athens, GA, 30602, USA
| | - Rick L Tarleton
- Center for Tropical and Emerging Global Diseases, Institute of Bioinformatics and Department of Cellular Biology, University of Georgia, Athens, GA, 30602, USA.
| |
Collapse
|
15
|
Korunes KL, Noor MAF. Gene conversion and linkage: effects on genome evolution and speciation. Mol Ecol 2016; 26:351-364. [DOI: 10.1111/mec.13736] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2016] [Revised: 06/07/2016] [Accepted: 06/22/2016] [Indexed: 12/12/2022]
|
16
|
Guo X, Delio M, Haque N, Castellanos R, Hestand MS, Vermeesch JR, Morrow BE, Zheng D. Variant discovery and breakpoint region prediction for studying the human 22q11.2 deletion using BAC clone and whole genome sequencing analysis. Hum Mol Genet 2016; 25:3754-3767. [PMID: 27436579 DOI: 10.1093/hmg/ddw221] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2016] [Revised: 06/28/2016] [Accepted: 07/01/2016] [Indexed: 11/13/2022] Open
Abstract
Velo-cardio-facial syndrome/DiGeorge syndrome/22q11.2 deletion syndrome (22q11.2DS) is caused by meiotic non-allelic homologous recombination events between flanking low copy repeats termed LCR22A and LCR22D, resulting in a 3 million base pair (Mb) deletion. Due to their complex structure, large size and high sequence identity, genetic variation within LCR22s among different individuals has not been well characterized. In this study, we sequenced 13 BAC clones derived from LCR22A/D and aligned them with 15 previously available BAC sequences to create a new genetic variation map. The thousands of variants identified by this analysis were not uniformly distributed in the two LCR22s. Moreover, shared single nucleotide variants between LCR22A and LCR22D were enriched in the Breakpoint Cluster Region pseudogene (BCRP) block, suggesting the existence of a possible recombination hotspot there. Interestingly, breakpoints for atypical 22q11.2 rearrangements have previously been located to BCRPs To further explore this finding, we carried out in-depth analyses of whole genome sequence (WGS) data from two unrelated probands harbouring a de novo 3Mb 22q11.2 deletion and their normal parents. By focusing primarily on WGS reads uniquely mapped to LCR22A, using the variation map from our BAC analysis to help resolve allele ambiguity, and by performing PCR analysis, we infer that the deletion breakpoints were most likely located near or within the BCRP module. In summary, we found a high degree of sequence variation in LCR22A and LCR22D and a potential recombination breakpoint near or within the BCRP block, providing a starting point for future breakpoint mapping using additional trios.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Deyou Zheng
- Department of Neurology .,Department of Genetics.,Department of Neuroscience, Albert Einstein College of Medicine, Bronx, NY, USA
| |
Collapse
|
17
|
Lanning DK, Knight KL. Diversification of the Primary Antibody Repertoire by AID-Mediated Gene Conversion. Results Probl Cell Differ 2016; 57:279-93. [PMID: 26537386 DOI: 10.1007/978-3-319-20819-0_12] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Gene conversion, mediated by activation-induced cytidine deaminase (AID), has been found to contribute to generation of the primary antibody repertoire in several vertebrate species. Generation of the primary antibody repertoire by gene conversion of immunoglobulin (Ig) genes occurs primarily in gut-associated lymphoid tissues (GALT) and is best described in chicken and rabbit. Here, we discuss current knowledge of the mechanism of gene conversion as well as the contribution of the microbiota in promoting gene conversion of Ig genes. Finally, we propose that the antibody diversification strategy used in GALT species, such as chicken and rabbit, is conserved in a subset of human and mouse B cells.
Collapse
Affiliation(s)
- Dennis K Lanning
- Department of Microbiology and Immunology, Loyola University Chicago, 2160 S. First Avenue, Maywood, IL, 60153, USA
| | - Katherine L Knight
- Department of Microbiology and Immunology, Loyola University Chicago, 2160 S. First Avenue, Maywood, IL, 60153, USA.
| |
Collapse
|
18
|
Abstract
The ends of linear chromosomes are capped by nucleoprotein structures called telomeres. A dysfunctional telomere may resemble a DNA double-strand break (DSB), which is a severe form of DNA damage. The presence of one DSB is sufficient to drive cell cycle arrest and cell death. Therefore cells have evolved mechanisms to repair DSBs such as homologous recombination (HR). HR-mediated repair of telomeres can lead to genome instability, a hallmark of cancer cells, which is why such repair is normally inhibited. However, some HR-mediated processes are required for proper telomere function. The need for some recombination activities at telomeres but not others necessitates careful and complex regulation, defects in which can lead to catastrophic consequences. Furthermore, some cell types can maintain telomeres via telomerase-independent, recombination-mediated mechanisms. In humans, these mechanisms are called alternative lengthening of telomeres (ALT) and are used in a subset of human cancer cells. In this review, we summarize the different recombination activities occurring at telomeres and discuss how they are regulated. Much of the current knowledge is derived from work using yeast models, which is the focus of this review, but relevant studies in mammals are also included.
Collapse
Affiliation(s)
- Clémence Claussin
- European Research Institute for the Biology of Ageing, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Michael Chang
- European Research Institute for the Biology of Ageing, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| |
Collapse
|
19
|
Su P, Wang Y, Cooper DN, Zhu W, Huang D, Férec C, Wang Y, Chen JM. Disclosing the Hidden Structure and Underlying Mutational Mechanism of a Novel Type of Duplication CNV Responsible for Hereditary Multiple Osteochondromas. Hum Mutat 2015; 36:758-63. [PMID: 25990786 DOI: 10.1002/humu.22815] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2015] [Accepted: 05/01/2015] [Indexed: 02/05/2023]
Abstract
The additional mutational complexity associated with copy number variation (CNV) can provide important clues as to the underlying mechanisms of CNV formation. Correct annotation of the additional mutational complexity is, however, a prerequisite for establishing the mutational mechanism. We illustrate this point through the characterization of a novel ∼230 kb EXT1 duplication CNV causing autosomal dominant hereditary multiple osteochondromas. Whole-genome sequencing initially identified the CNV as having a 22-bp insertion at the breakpoint junction and, unprecedentedly, multiple breakpoint-flanking micromutations on both sides of the duplication. Further investigation revealed that this genomic rearrangement had a duplication-inverted triplication-duplication structure, the inverted triplication being a 41-bp sequence synthesized from a nearby template. This permitted the identification of the sequence determinants of both the initiation (an inverted Alu repeat) and termination (a triplex-forming sequence) of break-induced replication and suggested a possible model for the repair of replication-associated double-strand breaks.
Collapse
Affiliation(s)
- Peiqiang Su
- Department of Orthopedics, First Affiliated Hospital, Sun Yat-Sen University, Guangzhou, China
| | - Ye Wang
- Department of Medical Genetics, Zhongshan School of Medicine and Center for Genome Research, Sun Yat-Sen University, Guangzhou, China
| | - David N Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff, United Kingdom
| | - Wenjuan Zhu
- Beijing Genomics Institute (BGI)-Shenzhen, Shenzhen, China
| | - Dongsheng Huang
- Department of Orthopedics, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou, China
| | - Claude Férec
- Institut National de la Santé et de la Recherche Médicale (INSERM), U1078, Brest, France.,Etablissement Français du Sang (EFS)-Bretagne, Brest, France.,Faculté de Médecine et des Sciences de la Santé, Université de Bretagne Occidentale (UBO), Brest, France.,Laboratoire de Génétique Moléculaire et d'Histocompatibilité, Centre Hospitalier Universitaire (CHU) Brest, Hôpital Morvan, Brest, France
| | - Yiming Wang
- Department of Medical Genetics, Zhongshan School of Medicine and Center for Genome Research, Sun Yat-Sen University, Guangzhou, China.,Beijing Genomics Institute (BGI)-Shenzhen, Shenzhen, China
| | - Jian-Min Chen
- Institut National de la Santé et de la Recherche Médicale (INSERM), U1078, Brest, France.,Etablissement Français du Sang (EFS)-Bretagne, Brest, France.,Faculté de Médecine et des Sciences de la Santé, Université de Bretagne Occidentale (UBO), Brest, France
| |
Collapse
|
20
|
Minimum length of direct repeat sequences required for efficient homologous recombination induced by zinc finger nuclease in yeast. Mol Biol Rep 2014; 41:6939-48. [DOI: 10.1007/s11033-014-3579-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2013] [Accepted: 07/01/2014] [Indexed: 10/25/2022]
|
21
|
Abstract
Gene conversion (conversion), the unidirectional transfer of DNA sequence information, occurs as a byproduct of recombinational repair of broken or damaged DNA molecules. Whereas excision repair processes replace damaged DNA by copying the complementary sequence from the undamaged strand of duplex DNA, recombinational mechanisms copy similar sequence, usually in another molecule, to replace the damaged sequence. In mitotic cells the other molecule is usually a sister chromatid, and the repair does not lead to genetic change. Less often a homologous chromosome or homologous sequence in an ectopic position is used. Conversion results from repair in two ways. First, if there was a double-strand gap at the site of a break, homologous sequence will be used as the template for synthesis to fill the gap, thus transferring sequence information in both strands. Second, recombinational repair uses complementary base pairing, and the heteroduplex molecule so formed is a source of conversion, both as heteroduplex and when donor (undamaged template) information is retained after correction of mismatched bases in heteroduplex. There are mechanisms that favour the use of sister molecules that must fail before ectopic homology can be used. Meiotic recombination events lead to the formation of crossovers required in meiosis for orderly segregation of pairs of homologous chromosomes. These events result from recombinational repair of programmed double-strand breaks, but in contrast with mitotic recombination, meiotic recombinational events occur predominantly between homologous chromosomes, so that transfer of sequence differences by conversion is very frequent. Transient recombination events that do not form crossovers form both between homologous chromosomes and between regions of ectopic homology, and leave their mark in the occurrence of frequent non-crossover conversion, including ectopic conversion.
Collapse
|
22
|
Nascent DNA synthesis during homologous recombination is synergistically promoted by the rad51 recombinase and DNA homology. Genetics 2014; 197:107-19. [PMID: 24583581 DOI: 10.1534/genetics.114.161455] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
In this study, we exploited a plasmid-based assay that detects the new DNA synthesis (3' extension) that accompanies Rad51-mediated homology searching and strand invasion steps of homologous recombination to investigate the interplay between Rad51 concentration and homology length. Mouse hybridoma cells that express endogenous levels of Rad51 display an approximate linear increase in the frequency of 3' extension for homology lengths of 500 bp to 2 kb. At values below ∼500 bp, the frequency of 3' extension declines markedly, suggesting that this might represent the minimal efficient processing segment for 3' extension. Overexpression of wild-type Rad51 stimulated the frequency of 3' extension by ∼3-fold for homology lengths <900 bp, but when homology was >2 kb, 3' extension frequency increased by as much as 10-fold. Excess wild-type Rad51 did not increase the average 3' extension tract length. Analysis of cell lines expressing N-terminally FLAG-tagged Rad51 polymerization mutants F86E, A89E, or F86E/A89E established that the 3' extension process requires Rad51 polymerization activity. Mouse hybridoma cells that have reduced Brca2 (Breast cancer susceptibility 2) due to stable expression of small interfering RNA show a significant reduction in 3' extension efficiency; expression of wild-type human BRCA2, but not a BRCA2 variant devoid of BRC repeats 1-8, rescues the 3' extension defect in these cells. Our results suggest that increased Rad51 concentration and homology length interact synergistically to promote 3' extension, presumably as a result of enhanced Brca2-mediated Rad51 polymerization.
Collapse
|
23
|
Gjini E, Haydon DT, David Barry J, Cobbold CA. Revisiting the diffusion approximation to estimate evolutionary rates of gene family diversification. J Theor Biol 2014; 341:111-22. [PMID: 24120993 DOI: 10.1016/j.jtbi.2013.10.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2012] [Revised: 06/21/2013] [Accepted: 10/02/2013] [Indexed: 11/18/2022]
Abstract
Genetic diversity in multigene families is shaped by multiple processes, including gene conversion and point mutation. Because multi-gene families are involved in crucial traits of organisms, quantifying the rates of their genetic diversification is important. With increasing availability of genomic data, there is a growing need for quantitative approaches that integrate the molecular evolution of gene families with their higher-scale function. In this study, we integrate a stochastic simulation framework with population genetics theory, namely the diffusion approximation, to investigate the dynamics of genetic diversification in a gene family. Duplicated genes can diverge and encode new functions as a result of point mutation, and become more similar through gene conversion. To model the evolution of pairwise identity in a multigene family, we first consider all conversion and mutation events in a discrete manner, keeping track of their details and times of occurrence; second we consider only the infinitesimal effect of these processes on pairwise identity accounting for random sampling of genes and positions. The purely stochastic approach is closer to biological reality and is based on many explicit parameters, such as conversion tract length and family size, but is more challenging analytically. The population genetics approach is an approximation accounting implicitly for point mutation and gene conversion, only in terms of per-site average probabilities. Comparison of these two approaches across a range of parameter combinations reveals that they are not entirely equivalent, but that for certain relevant regimes they do match. As an application of this modelling framework, we consider the distribution of nucleotide identity among VSG genes of African trypanosomes, representing the most prominent example of a multi-gene family mediating parasite antigenic variation and within-host immune evasion.
Collapse
Affiliation(s)
- Erida Gjini
- Instituto Gulbenkian de Ciência Oeiras, Portugal.
| | - Daniel T Haydon
- Institute of Biodiversity, Animal Health and Comparative Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom; The Boyd Orr Centre for Population and Ecosystem Health, University of Glasgow, Glasgow, United Kingdom; Wellcome Trust Centre for Molecular Parasitology, Institute of Infection, Immunity and Inflammation, University of Glasgow, Glasgow, United Kingdom
| | - J David Barry
- Wellcome Trust Centre for Molecular Parasitology, Institute of Infection, Immunity and Inflammation, University of Glasgow, Glasgow, United Kingdom
| | - Christina A Cobbold
- School of Mathematics and Statistics, College of Science and Engineering, University of Glasgow, Glasgow, United Kingdom; The Boyd Orr Centre for Population and Ecosystem Health, University of Glasgow, Glasgow, United Kingdom
| |
Collapse
|
24
|
Yang L, Luquette LJ, Gehlenborg N, Xi R, Haseley PS, Hsieh CH, Zhang C, Ren X, Protopopov A, Chin L, Kucherlapati R, Lee C, Park PJ. Diverse mechanisms of somatic structural variations in human cancer genomes. Cell 2013; 153:919-29. [PMID: 23663786 DOI: 10.1016/j.cell.2013.04.010] [Citation(s) in RCA: 254] [Impact Index Per Article: 21.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2012] [Revised: 02/28/2013] [Accepted: 03/29/2013] [Indexed: 01/09/2023]
Abstract
Identification of somatic rearrangements in cancer genomes has accelerated through analysis of high-throughput sequencing data. However, characterization of complex structural alterations and their underlying mechanisms remains inadequate. Here, applying an algorithm to predict structural variations from short reads, we report a comprehensive catalog of somatic structural variations and the mechanisms generating them, using high-coverage whole-genome sequencing data from 140 patients across ten tumor types. We characterize the relative contributions of different types of rearrangements and their mutational mechanisms, find that ~20% of the somatic deletions are complex deletions formed by replication errors, and describe the differences between the mutational mechanisms in somatic and germline alterations. Importantly, we provide detailed reconstructions of the events responsible for loss of CDKN2A/B and gain of EGFR in glioblastoma, revealing that these alterations can result from multiple mechanisms even in a single genome and that both DNA double-strand breaks and replication errors drive somatic rearrangements.
Collapse
Affiliation(s)
- Lixing Yang
- Center for Biomedical Informatics, Harvard Medical School, Boston, MA 02115, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
25
|
Katju V. In with the old, in with the new: the promiscuity of the duplication process engenders diverse pathways for novel gene creation. INTERNATIONAL JOURNAL OF EVOLUTIONARY BIOLOGY 2012; 2012:341932. [PMID: 23008799 PMCID: PMC3449122 DOI: 10.1155/2012/341932] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/18/2012] [Accepted: 06/03/2012] [Indexed: 01/26/2023]
Abstract
The gene duplication process has exhibited far greater promiscuity in the creation of paralogs with novel exon-intron structures than anticipated even by Ohno. In this paper I explore the history of the field, from the neo-Darwinian synthesis through Ohno's formulation of the canonical model for the evolution of gene duplicates and culminating in the present genomic era. I delineate the major tenets of Ohno's model and discuss its failure to encapsulate the full complexity of the duplication process as revealed in the era of genomics. I discuss the diverse classes of paralogs originating from both DNA- and RNA-mediated duplication events and their evolutionary potential for assuming radically altered functions, as well as the degree to which they can function unconstrained from the pressure of gene conversion. Lastly, I explore theoretical population-genetic considerations of how the effective population size (N(e)) of a species may influence the probability of emergence of genes with radically altered functions.
Collapse
Affiliation(s)
- Vaishali Katju
- Department of Biology, University of New Mexico, Albuquerque, NM 87131, USA
| |
Collapse
|
26
|
Simmons AD, Carvalho CMB, Lupski JR. What have studies of genomic disorders taught us about our genome? Methods Mol Biol 2012; 838:1-27. [PMID: 22228005 DOI: 10.1007/978-1-61779-507-7_1] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
The elucidation of genomic disorders began with molecular technologies that enabled detection of genomic changes which were (a) smaller than those resolved by traditional cytogenetics (less than 5 Mb) and (b) larger than what could be determined by conventional gel electrophoresis. Methods such as pulsed field gel electrophoresis (PFGE) and fluorescent in situ hybridization (FISH) could resolve such changes but were limited to locus-specific studies. The study of genomic disorders has rapidly advanced with the development of array-based techniques. These enabled examination of the entire human genome at a higher level of resolution, thus allowing elucidation of the basis of many new disorders, mechanisms that result in genomic changes that can result in copy number variation (CNV), and most importantly, a deeper understanding of the characteristics, features, and plasticity of our genome. In this chapter, we focus on the structural and architectural features of the genome, which can potentially result in genomic instability, delineate how mechanisms, such as NAHR, NHEJ, and FoSTeS/MMBIR lead to disease-causing rearrangements, and briefly describe the relationship between the leading methods presently used in studying genomic disorders. We end with a discussion on our new understanding about our genome including: the contribution of new mutation CNV to disease, the abundance of mosaicism, the extent of subtelomeric rearrangements, the frequency of de novo rearrangements associated with sporadic birth defects, the occurrence of balanced and unbalanced translocations, the increasing discovery of insertional translocations, the exploration of complex rearrangements and exonic CNVs. In the postgenomic era, our understanding of the genome has advanced very rapidly as the level of technical resolution has become higher. This leads to a greater understanding of the effects of rearrangements present both in healthy subjects and individuals with clinically relevant phenotypes.
Collapse
|
27
|
Grabarz A, Barascu A, Guirouilh-Barbat J, Lopez BS. Initiation of DNA double strand break repair: signaling and single-stranded resection dictate the choice between homologous recombination, non-homologous end-joining and alternative end-joining. Am J Cancer Res 2012; 2:249-268. [PMID: 22679557 PMCID: PMC3365807] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2012] [Accepted: 04/15/2012] [Indexed: 06/01/2023] Open
Abstract
A DNA double strand break (DSB) is a highly toxic lesion, which can generate genetic instability and profound genome rearrangements. However, DSBs are required to generate diversity during physiological processes such as meiosis or the establishment of the immune repertoire. Thus, the precise regulation of a complex network of processes is necessary for the maintenance of genomic stability, allowing genetic diversity but protecting against genetic instability and its consequences on oncogenesis. Two main strategies are employed for DSB repair: homologous recombination (HR) and non-homologous end-joining (NHEJ). HR is initiated by single-stranded DNA (ssDNA) resection and requires sequence homology with an intact partner, while NHEJ requires neither resection at initiation nor a homologous partner. Thus, resection is an pivotal step at DSB repair initiation, driving the choice of the DSB repair pathway employed. However, an alternative end-joining (A-EJ) pathway, which is highly mutagenic, has recently been described; A-EJ is initiated by ssDNA resection but does not require a homologous partner. The choice of the appropriate DSB repair system, for instance according the cell cycle stage, is essential for genome stability maintenance. In this context, controlling the initial events of DSB repair is thus an essential step that may be irreversible, and the wrong decision should lead to dramatic consequences. Here, we first present the main DSB repair mechanisms and then discuss the importance of the choice of the appropriate DSB repair pathway according to the cell cycle phase. In a third section, we present the early steps of DSB repair i.e., DSB signaling, chromatin remodeling, and the regulation of ssDNA resection. In the last part, we discuss the competition between the different DSB repair mechanisms. Finally, we conclude with the importance of the fine tuning of this network for genome stability maintenance and for tumor protection in fine.
Collapse
Affiliation(s)
- Anastazja Grabarz
- Université Paris Sud, Laboratoire « Stabilité Génétique et Oncogenèse » CNRS, UMR 8200 and Institut de Cancérologie Gustave-Roussy PR2, 114 Rue Edouard Vaillant, 94805 VILLEJUIF. CNRS, France
| | | | | | | |
Collapse
|
28
|
Mechanisms for recurrent and complex human genomic rearrangements. Curr Opin Genet Dev 2012; 22:211-20. [PMID: 22440479 DOI: 10.1016/j.gde.2012.02.012] [Citation(s) in RCA: 245] [Impact Index Per Article: 18.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2011] [Revised: 02/20/2012] [Accepted: 02/21/2012] [Indexed: 01/07/2023]
Abstract
During the last two decades, the importance of human genome copy number variation (CNV) in disease has become widely recognized. However, much is not understood about underlying mechanisms. We show how, although model organism research guides molecular understanding, important insights are gained from study of the wealth of information available in the clinic. We describe progress in explaining nonallelic homologous recombination (NAHR), a major cause of copy number change occurring when control of allelic recombination fails, highlight the growing importance of replicative mechanisms to explain complex events, and describe progress in understanding extreme chromosome reorganization (chromothripsis). Both nonhomologous end-joining and aberrant replication have significant roles in chromothripsis. As we study CNV, the processes underlying human genome evolution are revealed.
Collapse
|
29
|
Liu P, Lacaria M, Zhang F, Withers M, Hastings P, Lupski J. Frequency of nonallelic homologous recombination is correlated with length of homology: evidence that ectopic synapsis precedes ectopic crossing-over. Am J Hum Genet 2011; 89:580-8. [PMID: 21981782 DOI: 10.1016/j.ajhg.2011.09.009] [Citation(s) in RCA: 83] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2011] [Revised: 09/14/2011] [Accepted: 09/15/2011] [Indexed: 10/16/2022] Open
Abstract
Genomic disorders constitute a class of diseases that are associated with DNA rearrangements resulting from region-specific genome instability, that is, genome architecture incites genome instability. Nonallelic homologous recombination (NAHR) or crossing-over in meiosis between sequences that are not in allelic positions (i.e., paralogous sequences) can result in recurrent deletions or duplications causing genomic disorders. Previous studies of NAHR have focused on description of the phenomenon, but it remains unclear how NAHR occurs during meiosis and what factors determine its frequency. Here we assembled two patient cohorts with reciprocal genomic disorders; deletion associated Smith-Magenis syndrome and duplication associated Potocki-Lupski syndrome. By assessing the full spectrum of rearrangement types from the two cohorts, we find that complex rearrangements (those with more than one breakpoint) are more prevalent in copy-number gains (17.7%) than in copy-number losses (2.3%); an observation that supports a role for replicative mechanisms in complex rearrangement formation. Interestingly, for NAHR-mediated recurrent rearrangements, we show that crossover frequency is positively associated with the flanking low-copy repeat (LCR) length and inversely influenced by the inter-LCR distance. To explain this, we propose that the probability of ectopic chromosome synapsis increases with increased LCR length, and that ectopic synapsis is a necessary precursor to ectopic crossing-over.
Collapse
|
30
|
Cooper DN, Bacolla A, Férec C, Vasquez KM, Kehrer-Sawatzki H, Chen JM. On the sequence-directed nature of human gene mutation: the role of genomic architecture and the local DNA sequence environment in mediating gene mutations underlying human inherited disease. Hum Mutat 2011; 32:1075-99. [PMID: 21853507 PMCID: PMC3177966 DOI: 10.1002/humu.21557] [Citation(s) in RCA: 90] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2011] [Accepted: 06/17/2011] [Indexed: 12/21/2022]
Abstract
Different types of human gene mutation may vary in size, from structural variants (SVs) to single base-pair substitutions, but what they all have in common is that their nature, size and location are often determined either by specific characteristics of the local DNA sequence environment or by higher order features of the genomic architecture. The human genome is now recognized to contain "pervasive architectural flaws" in that certain DNA sequences are inherently mutation prone by virtue of their base composition, sequence repetitivity and/or epigenetic modification. Here, we explore how the nature, location and frequency of different types of mutation causing inherited disease are shaped in large part, and often in remarkably predictable ways, by the local DNA sequence environment. The mutability of a given gene or genomic region may also be influenced indirectly by a variety of noncanonical (non-B) secondary structures whose formation is facilitated by the underlying DNA sequence. Since these non-B DNA structures can interfere with subsequent DNA replication and repair and may serve to increase mutation frequencies in generalized fashion (i.e., both in the context of subtle mutations and SVs), they have the potential to serve as a unifying concept in studies of mutational mechanisms underlying human inherited disease.
Collapse
Affiliation(s)
- David N Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff, United Kingdom.
| | | | | | | | | | | |
Collapse
|
31
|
Chang M, Dittmar JC, Rothstein R. Long telomeres are preferentially extended during recombination-mediated telomere maintenance. Nat Struct Mol Biol 2011; 18:451-6. [PMID: 21441915 PMCID: PMC3071861 DOI: 10.1038/nsmb.2034] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2010] [Accepted: 01/20/2011] [Indexed: 01/01/2023]
Abstract
Most human somatic cells do not express telomerase. Consequently, with each cell division their telomeres progressively shorten until replicative senescence is induced. Approximately 15% of human cancers maintain their telomeres using telomerase-independent, recombination-based mechanisms collectively termed Alternative Lengthening of Telomeres (ALT). In the yeast Saccharomyces cerevisiae, ALT cells are referred to as “survivors”. One type of survivor (type II) resembles human ALT cells in that both are defined by the amplification of telomeric repeats. We analyzed recombination-mediated telomere extension events at individual telomeres in telomerase-negative yeast during type II survivor formation and find that long telomeres are preferentially extended. Furthermore, we find that senescent cells with long telomeres are more efficient at bypassing senescence via the type II pathway. We speculate that telomere length may be important in determining whether cancer cells utilize telomerase or ALT to bypass replicative senescence.
Collapse
Affiliation(s)
- Michael Chang
- Department of Genetics and Development, Columbia University Medical Center, New York, New York, USA.
| | | | | |
Collapse
|
32
|
Chen JM, Férec C, Cooper DN. Gene conversion in human genetic disease. Genes (Basel) 2010; 1:550-63. [PMID: 24710102 PMCID: PMC3966225 DOI: 10.3390/genes1030550] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2010] [Revised: 11/12/2010] [Accepted: 11/17/2010] [Indexed: 01/01/2023] Open
Abstract
Gene conversion is a specific type of homologous recombination that involves the unidirectional transfer of genetic material from a ‘donor’ sequence to a highly homologous ‘acceptor’. We have recently reviewed the molecular mechanisms underlying gene conversion, explored the key part that this process has played in fashioning extant human genes, and performed a meta-analysis of gene-conversion events known to have caused human genetic disease. Here we shall briefly summarize some of the latest developments in the study of pathogenic gene conversion events, including (i) the emerging idea of minimal efficient sequence homology (MESH) for homologous recombination, (ii) the local DNA sequence features that appear to predispose to gene conversion, (iii) a mechanistic comparison of gene conversion and transient hypermutability, and (iv) recently reported examples of pathogenic gene conversion events.
Collapse
Affiliation(s)
- Jian-Min Chen
- Institut National de la Santé et de la Recherche Médicale (INSERM), U613, Brest, France.
| | - Claude Férec
- Institut National de la Santé et de la Recherche Médicale (INSERM), U613, Brest, France.
| | - David N Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Heath Park, Cardiff CF14 4XN, UK.
| |
Collapse
|
33
|
Ezawa K, Ikeo K, Gojobori T, Saitou N. Evolutionary Pattern of Gene Homogenization between Primate-Specific Paralogs after Human and Macaque Speciation Using the 4-2-4 Method. Mol Biol Evol 2010; 27:2152-71. [DOI: 10.1093/molbev/msq109] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
|
34
|
Zepeda-Mendoza CJ, Lemus T, Yáñez O, García D, Valle-García D, Meza-Sosa KF, Gutiérrez-Arcelus M, Márquez-Ortiz Y, Domínguez-Vidaña R, Gonzaga-Jauregui C, Flores M, Palacios R. Identical repeated backbone of the human genome. BMC Genomics 2010; 11:60. [PMID: 20096123 PMCID: PMC2845111 DOI: 10.1186/1471-2164-11-60] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2009] [Accepted: 01/23/2010] [Indexed: 11/10/2022] Open
Abstract
Background Identical sequences with a minimal length of about 300 base pairs (bp) have been involved in the generation of various meiotic/mitotic genomic rearrangements through non-allelic homologous recombination (NAHR) events. Genomic disorders and structural variation, together with gene remodelling processes have been associated with many of these rearrangements. Based on these observations, we identified and integrated all the 100% identical repeats of at least 300 bp in the NCBI version 36.2 human genome reference assembly into non-overlapping regions, thus defining the Identical Repeated Backbone (IRB) of the reference human genome. Results The IRB sequences are distributed all over the genome in 66,600 regions, which correspond to ~2% of the total NCBI human genome reference assembly. Important structural and functional elements such as common repeats, segmental duplications, and genes are contained in the IRB. About 80% of the IRB bp overlap with known copy-number variants (CNVs). By analyzing the genes embedded in the IRB, we were able to detect some identical genes not previously included in the Ensembl release 50 annotation of human genes. In addition, we found evidence of IRB gene copy-number polymorphisms in raw sequence reads of two diploid sequenced genomes. Conclusions In general, the IRB offers new insight into the complex organization of the identical repeated sequences of the human genome. It provides an accurate map of potential NAHR sites which could be used in targeting the study of novel CNVs, predicting DNA copy-number variation in newly sequenced genomes, and improve genome annotation.
Collapse
Affiliation(s)
- Cinthya J Zepeda-Mendoza
- Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, 62210, México.
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
35
|
Kijima TE, Innan H. On the estimation of the insertion time of LTR retrotransposable elements. Mol Biol Evol 2009; 27:896-904. [PMID: 19955475 DOI: 10.1093/molbev/msp295] [Citation(s) in RCA: 56] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
It has been proposed that the insertion time of a long terminal repeat (LTR) retrotransposon can be estimated by the divergence between the two LTRs at the both ends because their sequences were identical at the insertion event. This method is based on the assumption that the two LTRs accumulate point mutations independently; therefore, the divergence reflects the time since the insertion event. However, if gene conversion occurs between LTRs, the nucleotide divergence will be much smaller than expected with the assumption of the independent accumulation of point mutations. To examine this assumption, we investigated the extent of gene conversion between LTRs by applying a comparative genomic approach to primates (humans and rhesus macaques) and rodents (mice and rats). We found that gene conversion plays a significant role in the molecular evolution of LTRs in primates and rodents, but the extent is quite different. In rodents, most LTRs are subject to extensive gene conversion that reduces the divergence, so that the divergence-based method results in a serious underestimation of the insertion time. In primates, this effect is limited to a small proportion of LTRs. The most likely explanation of the difference involves the minimum length of the interacting sequence (minimal efficient processing segment [MEPS]) for interlocus gene conversion. An empirical estimate of MEPS in human is 300-500 bp, which exceeds the length of most of the analyzed LTRs. In contrast, MEPS for mice should be much smaller. Thus, MEPS can be an important factor to determine the susceptibility of LTRs to gene conversion, although there are many other factors involved. It is concluded that the divergence method to estimate the insertion time should be applied with special caution because at least some LTRs undergo gene conversion.
Collapse
Affiliation(s)
- T E Kijima
- Graduate University for Advanced Studies, Hayama, Kanagawa, Japan
| | | |
Collapse
|
36
|
Abstract
Maintenance of genome stability is essential for the accurate propagation of genetic information and cell growth and survival. Organisms have therefore developed efficient strategies to prevent DNA lesions and rearrangements. Much of the information concerning these strategies has been obtained through the study of bacterial and nuclear genomes. Comparatively, little is known about how organelle genomes maintain a stable structure. Here, we report that the plastid-localized Whirly ssDNA-binding proteins are required for plastid genome stability in Arabidopsis. We show that a double KO of the genes AtWhy1 and AtWhy3 leads to the appearance of plants with variegated green/white/yellow leaves, symptomatic of nonfunctional chloroplasts. This variegation is maternally inherited, indicating defects in the plastid genome. Indeed, in all variegated lines examined, reorganized regions of plastid DNA are amplified as circular and/or head-tail concatemers. All amplified regions are delimited by short direct repeats of 10-18 bp, strongly suggesting that these regions result from illegitimate recombination between repeated sequences. This type of recombination occurs frequently in plants lacking both Whirlies, to a lesser extent in single KO plants and rarely in WT individuals. Maize mutants for the ZmWhy1 Whirly protein also show an increase in the frequency of illegitimate recombination. We propose a model where Whirlies contribute to plastid genome stability by protecting against illegitimate repeat-mediated recombination.
Collapse
|
37
|
Hastings PJ, Lupski JR, Rosenberg SM, Ira G. Mechanisms of change in gene copy number. Nat Rev Genet 2009. [PMID: 19597530 DOI: 10.1038/nrg2593.mechanisms] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
Deletions and duplications of chromosomal segments (copy number variants, CNVs) are a major source of variation between individual humans and are an underlying factor in human evolution and in many diseases, including mental illness, developmental disorders and cancer. CNVs form at a faster rate than other types of mutation, and seem to do so by similar mechanisms in bacteria, yeast and humans. Here we review current models of the mechanisms that cause copy number variation. Non-homologous end-joining mechanisms are well known, but recent models focus on perturbation of DNA replication and replication of non-contiguous DNA segments. For example, cellular stress might induce repair of broken replication forks to switch from high-fidelity homologous recombination to non-homologous repair, thus promoting copy number change.
Collapse
Affiliation(s)
- P J Hastings
- Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA.
| | | | | | | |
Collapse
|
38
|
Hastings PJ, Lupski JR, Rosenberg SM, Ira G. Mechanisms of change in gene copy number. Nat Rev Genet 2009; 10:551-64. [PMID: 19597530 DOI: 10.1038/nrg2593] [Citation(s) in RCA: 886] [Impact Index Per Article: 55.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
Deletions and duplications of chromosomal segments (copy number variants, CNVs) are a major source of variation between individual humans and are an underlying factor in human evolution and in many diseases, including mental illness, developmental disorders and cancer. CNVs form at a faster rate than other types of mutation, and seem to do so by similar mechanisms in bacteria, yeast and humans. Here we review current models of the mechanisms that cause copy number variation. Non-homologous end-joining mechanisms are well known, but recent models focus on perturbation of DNA replication and replication of non-contiguous DNA segments. For example, cellular stress might induce repair of broken replication forks to switch from high-fidelity homologous recombination to non-homologous repair, thus promoting copy number change.
Collapse
Affiliation(s)
- P J Hastings
- Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA.
| | | | | | | |
Collapse
|
39
|
A microhomology-mediated break-induced replication model for the origin of human copy number variation. PLoS Genet 2009; 5:e1000327. [PMID: 19180184 PMCID: PMC2621351 DOI: 10.1371/journal.pgen.1000327] [Citation(s) in RCA: 627] [Impact Index Per Article: 39.2] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
Abstract
Chromosome structural changes with nonrecurrent endpoints associated with genomic disorders offer windows into the mechanism of origin of copy number variation (CNV). A recent report of nonrecurrent duplications associated with Pelizaeus-Merzbacher disease identified three distinctive characteristics. First, the majority of events can be seen to be complex, showing discontinuous duplications mixed with deletions, inverted duplications, and triplications. Second, junctions at endpoints show microhomology of 2–5 base pairs (bp). Third, endpoints occur near pre-existing low copy repeats (LCRs). Using these observations and evidence from DNA repair in other organisms, we derive a model of microhomology-mediated break-induced replication (MMBIR) for the origin of CNV and, ultimately, of LCRs. We propose that breakage of replication forks in stressed cells that are deficient in homologous recombination induces an aberrant repair process with features of break-induced replication (BIR). Under these circumstances, single-strand 3′ tails from broken replication forks will anneal with microhomology on any single-stranded DNA nearby, priming low-processivity polymerization with multiple template switches generating complex rearrangements, and eventual re-establishment of processive replication.
Collapse
|
40
|
Abstract
Mammalian cells frequently depend on homologous recombination (HR) to repair DNA damage accurately and to help rescue stalled or collapsed replication forks. The essence of HR is an exchange of nucleotides between identical or nearly identical sequences. Although HR fulfills important biological roles, recombination between inappropriate sequence partners can lead to translocations or other deleterious rearrangements and such events must be avoided. For example, the recombination machinery must follow stringent rules to preclude recombination between the many repetitive elements in a mammalian genome that share significant but imperfect homology. This paper takes a conceptual approach in addressing the homology requirements for recombination in mammalian genomes as well as the general strategy used by cells to reject recombination between similar but imperfectly matched sequences. A mechanism of heteroduplex rejection that involves the unwinding of recombination intermediates that may form between mismatched sequences is discussed.
Collapse
Affiliation(s)
- Alan S Waldman
- Department of Biological Sciences, University of South Carolina, Biological Sciences, 700 Sumter Street, Columbia, SC 29208, USA.
| |
Collapse
|
41
|
Ectopic gene conversions in the human genome. Genomics 2008; 93:27-32. [PMID: 18848875 DOI: 10.1016/j.ygeno.2008.09.007] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2008] [Revised: 08/23/2008] [Accepted: 09/18/2008] [Indexed: 11/22/2022]
Abstract
We used the GENCONV method to characterize the gene conversions that occurred amongst the 1434 protein coding human gene families with three or more genes. Conversions occur at a frequency of 0.88% (483 conversion events/55,050 gene pairs compared) and have an average length of 371+/-752 bp (+/-standard deviation). Both the size and the frequency of conversions are positively correlated with the similarity of the sequences involved in these conversions. The frequency of conversions and the local recombination rate are also positively correlated. Intrachromosomal conversions are almost 5 times more frequent than interchromosomal conversions and the frequency of intrachromosomal conversions increases as the distance between genes decreases. However, the higher frequency of conversions between nearby genes with the same transcriptional orientation is due to the fact that most functional duplicated genes are found next to one another and in the same transcriptional orientation. The average length of a conversion spanning only an intron region is significantly smaller than conversions spanning both exons and introns or only exons. This suggests that the smaller degree of sequence similarity of introns limits the size of conversions between duplicated human genes. The significant excess of conversions at the 3'-end of human genes suggests that incomplete cDNA molecules are often involved in conversions with chromosomal gene copies.
Collapse
|
42
|
Chen JM, Cooper DN, Chuzhanova N, Férec C, Patrinos GP. Gene conversion: mechanisms, evolution and human disease. Nat Rev Genet 2007; 8:762-75. [PMID: 17846636 DOI: 10.1038/nrg2193] [Citation(s) in RCA: 472] [Impact Index Per Article: 26.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
Gene conversion, one of the two mechanisms of homologous recombination, involves the unidirectional transfer of genetic material from a 'donor' sequence to a highly homologous 'acceptor'. Considerable progress has been made in understanding the molecular mechanisms that underlie gene conversion, its formative role in human genome evolution and its implications for human inherited disease. Here we assess current thinking about how gene conversion occurs, explore the key part it has played in fashioning extant human genes, and carry out a meta-analysis of gene-conversion events that are known to have caused human genetic disease.
Collapse
|
43
|
Teich N, Nemoda Z, Köhler H, Heinritz W, Mössner J, Keim V, Sahin-Tóth M. Gene conversion between functional trypsinogen genes PRSS1 and PRSS2 associated with chronic pancreatitis in a six-year-old girl. Hum Mutat 2006; 25:343-7. [PMID: 15776435 PMCID: PMC2752332 DOI: 10.1002/humu.20148] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
Gene conversion--the substitution of genetic material from another gene--is recognized as the underlying cause of a growing number of genetic diseases. While in most cases conversion takes place between a normal gene and its pseudogene, here we report an occurrence of disease-associated gene conversion between two functional genes. Chronic pancreatitis in childhood is frequently associated with mutations of the cationic trypsinogen gene (serine protease 1; PRSS1). We have analyzed PRSS1 in 1106 patients with chronic pancreatitis, and identified a novel conversion event affecting exon 2 and the subsequent intron. The recombination replaced at least 289 nucleotides with the paralogous sequence from the anionic trypsinogen gene (serine protease 2; PRSS2), and resulted in the PRSS1 mutations c.86A > T and c.161A > G, causing the amino acid substitutions N29I and N54S, respectively. Analysis of the recombinant N29I-N54S double mutant cationic trypsinogen revealed increased autocatalytic activation, which was solely due to the N29I mutation. In conclusion, we have demonstrated that gene conversion between two functional paralogous trypsinogen genes can occur and cause genetically determined chronic pancreatitis.
Collapse
Affiliation(s)
- Niels Teich
- Medizinische Klinik und Poliklinik II, Universität Leipzig, Leipzig, Germany.
| | | | | | | | | | | | | |
Collapse
|
44
|
Abstract
Rearrangements of our genome can be responsible for inherited as well as sporadic traits. The analyses of chromosome breakpoints in the proximal short arm of Chromosome 17 (17p) reveal nonallelic homologous recombination (NAHR) as a major mechanism for recurrent rearrangements whereas nonhomologous end-joining (NHEJ) can be responsible for many of the nonrecurrent rearrangements. Genome architectural features consisting of low-copy repeats (LCRs), or segmental duplications, can stimulate and mediate NAHR, and there are hotspots for the crossovers within the LCRs. Rearrangements introduce variation into our genome for selection to act upon and as such serve an evolutionary function analogous to base pair changes. Genomic rearrangements may cause Mendelian diseases, produce complex traits such as behaviors, or represent benign polymorphic changes. The mechanisms by which rearrangements convey phenotypes are diverse and include gene dosage, gene interruption, generation of a fusion gene, position effects, unmasking of recessive coding region mutations (single nucleotide polymorphisms, SNPs, in coding DNA) or other functional SNPs, and perhaps by effects on transvection.
Collapse
Affiliation(s)
- James R Lupski
- Department of Molecular and Human Genetics, Baylor College of Medicine, and at the Texas Children's Hospital, Houston, Texas, United States of America.
| | | |
Collapse
|
45
|
BOERKOEL CF, INOUE K, REITER LT, WARNER LE, LUPSKI JR. Molecular Mechanisms for CMT1A Duplication and HNPP Deletion. Ann N Y Acad Sci 2006; 883:22-35. [DOI: 10.1111/j.1749-6632.1999.tb08563.x] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
46
|
Ezawa K, OOta S, Saitou N. Genome-Wide Search of Gene Conversions in Duplicated Genes of Mouse and Rat. Mol Biol Evol 2006; 23:927-40. [PMID: 16407460 DOI: 10.1093/molbev/msj093] [Citation(s) in RCA: 66] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Gene conversion is considered to play important roles in the formation of genomic makeup such as homogenization of multigene families and diversification of alleles. We devised two statistical tests on quartets for detecting gene conversion events. Each "quartet" consists of two pairs of orthologous sequences supposed to have been generated by a duplication event and a subsequent speciation of two closely related species. As example data, EnsEMBL mouse and rat cDNA sequences were used to obtain a genome-wide picture of gene conversion events. We extensively sampled 2,641 quartets that appear to have resulted from duplications after the divergence of primates and rodents and before mouse-rat speciation. Combination of our new tests with Sawyer's and Takahata's tests enhanced the detection sensitivity while keeping false positives as few as possible. About 18% (488 quartets) were shown to be highly positive for gene conversion using this combined test. Out of them, 340 (13% of the total) showed signs of gene conversion in mouse sequence pairs. Those gene conversion-positive gene pairs are mostly linked in the same chromosomes, with the proportion of positive pairs in the linked and unlinked categories being 15% and 1%, respectively. Statistical analyses showed that (1) the susceptibility to gene conversion correlates negatively with the physical distance, especially the frequency of 29% was observed for gene pairs whose distances are smaller than 55 kb; (2) the occurrence of gene conversions does not depend on the transcriptional direction; (3) small gene families consisting of between three and six contiguous genes are highly prone to gene conversion; and (4) frequency of gene conversions greatly varies depending on functional categories, and cadherins favor gene conversion, while vomeronasal receptors type 1 and immunoglobulin V-type proteins disfavor it. These findings will be useful to deepen the understanding of the roles of gene conversion.
Collapse
Affiliation(s)
- Kiyoshi Ezawa
- Division of Population Genetics, National Institute of Genetics, Mishima, Japan
| | | | | |
Collapse
|
47
|
Opperman R, Emmanuel E, Levy AA. The effect of sequence divergence on recombination between direct repeats in Arabidopsis. Genetics 2005; 168:2207-15. [PMID: 15611187 PMCID: PMC1448723 DOI: 10.1534/genetics.104.032896] [Citation(s) in RCA: 73] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
It is well established that sequence divergence has an inhibitory effect on homologous recombination. However, a detailed analysis of this relationship is missing for most higher eukaryotes. We have measured the rate of somatic recombination between direct repeats as a function of the number, type, and position of divergent nucleotides in Arabidopsis. We show that a minor divergence level of 0.16% (one mutation in otherwise identical 618 bp) has a profound effect, decreasing the recombination rate approximately threefold. A further increase in the divergence level affects the recombination rate to a smaller extent until a "divergence saturation" effect is reached at relatively low levels of divergence ( approximately 0.5%). The type of mismatched nucleotide does not affect recombination rates. The decrease in the rate of recombination caused by a single mismatch was not affected by the position of the mismatch along the repeat. This suggests that most recombination intermediate tracts contain a mismatch and thus are as long as the full length of the 618-bp repeats. Finally, we could deduce an antirecombination efficiency of approximately 66% for the first mismatch in the repeat. Altogether, this work shows some degree of conservation across kingdoms when compared to previous reports in yeast; it also provides new insight into the effect of sequence divergence on homologous recombination.
Collapse
Affiliation(s)
- Roy Opperman
- Plant Sciences Department, Weizmann Institute of Science, Rehovot, 76100 Israel
| | | | | |
Collapse
|
48
|
Scully R, Xie A. In my end is my beginning: control of end resection and DSBR pathway 'choice' by cyclin-dependent kinases. Oncogene 2005; 24:2871-6. [PMID: 15838521 PMCID: PMC2977996 DOI: 10.1038/sj.onc.1208609] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
The genome is constantly subjected to chemical alterations that have the potential to cause genetic mutation, chromosomal rearrangements and, in the case of multicellular organisms, cancer. Particular vulnerability exists during DNA replication, when the two DNA strands of a chromosome separate to form templates for the synthesis of sister chromatids. Attempted replication across a damaged or nicked DNA template can result in the formation of a double-strand break (DSB), arguably the most dangerous of DNA lesions. DSBs can also arise directly at any cell cycle stage following exposure to ionizing radiation or radiomimetic agents. To combat these recurrent threats of genomic instability, numerous distinct enzyme systems have evolved that sense DNA damage and coordinate its repair. Part of this coordination involves the activation of signal transduction cascades that target repair proteins, trigger DNA damage-dependent cell cycle checkpoints and profoundly affect chromatin neighboring a DSB. Here, we discuss current models of how lesion processing itself helps to coordinate these signals in dividing cells. Recent evidence in yeast of a role for cyclin-dependent kinases in DNA end resection suggests a possible solution to the long-standing puzzle of how DSBR pathway 'choice' is regulated through the cell cycle.
Collapse
Affiliation(s)
- Ralph Scully
- Department of Medicine, Harvard Medical School and Beth Israel Deaconess Medical Center, 330 Brookline Avenue, Boston, MA 02115, USA.
| | | |
Collapse
|
49
|
Puget N, Knowlton M, Scully R. Molecular analysis of sister chromatid recombination in mammalian cells. DNA Repair (Amst) 2005; 4:149-61. [PMID: 15590323 PMCID: PMC2967438 DOI: 10.1016/j.dnarep.2004.08.010] [Citation(s) in RCA: 65] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2004] [Indexed: 01/19/2023]
Abstract
Sister chromatid recombination (SCR) is a potentially error-free pathway for the repair of double-strand breaks arising during replication and is thought to be important for the prevention of genomic instability and cancer. Analysis of sister chromatid recombination at a molecular level has been limited by the difficulty of selecting specifically for these events. To overcome this, we have developed a novel "nested intron" reporter that allows the positive selection in mammalian cells of "long tract" gene conversion events arising between sister chromatids. We show that these events arise spontaneously in cycling cells and are strongly induced by a site-specific double-strand break (DSB) caused by the restriction endonuclease, I-SceI. Notably, some I-SceI-induced sister chromatid recombination events entailed multiple rounds of gene amplification within the reporter, with the generation of a concatemer of amplified gene segments. Thus, there is an intimate relationship between sister chromatid recombination control and certain types of gene amplification. Dysregulated sister chromatid recombination may contribute to cancer progression, in part, by promoting gene amplification.
Collapse
Affiliation(s)
| | | | - Ralph Scully
- Corresponding author. Tel.: +1 617 667 4252; fax: +1 617 667 0980. (R. Scully)
| |
Collapse
|
50
|
Kajon AE, Xu W, Erdman DD. Sequence polymorphism in the E3 7.7K ORF of subspecies B1 human adenoviruses. Virus Res 2005; 107:11-9. [PMID: 15567028 DOI: 10.1016/j.virusres.2004.06.005] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2003] [Revised: 06/16/2004] [Accepted: 06/16/2004] [Indexed: 10/26/2022]
Abstract
Sequences corresponding to the 7.7K open-reading frame (ORF) of the E3 region of subspecies B1 adenoviruses (Ads) were compared with prototype strains of Ad3, Ad7, Ad16, Ad21, and Ad50 and field isolates representing a variety of genome restriction types of Ad3 and Ad7 to better assess the extent of genetic variation in this intriguing region of the viral genome encoding a product whose function is still unknown. Alignment of 55 species B1 Ad sequences revealed a marked polymorphism in the 7.7K ORF and allowed the identification of eight distinct sequence profiles (SPs) characterized by (1) deletions that retain or change the reading frame, (2) single-base mutations (SBMs) that change the start codon (ATG to ATT or ATC), and (3) other SBMs. mRNAs of expected size for the observed sequence polymorphisms were identified by RT-PCR from DNAse I-treated total RNA extracts of infected cells. Predicted proteins ranged from 0 to 94 amino acids corresponding to molecular masses of 0-11 K. Together with the hypervariable regions of the hexon gene, the E3 7.7K ORF appears to be another area of the Ad genome in which genetic diversity may be generated by illegitimate recombination.
Collapse
Affiliation(s)
- A E Kajon
- Lovelace Respiratory Research Institute, Albuquerque, NM, USA
| | | | | |
Collapse
|