1
|
Meiotic DNA breaks drive multifaceted mutagenesis in the human germ line. Science 2023; 382:eadh2531. [PMID: 38033082 PMCID: PMC7615360 DOI: 10.1126/science.adh2531] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Accepted: 09/29/2023] [Indexed: 12/02/2023]
Abstract
Meiotic recombination commences with hundreds of programmed DNA breaks; however, the degree to which they are accurately repaired remains poorly understood. We report that meiotic break repair is eightfold more mutagenic for single-base substitutions than was previously understood, leading to de novo mutation in one in four sperm and one in 12 eggs. Its impact on indels and structural variants is even higher, with 100- to 1300-fold increases in rates per break. We uncovered new mutational signatures and footprints relative to break sites, which implicate unexpected biochemical processes and error-prone DNA repair mechanisms, including translesion synthesis and end joining in meiotic break repair. We provide evidence that these mechanisms drive mutagenesis in human germ lines and lead to disruption of hundreds of genes genome wide.
Collapse
|
2
|
Most large structural variants in cancer genomes can be detected without long reads. Nat Genet 2023; 55:2139-2148. [PMID: 37945902 PMCID: PMC10703688 DOI: 10.1038/s41588-023-01540-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2021] [Accepted: 09/19/2023] [Indexed: 11/12/2023]
Abstract
Short-read sequencing is the workhorse of cancer genomics yet is thought to miss many structural variants (SVs), particularly large chromosomal alterations. To characterize missing SVs in short-read whole genomes, we analyzed 'loose ends'-local violations of mass balance between adjacent DNA segments. In the landscape of loose ends across 1,330 high-purity cancer whole genomes, most large (>10-kb) clonal SVs were fully resolved by short reads in the 87% of the human genome where copy number could be reliably measured. Some loose ends represent neotelomeres, which we propose as a hallmark of the alternative lengthening of telomeres phenotype. These pan-cancer findings were confirmed by long-molecule profiles of 38 breast cancer and melanoma cases. Our results indicate that aberrant homologous recombination is unlikely to drive the majority of large cancer SVs. Furthermore, analysis of mass balance in short-read whole genome data provides a surprisingly complete picture of cancer chromosomal structure.
Collapse
|
3
|
Identification and characterization of extrachromosomal circular DNA in patients with high myopia and cataract. Epigenetics 2023; 18:2192324. [PMID: 36945837 PMCID: PMC10038054 DOI: 10.1080/15592294.2023.2192324] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/23/2023] Open
Abstract
To explore the presence of extrachromosomal circular DNA (eccDNA) in the anterior capsule of the lens in the eyes of patients with cataract and with high myopia. Circle-Seq was performed to identify differences in the eccDNA and gene expression between the anterior capsule of the lens of patients with simple nuclear cataract (C, n = 6 cases) and patients with nuclear cataract along with high myopia (HM, n = 6 cases). The expression of eccDNA was confirmed using routine quantitative polymerase chain reaction. The eccDNA ranked in C and HM ranged in length from 0.017 kb - 9.9 Mb with two distinctive peaks detected at 0.2 kb and 0.5 kb, while eccDNA that were differentially expressed ranged in size from 0.05 kb - 57.8 kb with two distinctive peaks observed at 0.1 kb and 0.5 kb. Only 2.5% of the eccDNA in C and 2% in HM were>25 kb in size. The gene-rich chromosomes contributed to more number of eccDNA/Mb, while several well-known high myopia candidate genes, including catenin delta 2 (CTNND2) and ubiquitin-like with PHD, exhibited significantly increased levels of eccDNA in the anterior capsule of the lens in patients with high myopia. This study highlighted the topologic analysis of the anterior capsule of eyes with high myopia, which is an emerging direction for research and clinical applications. These findings suggested that eccDNA was commonly detected in eyes with high myopia and cataracts, and the candidate genes for high myopia identified in previous studies were also observed in the eccDNA.
Collapse
|
4
|
Genome-Wide Extrachromosomal Circular DNA Profiling of Paired Hepatocellular Carcinoma and Adjacent Liver Tissues. Cancers (Basel) 2023; 15:5309. [PMID: 38001569 PMCID: PMC10670553 DOI: 10.3390/cancers15225309] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Revised: 10/27/2023] [Accepted: 11/03/2023] [Indexed: 11/26/2023] Open
Abstract
Hepatocellular carcinoma (HCC) develops through multiple mechanisms. While recent studies have shown the presence of extrachromosomal circular DNA (eccDNA) in most cancer types, the eccDNA expression pattern and its association with HCC remain obscure. We aimed to investigate this problem. The genome-wide eccDNA profiles of eight paired HCC and adjacent non-tumor tissue samples were comprehensively elucidated based on Circle-seq, and they were further cross-analyzed with the RNA sequencing data to determine the association between eccDNA expression and transcriptome dysregulation. A total of 60,423 unique eccDNA types were identified. Most of the detected eccDNAs were smaller than 1 kb, with a length up to 182,363 bp and a mean sizes of 674 bp (non-tumor) and 813 bp (tumor), showing a greater association with gene-rich rather than with gene-poor regions. Although there was no statistical difference in length and chromosome distribution, the eccDNA patterns between HCC and adjacent non-tumor tissues showed significant differences at both the chromosomal and single gene levels. Five of the eight HCC tissues showed significantly higher amounts of chromosome 22-derived eccDNA expression compared to the non-tumor tissue. Furthermore, two genes, SLC16A3 and BAIAP2L2, with a higher transcription level in tumor tissues, were related to eccDNAs exclusively detected in three HCC samples and were negatively associated with survival rates in HCC cohorts from public databases. These results indicate the existence and massive heterogeneity of eccDNAs in HCC and adjacent liver tissues, and suggest their potential association with dysregulated gene expression.
Collapse
|
5
|
Read-depth based approach on whole genome resequencing data reveals important insights into the copy number variation (CNV) map of major global buffalo breeds. BMC Genomics 2023; 24:616. [PMID: 37845620 PMCID: PMC10580622 DOI: 10.1186/s12864-023-09720-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Accepted: 10/05/2023] [Indexed: 10/18/2023] Open
Abstract
BACKGROUND Elucidating genome-wide structural variants including copy number variations (CNVs) have gained increased significance in recent times owing to their contribution to genetic diversity and association with important pathophysiological states. The present study aimed to elucidate the high-resolution CNV map of six different global buffalo breeds using whole genome resequencing data at two coverages (10X and 30X). Post-quality control, the sequence reads were aligned to the latest draft release of the Bubaline genome. The genome-wide CNVs were elucidated using a read-depth approach in CNVnator with different bin sizes. Adjacent CNVs were concatenated into copy number variation regions (CNVRs) in different breeds and their genomic coverage was elucidated. RESULTS Overall, the average size of CNVR was lower at 30X coverage, providing finer details. Most of the CNVRs were either deletion or duplication type while the occurrence of mixed events was lesser in number on a comparative basis in all breeds. The average CNVR size was lower at 30X coverage (0.201 Mb) as compared to 10X (0.013 Mb) with the finest variants in Banni buffaloes. The maximum number of CNVs was observed in Murrah (2627) and Pandharpuri (25,688) at 10X and 30X coverages, respectively. Whereas the minimum number of CNVs were scored in Surti at both coverages (2092 and 17,373). On the other hand, the highest and lowest number of CNVRs were scored in Jaffarabadi (833 and 10,179 events) and Surti (783 and 7553 events) at both coverages. Deletion events overnumbered duplications in all breeds at both coverages. Gene profiling of common overlapped genes and longest CNVRs provided important insights into the evolutionary history of these breeds and indicate the genomic regions under selection in respective breeds. CONCLUSION The present study is the first of its kind to elucidate the high-resolution CNV map in major buffalo populations using a read-depth approach on whole genome resequencing data. The results revealed important insights into the divergence of major global buffalo breeds along the evolutionary timescale.
Collapse
|
6
|
The more the merrier? Gene duplications in the coevolution of primate lentiviruses with their hosts. Curr Opin Virol 2023; 62:101350. [PMID: 37651832 DOI: 10.1016/j.coviro.2023.101350] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2023] [Revised: 07/20/2023] [Accepted: 07/29/2023] [Indexed: 09/02/2023]
Abstract
Gene duplications are a major source of genetic diversity and evolutionary innovation. Newly formed, duplicated genes can provide a selection advantage in constantly changing environments. One such example is the arms race of HIV and related lentiviruses with innate immune responses of their hosts. In recent years, it has become clear that both sides have benefited from multiple gene duplications. For example, amplifications of antiretroviral factors such as apolipoprotein-B mRNA-editing enzyme catalytic polypeptide-3 (APOBEC3), interferon-induced transmembrane protein (IFITM), and tripartite motif-containing (TRIM) proteins have expanded the repertoire of cell-intrinsic defense mechanisms and increased the barriers to retroviral replication and cross-species transmission. Conversely, recent studies have also shed light on how duplications of accessory lentiviral genes and Long terminal repeat (LTR) elements can provide a selection advantage in the coevolution with antiviral host proteins.
Collapse
|
7
|
Homozygous 22q11.2 distal type II microdeletion is associated with syndromic neurodevelopmental delay. Am J Med Genet A 2023; 191:2623-2630. [PMID: 37365930 DOI: 10.1002/ajmg.a.63326] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2022] [Revised: 05/19/2023] [Accepted: 06/01/2023] [Indexed: 06/28/2023]
Abstract
Genomic disorders result from heterozygous copy number variants (CNVs). Homozygous deletions spanning numerous genes are rare, despite the potential contribution of consanguinity to such instances. CNVs in the 22q11.2 region are mediated by nonallelic homologous recombination between pairs of low copy repeats (LCRs), from amongst eight LCRs designated A-H. Heterozygous distal type II deletions (LCR-E to LCR-F) have incomplete penetrance and variable expressivity, and can lead to neurodevelopmental issues, minor craniofacial anomalies, and congenital abnormalities. We report siblings with global developmental delay, hypotonia, minor craniofacial anomalies, ocular abnormalities, and minor skeletal issues, in whom chromosomal microarray identified a homozygous distal type II deletion. The deletion was brought to homozygosity as a result of a consanguineous marriage between two heterozygous carriers of the deletion. The phenotype of the children was strikingly more severe and complex than that of the parents. This report suggests that the distal type II deletion harbors a dosage-sensitive gene or regulatory element, which leads to a more severe phenotype when deleted on both chromosomes.
Collapse
|
8
|
Population-based Risk of Psychiatric Disorders Associated with Recurrent CNVs. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.09.04.23294975. [PMID: 37886536 PMCID: PMC10602037 DOI: 10.1101/2023.09.04.23294975] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/28/2023]
Abstract
Recurrent copy number variants (rCNVs) are associated with increased risk of neuropsychiatric disorders but their pathogenic population-level impact is unknown. We provide population-based estimates of rCNV-associated risk of neuropsychiatric disorders for 34 rCNVs in the iPSYCH2015 case-cohort sample (n=120,247). Most observed significant increases in rCNV-associated risk for ADHD, autism or schizophrenia were moderate (HR:1.42-5.00), and risk estimates were highly correlated across these disorders, the most notable exception being high autism-associated risk with Prader-Willi/Angelman Syndrome duplications (HR=20.8). No rCNV was associated with significant increase in depression risk. Also, rCNV-associated risk was positively correlated with locus size and gene constraint. Comparison with published rCNV studies suggests that prevalence of some rCNVs is higher, and risk of psychiatric disorders lower, than previously estimated. In an era where genetics is increasingly being clinically applied, our results highlight the importance of population-based risk estimates for genetics-based predictions.
Collapse
|
9
|
Copy Number Variation Discovery in South African Nguni-Sired and Bonsmara-Sired Crossbred Cattle. Animals (Basel) 2023; 13:2513. [PMID: 37570321 PMCID: PMC10417447 DOI: 10.3390/ani13152513] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2023] [Revised: 07/26/2023] [Accepted: 08/01/2023] [Indexed: 08/13/2023] Open
Abstract
Crossbreeding forms part of Climate-Smart beef production and is one of the strategies to mitigate the effects of climate change. Two Nguni-sired and three Bonsmara-sired crossbred animals underwent whole genome sequencing. Following quality control and file preparation, the sequence data were investigated for genome-wide copy number variation (CNV) using the panelcn.MOPS tool. A total of 355 CNVs were identified in the crossbreds, of which 274 were unique in Bonsmara-sired crossbreds and 81 unique in the Nguni-sired crossbreds. Genes that differed in copy number in both crossbreds included genes related to growth (SCRN2, LOC109572916) and fertility-related factors (RPS28, LOC1098562432, LOC109570037). Genes that were present only in the Bonsmara-sired crossbreds included genes relating to lipid metabolism (MAF1), olfaction (LOC109569114), body size (HES7), immunity (LOC10957335, LOC109877039) and disease (DMBT1). Genes that were present only in the Nguni-sired crossbreds included genes relating to ketosis (HMBOX1) and amino acid transport (LOC109572916). Results of this study indicate that Nguni and Bonsmara cattle can be utilized in crossbreeding programs as they may enhance the presence of economically important traits associated with both breeds. This will produce crossbred animals that are good meat producers, grow faster, have high fertility, strong immunity and a better chance of producing in South Africa's harsh climate conditions. Ultimately, this study provides new genetic insights into the adaptability of Nguni and Bonsmara crossbred cattle.
Collapse
|
10
|
A novel extrachromosomal circular DNA related genes signature for overall survival prediction in patients with ovarian cancer. BMC Med Genomics 2023; 16:140. [PMID: 37337170 DOI: 10.1186/s12920-023-01576-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2022] [Accepted: 06/09/2023] [Indexed: 06/21/2023] Open
Abstract
OBJECTIVE Ovarian cancer (OV) has a high mortality rate all over the world, and extrachromosomal circular DNA (eccDNA) plays a key role in carcinogenesis. We wish to study more about the molecular structure of eccDNA in the UACC-1598-4 cell line and how its genes are associated with ovarian cancer prognosis. METHODS We sequenced and annotated the eccDNA by Circle_seq of the OV cell line UACC-1598-4. To acquire the amplified genes of OV on eccDNA, the annotated eccDNA genes were intersected with the overexpression genes of OV in TCGA. Univariate Cox regression was used to find the genes on eccDNA that were linked to OV prognosis. The least absolute shrinkage and selection operator (LASSO) and cox regression models were used to create the OV prognostic model, as well as the receiver operating characteristic curve (ROC) curve and nomogram of the prediction model. By applying the median value of the risk score, the samples were separated into high-risk and low-risk groups, and the differences in immune infiltration between the two groups were examined using ssGSEA. RESULTS EccDNA in UACC-1598-4 has a length of 0-2000 bp, and some of them include the whole genes or gene fragments. These eccDNA originated from various parts of chromosomes, especially enriched in repeatmasker, introns, and coding regions. They were annotated with 2188 genes by Circle_seq. Notably, the TCGA database revealed that a total of 198 of these eccDNA genes were overexpressed in OV (p < 0.05). They were mostly enriched in pathways associated with cell adhesion, ECM receptors, and actin cytoskeleton. Univariate Cox analysis showed 13 genes associated with OV prognosis. LASSO and Cox regression analysis were used to create a risk model based on remained 9 genes. In both the training (TCGA database) and validation (International Cancer Genome Consortium, ICGC) cohorts, a 9-gene signature could successfully discriminate high-risk individuals (all p < 0.01). Immune infiltration differed significantly between the high-risk and low-risk groups. The model's area under the ROC curve was 0.67, and a nomograph was created to assist clinician. CONCLUSION EccDNA is found in UACC-1598-4, and part of its genes linked to OV prognosis. Patients with OV may be efficiently evaluated using a prognostic model based on eccDNA genes, including SLC7A1, NTN1, ADORA1, PADI2, SULT2B1, LINC00665, CILP2, EFNA5, TOMM.
Collapse
|
11
|
SNV/indel hypermutator phenotype in biallelic RAD51C variant: Fanconi anemia. Hum Genet 2023; 142:721-733. [PMID: 37031326 PMCID: PMC10996436 DOI: 10.1007/s00439-023-02550-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2023] [Accepted: 03/24/2023] [Indexed: 04/10/2023]
Abstract
We previously reported a fetus with Fanconi anemia (FA), complementation group O due to compound heterozygous variants involving RAD51C. Interestingly, the trio exome sequencing analysis also detected eight apparent de novo mosaic variants with variant allele fraction (VAF) ranging between 11.5 and 37%. Here, using whole genome sequencing and a 'home-brew' variant filtering pipeline and DeepMosaic module, we investigated the number and signature of de novo heterozygous and mosaic variants and the hypothesis of a rare phenomenon of hypermutation. Eight-hundred-thirty apparent de novo SNVs and 21 de novo indels had VAFs below 37.41% and were considered postzygotic somatic mosaic variants. The VAFs showed a bimodal distribution, with one component having an average VAF of 25% (range: 18.7-37.41%) (n = 446), representing potential postzygotic first mitotic events, and the other component with an average VAF of 12.5% (range 9.55-18.69%) (n = 384), describing potential second mitotic events. No increased rate of CNV formation was observed. The mutational pattern analysis for somatic single base substitution showed SBS40, SBS5, and SBS3 as the top recognized signatures. SBS3 is a known signature associated with homologous recombination-based DNA damage repair error. Our data demonstrate that biallelic RAD51C variants show evidence for defective genomic DNA damage repair and thereby result in a hypermutator phenotype with the accumulation of postzygotic de novo mutations, at least in the prenatal period. This 'genome hypermutator phenomenon' might contribute to the observed hematological manifestations and the predisposition to tumors in patients with FA. We propose that other FA groups should be investigated for genome-wide de novo variants.
Collapse
|
12
|
Did circular DNA shape the evolution of mammalian genomes? Trends Biochem Sci 2023; 48:317-320. [PMID: 36280496 DOI: 10.1016/j.tibs.2022.09.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2022] [Revised: 09/28/2022] [Accepted: 09/29/2022] [Indexed: 11/05/2022]
Abstract
Extrachromosomal circular DNA (eccDNA) can shape the genomes of somatic cells, but how it impacts genomes across generations is largely unexplored. We propose that genomes can rearrange via circular intermediates across generations and show that up to 6% of a mammalian genome can have changed gene order through eccDNA.
Collapse
|
13
|
SNV/indel hypermutator phenotype in biallelic RAD51C variant - Fanconi anemia. RESEARCH SQUARE 2023:rs.3.rs-2628288. [PMID: 36909564 PMCID: PMC10002829 DOI: 10.21203/rs.3.rs-2628288/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/06/2023]
Abstract
We previously reported a fetus with Fanconi anemia (FA), complementation group O due to compound heterozygous variants involving RAD51C . Interestingly, the trio exome sequencing analysis also detected eight apparent de novo mosaic variants with variant allele fraction (VAF) ranging between 11.5%-37%. Here, using whole genome sequencing and a 'home-brew' variant filtering pipeline and DeepMosaic module, we investigated the number and signature of de novo heterozygous and mosaic variants and the rare phenomenon of hypermutation. Eight-hundred-thirty apparent dnSNVs and 21 de novo indels had VAFs below 37.41% and were considered postzygotic somatic mosaic variants. The VAFs showed a bimodal distribution, with one component with an average VAF of 25% (range: 18.7-37.41%) (n=446), representing potential postzygotic first mitotic events, and the other component with an average VAF of 12.5% (range: 9.55-18.69%) (n=384), describing potential second mitotic events. No increased rate of CNV formation was observed. The mutational pattern analysis for somatic single base substitution showed SBS40, SBS5, and SBS3 as the top recognized signatures. SBS3 is a known signature associated with homologous recombination-based DNA damage repair error. Our data demonstrate that biallelic RAD51C variants show evidence for defective genomic DNA damage repair and thereby result in a hypermutator phenotype with the accumulation of postzygotic de novo mutations, at least in the prenatal period. This 'genome hypermutator phenomenon' might contribute to the observed hematological manifestations and the predisposition to tumors in patients with FA, and pregnancy loss in general. We propose that other FA groups should be investigated for genome-wide de novo variants.
Collapse
|
14
|
Diagnosis of prenatal 22q11.2 duplication syndrome: a two-case study. J Genet 2022. [DOI: 10.1007/s12041-022-01406-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
|
15
|
Abstract
Structural variations, including recurrent Copy Number Variants (CNVs) at specific genomic loci, have been found to be associated with increased risk of several diseases and syndromes. CNV carrier status can be determined in large collections of samples using SNP arrays and, more recently, sequencing data. Although there is some consensus among researchers about the essential steps required in such analysis (i.e., CNV calling, filtering of putative carriers, and visual validation using intensity data plots of the genomic region), standard methodologies and processes to control the quality and consistency of the results are lacking. Here, we present a comprehensive and user-friendly protocol that we have refined from our extensive research experience in the field. We cover every aspect of the analysis, from input data curation to final results. For each step, we highlight which parameters affect the analysis the most and how different settings may lead to different results. We provide a pipeline to run the complete analysis with effective (but customizable) pre-sets. We present software that we developed to better handle and filter putative CNV carriers and perform visual inspection to validate selected candidates. Finally, we describe methods to evaluate the critical sections and actions to counterbalance potential problems. The current implementation is focused on Illumina SNP array data. All the presented software is freely available and provided in a ready-to-use docker container. © 2022 The Authors. Current Protocols published by Wiley Periodicals LLC. Basic Protocol 1: From raw intensity data files to CNV calls Basic Protocol 2: From CNV calls to validated CNV carriers. Basic Protocol 3: Quality control and quality assessment Basic Protocol 4: Install the necessary software.
Collapse
|
16
|
Sequencing individual genomes with recurrent genomic disorder deletions: an approach to characterize genes for autosomal recessive rare disease traits. Genome Med 2022; 14:113. [PMID: 36180924 PMCID: PMC9526336 DOI: 10.1186/s13073-022-01113-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2021] [Accepted: 09/02/2022] [Indexed: 11/25/2022] Open
Abstract
BACKGROUND In medical genetics, discovery and characterization of disease trait contributory genes and alleles depends on genetic reasoning, study design, and patient ascertainment; we suggest a segmental haploid genetics approach to enhance gene discovery and molecular diagnostics. METHODS We constructed a genome-wide map for nonallelic homologous recombination (NAHR)-mediated recurrent genomic deletions and used this map to estimate population frequencies of NAHR deletions based on large-scale population cohorts and region-specific studies. We calculated recessive disease carrier burden using high-quality pathogenic or likely pathogenic variants from ClinVar and gnomAD. We developed a NIRD (NAHR deletion Impact to Recessive Disease) score for recessive disorders by quantifying the contribution of NAHR deletion to the overall allele load that enumerated all pairwise combinations of disease-causing alleles; we used a Punnett square approach based on an assumption of random mating. Literature mining was conducted to identify all reported patients with defects in a gene with a high NIRD score; meta-analysis was performed on these patients to estimate the representation of NAHR deletions in recessive traits from contemporary human genomics studies. Retrospective analyses of extant clinical exome sequencing (cES) were performed for novel rare recessive disease trait gene and allele discovery from individuals with NAHR deletions. RESULTS We present novel genomic insights regarding the genome-wide impact of NAHR recurrent segmental variants on recessive disease burden; we demonstrate the utility of NAHR recurrent deletions to enhance discovery in the challenging context of autosomal recessive (AR) traits and biallelic variation. Computational results demonstrate new mutations mediated by NAHR, involving recurrent deletions at 30 genomic regions, likely drive recessive disease burden for over 74% of loci within these segmental deletions or at least 2% of loci genome-wide. Meta-analyses on 170 literature-reported patients implicate that NAHR deletions are depleted from the ascertained pool of AR trait alleles. Exome reanalysis of personal genomes from subjects harboring recurrent deletions uncovered new disease-contributing variants in genes including COX10, ERCC6, PRRT2, and OTUD7A. CONCLUSIONS Our results demonstrate that genomic sequencing of personal genomes with NAHR deletions could dramatically improve allele and gene discovery and enhance clinical molecular diagnosis. Moreover, results suggest NAHR events could potentially enable human haploid genetic screens as an approach to experimental inquiry into disease biology.
Collapse
|
17
|
Assembly of a pangenome for global cattle reveals missing sequences and novel structural variations, providing new insights into their diversity and evolutionary history. Genome Res 2022; 32:gr.276550.122. [PMID: 35977842 PMCID: PMC9435747 DOI: 10.1101/gr.276550.122] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2022] [Accepted: 07/21/2022] [Indexed: 02/03/2023]
Abstract
A cattle pangenome representation was created based on the genome sequences of 898 cattle representing 57 breeds. The pangenome identified 83 Mb of sequence not found in the cattle reference genome, representing 3.1% novel sequence compared with the 2.71-Gb reference. A catalog of structural variants developed from this cattle population identified 3.3 million deletions, 0.12 million inversions, and 0.18 million duplications. Estimates of breed ancestry and hybridization between cattle breeds using insertion/deletions as markers were similar to those produced by single nucleotide polymorphism-based analysis. Hundreds of deletions were observed to have stratification based on subspecies and breed. For example, an insertion of a Bov-tA1 repeat element was identified in the first intron of the APPL2 gene and correlated with cattle breed geographic distribution. This insertion falls within a segment overlapping predicted enhancer and promoter regions of the gene, and could affect important traits such as immune response, olfactory functions, cell proliferation, and glucose metabolism in muscle. The results indicate that pangenomes are a valuable resource for studying diversity and evolutionary history, and help to delineate how domestication, trait-based breeding, and adaptive introgression have shaped the cattle genome.
Collapse
|
18
|
Extrachromosomal circular DNA (eccDNA): an emerging star in cancer. Biomark Res 2022; 10:53. [PMID: 35883211 PMCID: PMC9327165 DOI: 10.1186/s40364-022-00399-9] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2022] [Accepted: 07/13/2022] [Indexed: 02/08/2023] Open
Abstract
Extrachromosomal circular DNA (eccDNA) is defined as a type of circular DNA that exists widely in nature and is independent of chromosomes. EccDNA has attracted the attention of researchers due to its broad, random distribution, complex biogenesis and tumor-relevant functions. EccDNA can carry complete gene information, especially the oncogenic driver genes that are often carried in tumors, with increased copy number and high transcriptional activity. The high overexpression of oncogenes by eccDNA leads to malignant growth of tumors. Regardless, the exact generation and functional mechanisms of eccDNA in disease progression are not yet clear. There is, however, an emerging body of evidence characterizing that eccDNA can be generated from multiple pathways, including DNA damage repair pathways, breakage-fusion-bridge (BFB) mechanisms, chromothripsis and cell apoptosis, and participates in the regulation of tumor progression with multiplex functions. This up-to-date review summarizes and discusses the origins, biogenesis and functions of eccDNA, including its contribution to the formation of oncogene instability and mutations, the heterogeneity and cellular senescence of tumor cells, and the proinflammatory response of tumors. We highlight the possible cancer-related applications of eccDNA, such as its potential use in the diagnosis, targeted therapy and prognostic assessment of cancer.
Collapse
|
19
|
The landscape of submicroscopic structural variants at the OPN1LW/OPN1MW gene cluster on Xq28 underlying blue cone monochromacy. Proc Natl Acad Sci U S A 2022; 119:e2115538119. [PMID: 35759666 PMCID: PMC9271157 DOI: 10.1073/pnas.2115538119] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Blue cone monochromacy (BCM) is an X-linked retinal disorder characterized by low vision, photoaversion, and poor color discrimination. BCM is due to the lack of long-wavelength-sensitive and middle-wavelength-sensitive cone photoreceptor function and caused by mutations in the OPN1LW/OPN1MW gene cluster on Xq28. Here, we investigated the prevalence and the landscape of submicroscopic structural variants (SVs) at single-base resolution in BCM patients. We found that about one-third (n = 73) of the 213 molecularly confirmed BCM families carry an SV, most commonly deletions restricted to the OPN1LW/OPN1MW gene cluster. The structure and precise breakpoints of the SVs were resolved in all but one of the 73 families. Twenty-two families-all from the United States-showed the same SV, and we confirmed a common ancestry of this mutation. In total, 42 distinct SVs were identified, including 40 previously unreported SVs, thereby quadrupling the number of precisely mapped SVs underlying BCM. Notably, there was no "region of overlap" among these SVs. However, 90% of SVs encompass the upstream locus control region, an essential enhancer element. Its minimal functional extent based on deletion mapping in patients was refined to 358 bp. Breakpoint analyses suggest diverse mechanisms underlying SV formation as well as in one case the gene conversion-based exchange of a 142-bp deletion between opsin genes. Using parsimonious assumptions, we reconstructed the composition and copy number of the OPN1LW/OPN1MW gene cluster prior to the mutation event and found evidence that large gene arrays may be predisposed to the occurrence of SVs at this locus.
Collapse
|
20
|
RecombineX: A generalized computational framework for automatic high-throughput gamete genotyping and tetrad-based recombination analysis. PLoS Genet 2022; 18:e1010047. [PMID: 35533184 PMCID: PMC9119626 DOI: 10.1371/journal.pgen.1010047] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2022] [Revised: 05/19/2022] [Accepted: 04/14/2022] [Indexed: 01/09/2023] Open
Abstract
Meiotic recombination is an essential biological process that ensures faithful chromosome segregation and promotes parental allele shuffling. Tetrad analysis is a powerful approach to quantify the genetic makeups and recombination landscapes of meiotic products. Here we present RecombineX (https://github.com/yjx1217/RecombineX), a generalized computational framework that automates the full workflow of marker identification, gamete genotyping, and tetrad-based recombination profiling based on any organism or genetic background with batch processing capability. Aside from conventional reference-based analysis, RecombineX can also perform analysis based on parental genome assemblies, which facilitates analyzing meiotic recombination landscapes in their native genomic contexts. Additional features such as copy number variation profiling and missing genotype inference further enhance downstream analysis. RecombineX also includes a dedicate module for simulating the genomes and reads of recombinant tetrads, which enables fine-tuned simulation-based hypothesis testing. This simulation module revealed the power and accuracy of RecombineX even when analyzing tetrads with very low sequencing depths (e.g., 1-2X). Tetrad sequencing data from the budding yeast Saccharomyces cerevisiae and green alga Chlamydomonas reinhardtii were further used to demonstrate the accuracy and robustness of RecombineX for organisms with both small and large genomes, manifesting RecombineX as an all-around one stop solution for future tetrad analysis. Interestingly, our re-analysis of the budding yeast tetrad sequencing data with RecombineX and Oxford Nanopore sequencing revealed two unusual structural rearrangement events that were not noticed before, which exemplify the occasional genome instability triggered by meiosis.
Collapse
|
21
|
Editorial: Copy Number Variation in Rare Disorders. Front Genet 2022; 13:898059. [PMID: 35450215 PMCID: PMC9016220 DOI: 10.3389/fgene.2022.898059] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Accepted: 03/18/2022] [Indexed: 02/06/2023] Open
|
22
|
Chromosome Changes in Soma and Germ Line: Heritability and Evolutionary Outcome. Genes (Basel) 2022; 13:genes13040602. [PMID: 35456408 PMCID: PMC9029507 DOI: 10.3390/genes13040602] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2022] [Revised: 03/25/2022] [Accepted: 03/26/2022] [Indexed: 12/13/2022] Open
Abstract
The origin and inheritance of chromosome changes provide the essential foundation for natural selection and evolution. The evolutionary fate of chromosome changes depends on the place and time of their emergence and is controlled by checkpoints in mitosis and meiosis. Estimating whether the altered genome can be passed to subsequent generations should be central when we consider a particular genome rearrangement. Through comparative analysis of chromosome rearrangements in soma and germ line, the potential impact of macromutations such as chromothripsis or chromoplexy appears to be fascinating. What happens with chromosomes during the early development, and which alterations lead to mosaicism are other poorly studied but undoubtedly essential issues. The evolutionary impact can be gained most effectively through chromosome rearrangements arising in male meiosis I and in female meiosis II, which are the last divisions following fertilization. The diversity of genome organization has unique features in distinct animals; the chromosome changes, their internal relations, and some factors safeguarding genome maintenance in generations under natural selection were considered for mammals.
Collapse
|
23
|
Genomic structural variation in tomato and its role in plant immunity. MOLECULAR HORTICULTURE 2022; 2:7. [PMID: 37789472 PMCID: PMC10515242 DOI: 10.1186/s43897-022-00029-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2021] [Accepted: 02/22/2022] [Indexed: 10/05/2023]
Abstract
It is well known that large genomic variations can greatly impact the phenotype of an organism. Structural Variants (SVs) encompass any genomic variation larger than 30 base pairs, and include changes caused by deletions, inversions, duplications, transversions, and other genome modifications. Due to their size and complex nature, until recently, it has been difficult to truly capture these variations. Recent advances in sequencing technology and computational analyses now permit more extensive studies of SVs in plant genomes. In tomato, advances in sequencing technology have allowed researchers to sequence hundreds of genomes from tomatoes, and tomato relatives. These studies have identified SVs related to fruit size and flavor, as well as plant disease response, resistance/susceptibility, and the ability of plants to detect pathogens (immunity). In this review, we discuss the implications for genomic structural variation in plants with a focus on its role in tomato immunity. We also discuss how advances in sequencing technology have led to new discoveries of SVs in more complex genomes, the current evidence for the role of SVs in biotic and abiotic stress responses, and the outlook for genetic modification of SVs to advance plant breeding objectives.
Collapse
|
24
|
Circular DNA in the human germline and its association with recombination. Mol Cell 2022; 82:209-217.e7. [PMID: 34951964 PMCID: PMC10707452 DOI: 10.1016/j.molcel.2021.11.027] [Citation(s) in RCA: 24] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Revised: 08/24/2021] [Accepted: 11/23/2021] [Indexed: 12/24/2022]
Abstract
Extrachromosomal circular DNA (eccDNA) is common in somatic tissue, but its existence and effects in the human germline are unexplored. We used microscopy, long-read DNA sequencing, and new analytic methods to document thousands of eccDNAs from human sperm. EccDNAs derived from all genomic regions and mostly contained a single DNA fragment, although some consisted of multiple fragments. The generation of eccDNA inversely correlates with the meiotic recombination rate, and chromosomes with high coding-gene density and Alu element abundance form the least eccDNA. Analysis of insertions in human genomes further indicates that eccDNA can persist in the human germline when the circular molecules reinsert themselves into the chromosomes. Our results suggest that eccDNA has transient and permanent effects on the germline. They explain how differences in the physical and genetic map might arise and offer an explanation of how Alu elements coevolved with genes to protect genome integrity against deleterious mutations producing eccDNA.
Collapse
|
25
|
Genome-Wide Assessment Characteristics of Genes Overlapping Copy Number Variation Regions in Duroc Purebred Population. Front Genet 2021; 12:753748. [PMID: 34721540 PMCID: PMC8552909 DOI: 10.3389/fgene.2021.753748] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2021] [Accepted: 09/23/2021] [Indexed: 11/13/2022] Open
Abstract
Copy number variations (CNVs) are important structural variations that can cause significant phenotypic diversity. Reliable CNVs mapping can be achieved by identification of CNVs from different genetic backgrounds. Investigations on the characteristics of overlapping between CNV regions (CNVRs) and protein-coding genes (CNV genes) or miRNAs (CNV-miRNAs) can reveal the potential mechanisms of their regulation. In this study, we used 50 K SNP arrays to detect CNVs in Duroc purebred pig. A total number of 211 CNVRs were detected with a total length of 118.48 Mb, accounting for 5.23% of the autosomal genome sequence. Of these CNVRs, 32 were gains, 175 losses, and four contained both types (loss and gain within the same region). The CNVRs we detected were non-randomly distributed in the swine genome and were significantly enriched in the segmental duplication and gene density region. Additionally, these CNVRs were overlapping with 1,096 protein-coding genes (CNV-genes), and 39 miRNAs (CNV-miRNAs), respectively. The CNV-genes were enriched in terms of dosage-sensitive gene list. The expression of the CNV genes was significantly higher than that of the non-CNV genes in the adult Duroc prostate. Of all detected CNV genes, 22.99% genes were tissue-specific (TSI > 0.9). Strong negative selection had been underway in the CNV-genes as the ones that were located entirely within the loss CNVRs appeared to be evolving rapidly as determined by the median dN plus dS values. Non-CNV genes tended to be miRNA target than CNV-genes. Furthermore, CNV-miRNAs tended to target more genes compared to non-CNV-miRNAs, and a combination of two CNV-miRNAs preferentially synergistically regulated the same target genes. We also focused our efforts on examining CNV genes and CNV-miRNAs functions, which were also involved in the lipid metabolism, including DGAT1, DGAT2, MOGAT2, miR143, miR335, and miRLET7. Further molecular experiments and independent large studies are needed to confirm our findings.
Collapse
|
26
|
Clan genomics: From OMIM phenotypic traits to genes and biology. Am J Med Genet A 2021; 185:3294-3313. [PMID: 34405553 PMCID: PMC8530976 DOI: 10.1002/ajmg.a.62434] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2021] [Revised: 06/29/2021] [Accepted: 07/04/2021] [Indexed: 12/20/2022]
Abstract
Clinical characterization of a patient phenotype has been the quintessential approach for elucidating a differential diagnosis and a hypothesis to explore a potential clinical diagnosis. This has resulted in a language of medicine and a semantic ontology, with both specialty- and subspecialty-specific lexicons, that can be challenging to translate and interpret. There is no 'Rosetta Stone' of clinical medicine such as the genetic code that can assist translation and interpretation of the language of genetics. Nevertheless, the information content embodied within a clinical diagnosis can guide management, therapeutic intervention, and potentially prognostic outlook of disease enabling anticipatory guidance for patients and families. Clinical genomics is now established firmly in medical practice. The granularity and informative content of a personal genome is immense. Yet, we are limited in our utility of much of that personal genome information by the lack of functional characterization of the overwhelming majority of computationally annotated genes in the haploid human reference genome sequence. Whereas DNA and the genetic code have provided a 'Rosetta Stone' to translate genetic variant information, clinical medicine, and clinical genomics provide the context to understand human biology and disease. A path forward will integrate deep phenotyping, such as available in a clinical synopsis in the Online Mendelian Inheritance in Man (OMIM) entries, with personal genome analyses.
Collapse
|
27
|
Copy number variations of chromosome 17p11.2 region in children with development delay and in fetuses with abnormal imaging findings. BMC Med Genomics 2021; 14:215. [PMID: 34470638 PMCID: PMC8411507 DOI: 10.1186/s12920-021-01065-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Accepted: 08/20/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Deletion and duplication of the 3.7 Mb region in 17p11.2 result in two syndromes, Smith-Magenis syndrome and Potocki-Lupski syndrome, which are well-known development disorders. The purpose of this study was to determine the prevalence, genetic characteristics and clinical phenotypes of 17p11.2 deletion/duplication in Chinese children with development delay and in fetuses with potential congenital defects. METHODS 7077 children with development delay and/or intellectual disability were screened by multiplex ligation-dependent probe amplification P245 assay. 7319 fetuses with potential congenital defects were tested using next generation sequencing technique. RESULTS 417 of 7077 pediatric patients were determined to carry chromosome imbalance. 28 (28/7077, 0.4%) cases had imbalance at chromosome 17p11.2. Among them, 12 cases (42.9%) had heterozygous deletions and 16 cases (57.1%) had heterozygous duplications. The clinical phenotypes were variable, including neurobehavioral disorders, craniofacial/skeletal anomalies, immunologic defects, ocular problems and organ malformations. 263 of 7319 fetuses were recognized to have genomic copy number variations. Only 2 of them were found to harbor 17p11.2 imbalance. The fetus with deletion presented with ventricular septal defect and the fetus with duplication had cerebral ventricle dilation. CONCLUSION Our study highlights the phenotypic variability associated with 17p11.2 variations in China. The results further expand the phenotypic spectrum of SMS/PTLS and increase awareness of these disruptive mutations among clinicians.
Collapse
|
28
|
Introgression contributes to distribution of structural variations in cattle. Genomics 2021; 113:3092-3102. [PMID: 34242710 DOI: 10.1016/j.ygeno.2021.07.005] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2021] [Revised: 06/24/2021] [Accepted: 07/03/2021] [Indexed: 11/19/2022]
Abstract
Structural variations (SVs) are an important source of phenotypic diversity in cattle. Here, 72 whole genome sequences representing taurine and zebu cattle were used to identify SVs. Applying multiple approaches, 16,738 SVs were identified. A comparison against the Database of Genomic Variants archives revealed that 1575 SVs were novel in our data. A novel duplication covering the entire GALNT15 gene, was observed only in N'Dama. A duplication, which was previously reported only in zebu and associated with navel length, was also observed in N'Dama. Investigation of a novel deletion located upstream of CAST13 gene and identified only in Italian cattle and zebu, revealed its introgressed origin in the former. Overall, our data highlights how the SVs distribution in cattle is also shaped by forces such as demographical differences and gene flow. The cattle SVs of this study and its meta-data can be visualized on an interactive genome browser at https://tinyurl.com/svCowArs.
Collapse
|
29
|
Identification of microduplications at Xp21.2 and Xq13.1 in neurodevelopmental disorders. Mol Genet Genomic Med 2021; 9:e1703. [PMID: 33982443 PMCID: PMC8683627 DOI: 10.1002/mgg3.1703] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2021] [Revised: 04/05/2021] [Accepted: 04/20/2021] [Indexed: 11/21/2022] Open
Abstract
Background Microduplications are a rare cause of disease in X‐linked neurodevelopmental disorders but likely have been under reported due challenges in detection and interpretation. Methods We performed exome sequencing and subsequent microarray analysis in two families with a neurodevelopmental disorder. Results Here, we report on two families each with unique inherited microduplications at Xp21.2 and Xq13.1, respectively. In the first family, a 562.8‐kb duplication at Xq13.1 covering DLG3, TEX11, SLC7A3, GDPD2, and part KIF4A was identified in a boy whose phenotype was characterized by delayed speech development, mild intellectual disability (ID), mild dysmorphic facial features, a heart defect, and neuropsychiatric symptoms. By interrogating all reported Xq13.1 duplications in individuals affected with a neurodevelopmental disorder, we provide evidence that this genomic region and particularly DLG3 might be sensitive to an increased dosage. In the second family with four affected males, we found a noncontinuous 223‐ and 204‐kb duplication at Xp21.2, of which the first duplication covers exon 6 of IL1RAPL1. The phenotype of the male patients was characterized by delayed speech development, mild to moderate ID, strabismus, and neurobehavioral symptoms. The carrier daughter and her mother had learning difficulties. IL1RAPL1 shows nonrecurrent causal structural variation and is located at a common fragile site (FRAXC), prone to re‐arrangement. Conclusion In conclusion, we show that comprehensive clinical and genetic examination of microduplications on the X‐chromosome can be helpful in undiagnosed cases of neurodevelopmental disease.
Collapse
|
30
|
Genome diversity and instability in human germ cells and preimplantation embryos. Semin Cell Dev Biol 2021; 113:132-147. [PMID: 33500205 PMCID: PMC8097364 DOI: 10.1016/j.semcdb.2020.12.007] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2020] [Accepted: 12/18/2020] [Indexed: 12/26/2022]
Abstract
Genome diversity is essential for evolution and is of fundamental importance to human health. Generating genome diversity requires phases of DNA damage and repair that can cause genome instability. Humans have a high incidence of de novo congenital disorders compared to other organisms. Recent access to eggs, sperm and preimplantation embryos is revealing unprecedented rates of genome instability that may result in infertility and de novo mutations that cause genomic imbalance in at least 70% of conceptions. The error type and incidence of de novo mutations differ during developmental stages and are influenced by differences in male and female meiosis. In females, DNA repair is a critical factor that determines fertility and reproductive lifespan. In males, aberrant meiotic recombination causes infertility, embryonic failure and pregnancy loss. Evidence suggest germ cells are remarkably diverse in the type of genome instability that they display and the DNA damage responses they deploy. Additionally, the initial embryonic cell cycles are characterized by a high degree of genome instability that cause congenital disorders and may limit the use of CRISPR-Cas9 for heritable genome editing.
Collapse
|
31
|
Abstract
Mutation of the human genome results in three classes of genomic variation: single nucleotide variants; short insertions or deletions; and large structural variants (SVs). Some mutations occur during normal processes, such as meiotic recombination or B cell development, and others result from DNA replication or aberrant repair of breaks in sequence-specific contexts. Regardless of mechanism, mutations are subject to selection, and some hotspots can manifest in disease. Here, we discuss genomic regions prone to mutation, mechanisms contributing to mutation susceptibility, and the processes leading to their accumulation in normal and somatic genomes. With further, more accurate human genome sequencing, additional mutation hotspots, mechanistic details of their formation, and the relevance of hotspots to evolution and disease are likely to be discovered.
Collapse
|
32
|
Replicative aging is associated with loss of genetic heterogeneity from extrachromosomal circular DNA in Saccharomyces cerevisiae. Nucleic Acids Res 2020; 48:7883-7898. [PMID: 32609810 PMCID: PMC7430651 DOI: 10.1093/nar/gkaa545] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2019] [Revised: 05/28/2020] [Accepted: 06/17/2020] [Indexed: 12/26/2022] Open
Abstract
Circular DNA can arise from all parts of eukaryotic chromosomes. In yeast, circular ribosomal DNA (rDNA) accumulates dramatically as cells age, however little is known about the accumulation of other chromosome-derived circles or the contribution of such circles to genetic variation in aged cells. We profiled circular DNA in Saccharomyces cerevisiae populations sampled when young and after extensive aging. Young cells possessed highly diverse circular DNA populations but 94% of the circular DNA were lost after ∼15 divisions, whereas rDNA circles underwent massive accumulation to >95% of circular DNA. Circles present in both young and old cells were characterized by replication origins including circles from unique regions of the genome and repetitive regions: rDNA and telomeric Y' regions. We further observed that circles can have flexible inheritance patterns: [HXT6/7circle] normally segregates to mother cells but in low glucose is present in up to 50% of cells, the majority of which must have inherited this circle from their mother. Interestingly, [HXT6/7circle] cells are eventually replaced by cells carrying stable chromosomal HXT6 HXT6/7 HXT7 amplifications, suggesting circular DNAs are intermediates in chromosomal amplifications. In conclusion, the heterogeneity of circular DNA offers flexibility in adaptation, but this heterogeneity is remarkably diminished with age.
Collapse
|
33
|
Prenatal diagnosis of rearrangements in the fetal 22q11.2 region. Mol Cytogenet 2020; 13:28. [PMID: 32670410 PMCID: PMC7346507 DOI: 10.1186/s13039-020-00498-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2020] [Accepted: 06/24/2020] [Indexed: 12/28/2022] Open
Abstract
BACKGROUND 22q11.2 deletion syndrome (22q11.2DS) and 22q11.2 duplication syndrome (22q11.2DupS) are the most common copy number variations in humans. The clinical phenotypes of these two syndromes are variable, and there are no large sample data on the prenatal detection rate for these two syndromes in the Chinese population. RESULTS We recruited 411 pregnant women who showed either abnormal prenatal ultrasound findings or positive prenatal BoBs™ results or who had given birth to a child with chromosomal abnormalities. SNP-array analysis and interphase FISH analysis identified five fetuses with 22q11.2 copy number variants (CNVs), three of which were 22q11.2 deletion syndrome (22q11.2DS) (3/411) and two of which were 22q11.2 duplication syndrome (22q11.2DupS). In all 5 cases of diagnosed 22q11.2 abnormalities, inheritance could not be identified because the parents did not undergo further testing. CONCLUSION Our case reports provide a detection rate of 22q11.2 CNVs for fetuses with prenatal diagnostic indications, and early diagnosis of these two syndromes was essential for prenatal intervention in these cases. SNP-array technology is an effective tool in the prenatal diagnosis of 22q11.2 CNVs. The prenatal diagnosis of these two syndromes is helpful for early intervention, which is of great clinical significance.
Collapse
|
34
|
Autism risk in offspring can be assessed through quantification of male sperm mosaicism. Nat Med 2020; 26:143-150. [PMID: 31873310 PMCID: PMC7032648 DOI: 10.1038/s41591-019-0711-0] [Citation(s) in RCA: 57] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2019] [Accepted: 11/21/2019] [Indexed: 01/28/2023]
Abstract
De novo mutations arising on the paternal chromosome make the largest known contribution to autism risk, and correlate with paternal age at the time of conception. The recurrence risk for autism spectrum disorders is substantial, leading many families to decline future pregnancies, but the potential impact of assessing parental gonadal mosaicism has not been considered. We measured sperm mosaicism using deep-whole-genome sequencing, for variants both present in an offspring and evident only in father's sperm, and identified single-nucleotide, structural and short tandem-repeat variants. We found that mosaicism quantification can stratify autism spectrum disorders recurrence risk due to de novo mutations into a vast majority with near 0% recurrence and a small fraction with a substantially higher and quantifiable risk, and we identify novel mosaic variants at risk for transmission to a future offspring. This suggests, therefore, that genetic counseling would benefit from the addition of sperm mosaicism assessment.
Collapse
|
35
|
De novo copy number variants and parental age: Is there an association? Eur J Med Genet 2019; 63:103829. [PMID: 31883480 DOI: 10.1016/j.ejmg.2019.103829] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2018] [Revised: 10/10/2019] [Accepted: 12/21/2019] [Indexed: 11/30/2022]
Abstract
PURPOSE To investigate whether increased parental age is associated with an increased risk for de novo copy number variant (CNV) formation in offspring. METHODS CNV calls from 2323 individuals referred to Signature Genomic Laboratories for clinical microarray-based comparative genomic hybridization were investigated; 17% of the samples were prenatal and 83% were postnatal. The de novo CNV data were further split into de novo CNVs bound by low copy repeats (LCRs) and those not bound by LCRs. RESULTS No association was found between CNV occurrence and paternal age in both the prenatal (p = 0.6795) and postnatal (p = 0.1741) cohorts. Maternal age was significantly higher with de novo CNV occurrence in our postnatal cohort (p = 0.0126), an effect which may be driven by formation of de novo CNVs that are bound by LCRs (p = 0.0026). Furthermore, a significant positive correlation was observed between maternal age and de novo CNVs (Point-Biserial R2 = 0.0503, p = 0.0152). CONCLUSIONS This large-scale study did not find any evidence for the influence of increased paternal age on de novo CNV formation, while increased maternal age appeared to increase risk for de novo, non-complex CNV occurrence in offspring with intellectual disability/developmental delay. Further studies and continued technological advances will help yield more information on the risk factors for de novo CNVs.
Collapse
|
36
|
High frequency DNA rearrangement at qγ27 creates a novel allele for Quality Protein Maize breeding. Commun Biol 2019; 2:460. [PMID: 31840105 PMCID: PMC6904753 DOI: 10.1038/s42003-019-0711-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2019] [Accepted: 11/21/2019] [Indexed: 12/18/2022] Open
Abstract
Copy number variation (CNV) is a major source of genetic variation and often contributes to phenotypic variation in maize. The duplication at the 27-kDa γ-zein locus (qγ27) is essential to convert soft endosperm into hard endosperm in quality protein maize (QPM). This duplication is unstable and generally produces CNV at this locus. We conducted genetic experiments designed to directly measure DNA rearrangement frequencies occurring in males and females of different genetic backgrounds. The average frequency with which the duplication rearranges to single copies is 1.27 × 10-3 and varies among different lines. A triplication of γ27 gene was screened and showed a better potential than the duplication for the future QPM breeding. Our results highlight a novel approach to directly determine the frequency of DNA rearrangements, in this case resulting in CNV at the qγ27 locus. Furthermore, this provides a highly effective way to test suitable parents in QPM breeding.
Collapse
|
37
|
Impact of copy number variation on human neurocognitive deficits and congenital heart defects: A systematic review. Neurosci Biobehav Rev 2019; 108:83-93. [PMID: 31682886 DOI: 10.1016/j.neubiorev.2019.10.020] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2019] [Accepted: 10/26/2019] [Indexed: 12/16/2022]
Abstract
Copy number variant (CNV) syndromes are often associated with both neurocognitive deficits (NCDs) and congenital heart defects (CHDs). Children and adults with cardiac developmental defects likely to have NCDs leading to increased risk of hospitalisation and reduced level of independence. To date, the association between these two phenotypes have not been explored in relation to CNV syndromes. In order to address this question, we systematically reviewed the prevalence of CHDs in a range of CNV syndromes associated with NCDs. A meta-analysis showed a relationship with the size of CNV and its association with both NCDs and CHDs, and also inheritance pattern. To our knowledge, this is the first review to establish association between NCD and CHDs in CNV patients, specifically in relation to the severity of NCD. Importantly, we also found specific types of CHDs were associated with severe neurocognitive deficits. Finally, we discuss the implications of these results for patients in the clinical setting which warrants further exploration of this association in order to lead an improvement in the quality of patient's life.
Collapse
|
38
|
Additive Diagnostic Yield of Homozygosity Regions Identified During Chromosomal microarray Testing in Children with Developmental Delay, Dysmorphic Features or Congenital Anomalies. Biochem Genet 2019; 58:74-101. [PMID: 31273557 DOI: 10.1007/s10528-019-09931-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2019] [Accepted: 06/27/2019] [Indexed: 11/26/2022]
Abstract
Chromosomal microarray (CMA) has emerged as a robust tool for identifying microdeletions and microduplications, termed copy number variants (CNVs). Nevertheless, data regarding its utility in different patient populations with developmental delay (DD), dysmorphic features (DF) and congenital anomalies (CA), is a matter of dense debate. Although regions of homozygosity (ROH) are not diagnostic of a specific condition, they may have pathogenic implications. Certain CNVs and ROH have ethnically specific occurrences and frequencies. We aimed to determine whether CMA testing offers additional diagnostic information over classical cytogenetics for identifying genomic imbalances in a pediatric cohort with idiopathic DD, DF, or CA. One hundred sixty-nine patients were offered cytogenetics and CMA simultaneously for etiological diagnosis of DD (n = 67), DF (n = 52) and CA (n = 50). CMA could identify additional, clinically significant anomalies as compared with cytogenetics. CMA detected 61 CNVs [21 (34.4%) pathogenic CNVs, 37 (60.7%) variants of uncertain clinical significance and 3 (4.9%) benign CNVs] in 44 patients. CMA identified one or more ROH in 116/169 (68.6%) patients. When considering pathogenic CNVs and aneuploidies as positive findings, 9/169 (5.3%) received a genetic diagnosis from cytogenetics, while 25/169 (14.8%) could have a genetic diagnosis from CMA. The identification of ROH was clinically significant in two cases (2/169), thereby, adding 1.2% to the diagnostic yield of CMA (16% vs. 5.3%, p < 0.001). CMA uncovers additional genetic diagnoses over cytogenetics, thereby, offering a much higher diagnostic yield. Our findings convincingly demonstrate the additive diagnostic value of clinically significant ROH identified during CMA testing, highlighting the need for careful clinical interpretation of these ROH.
Collapse
|
39
|
Experience of chromosomal microarray applied in prenatal and postnatal settings in Hong Kong. AMERICAN JOURNAL OF MEDICAL GENETICS PART C-SEMINARS IN MEDICAL GENETICS 2019; 181:196-207. [PMID: 30903683 DOI: 10.1002/ajmg.c.31697] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/13/2018] [Revised: 03/06/2019] [Accepted: 03/07/2019] [Indexed: 12/14/2022]
Abstract
Chromosomal microarray (CMA) is recommended as a first tier investigation for patients with developmental delay (DD), intellectual disability (ID), autistic spectrum disorder (ASD), and multiple congenital anomalies (MCA). It is widely used in the prenatal and postnatal settings for detection of chromosomal aberrations. This is a retrospective review of all array comparative genomic hybridization (aCGH/ array CGH) findings ascertained in two major prenatal and postnatal genetic diagnostic centers in Hong Kong from June 2012 to December 2017. Medical records were reviewed for cases with pathogenic and variants of uncertain clinical significance (VUS). Classification of copy number variants (CNVs) was based on current knowledge and experience by August 2018. The aims of this review are to study the diagnostic yield of array CGH application in prenatal and postnatal settings in Hong Kong and to describe the spectrum of abnormalities found. Prenatal indications included abnormal ultrasound findings, positive Down syndrome screening, abnormal noninvasive prenatal test results, advanced maternal age and family history of chromosomal or genetic abnormalities. Postnatal indications included unexplained DD, ID, ASD, and MCA. A total of 1,261 prenatal subjects and 3,096 postnatal patients were reviewed. The prenatal diagnostic yield of pathogenic CNV and VUS (excluding those detectable by karyotype) was 3.5%. The postnatal diagnostic yield of pathogenic CNV was 15.2%. The detection rates for well-defined microdeletion and microduplication syndromes were 4.6% in prenatal and 6.1% (1 in 16 index patients) in postnatal cases, respectively. Chromosomes 15, 16, and 22 accounted for over 21 and 25% of pathogenic CNVs detected in prenatal and postnatal cohorts, respectively. This review provides the first large scale overview of genomic imbalance of mostly Chinese patients in prenatal and postnatal settings.
Collapse
|
40
|
2018 Victor A. McKusick Leadership Award: Molecular Mechanisms for Genomic and Chromosomal Rearrangements. Am J Hum Genet 2019; 104:391-406. [PMID: 30849326 DOI: 10.1016/j.ajhg.2018.12.018] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
|
41
|
Dysregulation of TBX1 dosage in the anterior heart field results in congenital heart disease resembling the 22q11.2 duplication syndrome. Hum Mol Genet 2019; 27:1847-1857. [PMID: 29509905 DOI: 10.1093/hmg/ddy078] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2017] [Accepted: 02/27/2018] [Indexed: 12/27/2022] Open
Abstract
Non-allelic homologous recombination events on chromosome 22q11.2 during meiosis can result in either the deletion (22q11.2DS) or duplication (22q11.2DupS) syndrome. Although the spectrum and frequency of congenital heart disease (CHD) are known for 22q11.2DS, there is less known for 22q11.2DupS. We now evaluated cardiac phenotypes in 235 subjects with 22q11.2DupS including 102 subjects we collected and 133 subjects that were previously reported as a confirmation and found 25% have CHD, mostly affecting the cardiac outflow tract (OFT). Previous studies have shown that global loss or gain of function (LOF; GOF) of mouse Tbx1, encoding a T-box transcription factor mapping to the region of synteny to 22q11.2, results in similar OFT defects. To further evaluate Tbx1 function in the progenitor cells forming the cardiac OFT, termed the anterior heart field, Tbx1 was overexpressed using the Mef2c-AHF-Cre driver (Tbx1 GOF). Here we found that all resulting conditional GOF embryos had a persistent truncus arteriosus (PTA), similar to what was previously reported for conditional Tbx1 LOF mutant embryos. To understand the basis for the PTA in the conditional GOF embryos, we found that proliferation in the Mef2c-AHF-Cre lineage cells before migrating to the heart, was reduced and critical genes were oppositely changed in this tissue in Tbx1 GOF embryos versus conditional LOF embryos. These results suggest that a major function of TBX1 in the AHF is to maintain the normal balance of expression of key cardiac developmental genes required to form the aorta and pulmonary trunk, which is disrupted in 22q11.2DS and 22q11.2DupS.
Collapse
|
42
|
Genomic copy number variation of the CHKB gene alters gene expression and affects growth traits of Chinese domestic yak (Bos grunniens) breeds. Mol Genet Genomics 2019; 294:549-561. [DOI: 10.1007/s00438-018-01530-y] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2018] [Accepted: 12/29/2018] [Indexed: 12/22/2022]
|
43
|
Age-Dependent De Novo Mutations During Spermatogenesis and Their Consequences. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2019; 1166:29-46. [DOI: 10.1007/978-3-030-21664-1_2] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
44
|
Characterization of copy number variants in a large multibreed population of beef and dairy cattle using high-density single nucleotide polymorphism genotype data. J Anim Sci 2018; 96:4112-4124. [PMID: 30239746 DOI: 10.1093/jas/sky302] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2018] [Accepted: 08/07/2018] [Indexed: 02/07/2023] Open
Abstract
Copy number variants (CNVs) are a form of genomic variation that changes the structure of the genome through deletion or duplication of stretches of DNA. The objective of the present study was to characterize CNVs in a large multibreed population of beef and dairy bulls. The CNVs were called on the autosomes of 5,551 cattle from 22 different beef and dairy breeds, using 2 freely available software suites, QuantiSNP and PennCNV. All CNVs were classified into either deletions or duplications. The median concordance between PennCNV and QuantiSNP, per animal, was 18.5% for deletions and 0% for duplications. The low concordance rate between PennCNV and QuantiSNP indicated that neither algorithm, by itself, could identify all CNVs in the population. In total, PennCNV and QuantiSNP collectively identified 747,129 deletions and 432,523 duplications; 80.2% of all duplications and 69.1% of all deletions were present only once in the population. Only 0.154% of all CNVs identified were present in more than 50 animals in the population. The distribution of the percentage of the autosomes that were composed of deletions, per animal, was positively skewed, as was the distribution for the percentage of the autosomes that were composed of duplications, per animal. The first quartile, median, and third quartile of the distribution of the percentage of the autosomes that were composed of deletions were 0.019%, 0.037%, and 0.201%, respectively. The first quartile, median, and third quartile of the distribution of the percentage of the autosomes that were composed of duplications were 0.013%, 0.028%, and 0.076%, respectively. The distributions of the number of deletions and duplications per animal were both positively skewed. The interquartile range for the number of deletions per animal in the population was between 16 and 117, whereas for duplications it was between 8 and 23. Per animal, there tended to be twice as many deletions as duplications. The distribution of the length of deletions was positively skewed, as was the distribution of the length of duplications. The interquartile range for the length of deletions in the population was between 25 and 101 kb, and for duplications the interquartile range was between 46 and 235 kb. Per animal, duplications tended to be twice as long as deletions. This study provides a description of the characteristics and distribution of CNVs in a large multibreed population of beef and dairy cattle.
Collapse
|
45
|
The other side of the coin: Hypersociability. GENES BRAIN AND BEHAVIOR 2018; 18:e12512. [PMID: 30101538 DOI: 10.1111/gbb.12512] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/10/2018] [Revised: 07/18/2018] [Accepted: 08/07/2018] [Indexed: 12/29/2022]
Abstract
Affiliative social motivation and behavior, that is, sociability that includes attachment, prosocial behavior (sharing, caring and helping) and empathy (the ability to understand and share the feelings of others), has high variability in the human population, with a portion of people outside of the normal range. While psychiatric disorders and autism spectrum disorders are typically associated with a deficit in social behavior, the opposite trait of hypersociability and indiscriminate friendliness are exhibited by individual with specific neurodevelopmental disorders and following early adverse care. Here we discuss both genetic and environmental factors that cause or increase the risk for developing pathological hypersociability from human to rodent models.
Collapse
|
46
|
Extreme clustering of type-1 NF1 deletion breakpoints co-locating with G-quadruplex forming sequences. Hum Genet 2018; 137:511-520. [PMID: 29992513 DOI: 10.1007/s00439-018-1904-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2018] [Accepted: 07/05/2018] [Indexed: 01/02/2023]
Abstract
The breakpoints of type-1 NF1 deletions encompassing 1.4-Mb are located within NF1-REPa and NF1-REPc, which exhibit a complex structure comprising different segmental duplications in direct and inverted orientation. Here, we systematically assessed the proportion of type-1 NF1 deletions caused by nonallelic homologous recombination (NAHR) and those mediated by other mutational mechanisms. To this end, we analyzed 236 unselected type-1 deletions and observed that 179 of them (75.8%) had breakpoints located within the NAHR hotspot PRS2, whereas 39 deletions (16.5%) had breakpoints located within PRS1. Sixteen deletions exhibited breakpoints located outside of these NAHR hotspots but were also mediated by NAHR. Taken together, the breakpoints of 234 (99.2%) of the 236 type-1 NF1 deletions were mediated by NAHR. Thus, NF1-REPa and NF1-REPc are strongly predisposed to recurrent NAHR, the main mechanism underlying type-1 NF1 deletions. We also observed a non-random overlap between type-1 NF1-deletion breakpoints and G-quadruplex forming sequences (GQs) as well as regions flanking PRDM9A binding-sites. These findings imply that GQs and PRDM9A binding-sites contribute to the clustering of type-1 deletion breakpoints. The co-location of both types of sequence was at its highest within PRS2, indicative of their synergistic contribution to the greatly increased NAHR activity within this hotspot.
Collapse
|
47
|
Landscape of copy number variations in Bos taurus: individual - and inter-breed variability. BMC Genomics 2018; 19:410. [PMID: 29843606 PMCID: PMC5975385 DOI: 10.1186/s12864-018-4815-6] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2018] [Accepted: 05/22/2018] [Indexed: 11/24/2022] Open
Abstract
Background The number of studies of Copy Number Variation in cattle has increased in recent years. This has been prompted by the increased availability of data on polymorphisms and their relationship with phenotypes. In addition, livestock species are good models for some human phenotypes. In the present study, we described the landscape of CNV driven genetic variation in a large population of 146 individuals representing 13 cattle breeds, using whole genome DNA sequence. Results A highly significant variation among all individuals and within each breed was observed in the number of duplications (P < 10−15) and in the number of deletions (P < 10−15). We also observed significant differences between breeds for duplication (P = 0.01932) and deletion (P = 0.01006) counts. The same variation CNV length - inter-individual and inter-breed differences were significant for duplications (P < 10−15) and deletions (P < 10−15). Moreover, breed-specific variants were identified, with the largest proportion of breed-specific duplications (9.57%) found for Fleckvieh and breed-specific deletions found for Brown Swiss (5.00%). Such breed-specific CNVs were predominantly located in intragenic regions, however in Simmental, one deletion present in five individuals was found in the coding sequence of a novel gene ENSBTAG00000000688 on chromosome 18. In Brown Swiss, Norwegian Red and Simmental breed-specific deletions were located within KIT and MC1R genes, which are responsible for a coat colour. The functional annotation of coding regions underlying the breed-specific CNVs showed that in Norwegian Red, Guernsey, and Simmental significantly under- and overrepresented GO terms were related to chemical stimulus involved in sensory perception of smell and the KEGG pathways for olfactory transduction. In addition, specifically for the Norwegian Red breed, the dopaminergic synapse KEGG pathway was significantly enriched within deleted parts of the genome. Conclusions The CNV landscape in Bos taurus genome revealed by this study was highly complex, with inter-breed differences, but also a significant variation within breeds. The former, may explain some of the phenotypic differences among analysed breeds, and the latter contributes to within-breed variation available for selection. Electronic supplementary material The online version of this article (10.1186/s12864-018-4815-6) contains supplementary material, which is available to authorized users.
Collapse
|
48
|
Multi-Invasion-Induced Rearrangements as a Pathway for Physiological and Pathological Recombination. Bioessays 2018; 40:e1700249. [PMID: 29578583 PMCID: PMC6072258 DOI: 10.1002/bies.201700249] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2017] [Revised: 02/08/2018] [Indexed: 01/24/2023]
Abstract
Cells mitigate the detrimental consequences of DNA damage on genome stability by attempting high fidelity repair. Homologous recombination templates DNA double-strand break (DSB) repair on an identical or near identical donor sequence in a process that can in principle access the entire genome. Other physiological processes, such as homolog recognition and pairing during meiosis, also harness the HR machinery using programmed DSBs to physically link homologs and generate crossovers. A consequence of the homology search process by a long nucleoprotein filament is the formation of multi-invasions (MI), a joint molecule in which the damaged ssDNA has invaded more than one donor molecule. Processing of MI joint molecules can compromise the integrity of both donor sites and lead to their rearrangement. Here, two mechanisms for the generation of rearrangements as a pathological consequence of MI processing are detailed and the potential relevance for non-allelic homologous recombination discussed. Finally, it is proposed that MI-induced crossover formation may be a feature of physiological recombination.
Collapse
|
49
|
An estimation of the prevalence of genomic disorders using chromosomal microarray data. J Hum Genet 2018; 63:795-801. [PMID: 29691480 PMCID: PMC6019170 DOI: 10.1038/s10038-018-0451-x] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2017] [Revised: 02/20/2018] [Accepted: 03/14/2018] [Indexed: 02/04/2023]
Abstract
Multiple genomic disorders result from recurrent deletions or duplications between low-copy repeat (LCR) clusters, mediated by nonallelic homologous recombination (NAHR). These copy number variants (CNVs) often exhibit variable expressivity and/or incomplete penetrance. However, the population prevalence of many genomic disorders has not been estimated accurately. A subset of genomic disorders similarly characterized by CNVs between LCRs have been studied epidemiologically, including Williams-Beuren syndrome (7q11.23), Smith-Magenis syndrome (17p11.2), Velocardiofacial syndrome (22q11.21), Prader-Willi/Angelman syndromes (15q11.2q12), 17q12 deletion syndrome, and Charcot-Marie-Tooth Neuropathy type 1 (CMT1A)/hereditary neuropathy with liability to pressure palsy (HNPP) (PMP22, 17q11.2). We have generated a method to estimate prevalence of highly penetrant genomic disorders by (1) leveraging epidemiological data for genomic disorders with previously reported prevalence estimates, (2) obtaining chromosomal microarray data on genomic disorders from a large medical genetics clinic; and (3) utilizing these in a linear regression model to determine the prevalence of this syndromic copy number change among the general population. Using our algorithm, the prevalence for five clinically relevant recurrent genomic disorders: 1q21.1 microdeletion (1/6,882 live births) and microduplication syndromes (1/6,309), 15q13.3 microdeletion syndrome (1/5,525), and 16p11.2 microdeletion (1/3,021) and microduplication syndromes (1/4,216), were determined. These findings will inform epidemiological strategies for evaluating those conditions, and our method may be useful to evaluate the prevalence of other highly penetrant genomic disorders.
Collapse
|
50
|
Small de novo CNVs as biomarkers of parental exposure to low doses of ionizing radiation of caesium-137. Sci Rep 2018; 8:5914. [PMID: 29651024 PMCID: PMC5897394 DOI: 10.1038/s41598-018-23813-5] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2017] [Accepted: 03/20/2018] [Indexed: 12/21/2022] Open
Abstract
The radiological accident in Goiania in 1987 caused a trail of human contamination, animal, plant and environmental by a radionuclide. Exposure to ionizing radiation results in different types of DNA lesions. The mutagenic effects of ionizing radiation on the germline are special concern because they can endures for several generations, leading to an increase in the rate of mutations in children of irradiated parents. Thus, to evaluate the biological mechanisms of ionizing radiation in somatic and germline cells, with consequent determination of the rate mutations, is extremely important for the estimation of genetic risks. Recently it was established that Chromosomal Microarray Analysis is an important tool for detecting wide spectra of gains or losses in the human genome. Here we present the results of the effect of accidental exposure to low doses of ionizing radiation on the formation of CNVs in the progeny of a human population accidentally exposed to Caesium-137 during the radiological accident in Goiânia, Brazil.
Collapse
|