1
|
Krishna Murthy SB, Yang S, Bheda S, Tomar N, Li H, Yaghoobi A, Khan A, Kiryluk K, Motelow JE, Ren N, Gharavi AG, Milo Rasouly H. Assisting the analysis of insertions and deletions using regional allele frequencies. Funct Integr Genomics 2024; 24:104. [PMID: 38764005 PMCID: PMC11414712 DOI: 10.1007/s10142-024-01358-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Revised: 04/02/2024] [Accepted: 04/12/2024] [Indexed: 05/21/2024]
Abstract
Accurate estimation of population allele frequency (AF) is crucial for gene discovery and genetic diagnostics. However, determining AF for frameshift-inducing small insertions and deletions (indels) faces challenges due to discrepancies in mapping and variant calling methods. Here, we propose an innovative approach to assess indel AF. We developed CRAFTS-indels (Calculating Regional Allele Frequency Targeting Small indels), an algorithm that combines AF of distinct indels within a given region and provides "regional AF" (rAF). We tested and validated CRAFTS-indels using three independent datasets: gnomAD v2 (n=125,748 samples), an internal dataset (IGM; n=39,367), and the UK BioBank (UKBB; n=469,835). By comparing rAF against standard AF, we identified rare indels with rAF exceeding standard AF (sAF≤10-4 and rAF>10-4) as "rAF-hi" indels. Notably, a high percentage of rare indels were "rAF-hi", with a higher proportion in gnomAD v2 (11-20%) and IGM (11-22%) compared to the UKBB (5-9% depending on the CRAFTS-indels' parameters). Analysis of the overlap of regions based on their rAF with low complexity regions and with ClinVar classification supported the pertinence of rAF. Using the internal dataset, we illustrated the utility of CRAFTS-indel in the analysis of de novo variants and the potential negative impact of rAF-hi indels in gene discovery. In summary, annotation of indels with cohort specific rAF can be used to handle some of the limitations of current annotation pipelines and facilitate detection of novel gene disease associations. CRAFTS-indels offers a user-friendly approach to providing rAF annotation. It can be integrated into public databases such as gnomAD, UKBB and used by ClinVar to revise indel classifications.
Collapse
Affiliation(s)
- Sarath Babu Krishna Murthy
- Center for Precision Genetics and Genomics, Department of Medicine, Columbia University, New York, NY, USA
| | - Sandy Yang
- Center for Precision Genetics and Genomics, Department of Medicine, Columbia University, New York, NY, USA
| | - Shiraz Bheda
- Center for Precision Genetics and Genomics, Department of Medicine, Columbia University, New York, NY, USA
| | - Nikita Tomar
- Center for Precision Genetics and Genomics, Department of Medicine, Columbia University, New York, NY, USA
| | - Haiyue Li
- Center for Precision Genetics and Genomics, Department of Medicine, Columbia University, New York, NY, USA
| | - Amir Yaghoobi
- Center for Precision Genetics and Genomics, Department of Medicine, Columbia University, New York, NY, USA
| | - Atlas Khan
- Division of Nephrology, Department of Medicine, Columbia University, New York, NY, USA
| | - Krzysztof Kiryluk
- Division of Nephrology, Department of Medicine, Columbia University, New York, NY, USA
| | - Joshua E Motelow
- Division of Critical Care and Hospital Medicine, Department of Pediatrics, Columbia University Irving Medical Center, New York-Presbyterian Morgan Stanley Children's Hospital, New York, New York, USA
| | - Nick Ren
- Institute for Genomic Medicine, Columbia University, New York, NY, USA
| | - Ali G Gharavi
- Center for Precision Genetics and Genomics, Department of Medicine, Columbia University, New York, NY, USA
- Division of Nephrology, Department of Medicine, Columbia University, New York, NY, USA
| | - Hila Milo Rasouly
- Center for Precision Genetics and Genomics, Department of Medicine, Columbia University, New York, NY, USA.
- Division of Nephrology, Department of Medicine, Columbia University, New York, NY, USA.
| |
Collapse
|
2
|
Bertrand A, Ba I, Kermasson L, Pirabakaran V, Chable N, Lainey E, Ménard C, Kallel F, Picard C, Hadiji S, Coolen-Allou N, Blanchard E, de Villartay JP, Moshous D, Roelens M, Callebaut I, Kannengiesser C, Revy P. Characterization of novel mutations in the TEL-patch domain of the telomeric factor TPP1 associated with telomere biology disorders. Hum Mol Genet 2024; 33:612-623. [PMID: 38176734 DOI: 10.1093/hmg/ddad210] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Revised: 12/04/2023] [Accepted: 12/13/2023] [Indexed: 01/06/2024] Open
Abstract
Telomeres are nucleoprotein structures that protect the chromosome ends from degradation and fusion. Telomerase is a ribonucleoprotein complex essential to maintain the length of telomeres. Germline defects that lead to short and/or dysfunctional telomeres cause telomere biology disorders (TBDs), a group of rare and heterogeneous Mendelian diseases including pulmonary fibrosis, dyskeratosis congenita, and Høyeraal-Hreidarsson syndrome. TPP1, a telomeric factor encoded by the gene ACD, recruits telomerase at telomere and stimulates its activity via its TEL-patch domain that directly interacts with TERT, the catalytic subunit of telomerase. TBDs due to TPP1 deficiency have been reported only in 11 individuals. We here report four unrelated individuals with a wide spectrum of TBD manifestations carrying either heterozygous or homozygous ACD variants consisting in the recurrent and previously described in-frame deletion of K170 (K170∆) and three novel missense mutations G179D, L184R, and E215V. Structural and functional analyses demonstrated that the four variants affect the TEL-patch domain of TPP1 and impair telomerase activity. In addition, we identified in the ACD gene several motifs associated with small deletion hotspots that could explain the recurrence of the K170∆ mutation. Finally, we detected in a subset of blood cells from one patient, a somatic TERT promoter-activating mutation that likely provides a selective advantage over non-modified cells, a phenomenon known as indirect somatic genetic rescue. Together, our results broaden the genetic and clinical spectrum of TPP1 deficiency and specify new residues in the TEL-patch domain that are crucial for length maintenance and stability of human telomeres in vivo.
Collapse
Affiliation(s)
- Alexis Bertrand
- Laboratory of Genome Dynamics in the Immune System, Equipe Labellisée Ligue contre le Cancer, INSERM UMR 1163, Imagine Institute, 24 boulevard du Montparnasse, Paris 75015, France
- Université Paris Cité, Imagine Institute, Paris 75015, France
| | - Ibrahima Ba
- Laboratory of Genome Dynamics in the Immune System, Equipe Labellisée Ligue contre le Cancer, INSERM UMR 1163, Imagine Institute, 24 boulevard du Montparnasse, Paris 75015, France
- Université Paris Cité, Imagine Institute, Paris 75015, France
- Assistance Publique des Hôpitaux de Paris, Hôpital Bichat, Service de Génétique, Université Paris Diderot, Paris 75018, France
| | - Laëtitia Kermasson
- Laboratory of Genome Dynamics in the Immune System, Equipe Labellisée Ligue contre le Cancer, INSERM UMR 1163, Imagine Institute, 24 boulevard du Montparnasse, Paris 75015, France
- Université Paris Cité, Imagine Institute, Paris 75015, France
| | - Vithura Pirabakaran
- Laboratory of Genome Dynamics in the Immune System, Equipe Labellisée Ligue contre le Cancer, INSERM UMR 1163, Imagine Institute, 24 boulevard du Montparnasse, Paris 75015, France
- Université Paris Cité, Imagine Institute, Paris 75015, France
| | - Noémie Chable
- Laboratory of Genome Dynamics in the Immune System, Equipe Labellisée Ligue contre le Cancer, INSERM UMR 1163, Imagine Institute, 24 boulevard du Montparnasse, Paris 75015, France
- Université Paris Cité, Imagine Institute, Paris 75015, France
| | - Elodie Lainey
- Hematology Laboratory, Robert Debré Hospital-AssistancePublique-Hôpitaux de Paris (APHP), INSERM UMR 1131-Hematology University Institute-Denis Diderot School of Medicine, Paris 75019, France
| | - Christelle Ménard
- Assistance Publique des Hôpitaux de Paris, Hôpital Bichat, Service de Génétique, Université Paris Diderot, Paris 75018, France
| | - Faten Kallel
- Hematology Department, Hedi Chaker Hospital, 3029, Sfax, Tunisia
| | - Capucine Picard
- Université Paris Cité, Imagine Institute, Paris 75015, France
- Department of Pediatric Immunology, Hematology and Rheumatology, Necker-Enfants Malades Hospital, (APHP), Paris 75015, France
- Centre de références des déficits immunitaires Héréditaires (CEREDIH), Necker-Enfants Malades Hospital APHP, Paris 75015, France
- Laboratory of Lymphocyte Activation and Susceptibility to EBV infection, Inserm UMR 1163, Institut Imagine, Paris 75015, France
| | - Sondes Hadiji
- Hematology Department, Hedi Chaker Hospital, 3029, Sfax, Tunisia
| | - Nathalie Coolen-Allou
- Service de Pneumologie, Hôpital Félix Guyon, CHU Réunion, Saint-Denis de la Réunion 97400, France
| | - Elodie Blanchard
- Service de Pneumologie, Hôpital Haut-Lévêque, CHU Bordeaux, Bordeaux 33604, France
| | - Jean-Pierre de Villartay
- Laboratory of Genome Dynamics in the Immune System, Equipe Labellisée Ligue contre le Cancer, INSERM UMR 1163, Imagine Institute, 24 boulevard du Montparnasse, Paris 75015, France
- Université Paris Cité, Imagine Institute, Paris 75015, France
| | - Despina Moshous
- Laboratory of Genome Dynamics in the Immune System, Equipe Labellisée Ligue contre le Cancer, INSERM UMR 1163, Imagine Institute, 24 boulevard du Montparnasse, Paris 75015, France
- Université Paris Cité, Imagine Institute, Paris 75015, France
- Department of Pediatric Immunology, Hematology and Rheumatology, Necker-Enfants Malades Hospital, (APHP), Paris 75015, France
| | - Marie Roelens
- Université Paris Cité, Imagine Institute, Paris 75015, France
- Centre de références des déficits immunitaires Héréditaires (CEREDIH), Necker-Enfants Malades Hospital APHP, Paris 75015, France
| | - Isabelle Callebaut
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris 75005, France
| | - Caroline Kannengiesser
- Assistance Publique des Hôpitaux de Paris, Hôpital Bichat, Service de Génétique, Université Paris Diderot, Paris 75018, France
| | - Patrick Revy
- Laboratory of Genome Dynamics in the Immune System, Equipe Labellisée Ligue contre le Cancer, INSERM UMR 1163, Imagine Institute, 24 boulevard du Montparnasse, Paris 75015, France
- Université Paris Cité, Imagine Institute, Paris 75015, France
| |
Collapse
|
3
|
Rutkowska L, Sałacińska K, Salachna D, Matusik P, Pinkier I, Kępczyński Ł, Piotrowicz M, Starostecka E, Lewiński A, Gach A. Identification of New Genetic Determinants in Pediatric Patients with Familial Hypercholesterolemia Using a Custom NGS Panel. Genes (Basel) 2022; 13:genes13060999. [PMID: 35741760 PMCID: PMC9223034 DOI: 10.3390/genes13060999] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Revised: 05/30/2022] [Accepted: 05/30/2022] [Indexed: 02/04/2023] Open
Abstract
The most common form of inherited lipid disorders is familial hypercholesterolemia (FH). It is characterized primarily by high concentrations of the clinical triad of low-density lipoprotein cholesterol, tendon xanthomas and premature CVD. The well-known genetic background are mutations in LDLR, APOB and PCSK9 gene. Causative mutations can be found in 60−80% of definite FH patients and 20−30% of those with possible FH. Their occurrence could be attributed to the activity of minor candidate genes, whose causal mechanism has not been fully discovered. The aim of the conducted study was to identify disease-causing mutations in FH-related and candidate genes in pediatric patients from Poland using next generation sequencing (NGS). An NGS custom panel was designed to cover 21 causative and candidate genes linked to primary dyslipidemia. Recruitment was performed using Simon Broome diagnostic criteria. Targeted next generation sequencing was performed on a MiniSeq sequencer (Illumina, San Diego, CA, USA) using a 2 × 150 bp paired-end read module. Sequencing data analysis revealed pathogenic and possibly pathogenic variants in 33 out of 57 studied children. The affected genes were LDLR, APOB, ABCG5 and LPL. A novel pathogenic 7bp frameshift deletion c.373_379delCAGTTCG in the exon 4 of the LDLR gene was found. Our findings are the first to identify the c.373_379delCAGTTCG mutation in the LDLR gene. Furthermore, the double heterozygous carrier of frameshift insertion c.2416dupG in the LDLR gene and missense variant c.10708C>T in the APOB gene was identified. The c.2416dupG variant was defined as pathogenic, as confirmed by its cosegregation with hypercholesterolemia in the proband’s family. Although the APOB c.10708C>T variant was previously detected in hypercholesterolemic patients, our data seem to demonstrate no clinical impact. Two missense variants in the LPL gene associated with elevated triglyceride plasma level (c.106G>A and c.953A>G) were also identified. The custom NGS panel proved to be an effective research tool for identifying new causative aberrations in a genetically heterogeneous disease as familial hypercholesterolemia (FH). Our findings expand the spectrum of variants associated with the FH loci and will be of value in genetic counseling among patients with the disease.
Collapse
Affiliation(s)
- Lena Rutkowska
- Department of Genetics, Polish Mother’s Memorial Hospital—Research Institute, 93-338 Lodz, Poland; (K.S.); (D.S.); (I.P.); (Ł.K.); (M.P.)
- Correspondence: (L.R.); (A.L.); (A.G.)
| | - Kinga Sałacińska
- Department of Genetics, Polish Mother’s Memorial Hospital—Research Institute, 93-338 Lodz, Poland; (K.S.); (D.S.); (I.P.); (Ł.K.); (M.P.)
| | - Dominik Salachna
- Department of Genetics, Polish Mother’s Memorial Hospital—Research Institute, 93-338 Lodz, Poland; (K.S.); (D.S.); (I.P.); (Ł.K.); (M.P.)
| | - Paweł Matusik
- Department of Pediatrics, Pediatric Obesity and Metabolic Bone Diseases, Faculty of Medical Sciences in Katowice, Medical University of Silesia, 40-055 Katowice, Poland;
| | - Iwona Pinkier
- Department of Genetics, Polish Mother’s Memorial Hospital—Research Institute, 93-338 Lodz, Poland; (K.S.); (D.S.); (I.P.); (Ł.K.); (M.P.)
| | - Łukasz Kępczyński
- Department of Genetics, Polish Mother’s Memorial Hospital—Research Institute, 93-338 Lodz, Poland; (K.S.); (D.S.); (I.P.); (Ł.K.); (M.P.)
| | - Małgorzata Piotrowicz
- Department of Genetics, Polish Mother’s Memorial Hospital—Research Institute, 93-338 Lodz, Poland; (K.S.); (D.S.); (I.P.); (Ł.K.); (M.P.)
| | - Ewa Starostecka
- Department of Endocrinology and Metabolic Disease, Polish Mother’s Memorial Hospital—Research Institute, 93-338 Lodz, Poland;
| | - Andrzej Lewiński
- Department of Endocrinology and Metabolic Disease, Polish Mother’s Memorial Hospital—Research Institute, 93-338 Lodz, Poland;
- Department of Endocrinology and Metabolic Diseases, Medical University of Lodz, 90-419 Lodz, Poland
- Correspondence: (L.R.); (A.L.); (A.G.)
| | - Agnieszka Gach
- Department of Genetics, Polish Mother’s Memorial Hospital—Research Institute, 93-338 Lodz, Poland; (K.S.); (D.S.); (I.P.); (Ł.K.); (M.P.)
- Correspondence: (L.R.); (A.L.); (A.G.)
| |
Collapse
|
4
|
Qi M, Stenson PD, Ball EV, Tainer JA, Bacolla A, Kehrer-Sawatzki H, Cooper DN, Zhao H. Distinct sequence features underlie microdeletions and gross deletions in the human genome. Hum Mutat 2021; 43:328-346. [PMID: 34918412 PMCID: PMC9069542 DOI: 10.1002/humu.24314] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2021] [Revised: 11/02/2021] [Accepted: 12/14/2021] [Indexed: 11/18/2022]
Abstract
Microdeletions and gross deletions are important causes (~20%) of human inherited disease and their genomic locations are strongly influenced by the local DNA sequence environment. This notwithstanding, no study has systematically examined their underlying generative mechanisms. Here, we obtained 42,098 pathogenic microdeletions and gross deletions from the Human Gene Mutation Database (HGMD) that together form a continuum of germline deletions ranging in size from 1 to 28,394,429 bp. We analyzed the DNA sequence within 1 kb of the breakpoint junctions and found that the frequencies of non‐B DNA‐forming repeats, GC‐content, and the presence of seven of 78 specific sequence motifs in the vicinity of pathogenic deletions correlated with deletion length for deletions of length ≤30 bp. Further, we found that the presence of DR, GQ, and STR repeats is important for the formation of longer deletions (>30 bp) but not for the formation of shorter deletions (≤30 bp) while significantly (χ2, p < 2E−16) more microhomologies were identified flanking short deletions than long deletions (length >30 bp). We provide evidence to support a functional distinction between microdeletions and gross deletions. Finally, we propose that a deletion length cut‐off of 25–30 bp may serve as an objective means to functionally distinguish microdeletions from gross deletions.
Collapse
Affiliation(s)
- Mengling Qi
- Department of Medical Research Center, Sun Yat-sen Memorial Hospital; Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Guangzhou, China
| | - Peter D Stenson
- Institute of Medical Genetics, School of Medicine, Cardiff University, Heath Park, Cardiff, CF14 4XN, UK
| | - Edward V Ball
- Institute of Medical Genetics, School of Medicine, Cardiff University, Heath Park, Cardiff, CF14 4XN, UK
| | - John A Tainer
- Departments of Cancer Biology and of Molecular and Cellular Oncology, University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - Albino Bacolla
- Departments of Cancer Biology and of Molecular and Cellular Oncology, University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | | | - David N Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Heath Park, Cardiff, CF14 4XN, UK
| | - Huiying Zhao
- Department of Medical Research Center, Sun Yat-sen Memorial Hospital; Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Guangzhou, China
| |
Collapse
|
5
|
Kley RA, Leber Y, Schrank B, Zhuge H, Orfanos Z, Kostan J, Onipe A, Sellung D, Güttsches AK, Eggers B, Jacobsen F, Kress W, Marcus K, Djinovic-Carugo K, van der Ven PFM, Fürst DO, Vorgerd M. FLNC-Associated Myofibrillar Myopathy: New Clinical, Functional, and Proteomic Data. NEUROLOGY-GENETICS 2021; 7:e590. [PMID: 34235269 PMCID: PMC8237399 DOI: 10.1212/nxg.0000000000000590] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Accepted: 12/28/2020] [Indexed: 11/15/2022]
Abstract
Objective To determine whether a new indel mutation in the dimerization domain of filamin C (FLNc) causes a hereditary myopathy with protein aggregation in muscle fibers, we clinically and molecularly studied a German family with autosomal dominant myofibrillar myopathy (MFM). Methods We performed mutational analysis in 3 generations, muscle histopathology, and proteomic studies of IM protein aggregates. Functional consequences of the FLNC mutation were investigated with interaction and transfection studies and biophysics molecular analysis. Results Eight patients revealed clinical features of slowly progressive proximal weakness associated with a heterozygous c.8025_8030delCAAGACinsA (p.K2676Pfs*3) mutation in FLNC. Two patients exhibited a mild cardiomyopathy. MRI of skeletal muscle revealed lipomatous changes typical for MFM with FLNC mutations. Muscle biopsies showed characteristic MFM findings with protein aggregation and lesion formation. The proteomic profile of aggregates was specific for MFM-filaminopathy and indicated activation of the ubiquitin-proteasome system (UPS) and autophagic pathways. Functional studies revealed that mutant FLNc is misfolded, unstable, and incapable of forming homodimers and heterodimers with wild-type FLNc. Conclusions This new MFM-filaminopathy family confirms that expression of mutant FLNC leads to an adult-onset muscle phenotype with intracellular protein accumulation. Mutant FLNc protein is biochemically compromised and leads to dysregulation of protein quality control mechanisms. Proteomic analysis of MFM protein aggregates is a potent method to identify disease-relevant proteins, differentiate MFM subtypes, evaluate the relevance of gene variants, and identify novel MFM candidate genes.
Collapse
Affiliation(s)
- Rudolf Andre Kley
- Department of Neurology (R.A.K., H.Z., D.S., A.K.G., F.J., M.V.), Heimer Institute for Muscle Research, University Hospital Bergmannsheil, Ruhr-University Bochum, Bochum, Germany; Department of Neurology and Clinical Neurophysiology (R.A.K.), St. Marien-Hospital Borken, Borken, Germany; Department of Molecular Cell Biology (Y.L., Z.O., P.F.M.V., D.O.F.), Institute for Cell Biology, University of Bonn, Bonn, Germany; Department of Neurology (B.S.), DKD HELIOS Klinik Wiesbaden, Wiesbaden, Germany; Department of Structural and Computational Biology (J.K., A.O., K.D.-C.), Max Perutz Laboratories, University of Vienna, Vienna, Austria; Medizinisches Proteom-Center (B.E., K.M.), Ruhr-University Bochum, Bochum, Germany; Institute of Human Genetics (W.K.), University of Würzburg, Würzburg, Germany; and Department of Biochemistry (K.D.-C.), Faculty of Chemistry and Chemical Technology, University of Ljubljana, Ljubljana, Slovenia
| | - Yvonne Leber
- Department of Neurology (R.A.K., H.Z., D.S., A.K.G., F.J., M.V.), Heimer Institute for Muscle Research, University Hospital Bergmannsheil, Ruhr-University Bochum, Bochum, Germany; Department of Neurology and Clinical Neurophysiology (R.A.K.), St. Marien-Hospital Borken, Borken, Germany; Department of Molecular Cell Biology (Y.L., Z.O., P.F.M.V., D.O.F.), Institute for Cell Biology, University of Bonn, Bonn, Germany; Department of Neurology (B.S.), DKD HELIOS Klinik Wiesbaden, Wiesbaden, Germany; Department of Structural and Computational Biology (J.K., A.O., K.D.-C.), Max Perutz Laboratories, University of Vienna, Vienna, Austria; Medizinisches Proteom-Center (B.E., K.M.), Ruhr-University Bochum, Bochum, Germany; Institute of Human Genetics (W.K.), University of Würzburg, Würzburg, Germany; and Department of Biochemistry (K.D.-C.), Faculty of Chemistry and Chemical Technology, University of Ljubljana, Ljubljana, Slovenia
| | - Bertold Schrank
- Department of Neurology (R.A.K., H.Z., D.S., A.K.G., F.J., M.V.), Heimer Institute for Muscle Research, University Hospital Bergmannsheil, Ruhr-University Bochum, Bochum, Germany; Department of Neurology and Clinical Neurophysiology (R.A.K.), St. Marien-Hospital Borken, Borken, Germany; Department of Molecular Cell Biology (Y.L., Z.O., P.F.M.V., D.O.F.), Institute for Cell Biology, University of Bonn, Bonn, Germany; Department of Neurology (B.S.), DKD HELIOS Klinik Wiesbaden, Wiesbaden, Germany; Department of Structural and Computational Biology (J.K., A.O., K.D.-C.), Max Perutz Laboratories, University of Vienna, Vienna, Austria; Medizinisches Proteom-Center (B.E., K.M.), Ruhr-University Bochum, Bochum, Germany; Institute of Human Genetics (W.K.), University of Würzburg, Würzburg, Germany; and Department of Biochemistry (K.D.-C.), Faculty of Chemistry and Chemical Technology, University of Ljubljana, Ljubljana, Slovenia
| | - Heidi Zhuge
- Department of Neurology (R.A.K., H.Z., D.S., A.K.G., F.J., M.V.), Heimer Institute for Muscle Research, University Hospital Bergmannsheil, Ruhr-University Bochum, Bochum, Germany; Department of Neurology and Clinical Neurophysiology (R.A.K.), St. Marien-Hospital Borken, Borken, Germany; Department of Molecular Cell Biology (Y.L., Z.O., P.F.M.V., D.O.F.), Institute for Cell Biology, University of Bonn, Bonn, Germany; Department of Neurology (B.S.), DKD HELIOS Klinik Wiesbaden, Wiesbaden, Germany; Department of Structural and Computational Biology (J.K., A.O., K.D.-C.), Max Perutz Laboratories, University of Vienna, Vienna, Austria; Medizinisches Proteom-Center (B.E., K.M.), Ruhr-University Bochum, Bochum, Germany; Institute of Human Genetics (W.K.), University of Würzburg, Würzburg, Germany; and Department of Biochemistry (K.D.-C.), Faculty of Chemistry and Chemical Technology, University of Ljubljana, Ljubljana, Slovenia
| | - Zacharias Orfanos
- Department of Neurology (R.A.K., H.Z., D.S., A.K.G., F.J., M.V.), Heimer Institute for Muscle Research, University Hospital Bergmannsheil, Ruhr-University Bochum, Bochum, Germany; Department of Neurology and Clinical Neurophysiology (R.A.K.), St. Marien-Hospital Borken, Borken, Germany; Department of Molecular Cell Biology (Y.L., Z.O., P.F.M.V., D.O.F.), Institute for Cell Biology, University of Bonn, Bonn, Germany; Department of Neurology (B.S.), DKD HELIOS Klinik Wiesbaden, Wiesbaden, Germany; Department of Structural and Computational Biology (J.K., A.O., K.D.-C.), Max Perutz Laboratories, University of Vienna, Vienna, Austria; Medizinisches Proteom-Center (B.E., K.M.), Ruhr-University Bochum, Bochum, Germany; Institute of Human Genetics (W.K.), University of Würzburg, Würzburg, Germany; and Department of Biochemistry (K.D.-C.), Faculty of Chemistry and Chemical Technology, University of Ljubljana, Ljubljana, Slovenia
| | - Julius Kostan
- Department of Neurology (R.A.K., H.Z., D.S., A.K.G., F.J., M.V.), Heimer Institute for Muscle Research, University Hospital Bergmannsheil, Ruhr-University Bochum, Bochum, Germany; Department of Neurology and Clinical Neurophysiology (R.A.K.), St. Marien-Hospital Borken, Borken, Germany; Department of Molecular Cell Biology (Y.L., Z.O., P.F.M.V., D.O.F.), Institute for Cell Biology, University of Bonn, Bonn, Germany; Department of Neurology (B.S.), DKD HELIOS Klinik Wiesbaden, Wiesbaden, Germany; Department of Structural and Computational Biology (J.K., A.O., K.D.-C.), Max Perutz Laboratories, University of Vienna, Vienna, Austria; Medizinisches Proteom-Center (B.E., K.M.), Ruhr-University Bochum, Bochum, Germany; Institute of Human Genetics (W.K.), University of Würzburg, Würzburg, Germany; and Department of Biochemistry (K.D.-C.), Faculty of Chemistry and Chemical Technology, University of Ljubljana, Ljubljana, Slovenia
| | - Adekunle Onipe
- Department of Neurology (R.A.K., H.Z., D.S., A.K.G., F.J., M.V.), Heimer Institute for Muscle Research, University Hospital Bergmannsheil, Ruhr-University Bochum, Bochum, Germany; Department of Neurology and Clinical Neurophysiology (R.A.K.), St. Marien-Hospital Borken, Borken, Germany; Department of Molecular Cell Biology (Y.L., Z.O., P.F.M.V., D.O.F.), Institute for Cell Biology, University of Bonn, Bonn, Germany; Department of Neurology (B.S.), DKD HELIOS Klinik Wiesbaden, Wiesbaden, Germany; Department of Structural and Computational Biology (J.K., A.O., K.D.-C.), Max Perutz Laboratories, University of Vienna, Vienna, Austria; Medizinisches Proteom-Center (B.E., K.M.), Ruhr-University Bochum, Bochum, Germany; Institute of Human Genetics (W.K.), University of Würzburg, Würzburg, Germany; and Department of Biochemistry (K.D.-C.), Faculty of Chemistry and Chemical Technology, University of Ljubljana, Ljubljana, Slovenia
| | - Dominik Sellung
- Department of Neurology (R.A.K., H.Z., D.S., A.K.G., F.J., M.V.), Heimer Institute for Muscle Research, University Hospital Bergmannsheil, Ruhr-University Bochum, Bochum, Germany; Department of Neurology and Clinical Neurophysiology (R.A.K.), St. Marien-Hospital Borken, Borken, Germany; Department of Molecular Cell Biology (Y.L., Z.O., P.F.M.V., D.O.F.), Institute for Cell Biology, University of Bonn, Bonn, Germany; Department of Neurology (B.S.), DKD HELIOS Klinik Wiesbaden, Wiesbaden, Germany; Department of Structural and Computational Biology (J.K., A.O., K.D.-C.), Max Perutz Laboratories, University of Vienna, Vienna, Austria; Medizinisches Proteom-Center (B.E., K.M.), Ruhr-University Bochum, Bochum, Germany; Institute of Human Genetics (W.K.), University of Würzburg, Würzburg, Germany; and Department of Biochemistry (K.D.-C.), Faculty of Chemistry and Chemical Technology, University of Ljubljana, Ljubljana, Slovenia
| | - Anne Katrin Güttsches
- Department of Neurology (R.A.K., H.Z., D.S., A.K.G., F.J., M.V.), Heimer Institute for Muscle Research, University Hospital Bergmannsheil, Ruhr-University Bochum, Bochum, Germany; Department of Neurology and Clinical Neurophysiology (R.A.K.), St. Marien-Hospital Borken, Borken, Germany; Department of Molecular Cell Biology (Y.L., Z.O., P.F.M.V., D.O.F.), Institute for Cell Biology, University of Bonn, Bonn, Germany; Department of Neurology (B.S.), DKD HELIOS Klinik Wiesbaden, Wiesbaden, Germany; Department of Structural and Computational Biology (J.K., A.O., K.D.-C.), Max Perutz Laboratories, University of Vienna, Vienna, Austria; Medizinisches Proteom-Center (B.E., K.M.), Ruhr-University Bochum, Bochum, Germany; Institute of Human Genetics (W.K.), University of Würzburg, Würzburg, Germany; and Department of Biochemistry (K.D.-C.), Faculty of Chemistry and Chemical Technology, University of Ljubljana, Ljubljana, Slovenia
| | - Britta Eggers
- Department of Neurology (R.A.K., H.Z., D.S., A.K.G., F.J., M.V.), Heimer Institute for Muscle Research, University Hospital Bergmannsheil, Ruhr-University Bochum, Bochum, Germany; Department of Neurology and Clinical Neurophysiology (R.A.K.), St. Marien-Hospital Borken, Borken, Germany; Department of Molecular Cell Biology (Y.L., Z.O., P.F.M.V., D.O.F.), Institute for Cell Biology, University of Bonn, Bonn, Germany; Department of Neurology (B.S.), DKD HELIOS Klinik Wiesbaden, Wiesbaden, Germany; Department of Structural and Computational Biology (J.K., A.O., K.D.-C.), Max Perutz Laboratories, University of Vienna, Vienna, Austria; Medizinisches Proteom-Center (B.E., K.M.), Ruhr-University Bochum, Bochum, Germany; Institute of Human Genetics (W.K.), University of Würzburg, Würzburg, Germany; and Department of Biochemistry (K.D.-C.), Faculty of Chemistry and Chemical Technology, University of Ljubljana, Ljubljana, Slovenia
| | - Frank Jacobsen
- Department of Neurology (R.A.K., H.Z., D.S., A.K.G., F.J., M.V.), Heimer Institute for Muscle Research, University Hospital Bergmannsheil, Ruhr-University Bochum, Bochum, Germany; Department of Neurology and Clinical Neurophysiology (R.A.K.), St. Marien-Hospital Borken, Borken, Germany; Department of Molecular Cell Biology (Y.L., Z.O., P.F.M.V., D.O.F.), Institute for Cell Biology, University of Bonn, Bonn, Germany; Department of Neurology (B.S.), DKD HELIOS Klinik Wiesbaden, Wiesbaden, Germany; Department of Structural and Computational Biology (J.K., A.O., K.D.-C.), Max Perutz Laboratories, University of Vienna, Vienna, Austria; Medizinisches Proteom-Center (B.E., K.M.), Ruhr-University Bochum, Bochum, Germany; Institute of Human Genetics (W.K.), University of Würzburg, Würzburg, Germany; and Department of Biochemistry (K.D.-C.), Faculty of Chemistry and Chemical Technology, University of Ljubljana, Ljubljana, Slovenia
| | - Wolfram Kress
- Department of Neurology (R.A.K., H.Z., D.S., A.K.G., F.J., M.V.), Heimer Institute for Muscle Research, University Hospital Bergmannsheil, Ruhr-University Bochum, Bochum, Germany; Department of Neurology and Clinical Neurophysiology (R.A.K.), St. Marien-Hospital Borken, Borken, Germany; Department of Molecular Cell Biology (Y.L., Z.O., P.F.M.V., D.O.F.), Institute for Cell Biology, University of Bonn, Bonn, Germany; Department of Neurology (B.S.), DKD HELIOS Klinik Wiesbaden, Wiesbaden, Germany; Department of Structural and Computational Biology (J.K., A.O., K.D.-C.), Max Perutz Laboratories, University of Vienna, Vienna, Austria; Medizinisches Proteom-Center (B.E., K.M.), Ruhr-University Bochum, Bochum, Germany; Institute of Human Genetics (W.K.), University of Würzburg, Würzburg, Germany; and Department of Biochemistry (K.D.-C.), Faculty of Chemistry and Chemical Technology, University of Ljubljana, Ljubljana, Slovenia
| | - Katrin Marcus
- Department of Neurology (R.A.K., H.Z., D.S., A.K.G., F.J., M.V.), Heimer Institute for Muscle Research, University Hospital Bergmannsheil, Ruhr-University Bochum, Bochum, Germany; Department of Neurology and Clinical Neurophysiology (R.A.K.), St. Marien-Hospital Borken, Borken, Germany; Department of Molecular Cell Biology (Y.L., Z.O., P.F.M.V., D.O.F.), Institute for Cell Biology, University of Bonn, Bonn, Germany; Department of Neurology (B.S.), DKD HELIOS Klinik Wiesbaden, Wiesbaden, Germany; Department of Structural and Computational Biology (J.K., A.O., K.D.-C.), Max Perutz Laboratories, University of Vienna, Vienna, Austria; Medizinisches Proteom-Center (B.E., K.M.), Ruhr-University Bochum, Bochum, Germany; Institute of Human Genetics (W.K.), University of Würzburg, Würzburg, Germany; and Department of Biochemistry (K.D.-C.), Faculty of Chemistry and Chemical Technology, University of Ljubljana, Ljubljana, Slovenia
| | - Kristina Djinovic-Carugo
- Department of Neurology (R.A.K., H.Z., D.S., A.K.G., F.J., M.V.), Heimer Institute for Muscle Research, University Hospital Bergmannsheil, Ruhr-University Bochum, Bochum, Germany; Department of Neurology and Clinical Neurophysiology (R.A.K.), St. Marien-Hospital Borken, Borken, Germany; Department of Molecular Cell Biology (Y.L., Z.O., P.F.M.V., D.O.F.), Institute for Cell Biology, University of Bonn, Bonn, Germany; Department of Neurology (B.S.), DKD HELIOS Klinik Wiesbaden, Wiesbaden, Germany; Department of Structural and Computational Biology (J.K., A.O., K.D.-C.), Max Perutz Laboratories, University of Vienna, Vienna, Austria; Medizinisches Proteom-Center (B.E., K.M.), Ruhr-University Bochum, Bochum, Germany; Institute of Human Genetics (W.K.), University of Würzburg, Würzburg, Germany; and Department of Biochemistry (K.D.-C.), Faculty of Chemistry and Chemical Technology, University of Ljubljana, Ljubljana, Slovenia
| | - Peter F M van der Ven
- Department of Neurology (R.A.K., H.Z., D.S., A.K.G., F.J., M.V.), Heimer Institute for Muscle Research, University Hospital Bergmannsheil, Ruhr-University Bochum, Bochum, Germany; Department of Neurology and Clinical Neurophysiology (R.A.K.), St. Marien-Hospital Borken, Borken, Germany; Department of Molecular Cell Biology (Y.L., Z.O., P.F.M.V., D.O.F.), Institute for Cell Biology, University of Bonn, Bonn, Germany; Department of Neurology (B.S.), DKD HELIOS Klinik Wiesbaden, Wiesbaden, Germany; Department of Structural and Computational Biology (J.K., A.O., K.D.-C.), Max Perutz Laboratories, University of Vienna, Vienna, Austria; Medizinisches Proteom-Center (B.E., K.M.), Ruhr-University Bochum, Bochum, Germany; Institute of Human Genetics (W.K.), University of Würzburg, Würzburg, Germany; and Department of Biochemistry (K.D.-C.), Faculty of Chemistry and Chemical Technology, University of Ljubljana, Ljubljana, Slovenia
| | - Dieter O Fürst
- Department of Neurology (R.A.K., H.Z., D.S., A.K.G., F.J., M.V.), Heimer Institute for Muscle Research, University Hospital Bergmannsheil, Ruhr-University Bochum, Bochum, Germany; Department of Neurology and Clinical Neurophysiology (R.A.K.), St. Marien-Hospital Borken, Borken, Germany; Department of Molecular Cell Biology (Y.L., Z.O., P.F.M.V., D.O.F.), Institute for Cell Biology, University of Bonn, Bonn, Germany; Department of Neurology (B.S.), DKD HELIOS Klinik Wiesbaden, Wiesbaden, Germany; Department of Structural and Computational Biology (J.K., A.O., K.D.-C.), Max Perutz Laboratories, University of Vienna, Vienna, Austria; Medizinisches Proteom-Center (B.E., K.M.), Ruhr-University Bochum, Bochum, Germany; Institute of Human Genetics (W.K.), University of Würzburg, Würzburg, Germany; and Department of Biochemistry (K.D.-C.), Faculty of Chemistry and Chemical Technology, University of Ljubljana, Ljubljana, Slovenia
| | - Matthias Vorgerd
- Department of Neurology (R.A.K., H.Z., D.S., A.K.G., F.J., M.V.), Heimer Institute for Muscle Research, University Hospital Bergmannsheil, Ruhr-University Bochum, Bochum, Germany; Department of Neurology and Clinical Neurophysiology (R.A.K.), St. Marien-Hospital Borken, Borken, Germany; Department of Molecular Cell Biology (Y.L., Z.O., P.F.M.V., D.O.F.), Institute for Cell Biology, University of Bonn, Bonn, Germany; Department of Neurology (B.S.), DKD HELIOS Klinik Wiesbaden, Wiesbaden, Germany; Department of Structural and Computational Biology (J.K., A.O., K.D.-C.), Max Perutz Laboratories, University of Vienna, Vienna, Austria; Medizinisches Proteom-Center (B.E., K.M.), Ruhr-University Bochum, Bochum, Germany; Institute of Human Genetics (W.K.), University of Würzburg, Würzburg, Germany; and Department of Biochemistry (K.D.-C.), Faculty of Chemistry and Chemical Technology, University of Ljubljana, Ljubljana, Slovenia
| |
Collapse
|
6
|
Di Iorio V, Karali M, Melillo P, Testa F, Brunetti-Pierri R, Musacchia F, Condroyer C, Neidhardt J, Audo I, Zeitz C, Banfi S, Simonelli F. Spectrum of Disease Severity in Patients With X-Linked Retinitis Pigmentosa Due to RPGR Mutations. Invest Ophthalmol Vis Sci 2021; 61:36. [PMID: 33372982 PMCID: PMC7774109 DOI: 10.1167/iovs.61.14.36] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
Purpose The purpose of this study was to perform a detailed longitudinal phenotyping of X-linked retinitis pigmentosa (RP) caused by mutations in the RPGR gene during a long follow-up period. Methods An Italian cohort of 48 male patients (from 31 unrelated families) with RPGR-associated RP was clinically assessed at a single center (mean follow-up = 6.5 years), including measurements of best-corrected visual acuity (BCVA), Goldmann visual field (GVF), optical coherence tomography (OCT), fundus autofluorescence (FAF), microperimetry, and full-field electroretinography (ERG). Results Patients (29.6 ± 15.2 years) showed a mean BCVA of 0.6 ± 0.7 logMAR, mostly with myopic refraction (79.2%). Thirty patients (62.5%) presented a typical RP fundus, while the remaining sine pigmento RP. Over the follow-up, BCVA significantly declined at a mean rate of 0.025 logMAR/year. Typical RP and high myopia were associated with a significantly faster decline of BCVA. Blindness was driven primarily by GVF loss. ERG responses with a rod-cone pattern of dysfunction were detectable in patients (50%) that were significantly younger and more frequently presented sine pigmento RP. Thirteen patients (27.1%) had macular abnormalities without cystoid macular edema. Patients (50%) with a perimacular hyper-FAF ring were significantly younger, had a higher BCVA and a better-preserved ellipsoid zone band than those with markedly decreased FAF. Patients harboring pathogenic variants in exons 1 to 14 showed a milder phenotype compared to those with ORF15 mutations. Conclusions Our monocentric, longitudinal retrospective study revealed a spectrum disease progression in male patients with RPGR-associated RP. Slow disease progression correlated with sine pigmento RP, absence of high myopia, and mutations in RPGR exons 1 to 14.
Collapse
Affiliation(s)
- Valentina Di Iorio
- Eye Clinic, Multidisciplinary Department of Medical, Surgical and Dental Sciences, Università degli Studi della Campania "Luigi Vanvitelli," Naples, Italy
| | - Marianthi Karali
- Eye Clinic, Multidisciplinary Department of Medical, Surgical and Dental Sciences, Università degli Studi della Campania "Luigi Vanvitelli," Naples, Italy.,Telethon Institute of Genetics and Medicine, Pozzuoli, Italy
| | - Paolo Melillo
- Eye Clinic, Multidisciplinary Department of Medical, Surgical and Dental Sciences, Università degli Studi della Campania "Luigi Vanvitelli," Naples, Italy
| | - Francesco Testa
- Eye Clinic, Multidisciplinary Department of Medical, Surgical and Dental Sciences, Università degli Studi della Campania "Luigi Vanvitelli," Naples, Italy
| | - Raffaella Brunetti-Pierri
- Eye Clinic, Multidisciplinary Department of Medical, Surgical and Dental Sciences, Università degli Studi della Campania "Luigi Vanvitelli," Naples, Italy
| | | | | | - John Neidhardt
- Human Genetics, Faculty of Medicine and Health Sciences, University of Oldenburg, Oldenburg, Germany.,Research Center Neurosensory Science, University Oldenburg, Oldenburg, Germany
| | - Isabelle Audo
- Sorbonne Université, INSERM, CNRS, Institut de la Vision, Paris, France.,CHNO des Quinze-Vingts, DHU Sight Restore, INSERM-DGOS CIC, France.,Institute of Ophthalmology, University College of London, London, United Kingdom
| | - Christina Zeitz
- Sorbonne Université, INSERM, CNRS, Institut de la Vision, Paris, France
| | - Sandro Banfi
- Telethon Institute of Genetics and Medicine, Pozzuoli, Italy.,Medical Genetics, Department of Precision Medicine, Università degli Studi della Campania "Luigi Vanvitelli," Naples, Italy
| | - Francesca Simonelli
- Eye Clinic, Multidisciplinary Department of Medical, Surgical and Dental Sciences, Università degli Studi della Campania "Luigi Vanvitelli," Naples, Italy
| |
Collapse
|
7
|
Ding Y, Xue H, Ding X, Zhao Y, Zhao Z, Wang D, Wu J. On the complexity measures of mutation hotspots in human TP53 protein. CHAOS (WOODBURY, N.Y.) 2020; 30:073118. [PMID: 32752620 DOI: 10.1063/1.5143584] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/27/2019] [Accepted: 06/15/2020] [Indexed: 06/11/2023]
Abstract
The role of sequence complexity in 23 051 somatic missense mutations including 73 well-known mutation hotspots across 22 major cancers was studied in human TP53 proteins. A role for sequence complexity in TP53 protein mutations is suggested since (i) the mutation rate significantly increases in low amino acid pair bias complexity; (ii) probability distribution complexity increases following single point substitution mutations and strikingly increases after mutation at the mutation hotspots including six detectable hotspot mutations (R175, G245, R248, R249, R273, and R282); and (iii) the degree of increase in distribution complexity is significantly correlated with the frequency of missense mutations (r = -0.5758, P < 0.0001) across 20 major types of solid tumors. These results are consistent with the hypothesis that amino acid pair bias and distribution probability may be used as novel measures for protein sequence complexity, and the degree of complexity is related to its susceptibility to mutation, as such, it may be used as a predictor for modeling protein mutations in human cancers.
Collapse
Affiliation(s)
- Yan Ding
- Department of Pediatrics, Boston Children's Hospital, Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Hongsheng Xue
- Institute for Translational Medicine, The Affiliated Zhongshan Hospital of Dalian University, Dalian 116001, China
| | - Xinjia Ding
- Department of Urology, The Second Affiliated Hospital of Dalian Medical University, Dalian 116023, China
| | - Yuqing Zhao
- Department of Urology, The Second Affiliated Hospital of Dalian Medical University, Dalian 116023, China
| | - Zhilong Zhao
- Institute for Translational Medicine, The Affiliated Zhongshan Hospital of Dalian University, Dalian 116001, China
| | - Dazhi Wang
- Department of Pediatrics, Boston Children's Hospital, Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Jianlin Wu
- Institute for Translational Medicine, The Affiliated Zhongshan Hospital of Dalian University, Dalian 116001, China
| |
Collapse
|
8
|
Reyna-Fabián ME, Hernández-Martínez NL, Alcántara-Ortigoza MA, Ayala-Sumuano JT, Enríquez-Flores S, Velázquez-Aragón JA, Varela-Echavarría A, Todd-Quiñones CG, González-Del Angel A. First comprehensive TSC1/TSC2 mutational analysis in Mexican patients with Tuberous Sclerosis Complex reveals numerous novel pathogenic variants. Sci Rep 2020; 10:6589. [PMID: 32313033 PMCID: PMC7170856 DOI: 10.1038/s41598-020-62759-5] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2019] [Accepted: 03/11/2020] [Indexed: 12/12/2022] Open
Abstract
The aim of this study was to improve knowledge of the mutational spectrum causing tuberous sclerosis complex (TSC) in a sample of Mexican patients, given the limited information available regarding this disease in Mexico and Latin America. Four different molecular techniques were implemented to identify from single nucleotide variants to large rearrangements in the TSC1 and TSC2 genes of 66 unrelated Mexican-descent patients that clinically fulfilled the criteria for a definitive TSC diagnosis. The mutation detection rate was 94%, TSC2 pathogenic variants (PV) prevailed over TSC1 PV (77% vs. 23%) and a recurrent mutation site (hotspot) was observed in TSC1 exon 15. Interestingly, 40% of the identified mutations had not been previously reported. The wide range of novels PV made it difficult to establish any genotype-phenotype correlation, but most of the PV conditioned neurological involvement (intellectual disability and epilepsy). Our 3D protein modeling of two variants classified as likely pathogenic demonstrated that they could alter the structure and function of the hamartin (TSC1) or tuberin (TSC2) proteins. Molecular analyses of parents and first-degree affected family members of the index cases enabled us to distinguish familial (18%) from sporadic (82%) cases and to identify one case of apparent gonadal mosaicism.
Collapse
Affiliation(s)
- Miriam E Reyna-Fabián
- Laboratorio de Biología Molecular, Instituto Nacional de Pediatría, Secretaría de Salud, Ciudad de México, México
| | - Nancy L Hernández-Martínez
- Laboratorio de Biología Molecular, Instituto Nacional de Pediatría, Secretaría de Salud, Ciudad de México, México
| | - Miguel A Alcántara-Ortigoza
- Laboratorio de Biología Molecular, Instituto Nacional de Pediatría, Secretaría de Salud, Ciudad de México, México
| | | | - Sergio Enríquez-Flores
- Grupo de Investigación en Biomoléculas y Salud Infantil, Laboratorio de Errores Innatos del Metabolismo y Tamiz, Instituto Nacional de Pediatría, Ciudad de México, México
| | - José A Velázquez-Aragón
- Laboratorio de Biología Molecular, Instituto Nacional de Pediatría, Secretaría de Salud, Ciudad de México, México
| | - Alfredo Varela-Echavarría
- Departamento de Neurobiología del Desarrollo y Neurofisiología, Instituto de Neurobiología, Universidad Nacional Autónoma de México, Querétaro, México
| | - Carlos G Todd-Quiñones
- Posgrado en Biología Experimental, Universidad Autónoma Metropolitana-Iztapalapa, Ciudad de México, México
- Laboratorio de Biología Molecular, Departamento de Genética Humana, Hospital de Alta Especialidad de Veracruz, Veracruz, México
| | - Ariadna González-Del Angel
- Laboratorio de Biología Molecular, Instituto Nacional de Pediatría, Secretaría de Salud, Ciudad de México, México.
| |
Collapse
|
9
|
Karolak JA, Bacolla A, Liu Q, Lantz PE, Petty J, Trapane P, Panzer K, Totapally BR, Niu Z, Xiao R, Xie NG, Wu LR, Szafranski P, Zhang DY, Stankiewicz P. A recurrent 8 bp frameshifting indel in FOXF1 defines a novel mutation hotspot associated with alveolar capillary dysplasia with misalignment of pulmonary veins. Am J Med Genet A 2019; 179:2272-2276. [PMID: 31436901 DOI: 10.1002/ajmg.a.61338] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2019] [Revised: 08/01/2019] [Accepted: 08/05/2019] [Indexed: 12/11/2022]
Abstract
Alveolar capillary dysplasia with misalignment of pulmonary veins (ACDMPV) is a rare lethal lung developmental disease. Affected infants manifest with severe respiratory distress and refractory pulmonary hypertension and uniformly die in the first month of life. Heterozygous point mutations or copy-number variant deletions involving FOXF1 and/or its upstream lung-specific enhancer on 16q24.1 have been identified in the vast majority of ACDMPV patients. We have previously described two unrelated families with a de novo pathogenic frameshift variant c.691_698del (p.Ala231Argfs*61) in the exon 1 of FOXF1. Here, we present a third unrelated ACDMPV family with the same de novo variant and propose that a direct tandem repeat of eight consecutive nucleotides GCGGCGGC within the ~4 kb CpG island in FOXF1 exon 1 is a novel mutation hotspot causative for ACDMPV.
Collapse
Affiliation(s)
- Justyna A Karolak
- Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, Texas.,Department of Genetics and Pharmaceutical Microbiology, Poznan University of Medical Sciences, Poznan, Poland
| | - Albino Bacolla
- Department of Cancer Biology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas.,Department of Molecular and Cellular Oncology, The University of Texas M.D. Anderson Cancer Center, Houston, Texas
| | - Qian Liu
- Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, Texas
| | - Patrick E Lantz
- Department of Pathology, Wake Forest School of Medicine, Baptist Medical Center, Winston-Salem, North Carolina
| | - John Petty
- Department of General Surgery, Wake Forest School of Medicine, Baptist Medical Center, Winston-Salem, North Carolina
| | - Pamela Trapane
- Department of Pediatrics, Division of Pediatric Genetics, University of Florida College of Medicine-Jacksonville, Jacksonville, Florida
| | - Karin Panzer
- Department of Pediatrics, University of Iowa Stead Family Children's Hospital, Iowa City, Iowa
| | - Balagangadhar R Totapally
- Department of Pediatrics, Florida International University, Nicklaus Children's Hospital, Miami, Florida
| | - Zhiyv Niu
- Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, Texas
| | | | - Nina G Xie
- Department of Bioengineering, Rice University, Houston, Texas
| | - Lucia R Wu
- Department of Bioengineering, Rice University, Houston, Texas
| | - Przemyslaw Szafranski
- Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, Texas
| | - David Y Zhang
- Department of Bioengineering, Rice University, Houston, Texas
| | - Paweł Stankiewicz
- Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, Texas
| |
Collapse
|
10
|
Yue Z, Zhao L, Cheng N, Yan H, Xia J. dbCID: a manually curated resource for exploring the driver indels in human cancer. Brief Bioinform 2019; 20:1925-1933. [DOI: 10.1093/bib/bby059] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2018] [Revised: 05/22/2018] [Indexed: 12/12/2022] Open
Abstract
Abstract
While recent advances in next-generation sequencing technologies have enabled the creation of a multitude of databases in cancer genomic research, there is no comprehensive database focusing on the annotation of driver indels (insertions and deletions) yet. Therefore, we have developed the database of Cancer driver InDels (dbCID), which is a collection of known coding indels that likely to be engaged in cancer development, progression or therapy. dbCID contains experimentally supported and putative driver indels derived from manual curation of literature and is freely available online at http://bioinfo.ahu.edu.cn:8080/dbCID. Using the data deposited in dbCID, we summarized features of driver indels in four levels (gene, DNA, transcript and protein) through comparing with putative neutral indels. We found that most of the genes containing driver indels in dbCID are known cancer genes playing a role in tumorigenesis. Contrary to the expectation, the sequences affected by driver frameshift indels are not larger than those by neutral ones. In addition, the frameshift and inframe driver indels prefer to disrupt high-conservative regions both in DNA sequences and protein domains. Finally, we developed a computational method for discriminating cancer driver from neutral frameshift indels based on the deposited data in dbCID. The proposed method outperformed other widely used non-cancer-specific predictors on an external test set, which demonstrated the usefulness of the data deposited in dbCID. We hope dbCID will be a benchmark for improving and evaluating prediction algorithms, and the characteristics summarized here may assist with investigating the mechanism of indel–cancer association.
Collapse
Affiliation(s)
- Zhenyu Yue
- Institute of Physical Science and Information Technology, School of Computer Science and Technology, Anhui University, Hefei, Anhui, China
| | - Le Zhao
- Institute of Physical Science and Information Technology, School of Computer Science and Technology, Anhui University, Hefei, Anhui, China
| | - Na Cheng
- Institute of Physical Science and Information Technology, School of Computer Science and Technology, Anhui University, Hefei, Anhui, China
| | - Hua Yan
- School of Life Sciences, Anhui University, Hefei, Anhui, China
| | - Junfeng Xia
- Institute of Physical Science and Information Technology, School of Computer Science and Technology, Anhui University, Hefei, Anhui, China
| |
Collapse
|
11
|
Boschiero C, Moreira GCM, Gheyas AA, Godoy TF, Gasparin G, Mariani PDSC, Paduan M, Cesar ASM, Ledur MC, Coutinho LL. Genome-wide characterization of genetic variants and putative regions under selection in meat and egg-type chicken lines. BMC Genomics 2018; 19:83. [PMID: 29370772 PMCID: PMC5785814 DOI: 10.1186/s12864-018-4444-0] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2017] [Accepted: 01/10/2018] [Indexed: 12/13/2022] Open
Abstract
Background Meat and egg-type chickens have been selected for several generations for different traits. Artificial and natural selection for different phenotypes can change frequency of genetic variants, leaving particular genomic footprints throghtout the genome. Thus, the aims of this study were to sequence 28 chickens from two Brazilian lines (meat and white egg-type) and use this information to characterize genome-wide genetic variations, identify putative regions under selection using Fst method, and find putative pathways under selection. Results A total of 13.93 million SNPs and 1.36 million INDELs were identified, with more variants detected from the broiler (meat-type) line. Although most were located in non-coding regions, we identified 7255 intolerant non-synonymous SNPs, 512 stopgain/loss SNPs, 1381 frameshift and 1094 non-frameshift INDELs that may alter protein functions. Genes harboring intolerant non-synonymous SNPs affected metabolic pathways related mainly to reproduction and endocrine systems in the white-egg layer line, and lipid metabolism and metabolic diseases in the broiler line. Fst analysis in sliding windows, using SNPs and INDELs separately, identified over 300 putative regions of selection overlapping with more than 250 genes. For the first time in chicken, INDEL variants were considered for selection signature analysis, showing high level of correlation in results between SNP and INDEL data. The putative regions of selection signatures revealed interesting candidate genes and pathways related to important phenotypic traits in chicken, such as lipid metabolism, growth, reproduction, and cardiac development. Conclusions In this study, Fst method was applied to identify high confidence putative regions under selection, providing novel insights into selection footprints that can help elucidate the functional mechanisms underlying different phenotypic traits relevant to meat and egg-type chicken lines. In addition, we generated a large catalog of line-specific and common genetic variants from a Brazilian broiler and a white egg layer line that can be used for genomic studies involving association analysis with phenotypes of economic interest to the poultry industry. Electronic supplementary material The online version of this article (10.1186/s12864-018-4444-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Clarissa Boschiero
- Animal Biotechnology Laboratory, Animal Science Department, Luiz de Queiroz College of Agriculture (ESALQ), University of São Paulo (USP), Piracicaba, SP, 13418-900, Brazil. .,Noble Reserch Institute, 2510 Sam Noble Parkway, Ardmore, Oklahoma, 73401, USA.
| | - Gabriel Costa Monteiro Moreira
- Animal Biotechnology Laboratory, Animal Science Department, Luiz de Queiroz College of Agriculture (ESALQ), University of São Paulo (USP), Piracicaba, SP, 13418-900, Brazil
| | - Almas Ara Gheyas
- Department of Genetics and Genomics, The Roslin Institute and Royal School of Veterinary Studies, University of Edinburgh, Easter Bush Campus, Midlothian, EH25 9RG, UK
| | - Thaís Fernanda Godoy
- Animal Biotechnology Laboratory, Animal Science Department, Luiz de Queiroz College of Agriculture (ESALQ), University of São Paulo (USP), Piracicaba, SP, 13418-900, Brazil
| | - Gustavo Gasparin
- Animal Biotechnology Laboratory, Animal Science Department, Luiz de Queiroz College of Agriculture (ESALQ), University of São Paulo (USP), Piracicaba, SP, 13418-900, Brazil
| | - Pilar Drummond Sampaio Corrêa Mariani
- Animal Biotechnology Laboratory, Animal Science Department, Luiz de Queiroz College of Agriculture (ESALQ), University of São Paulo (USP), Piracicaba, SP, 13418-900, Brazil
| | - Marcela Paduan
- Animal Biotechnology Laboratory, Animal Science Department, Luiz de Queiroz College of Agriculture (ESALQ), University of São Paulo (USP), Piracicaba, SP, 13418-900, Brazil
| | - Aline Silva Mello Cesar
- Animal Biotechnology Laboratory, Animal Science Department, Luiz de Queiroz College of Agriculture (ESALQ), University of São Paulo (USP), Piracicaba, SP, 13418-900, Brazil
| | | | - Luiz Lehmann Coutinho
- Animal Biotechnology Laboratory, Animal Science Department, Luiz de Queiroz College of Agriculture (ESALQ), University of São Paulo (USP), Piracicaba, SP, 13418-900, Brazil
| |
Collapse
|
12
|
Sadeghipour F, Basiratnia M, Derakhshan A, Fardaei M. Mutation analysis of the CTNS gene in Iranian patients with infantile nephropathic cystinosis: identification of two novel mutations. Hum Genome Var 2017; 4:17038. [PMID: 28983406 PMCID: PMC5628181 DOI: 10.1038/hgv.2017.38] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2017] [Revised: 07/07/2017] [Accepted: 07/09/2017] [Indexed: 12/02/2022] Open
Abstract
Nephropathic cystinosis is an inherited lysosomal transport disorder caused by mutations in the CTNS gene that encodes for a lysosomal membrane transporter, cystinosin. Dysfunction in this protein leads to cystine accumulation in the cells of different organs. The accumulation of cystine in the kidneys becomes apparent with renal tubular Fanconi syndrome between 6 and 12 months of age and leads to renal failure in the first decade of life. The aim of this study was to analyze the CTNS mutations in 20 Iranian patients, from 20 unrelated families, all of whom were afflicted with infantile nephropathic cystinosis. In these patients, seven different mutant alleles were found, including two new mutations, c.517T>C; p.Y173H and c.492_515del, that have not been previously reported. In addition, we observed that c.681G>A, the common Middle Eastern mutation, was the most common mutation in our patients. Moreover, a new minisatellite or variable number of tandem repeat marker (KX499495) was identified at the CTNS gene. Seven different alleles were found for this marker, and its allele frequency and heterozygosity degree were calculated in cystinosis patients and healthy individuals.
Collapse
Affiliation(s)
- Forough Sadeghipour
- Department of Medical Genetics, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Mitra Basiratnia
- Department of Pediatric Nephrology, Namazi Hospital, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Ali Derakhshan
- Department of Pediatric Nephrology, Namazi Hospital, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Majid Fardaei
- Department of Medical Genetics, Shiraz University of Medical Sciences, Shiraz, Iran.,Transgenic Technology Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
| |
Collapse
|
13
|
Okada A, Kohmoto T, Naruto T, Yokota I, Kotani Y, Shimada A, Miyamoto Y, Takahashi R, Goji A, Masuda K, Kagami S, Imoto I. The first Japanese patient with mandibular hypoplasia, deafness, progeroid features and lipodystrophy diagnosed via POLD1 mutation detection. Hum Genome Var 2017; 4:17031. [PMID: 28791128 PMCID: PMC5540733 DOI: 10.1038/hgv.2017.31] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2017] [Revised: 06/19/2017] [Accepted: 06/19/2017] [Indexed: 02/07/2023] Open
Abstract
Mandibular hypoplasia, deafness, progeroid features and lipodystrophy (MDPL) syndrome is a rare autosomal dominant disorder caused by heterozygous POLD1 mutations. To date, 13 patients affected by POLD1 mutation-caused MDPL have been described. We report a clinically undiagnosed 11-year-old male who noted joint contractures at 6 years of age. Targeted exome sequencing identified a known POLD1 mutation [NM_002691.3:c.1812_1814del, p.(Ser605del)] that diagnosed him as the first Japanese/East Asian MDPL case.
Collapse
Affiliation(s)
- Asami Okada
- Department of Paediatrics, Graduate School of Biomedical Sciences, Tokushima University, Tokushima, Japan
| | - Tomohiro Kohmoto
- Department of Human Genetics, Graduate School of Biomedical Sciences, Tokushima University, Tokushima, Japan
| | - Takuya Naruto
- Department of Human Genetics, Graduate School of Biomedical Sciences, Tokushima University, Tokushima, Japan
| | - Ichiro Yokota
- Department of Paediatrics, Graduate School of Biomedical Sciences, Tokushima University, Tokushima, Japan.,Department of Pediatrics, Division of Pediatric Endocrinology and Metabolism, Shikoku Medical Center for Children and Adults, Zentsuji, Japan
| | - Yumiko Kotani
- Department of Paediatrics, Graduate School of Biomedical Sciences, Tokushima University, Tokushima, Japan
| | - Aki Shimada
- Department of Pediatrics, Division of Pediatric Endocrinology and Metabolism, Shikoku Medical Center for Children and Adults, Zentsuji, Japan.,Department of Otolaryngology, Graduate School of Biomedical Sciences, Tokushima University, Tokushima, Japan
| | - Yoko Miyamoto
- Department of Human Genetics, Graduate School of Biomedical Sciences, Tokushima University, Tokushima, Japan
| | - Rizu Takahashi
- Department of Human Genetics, Graduate School of Biomedical Sciences, Tokushima University, Tokushima, Japan
| | - Aya Goji
- Department of Paediatrics, Graduate School of Biomedical Sciences, Tokushima University, Tokushima, Japan
| | - Kiyoshi Masuda
- Department of Human Genetics, Graduate School of Biomedical Sciences, Tokushima University, Tokushima, Japan
| | - Shoji Kagami
- Department of Paediatrics, Graduate School of Biomedical Sciences, Tokushima University, Tokushima, Japan
| | - Issei Imoto
- Department of Human Genetics, Graduate School of Biomedical Sciences, Tokushima University, Tokushima, Japan
| |
Collapse
|
14
|
Improved Diagnosis of Inherited Retinal Dystrophies by High-Fidelity PCR of ORF15 followed by Next-Generation Sequencing. J Mol Diagn 2016; 18:817-824. [PMID: 27620828 DOI: 10.1016/j.jmoldx.2016.06.007] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2016] [Revised: 06/10/2016] [Accepted: 06/16/2016] [Indexed: 11/21/2022] Open
Abstract
Retinitis pigmentosa (RP) is the most common form of retinal dystrophy. The disease is characterized by the progressive degeneration of photoreceptors, ultimately leading to blindness. The exon ORF15 of RP GTPase regulator (RPGR) is a mutation hot spot for X-linked RP and one form of cone dystrophy. However, accurate molecular testing of ORF15 is challenging because of a large segment of highly repetitive purine-rich sequence in this exon. ORF15 performs poorly in next-generation sequencing-based panels or whole exome sequencing analysis, whereas Sanger sequencing of ORF15 requires special reagents and PCR conditions with multiple pairs of overlapping primers that often do not provide a clean sequence. Because of these technical difficulties, molecular analysis of ORF15 is performed mostly in research laboratories without validation for clinical application. Herein, we report the development of a single step of high-fidelity PCR followed by next-generation sequencing for accurate mutation detection, which is easily integrated into routine clinical practice. Our approach has improved coverage depth of ORF15 with the ability to detect single-nucleotide variants and deletions/duplications. Using this method, we were able to identify ORF15 pathogenic variants in approximately 31% of undiagnosed RP patients. Our results underline the clinical importance of complete and accurate sequence analysis of ORF15 for patients with retinal dystrophies.
Collapse
|
15
|
Hasan MS, Zhang L. SPAI: an interactive platform for indel analysis. BMC Genomics 2016; 17 Suppl 5:496. [PMID: 27585593 PMCID: PMC5009558 DOI: 10.1186/s12864-016-2824-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
BACKGROUND Insertions and Deletions (Indels) are the most common form of structural variation in human genome. Indels not only contribute to genetic diversity but also cause diseases. Therefore assessing indels in human genome has become an interesting topic to the research community. This increasing interest on indel calling research has resulted into the development of a good number of indel calling tools. However, all of these tools are command line based and require expertise from Computer Science (CS) to execute them which makes it challenging for researchers from non-CS background. METHODS In this paper, we describe an interactive platform named SPAI which stands for Single Platform for Analyzing Indels. RESULTS Being a Graphical User Interface (GUI) tool, SPAI facilitates users to run several popular indel calling tools and perform several analyses on the indel calling results without knowing any command line programming. CONCLUSIONS SPAI is written in Java and tested in Linux operating system.
Collapse
Affiliation(s)
| | - Liqing Zhang
- Department of Computer Science, Virginia Tech, Blacksburg, VA 24061 USA
| |
Collapse
|
16
|
Raimondi D, Gazzo AM, Rooman M, Lenaerts T, Vranken WF. Multilevel biological characterization of exomic variants at the protein level significantly improves the identification of their deleterious effects. Bioinformatics 2016; 32:1797-804. [DOI: 10.1093/bioinformatics/btw094] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2015] [Accepted: 02/15/2016] [Indexed: 11/14/2022] Open
|
17
|
Kamat MA, Bacolla A, Cooper DN, Chuzhanova N. A Role for Non-B DNA Forming Sequences in Mediating Microlesions Causing Human Inherited Disease. Hum Mutat 2015; 37:65-73. [PMID: 26466920 DOI: 10.1002/humu.22917] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2015] [Accepted: 09/22/2015] [Indexed: 12/25/2022]
Abstract
Missense/nonsense mutations and microdeletions/microinsertions (<21 bp) represent ∼ 76% of all mutations causing human inherited disease, and their occurrence has been associated with sequence motifs (direct, inverted, and mirror repeats; G-quartets) capable of adopting non-B DNA structures. We found that a significant proportion (∼ 21%) of both microdeletions and microinsertions occur within direct repeats, and are explicable by slipped misalignment. A novel mutational mechanism, DNA triplex formation followed by DNA repair, may explain ∼ 5% of microdeletions and microinsertions at mirror repeats. Further, G-quartets, direct, and inverted repeats also appear to play a prominent role in mediating missense mutations, whereas only direct and inverted repeats mediate nonsense mutations. We suggest a mutational mechanism involving slipped strand mispairing, slipped structure formation, and DNA repair, to explain ∼ 15% of missense and ∼ 12% of nonsense mutations yielding perfect direct repeats from imperfect repeats, or the extension of existing direct repeats. Similar proportions of missense and nonsense mutations were explicable by hairpin/loop formation and DNA repair, yielding perfect inverted repeats from imperfect repeats. We also propose a model for single base-pair substitution based on one-electron oxidation reactions at G-quadruplex DNA. Overall, the proposed mechanisms provide support for a role for non-B DNA structures in human gene mutagenesis.
Collapse
Affiliation(s)
- Mihir Anant Kamat
- School of Science and Technology, Nottingham Trent University, Clifton Lane, Nottingham, UK
| | - Albino Bacolla
- Institute of Medical Genetics, School of Medicine, Cardiff University, Heath Park, Cardiff, UK
| | - David N Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Heath Park, Cardiff, UK
| | - Nadia Chuzhanova
- School of Science and Technology, Nottingham Trent University, Clifton Lane, Nottingham, UK
| |
Collapse
|
18
|
Boschiero C, Gheyas AA, Ralph HK, Eory L, Paton B, Kuo R, Fulton J, Preisinger R, Kaiser P, Burt DW. Detection and characterization of small insertion and deletion genetic variants in modern layer chicken genomes. BMC Genomics 2015; 16:562. [PMID: 26227840 PMCID: PMC4563830 DOI: 10.1186/s12864-015-1711-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2014] [Accepted: 06/22/2015] [Indexed: 01/17/2023] Open
Abstract
BACKGROUND Small insertions and deletions (InDels) constitute the second most abundant class of genetic variants and have been found to be associated with many traits and diseases. The present study reports on the detection and characterisation of about 883 K high quality InDels from the whole-genome analysis of several modern layer chicken lines from diverse breeds. RESULTS To reduce the error rates seen in InDel detection, this study used the consensus set from two InDel-calling packages: SAMtools and Dindel, as well as stringent post-filtering criteria. By analysing sequence data from 163 chickens from 11 commercial and 5 experimental layer lines, this study detected about 883 K high quality consensus InDels with 93% validation rate and an average density of 0.78 InDels/kb over the genome. Certain chromosomes, viz, GGAZ, 16, 22 and 25 showed very low densities of InDels whereas the highest rate was observed on GGA6. In spite of the higher recombination rates on microchromosomes, the InDel density on these chromosomes was generally lower relative to macrochromosomes possibly due to their higher gene density. About 43-87% of the InDels were found to be fixed within each line. The majority of detected InDels (86%) were 1-5 bases and about 63% were non-repetitive in nature while the rest were tandem repeats of various motif types. Functional annotation identified 613 frameshift, 465 non-frameshift and 10 stop-gain/loss InDels. Apart from the frameshift and stopgain/loss InDels that are expected to affect the translation of protein sequences and their biological activity, 33% of the non-frameshift were predicted as evolutionary intolerant with potential impact on protein functions. Moreover, about 2.5% of the InDels coincided with the most-conserved elements previously mapped on the chicken genome and are likely to define functional elements. InDels potentially affecting protein function were found to be enriched for certain gene-classes e.g. those associated with cell proliferation, chromosome and Golgi organization, spermatogenesis, and muscle contraction. CONCLUSIONS The large catalogue of InDels presented in this study along with their associated information such as functional annotation, estimated allele frequency, etc. are expected to serve as a rich resource for application in future research and breeding in the chicken.
Collapse
Affiliation(s)
- Clarissa Boschiero
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Campus, Midlothian, EH25 9RG, UK. .,Current Address: Departamento de Zootecnia, University of Sao Paulo/ESALQ, Piracicaba, SP, 13418-900, Brazil.
| | - Almas A Gheyas
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Campus, Midlothian, EH25 9RG, UK.
| | - Hannah K Ralph
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Campus, Midlothian, EH25 9RG, UK.
| | - Lel Eory
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Campus, Midlothian, EH25 9RG, UK.
| | - Bob Paton
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Campus, Midlothian, EH25 9RG, UK.
| | - Richard Kuo
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Campus, Midlothian, EH25 9RG, UK.
| | | | | | - Pete Kaiser
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Campus, Midlothian, EH25 9RG, UK.
| | - David W Burt
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Campus, Midlothian, EH25 9RG, UK.
| |
Collapse
|
19
|
Magi A, D'Aurizio R, Palombo F, Cifola I, Tattini L, Semeraro R, Pippucci T, Giusti B, Romeo G, Abbate R, Gensini GF. Characterization and identification of hidden rare variants in the human genome. BMC Genomics 2015; 16:340. [PMID: 25903059 PMCID: PMC4416239 DOI: 10.1186/s12864-015-1481-9] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2015] [Accepted: 03/23/2015] [Indexed: 12/11/2022] Open
Abstract
Background By examining the genotype calls generated by the 1000 Genomes Project we discovered that the human reference genome GRCh37 contains almost 20,000 loci in which the reference allele has never been observed in healthy individuals and around 70,000 loci in which it has been observed only in the heterozygous state. Results We show that a large fraction of this rare reference allele (RRA) loci belongs to coding, functional and regulatory elements of the genome and could be linked to rare Mendelian disorders as well as cancer. We also demonstrate that classical germline and somatic variant calling tools are not capable to recognize the rare allele when present in these loci. To overcome such limitations, we developed a novel tool, named RAREVATOR, that is able to identify and call the rare allele in these genomic positions. By using a small cancer dataset we compared our tool with two state-of-the-art callers and we found that RAREVATOR identified more than 1,500 germline and 22 somatic RRA variants missed by the two methods and which belong to significantly mutated pathways. Conclusions These results show that, to date, the investigation of around 100,000 loci of the human genome has been missed by re-sequencing experiments based on the GRCh37 assembly and that our tool can fill the gap left by other methods. Moreover, the investigation of the latest version of the human reference genome, GRCh38, showed that although the GRC corrected almost all insertions and a small part of SNVs and deletions, a large number of functionally relevant RRAs still remain unchanged. For this reason, also future resequencing experiments, based on GRCh38, will benefit from RAREVATOR analysis results. RAREVATOR is freely available at http://sourceforge.net/projects/rarevator. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1481-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Alberto Magi
- Department of Experimental and Clinical Medicine, University of Florence, Florence, Italy.
| | - Romina D'Aurizio
- Laboratory of Integrative Systems Medicine (LISM), Institute of Informatics and Telematics and Institute of Clinical Physiology, National Research Council, Pisa, Italy.
| | - Flavia Palombo
- Medical Genetics Unit, Department of Medical and Surgical Sciences, University of Bologna, Bologna, Italy.
| | - Ingrid Cifola
- Institute for Biomedical Technologies, National Research Council, Milan, Italy.
| | - Lorenzo Tattini
- Department of Neuroscience, Pharmacology and Child Health, University of Florence, Florence, Italy.
| | - Roberto Semeraro
- Department of Experimental and Clinical Medicine, University of Florence, Florence, Italy.
| | - Tommaso Pippucci
- Medical Genetics Unit, Department of Medical and Surgical Sciences, University of Bologna, Bologna, Italy.
| | - Betti Giusti
- Department of Experimental and Clinical Medicine, University of Florence, Florence, Italy.
| | - Giovanni Romeo
- Medical Genetics Unit, Department of Medical and Surgical Sciences, University of Bologna, Bologna, Italy.
| | - Rosanna Abbate
- Department of Experimental and Clinical Medicine, University of Florence, Florence, Italy.
| | - Gian Franco Gensini
- Department of Experimental and Clinical Medicine, University of Florence, Florence, Italy.
| |
Collapse
|
20
|
Sun C, Mueller RL. Hellbender genome sequences shed light on genomic expansion at the base of crown salamanders. Genome Biol Evol 2015; 6:1818-29. [PMID: 25115007 PMCID: PMC4122941 DOI: 10.1093/gbe/evu143] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Among animals, genome sizes range from 20 Mb to 130 Gb, with 380-fold variation across vertebrates. Most of the largest vertebrate genomes are found in salamanders, an amphibian clade of 660 species. Thus, salamanders are an important system for studying causes and consequences of genomic gigantism. Previously, we showed that plethodontid salamander genomes accumulate higher levels of long terminal repeat (LTR) retrotransposons than do other vertebrates, although the evolutionary origins of such sequences remained unexplored. We also showed that some salamanders in the family Plethodontidae have relatively slow rates of DNA loss through small insertions and deletions. Here, we present new data from Cryptobranchus alleganiensis, the hellbender. Cryptobranchus and Plethodontidae span the basal phylogenetic split within salamanders; thus, analyses incorporating these taxa can shed light on the genome of the ancestral crown salamander lineage, which underwent expansion. We show that high levels of LTR retrotransposons likely characterize all crown salamanders, suggesting that disproportionate expansion of this transposable element (TE) class contributed to genomic expansion. Phylogenetic and age distribution analyses of salamander LTR retrotransposons indicate that salamanders' high TE levels reflect persistence and diversification of ancestral TEs rather than horizontal transfer events. Finally, we show that relatively slow DNA loss rates through small indels likely characterize all crown salamanders, suggesting that a decreased DNA loss rate contributed to genomic expansion at the clade's base. Our identification of shared genomic features across phylogenetically distant salamanders is a first step toward identifying the evolutionary processes underlying accumulation and persistence of high levels of repetitive sequence in salamander genomes.
Collapse
|
21
|
Wu Z, Tembrock LR, Ge S. Are differences in genomic data sets due to true biological variants or errors in genome assembly: an example from two chloroplast genomes. PLoS One 2015; 10:e0118019. [PMID: 25658309 PMCID: PMC4320078 DOI: 10.1371/journal.pone.0118019] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2014] [Accepted: 01/07/2015] [Indexed: 01/01/2023] Open
Abstract
DNA sequencing has been revolutionized by the development of high-throughput sequencing technologies. Plummeting costs and the massive throughput capacities of second and third generation sequencing platforms have transformed many fields of biological research. Concurrently, new data processing pipelines made rapid de novo genome assemblies possible. However, high quality data are critically important for all investigations in the genomic era. We used chloroplast genomes of one Oryza species (O. australiensis) to compare differences in sequence quality: one genome (GU592209) was obtained through Illumina sequencing and reference-guided assembly and the other genome (KJ830774) was obtained via target enrichment libraries and shotgun sequencing. Based on the whole genome alignment, GU592209 was more similar to the reference genome (O. sativa: AY522330) with 99.2% sequence identity (SI value) compared with the 98.8% SI values in the KJ830774 genome; whereas the opposite result was obtained when the SI values in coding and noncoding regions of GU592209 and KJ830774 were compared. Additionally, the junctions of two single copies and repeat copies in the chloroplast genome exhibited differences. Phylogenetic analyses were conducted using these sequences, and the different data sets yielded dissimilar topologies: phylogenetic replacements of the two individuals were remarkably different based on whole genome sequencing or SNP data and insertions and deletions (indels) data. Thus, we concluded that the genomic composition of GU592209 was heterogeneous in coding and non-coding regions. These findings should impel biologists to carefully consider the quality of sequencing and assembly when working with next-generation data.
Collapse
Affiliation(s)
- Zhiqiang Wu
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing, China
- Department of Biology, Colorado State University, Fort Collins, Colorado, United States of America
| | - Luke R. Tembrock
- Department of Biology, Colorado State University, Fort Collins, Colorado, United States of America
| | - Song Ge
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
22
|
Folkman L, Yang Y, Li Z, Stantic B, Sattar A, Mort M, Cooper DN, Liu Y, Zhou Y. DDIG-in: detecting disease-causing genetic variations due to frameshifting indels and nonsense mutations employing sequence and structural properties at nucleotide and protein levels. ACTA ACUST UNITED AC 2015; 31:1599-606. [PMID: 25573915 DOI: 10.1093/bioinformatics/btu862] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2014] [Accepted: 12/23/2014] [Indexed: 12/15/2022]
Abstract
MOTIVATION Frameshifting (FS) indels and nonsense (NS) variants disrupt the protein-coding sequence downstream of the mutation site by changing the reading frame or introducing a premature termination codon, respectively. Despite such drastic changes to the protein sequence, FS indels and NS variants have been discovered in healthy individuals. How to discriminate disease-causing from neutral FS indels and NS variants is an understudied problem. RESULTS We have built a machine learning method called DDIG-in (FS) based on real human genetic variations from the Human Gene Mutation Database (inherited disease-causing) and the 1000 Genomes Project (GP) (putatively neutral). The method incorporates both sequence and predicted structural features and yields a robust performance by 10-fold cross-validation and independent tests on both FS indels and NS variants. We showed that human-derived NS variants and FS indels derived from animal orthologs can be effectively employed for independent testing of our method trained on human-derived FS indels. DDIG-in (FS) achieves a Matthews correlation coefficient (MCC) of 0.59, a sensitivity of 86%, and a specificity of 72% for FS indels. Application of DDIG-in (FS) to NS variants yields essentially the same performance (MCC of 0.43) as a method that was specifically trained for NS variants. DDIG-in (FS) was shown to make a significant improvement over existing techniques.
Collapse
Affiliation(s)
- Lukas Folkman
- School of Information and Communication Technology, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute for Integrated and Intelligent Systems, Griffith University, 170 Kessels Road, Brisbane, Queensland 4111, Australia, Queensland Research Laboratory, NICTA - National ICT Australia, 70-72 Bowen Street, Spring Hill, Queensland 4000, Australia, Institute for Glycomics, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute of Medical Genetics, Cardiff University, Heath Park, Cardiff CF14 4XN, UK and Department of Medical and Molecular Genetics, Indiana University School of Medicine, 975 West Walnut Street, MRL Bldg IB130, Indianapolis, IN 46202, USA School of Information and Communication Technology, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute for Integrated and Intelligent Systems, Griffith University, 170 Kessels Road, Brisbane, Queensland 4111, Australia, Queensland Research Laboratory, NICTA - National ICT Australia, 70-72 Bowen Street, Spring Hill, Queensland 4000, Australia, Institute for Glycomics, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute of Medical Genetics, Cardiff University, Heath Park, Cardiff CF14 4XN, UK and Department of Medical and Molecular Genetics, Indiana University School of Medicine, 975 West Walnut Street, MRL Bldg IB130, Indianapolis, IN 46202, USA School of Information and Communication Technology, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute for Integrated and Intelligent Systems, Griffith University, 170 Kessels Road, Brisbane, Queensland 4111, Australia, Queensland Research Laboratory, NICTA - National ICT Australia, 70-72 Bowen Street, Spring Hill, Queensland 4000, Australia, Institute for Glycomics, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute of Medical Genetics, Cardiff University, Heath Park, Cardif
| | - Yuedong Yang
- School of Information and Communication Technology, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute for Integrated and Intelligent Systems, Griffith University, 170 Kessels Road, Brisbane, Queensland 4111, Australia, Queensland Research Laboratory, NICTA - National ICT Australia, 70-72 Bowen Street, Spring Hill, Queensland 4000, Australia, Institute for Glycomics, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute of Medical Genetics, Cardiff University, Heath Park, Cardiff CF14 4XN, UK and Department of Medical and Molecular Genetics, Indiana University School of Medicine, 975 West Walnut Street, MRL Bldg IB130, Indianapolis, IN 46202, USA School of Information and Communication Technology, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute for Integrated and Intelligent Systems, Griffith University, 170 Kessels Road, Brisbane, Queensland 4111, Australia, Queensland Research Laboratory, NICTA - National ICT Australia, 70-72 Bowen Street, Spring Hill, Queensland 4000, Australia, Institute for Glycomics, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute of Medical Genetics, Cardiff University, Heath Park, Cardiff CF14 4XN, UK and Department of Medical and Molecular Genetics, Indiana University School of Medicine, 975 West Walnut Street, MRL Bldg IB130, Indianapolis, IN 46202, USA School of Information and Communication Technology, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute for Integrated and Intelligent Systems, Griffith University, 170 Kessels Road, Brisbane, Queensland 4111, Australia, Queensland Research Laboratory, NICTA - National ICT Australia, 70-72 Bowen Street, Spring Hill, Queensland 4000, Australia, Institute for Glycomics, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute of Medical Genetics, Cardiff University, Heath Park, Cardif
| | - Zhixiu Li
- School of Information and Communication Technology, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute for Integrated and Intelligent Systems, Griffith University, 170 Kessels Road, Brisbane, Queensland 4111, Australia, Queensland Research Laboratory, NICTA - National ICT Australia, 70-72 Bowen Street, Spring Hill, Queensland 4000, Australia, Institute for Glycomics, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute of Medical Genetics, Cardiff University, Heath Park, Cardiff CF14 4XN, UK and Department of Medical and Molecular Genetics, Indiana University School of Medicine, 975 West Walnut Street, MRL Bldg IB130, Indianapolis, IN 46202, USA School of Information and Communication Technology, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute for Integrated and Intelligent Systems, Griffith University, 170 Kessels Road, Brisbane, Queensland 4111, Australia, Queensland Research Laboratory, NICTA - National ICT Australia, 70-72 Bowen Street, Spring Hill, Queensland 4000, Australia, Institute for Glycomics, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute of Medical Genetics, Cardiff University, Heath Park, Cardiff CF14 4XN, UK and Department of Medical and Molecular Genetics, Indiana University School of Medicine, 975 West Walnut Street, MRL Bldg IB130, Indianapolis, IN 46202, USA School of Information and Communication Technology, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute for Integrated and Intelligent Systems, Griffith University, 170 Kessels Road, Brisbane, Queensland 4111, Australia, Queensland Research Laboratory, NICTA - National ICT Australia, 70-72 Bowen Street, Spring Hill, Queensland 4000, Australia, Institute for Glycomics, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute of Medical Genetics, Cardiff University, Heath Park, Cardif
| | - Bela Stantic
- School of Information and Communication Technology, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute for Integrated and Intelligent Systems, Griffith University, 170 Kessels Road, Brisbane, Queensland 4111, Australia, Queensland Research Laboratory, NICTA - National ICT Australia, 70-72 Bowen Street, Spring Hill, Queensland 4000, Australia, Institute for Glycomics, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute of Medical Genetics, Cardiff University, Heath Park, Cardiff CF14 4XN, UK and Department of Medical and Molecular Genetics, Indiana University School of Medicine, 975 West Walnut Street, MRL Bldg IB130, Indianapolis, IN 46202, USA School of Information and Communication Technology, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute for Integrated and Intelligent Systems, Griffith University, 170 Kessels Road, Brisbane, Queensland 4111, Australia, Queensland Research Laboratory, NICTA - National ICT Australia, 70-72 Bowen Street, Spring Hill, Queensland 4000, Australia, Institute for Glycomics, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute of Medical Genetics, Cardiff University, Heath Park, Cardiff CF14 4XN, UK and Department of Medical and Molecular Genetics, Indiana University School of Medicine, 975 West Walnut Street, MRL Bldg IB130, Indianapolis, IN 46202, USA
| | - Abdul Sattar
- School of Information and Communication Technology, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute for Integrated and Intelligent Systems, Griffith University, 170 Kessels Road, Brisbane, Queensland 4111, Australia, Queensland Research Laboratory, NICTA - National ICT Australia, 70-72 Bowen Street, Spring Hill, Queensland 4000, Australia, Institute for Glycomics, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute of Medical Genetics, Cardiff University, Heath Park, Cardiff CF14 4XN, UK and Department of Medical and Molecular Genetics, Indiana University School of Medicine, 975 West Walnut Street, MRL Bldg IB130, Indianapolis, IN 46202, USA School of Information and Communication Technology, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute for Integrated and Intelligent Systems, Griffith University, 170 Kessels Road, Brisbane, Queensland 4111, Australia, Queensland Research Laboratory, NICTA - National ICT Australia, 70-72 Bowen Street, Spring Hill, Queensland 4000, Australia, Institute for Glycomics, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute of Medical Genetics, Cardiff University, Heath Park, Cardiff CF14 4XN, UK and Department of Medical and Molecular Genetics, Indiana University School of Medicine, 975 West Walnut Street, MRL Bldg IB130, Indianapolis, IN 46202, USA School of Information and Communication Technology, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute for Integrated and Intelligent Systems, Griffith University, 170 Kessels Road, Brisbane, Queensland 4111, Australia, Queensland Research Laboratory, NICTA - National ICT Australia, 70-72 Bowen Street, Spring Hill, Queensland 4000, Australia, Institute for Glycomics, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute of Medical Genetics, Cardiff University, Heath Park, Cardif
| | - Matthew Mort
- School of Information and Communication Technology, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute for Integrated and Intelligent Systems, Griffith University, 170 Kessels Road, Brisbane, Queensland 4111, Australia, Queensland Research Laboratory, NICTA - National ICT Australia, 70-72 Bowen Street, Spring Hill, Queensland 4000, Australia, Institute for Glycomics, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute of Medical Genetics, Cardiff University, Heath Park, Cardiff CF14 4XN, UK and Department of Medical and Molecular Genetics, Indiana University School of Medicine, 975 West Walnut Street, MRL Bldg IB130, Indianapolis, IN 46202, USA
| | - David N Cooper
- School of Information and Communication Technology, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute for Integrated and Intelligent Systems, Griffith University, 170 Kessels Road, Brisbane, Queensland 4111, Australia, Queensland Research Laboratory, NICTA - National ICT Australia, 70-72 Bowen Street, Spring Hill, Queensland 4000, Australia, Institute for Glycomics, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute of Medical Genetics, Cardiff University, Heath Park, Cardiff CF14 4XN, UK and Department of Medical and Molecular Genetics, Indiana University School of Medicine, 975 West Walnut Street, MRL Bldg IB130, Indianapolis, IN 46202, USA
| | - Yunlong Liu
- School of Information and Communication Technology, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute for Integrated and Intelligent Systems, Griffith University, 170 Kessels Road, Brisbane, Queensland 4111, Australia, Queensland Research Laboratory, NICTA - National ICT Australia, 70-72 Bowen Street, Spring Hill, Queensland 4000, Australia, Institute for Glycomics, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute of Medical Genetics, Cardiff University, Heath Park, Cardiff CF14 4XN, UK and Department of Medical and Molecular Genetics, Indiana University School of Medicine, 975 West Walnut Street, MRL Bldg IB130, Indianapolis, IN 46202, USA
| | - Yaoqi Zhou
- School of Information and Communication Technology, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute for Integrated and Intelligent Systems, Griffith University, 170 Kessels Road, Brisbane, Queensland 4111, Australia, Queensland Research Laboratory, NICTA - National ICT Australia, 70-72 Bowen Street, Spring Hill, Queensland 4000, Australia, Institute for Glycomics, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute of Medical Genetics, Cardiff University, Heath Park, Cardiff CF14 4XN, UK and Department of Medical and Molecular Genetics, Indiana University School of Medicine, 975 West Walnut Street, MRL Bldg IB130, Indianapolis, IN 46202, USA School of Information and Communication Technology, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute for Integrated and Intelligent Systems, Griffith University, 170 Kessels Road, Brisbane, Queensland 4111, Australia, Queensland Research Laboratory, NICTA - National ICT Australia, 70-72 Bowen Street, Spring Hill, Queensland 4000, Australia, Institute for Glycomics, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute of Medical Genetics, Cardiff University, Heath Park, Cardiff CF14 4XN, UK and Department of Medical and Molecular Genetics, Indiana University School of Medicine, 975 West Walnut Street, MRL Bldg IB130, Indianapolis, IN 46202, USA School of Information and Communication Technology, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute for Integrated and Intelligent Systems, Griffith University, 170 Kessels Road, Brisbane, Queensland 4111, Australia, Queensland Research Laboratory, NICTA - National ICT Australia, 70-72 Bowen Street, Spring Hill, Queensland 4000, Australia, Institute for Glycomics, Griffith University, Parklands Drive, Southport, Queensland 4222, Australia, Institute of Medical Genetics, Cardiff University, Heath Park, Cardif
| |
Collapse
|
23
|
Brown JC. The role of DNA repair in herpesvirus pathogenesis. Genomics 2014; 104:287-94. [DOI: 10.1016/j.ygeno.2014.08.005] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2014] [Revised: 08/12/2014] [Accepted: 08/19/2014] [Indexed: 10/24/2022]
|
24
|
Yi H, Song H, Hwang J, Kim K, Nierman WC, Kim HS. The tandem repeats enabling reversible switching between the two phases of β-lactamase substrate spectrum. PLoS Genet 2014; 10:e1004640. [PMID: 25233343 PMCID: PMC4169377 DOI: 10.1371/journal.pgen.1004640] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2014] [Accepted: 07/31/2014] [Indexed: 11/25/2022] Open
Abstract
Expansion or shrinkage of existing tandem repeats (TRs) associated with various biological processes has been actively studied in both prokaryotic and eukaryotic genomes, while their origin and biological implications remain mostly unknown. Here we describe various duplications (de novo TRs) that occurred in the coding region of a β-lactamase gene, where a conserved structure called the omega loop is encoded. These duplications that occurred under selection using ceftazidime conferred substrate spectrum extension to include the antibiotic. Under selective pressure with one of the original substrates (amoxicillin), a high level of reversion occurred in the mutant β-lactamase genes completing a cycle back to the original substrate spectrum. The de novo TRs coupled with reversion makes a genetic toggling mechanism enabling reversible switching between the two phases of the substrate spectrum of β-lactamases. This toggle exemplifies the effective adaptation of de novo TRs for enhanced bacterial survival. We found pairs of direct repeats that mediated the DNA duplication (TR formation). In addition, we found different duos of sequences that mediated the DNA duplication. These novel elements—that we named SCSs (same-strand complementary sequences)—were also found associated with β-lactamase TR mutations from clinical isolates. Both direct repeats and SCSs had a high correlation with TRs in diverse bacterial genomes throughout the major phylogenetic lineages, suggesting that they comprise a fundamental mechanism shaping the bacterial evolution. β-lactamases can adapt to new antibiotics by mutations in their genes. The original and the extended substrate spectrums of β-lactamases define two phases of catalytic activity, and the conversion by point mutations is unidirectional from the initial to the new spectrum. We describe duplication mutations that enable reversible switching between the substrate spectrums, increasing the adaptability of the bacterium. We provide evidence supporting that two distinct groups of short sequences mediated the formation of DNA duplications in β-lactamases: direct repeats and novel elements that we named, SCSs (same-strand complementary sequences). Our study suggests that DNA duplication processes mediated by both direct repeats and SCSs are not just limited to the β-lactamase genes but comprise a fundamental mechanism in bacterial genome evolution.
Collapse
Affiliation(s)
- Hyojeong Yi
- Department of Biosystems and Biotechnology, Korea University, Seoul, Korea
- Department of Biomedical Sciences, Korea University, Seoul, Korea
| | - Han Song
- Department of Biomedical Sciences, Korea University, Seoul, Korea
| | - Junghyun Hwang
- Department of Biomedical Sciences, Korea University, Seoul, Korea
| | - Karan Kim
- Department of Biomedical Sciences, Korea University, Seoul, Korea
| | - William C. Nierman
- J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Heenam Stanley Kim
- Department of Biosystems and Biotechnology, Korea University, Seoul, Korea
- Department of Biomedical Sciences, Korea University, Seoul, Korea
- * E-mail:
| |
Collapse
|
25
|
Zhuang J, Chen X, Tan Z, Zhu Y, Zhao K, Yang J. A novel de novo duplication mutation of PAX6 in a Chinese family with aniridia and other ocular abnormalities. Sci Rep 2014; 4:4836. [PMID: 24787241 PMCID: PMC4007070 DOI: 10.1038/srep04836] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2013] [Accepted: 03/31/2014] [Indexed: 11/13/2022] Open
Abstract
Aniridia is a congenital panocular disorder caused by the mutations of the paired box gene-6 (PAX6). To investigate the clinical characterization and the underlying genetic defect in a Chinese family with aniridia and other ocular abnormalities, we recruited the family members who underwent ophthalmic examination. Two patients in this family, the proband and his affected son, both have bilateral aniridia, foveal hypoplasia and nystagmus. Moreover, the proband also had presenile cataracts, but his affected son did not show cataracts at the time of examination. Sequencing PAX6 revealed that a heterozygous duplication mutation c.95_105dup11, predicted to generate non-functional truncated protein at position Gly36 (p.G36X), was found in the affected individuals but not in any of the unaffected family members including the parents of the proband. Haplotype analysis showed that the proband and his affected son shared a common disease-related haplotype, which was arisen from the proband's unaffected father through crossing-over. In conclusion, we identified a novel de novo duplication mutation of PAX6 in the aniridia and other ocular abnormalities family. This mutation has occurred de novo on a paternal chromosome by direct duplication, which presumably results from replication slippage or unequal non-sister chromatids exchange during spermatogenesis.
Collapse
Affiliation(s)
- Jianfu Zhuang
- 1] Xiamen Eye Center of Xiamen University, Xiamen, Fujian, China [2] Clinical College of Ophthalmology, Tianjin Medical University, Tianjin, China [3]
| | - Xiaole Chen
- 1] Biomedical Engineering Center, Fujian Medical University, Fuzhou, Fujian, China [2]
| | - Zhihua Tan
- Biomedical Engineering Center, Fujian Medical University, Fuzhou, Fujian, China
| | - Yihua Zhu
- Department of Ophthalmology, The First Affiliated Hospital of Fujian Medical University, Fuzhou, Fujian, China
| | - Kanxing Zhao
- Clinical College of Ophthalmology, Tianjin Medical University, Tianjin, China
| | - Juhua Yang
- Biomedical Engineering Center, Fujian Medical University, Fuzhou, Fujian, China
| |
Collapse
|
26
|
Huang S, Li J, Xu A, Huang G, You L. Small Insertions Are More Deleterious than Small Deletions in Human Genomes. Hum Mutat 2013; 34:1642-9. [DOI: 10.1002/humu.22435] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2013] [Accepted: 08/22/2013] [Indexed: 11/09/2022]
Affiliation(s)
- Shengfeng Huang
- State Key Laboratory of Biocontrol; Guangdong Key Laboratory of Pharmaceutical Functional Genes; College of Life Sciences, Sun Yat-sen University; Guangzhou 510275 People's Republic of China
| | - Jie Li
- State Key Laboratory of Biocontrol; Guangdong Key Laboratory of Pharmaceutical Functional Genes; College of Life Sciences, Sun Yat-sen University; Guangzhou 510275 People's Republic of China
| | - Anlong Xu
- State Key Laboratory of Biocontrol; Guangdong Key Laboratory of Pharmaceutical Functional Genes; College of Life Sciences, Sun Yat-sen University; Guangzhou 510275 People's Republic of China
- Beijing University of Chinese Medicine, Chao-yang District; Beijing 100029 People's Republic of China
| | - Guangrui Huang
- State Key Laboratory of Biocontrol; Guangdong Key Laboratory of Pharmaceutical Functional Genes; College of Life Sciences, Sun Yat-sen University; Guangzhou 510275 People's Republic of China
| | - Leiming You
- State Key Laboratory of Biocontrol; Guangdong Key Laboratory of Pharmaceutical Functional Genes; College of Life Sciences, Sun Yat-sen University; Guangzhou 510275 People's Republic of China
| |
Collapse
|
27
|
Molecular genetic epidemiology of human diseases: from patterns to predictions. Hum Genet 2013; 133:425-30. [PMID: 24241280 DOI: 10.1007/s00439-013-1396-y] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2013] [Accepted: 11/07/2013] [Indexed: 10/26/2022]
Abstract
Databases of disease-associated or disease-causing mutations allow the study, not only of the molecular mechanisms underlying the primary lesions at the DNA level, but also of the functional consequences of mutation at the phenotypic level. The Human Gene Mutation Database (HGMD) and the bioinformatics analyses of its content provide an illustrative example of this indirect approach to molecular genetic epidemiology. In fact, the Bayesian type of reasoning underlying previous scientific analyses of HGMD data is also reflected in current software tools used to predict the likely disease relevance of a newly detected genetic variant. After a brief resume of the past scientific utility of HGMD, we, therefore, shortly review three representative and commonly used examples of these tools, namely SIFT, PolyPhen-2 and NNSplice.
Collapse
|
28
|
Meng T, Soliman AT, Shyu ML, Yang Y, Chen SC, Iyengar SS, Yordy JS, Iyengar P. Wavelet analysis in current cancer genome research: a survey. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2013; 10:1442-1459. [PMID: 24407303 DOI: 10.1109/tcbb.2013.134] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
With the rapid development of next generation sequencing technology, the amount of biological sequence data of the cancer genome increases exponentially, which calls for efficient and effective algorithms that may identify patterns hidden underneath the raw data that may distinguish cancer Achilles' heels. From a signal processing point of view, biological units of information, including DNA and protein sequences, have been viewed as one-dimensional signals. Therefore, researchers have been applying signal processing techniques to mine the potentially significant patterns within these sequences. More specifically, in recent years, wavelet transforms have become an important mathematical analysis tool, with a wide and ever increasing range of applications. The versatility of wavelet analytic techniques has forged new interdisciplinary bounds by offering common solutions to apparently diverse problems and providing a new unifying perspective on problems of cancer genome research. In this paper, we provide a survey of how wavelet analysis has been applied to cancer bioinformatics questions. Specifically, we discuss several approaches of representing the biological sequence data numerically and methods of using wavelet analysis on the numerical sequences.
Collapse
Affiliation(s)
- Tao Meng
- University of Miami, Coral Gables
| | | | | | | | | | | | - John S Yordy
- University of Texas Southwestern Medical Center, Dallas
| | | |
Collapse
|
29
|
Genetic analysis of West Nile virus isolates from an outbreak in Idaho, United States, 2006-2007. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2013; 10:4486-506. [PMID: 24065039 PMCID: PMC3799518 DOI: 10.3390/ijerph10094486] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/09/2013] [Revised: 09/12/2013] [Accepted: 09/16/2013] [Indexed: 12/26/2022]
Abstract
West Nile virus (WNV) appeared in the U.S. in 1999 and has since become endemic, with yearly summer epidemics causing tens of thousands of cases of serious disease over the past 14 years. Analysis of WNV strains isolated during the 2006–2007 epidemic seasons demonstrates that a new genetic variant had emerged coincidentally with an intense outbreak in Idaho during 2006. The isolates belonging to the new variant carry a 13 nt deletion, termed ID-Δ13, located at the variable region of the 3′UTR, and are genetically related. The analysis of deletions and insertions in the 3′UTR of two major lineages of WNV revealed the presence of conserved repeats and two indel motifs in the variable region of the 3′UTR. One human and two bird isolates from the Idaho 2006–2007 outbreaks were sequenced using Illumina technology and within-host variability was analyzed. Continued monitoring of new genetic variants is important for public health as WNV continues to evolve.
Collapse
|
30
|
Sankaranarayanan K, Taleei R, Rahmanian S, Nikjoo H. Ionizing radiation and genetic risks. XVII. Formation mechanisms underlying naturally occurring DNA deletions in the human genome and their potential relevance for bridging the gap between induced DNA double-strand breaks and deletions in irradiated germ cells. Mutat Res 2013; 753:114-130. [PMID: 23948232 DOI: 10.1016/j.mrrev.2013.07.003] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2013] [Revised: 06/27/2013] [Accepted: 07/22/2013] [Indexed: 12/21/2022]
Abstract
While much is known about radiation-induced DNA double-strand breaks (DSBs) and their repair, the question of how deletions of different sizes arise as a result of the processing of DSBs by the cell's repair systems has not been fully answered. In order to bridge this gap between DSBs and deletions, we critically reviewed published data on mechanisms pertaining to: (a) repair of DNA DSBs (from basic studies in this area); (b) formation of naturally occurring structural variation (SV) - especially of deletions - in the human genome (from genomic studies) and (c) radiation-induced mutations and structural chromosomal aberrations in mammalian somatic cells (from radiation mutagenesis and radiation cytogenetic studies). The specific aim was to assess the relative importance of the postulated mechanisms in generating deletions in the human genome and examine whether empirical data on radiation-induced deletions in mouse germ cells are consistent with predictions of these mechanisms. The mechanisms include (a) NHEJ, a DSB repair process that does not require any homology and which functions in all stages of the cell cycle (and is of particular relevance in G0/G1); (b) MMEJ, also a DSB repair process but which requires microhomology and which presumably functions in all cell cycle stages; (c) NAHR, a recombination-based DSB repair mechanism which operates in prophase I of meiosis in germ cells; (d) MMBIR, a microhomology-mediated, replication-based mechanism which operates in the S phase of the cell cycle, and (e) strand slippage during replication (involved in the origin of small insertions and deletions (INDELs). Our analysis permits the inference that, between them, these five mechanisms can explain nearly all naturally occurring deletions of different sizes identified in the human genome, NAHR and MMBIR being potentially more versatile in this regard. With respect to radiation-induced deletions, the basic studies suggest that those arising as a result of the operation of NHEJ/MMEJ processes, as currently formulated, are expected to be relatively small. However, data on induced mutations in mouse spermatogonial stem cells (irradiation in G0/G1 phase of the cell cycle and DSB repair presumed to be via NHEJ predominantly) show that most are associated with deletions of different sizes, some in the megabase range. There is thus a 'discrepancy' between what the basic studies suggest and the empirical observations in mutagenesis studies. This discrepancy, however, is only an apparent but not a real one. It can be resolved by considering the issue of deletions in the broader context of and in conjunction with the organization of chromatin in chromosomes and nuclear architecture, the conceptual framework for which already exists in studies carried out during the past fifteen years or so. In this paper, we specifically hypothesize that repair of DSBs induced in chromatin loops may offer a basis to explain the induction of deletions of different sizes and suggest an approach to test the hypothesis. We emphasize that the bridging of the gap between induced DSB and resulting deletions of different sizes is critical for current efforts in computational modeling of genetic risks.
Collapse
Affiliation(s)
- Krishnaswami Sankaranarayanan
- Radiation Biophysics Group, Department of Oncology-Pathology, Karolinska Instituet, Box 260, Stockholm SE 17176, Sweden
| | - Reza Taleei
- Radiation Biophysics Group, Department of Oncology-Pathology, Karolinska Instituet, Box 260, Stockholm SE 17176, Sweden
| | - Shirin Rahmanian
- Radiation Biophysics Group, Department of Oncology-Pathology, Karolinska Instituet, Box 260, Stockholm SE 17176, Sweden
| | - Hooshang Nikjoo
- Radiation Biophysics Group, Department of Oncology-Pathology, Karolinska Instituet, Box 260, Stockholm SE 17176, Sweden.
| |
Collapse
|
31
|
Characterization of bud emergence 46 (BEM46) protein: sequence, structural, phylogenetic and subcellular localization analyses. Biochem Biophys Res Commun 2013; 438:526-32. [PMID: 23916612 DOI: 10.1016/j.bbrc.2013.07.103] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2013] [Accepted: 07/25/2013] [Indexed: 02/04/2023]
Abstract
The bud emergence 46 (BEM46) protein from Neurospora crassa belongs to the α/β-hydrolase superfamily. Recently, we have reported that the BEM46 protein is localized in the perinuclear ER and also forms spots close by the plasma membrane. The protein appears to be required for cell type-specific polarity formation in N. crassa. Furthermore, initial studies suggested that the BEM46 amino acid sequence is conserved in eukaryotes and is considered to be one of the widespread conserved "known unknown" eukaryotic genes. This warrants for a comprehensive phylogenetic analysis of this superfamily to unravel origin and molecular evolution of these genes in different eukaryotes. Herein, we observe that all eukaryotes have at least a single copy of a bem46 ortholog. Upon scanning of these proteins in various genomes, we find that there are expansions leading into several paralogs in vertebrates. Usingcomparative genomic analyses, we identified insertion/deletions (indels) in the conserved domain of BEM46 protein, which allow to differentiate fungal classes such as ascomycetes from basidiomycetes. We also find that exonic indels are able to differentiate BEM46 homologs of different eukaryotic lineage. Furthermore, we unravel that BEM46 protein from N. crassa possess a novel endoplasmic-retention signal (PEKK) using GFP-fusion tagging experiments. We propose that three residues namely a serine 188S, a histidine 292H and an aspartic acid 262D are most critical residues, forming a catalytic triad in BEM46 protein from N. crassa. We carried out a comprehensive study on bem46 genes from a molecular evolution perspective with combination of functional analyses. The evolutionary history of BEM46 proteins is characterized by exonic indels in lineage specific manner.
Collapse
|
32
|
Markowski DN, Nimzyk R, Belge G, Löning T, Helmke BM, Bullerdiek J. Molecular topography of the MED12-deleted region in smooth muscle tumors: a possible link between non-B DNA structures and hypermutability. Mol Cytogenet 2013; 6:23. [PMID: 23738817 PMCID: PMC3712005 DOI: 10.1186/1755-8166-6-23] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2013] [Accepted: 04/24/2013] [Indexed: 01/22/2023] Open
Abstract
Background Deletions of the gene encoding mediator subcomplex 12 (MED12) in human smooth muscle tumors rank among the most frequent genomic alterations in human tumors at all. In a minority of these cases, small deletions are found. In an attempt to delineate key features of the deletions aimed at a better understanding of the molecular pathogenesis of uterine smooth muscle tumors we have analyzed 70 MED12 deletions including 46 cases from the literature and 24 own unpublished cases. Results The average length of the deletions was 18.7 bp ranging between 2 bp and 43 bp. While in general multitudes of 3 clearly dominated leaving the transcript in frame, deletions of 21, 24, 30, and 33 nucleotides were clearly underrepresented. Within the DNA segment affected deletion breakpoints were not randomly distributed. Most breakpoints clustered within the center of the segment where two peaks of breakpoint clusters could be distinguished. Interestingly, one of these clusters coincides with the loop of a putative folded non-B DNA structure whereas a much lower number of breaks noted in the 5′ and 3′ stem of the structure forming an intramolecular B-helix. The second cluster mainly consisting of 3′ breaks was located in a region downstream adjacent to the stem. Conclusion The present study describes for the first time main characteristics of MED12 deletions occurring in smooth muscle tumors. Interestingly, the non-random distribution of breakpoints within the deletion hotspot region may point to a role of non-canonical DNA structures for the occurrence of these mutations and the molecular pathogenesis of uterine smooth muscle tumors, respectively.
Collapse
|
33
|
Sun C, López Arriaza JR, Mueller RL. Slow DNA loss in the gigantic genomes of salamanders. Genome Biol Evol 2013; 4:1340-8. [PMID: 23175715 PMCID: PMC3542557 DOI: 10.1093/gbe/evs103] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Evolutionary changes in genome size result from the combined effects of mutation, natural
selection, and genetic drift. Insertion and deletion mutations (indels) directly impact
genome size by adding or removing sequences. Most species lose more DNA through small
indels (i.e., ∼1–30 bp) than they gain, which can result in genome reduction
over time. Because this rate of DNA loss varies across species, small indel dynamics have
been suggested to contribute to genome size evolution. Species with extremely large
genomes provide interesting test cases for exploring the link between small indels and
genome size; however, most large genomes remain relatively unexplored. Here, we examine
rates of DNA loss in the tetrapods with the largest genomes—the salamanders. We used
low-coverage genomic shotgun sequence data from four salamander species to examine
patterns of insertion, deletion, and substitution in neutrally evolving non-long terminal
repeat (LTR) retrotransposon sequences. For comparison, we estimated genome-wide DNA loss
rates in non-LTR retrotransposon sequences from five other vertebrate genomes:
Anolis carolinensis, Danio rerio, Gallus
gallus, Homo sapiens, and Xenopus tropicalis.
Our results show that salamanders have significantly lower rates of DNA loss than do other
vertebrates. More specifically, salamanders experience lower numbers of deletions relative
to insertions, and both deletions and insertions are skewed toward smaller sizes. On the
basis of these patterns, we conclude that slow DNA loss contributes to genomic gigantism
in salamanders. We also identify candidate molecular mechanisms underlying these
differences and suggest that natural variation in indel dynamics provides a unique
opportunity to study the basis of genome stability.
Collapse
Affiliation(s)
- Cheng Sun
- Department of Biology, Colorado State University, CO, USA
| | | | | |
Collapse
|
34
|
Zhao H, Yang Y, Lin H, Zhang X, Mort M, Cooper DN, Liu Y, Zhou Y. DDIG-in: discriminating between disease-associated and neutral non-frameshifting micro-indels. Genome Biol 2013; 14:R23. [PMID: 23497682 PMCID: PMC4053752 DOI: 10.1186/gb-2013-14-3-r23] [Citation(s) in RCA: 56] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2012] [Accepted: 03/13/2013] [Indexed: 02/07/2023] Open
Abstract
Micro-indels (insertions or deletions shorter than 21 bps) constitute the second most frequent class of human gene mutation after single nucleotide variants. Despite the relative abundance of non-frameshifting indels, their damaging effect on protein structure and function has gone largely unstudied. We have developed a support vector machine-based method named DDIG-in (Detecting disease-causing genetic variations due to indels) to prioritize non-frameshifting indels by comparing disease-associated mutations with putatively neutral mutations from the 1,000 Genomes Project. The final model gives good discrimination for indels and is robust against annotation errors. A webserver implementing DDIG-in is available at http://sparks-lab.org/ddig.
Collapse
|
35
|
Ferri L, Donati MA, Funghini S, Malvagia S, Catarzi S, Lugli L, Ragni L, Bertini E, Vaz FM, Cooper DN, Guerrini R, Morrone A. New clinical and molecular insights on Barth syndrome. Orphanet J Rare Dis 2013; 8:27. [PMID: 23409742 PMCID: PMC3599367 DOI: 10.1186/1750-1172-8-27] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2012] [Accepted: 01/30/2013] [Indexed: 11/26/2022] Open
Abstract
Background Barth syndrome (BS) is an X-linked infantile-onset cardioskeletal disease characterized by cardiomyopathy, hypotonia, growth delay, neutropenia and 3-methylglutaconic aciduria. It is caused by mutations in the TAZ gene encoding tafazzin, a protein involved in the metabolism of cardiolipin, a mitochondrial-specific phospholipid involved in mitochondrial energy production. Methods Clinical, biochemical and molecular characterization of a group of six male patients suspected of having BS. Three patients presented early with severe metabolic decompensation including respiratory distress, oxygen desaturation and cardiomyopathy and died within the first year of life. The remaining three patients had cardiomyopathy, hypotonia and growth delay and are still alive. Cardiomyopathy was detected during pregnancy through a routine check-up in one patient. All patients exhibited 3-methylglutaconic aciduria and neutropenia, when tested and five of them also had lactic acidosis. Results We confirmed the diagnosis of BS with sequence analysis of the TAZ gene, and found five new mutations, c.641A>G p.His214Arg, c.284dupG (p.Thr96Aspfs*37), c.678_691del14 (p.Tyr227Trpfs*79), g.8009_16445del8437 and g.[9777_9814del38; 9911-?_14402del] and the known nonsense mutation c.367C>T (p.Arg123Term). The two gross rearrangements ablated TAZ exons 6 to 11 and probably originated by non-allelic homologous recombination and by Serial Replication Slippage (SRS), respectively. The identification of the breakpoints boundaries of the gross deletions allowed the direct detection of heterozygosity in carrier females. Conclusions Lactic acidosis associated with 3-methylglutaconic aciduria is highly suggestive of BS, whilst the severity of the metabolic decompensation at disease onset should be considered for prognostic purposes. Mutation analysis of the TAZ gene is necessary for confirming the clinical and biochemical diagnosis in probands in order to identify heterozygous carriers and supporting prenatal diagnosis and genetic counseling.
Collapse
Affiliation(s)
- Lorenzo Ferri
- Department of Neurosciences, Psychology, Pharmacology and Child Health, University of Florence and Paediatric Neurology Unit and Laboratories, Meyer Children's Hospital, Viale Pieraccini n, 24, 50139 Florence, Italy
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
36
|
Albuisson J, Murthy SE, Bandell M, Coste B, Louis-Dit-Picard H, Mathur J, Fénéant-Thibault M, Tertian G, de Jaureguiberry JP, Syfuss PY, Cahalan S, Garçon L, Toutain F, Simon Rohrlich P, Delaunay J, Picard V, Jeunemaitre X, Patapoutian A. Dehydrated hereditary stomatocytosis linked to gain-of-function mutations in mechanically activated PIEZO1 ion channels. Nat Commun 2013; 4:1884. [PMID: 23695678 PMCID: PMC3674779 DOI: 10.1038/ncomms2899] [Citation(s) in RCA: 263] [Impact Index Per Article: 21.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2013] [Accepted: 04/18/2013] [Indexed: 02/06/2023] Open
Abstract
Dehydrated hereditary stomatocytosis is a genetic condition with defective red blood cell membrane properties that causes an imbalance in intracellular cation concentrations. Recently, two missense mutations in the mechanically activated PIEZO1 (FAM38A) ion channel were associated with dehydrated hereditary stomatocytosis. However, it is not known how these mutations affect PIEZO1 function. Here, by combining linkage analysis and whole-exome sequencing in a large pedigree and Sanger sequencing in two additional kindreds and 11 unrelated dehydrated hereditary stomatocytosis cases, we identify three novel missense mutations and one recurrent duplication in PIEZO1, demonstrating that it is the major gene for dehydrated hereditary stomatocytosis. All the dehydrated hereditary stomatocytosis-associated mutations locate at C-terminal half of PIEZO1. Remarkably, we find that all PIEZO1 mutations give rise to mechanically activated currents that inactivate more slowly than wild-type currents. This gain-of-function PIEZO1 phenotype provides insight that helps to explain the increased permeability of cations in red blood cells of dehydrated hereditary stomatocytosis patients. Our findings also suggest a new role for mechanotransduction in red blood cell biology and pathophysiology.
Collapse
|
37
|
Popovska-Jankovic K, Tasic V, Bogdanovic R, Miljkovic P, Golubovic E, Soylu A, Saraga M, Pavicevic S, Baskin E, Akil I, Gregoric A, Lilova M, Topaloglu R, Sukarova Stefanovska E, Plaseska-Karanfilska D. Molecular characterization of cystinuria in south-eastern European countries. Urolithiasis 2012; 41:21-30. [PMID: 23532419 DOI: 10.1007/s00240-012-0531-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2012] [Accepted: 12/08/2012] [Indexed: 10/27/2022]
Abstract
Cystinuria is an autosomal recessive disorder caused by defective transport of cystine and dibasic amino acids in the proximal renal tubules and small intestine. So far, more than 128 mutations in SLC3A1 gene, and 93 in SLC7A9 gene have been described as a cause of cystinuria. We present a molecular characterization of the cystinuria in 47 unrelated south-east European families. The molecular methodology included direct sequencing, single strand conformational polymorphism, and restriction fragment length polymorphism. A total of 93 (94.9 %) out of 98 unrelated cystinuric chromosomes have been characterized. Mutations in SLC3A1 gene account for 64.3 % and in SLC7A9 gene for 30.6 % of the cystinuric chromosomes. Ten different mutations in SLC3A1 gene were found, and two of them were novel (C242R and L573X), while in SLC7A9 gene seven mutations were found, of which three were novel (G73R, V375I and c.1048_1051delACTC). The most common mutations in this study were T216M (24.5 %), M467T (16.3 %) and R365L (11.2 %) in SLC3A1 and G105R (21.4 %) in SLC7A9 gene. A population specificity of cystinuria mutations was observed; T216M mutation was the only mutation present among Gypsies, G105R was the most common mutation among Albanians and Macedonians, and R365L among Serbs. The results of this study allowed introduction of rapid, simple and cost-effective genetic diagnosis of cystinuria that enables an early preventive care of affected patients and a prenatal diagnosis in affected families.
Collapse
Affiliation(s)
- Katerina Popovska-Jankovic
- Macedonian Academy of Science and Arts, Research Centre for Genetic Engineering and Biotechnology "Georgi D. Efremov", 1000, Skopje, Republic of Macedonia
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
38
|
Stenson PD, Ball EV, Mort M, Phillips AD, Shaw K, Cooper DN. The Human Gene Mutation Database (HGMD) and its exploitation in the fields of personalized genomics and molecular evolution. ACTA ACUST UNITED AC 2012; Chapter 1:1.13.1-1.13.20. [PMID: 22948725 DOI: 10.1002/0471250953.bi0113s39] [Citation(s) in RCA: 164] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The Human Gene Mutation Database (HGMD) constitutes a comprehensive core collection of data on germ-line mutations in nuclear genes underlying or associated with human inherited disease (http://www.hgmd.org). Data cataloged include single-base-pair substitutions in coding, regulatory, and splicing-relevant regions, micro-deletions and micro-insertions, indels, and triplet repeat expansions, as well as gross gene deletions, insertions, duplications, and complex rearrangements. Each mutation is entered into HGMD only once, in order to avoid confusion between recurrent and identical-by-descent lesions. By March 2012, the database contained in excess of 123,600 different lesions (HGMD Professional release 2012.1) detected in 4,514 different nuclear genes, with new entries currently accumulating at a rate in excess of 10,000 per annum. ∼6,000 of these entries constitute disease-associated and functional polymorphisms. HGMD also includes cDNA reference sequences for more than 98% of the listed genes.
Collapse
|
39
|
Vogt J, Mussotter T, Bengesser K, Claes K, Högel J, Chuzhanova N, Fu C, van den Ende J, Mautner VF, Cooper DN, Messiaen L, Kehrer-Sawatzki H. Identification of recurrent type-2 NF1 microdeletions reveals a mitotic nonallelic homologous recombination hotspot underlying a human genomic disorder. Hum Mutat 2012; 33:1599-609. [PMID: 22837079 DOI: 10.1002/humu.22171] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2012] [Accepted: 07/11/2012] [Indexed: 01/08/2023]
Abstract
Nonallelic homologous recombination (NAHR) is one of the major mechanisms underlying copy number variation in the human genome. Although several disease-associated meiotic NAHR breakpoints have been analyzed in great detail, hotspots for mitotic NAHR are not well characterized. Type-2 NF1 microdeletions, which are predominantly of postzygotic origin, constitute a highly informative model with which to investigate the features of mitotic NAHR. Here, a custom-designed MLPA- and PCR-based approach was used to identify 23 novel NAHR-mediated type-2 NF1 deletions. Breakpoint analysis of these 23 type-2 deletions, together with 17 NAHR-mediated type-2 deletions identified previously, revealed that the breakpoints are nonuniformly distributed within the paralogous SUZ12 and SUZ12P sequences. Further, the analysis of this large group of type-2 deletions revealed breakpoint recurrence within short segments (ranging in size from 57 to 253-bp) as well as the existence of a novel NAHR hotspot of 1.9-kb (termed PRS4). This hotspot harbored 20% (8/40) of the type-2 deletion breakpoints and contains the 253-bp recurrent breakpoint region BR6 in which four independent type-2 deletion breakpoints were identified. Our findings indicate that a combination of an open chromatin conformation and short non-B DNA-forming repeats may predispose to recurrent mitotic NAHR events between SUZ12 and its pseudogene.
Collapse
Affiliation(s)
- Julia Vogt
- Institute of Human Genetics, University of Ulm, Ulm, Germany
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
40
|
Chen JM, Cooper DN, Férec C. Local sequence determinants of two in-frame triplet deletion/duplication hotspots in the RHD/RHCE genes. Hum Genomics 2012; 6:8. [PMID: 23245726 PMCID: PMC3500233 DOI: 10.1186/1479-7364-6-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2012] [Accepted: 05/14/2012] [Indexed: 01/04/2023] Open
|
41
|
Chen JM, Fichou Y, Jamet D, Dupont I, Cooper DN, Le Maréchal C, Férec C. Small deletions within theRHDcoding sequence: a report of two novel mutational events and a survey of the underlying pathophysiologic mechanisms. Transfusion 2012; 53:206-10. [DOI: 10.1111/j.1537-2995.2012.03713.x] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
|
42
|
Huang S, Yu T, Chen Z, Yuan S, Chen S, Xu A. More single-nucleotide mutations surround small insertions than small deletions in primates. Hum Mutat 2012; 33:1099-106. [PMID: 22461281 DOI: 10.1002/humu.22085] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2011] [Accepted: 03/06/2012] [Indexed: 01/26/2023]
Abstract
Early studies have shown that single-nucleotide mutation rates increase close to insertions and deletions, but it is not fully understood how natural selection shapes genome-wide patterns of indels and their nearby single-nucleotide mutations. In this study, we find that, in primates, more single-nucleotide mutations surround small insertions than small deletions. This pattern affects <150 base pair (bp) sequences close to indels and persists under different genomic properties, such as exon/intron/intergenic contexts, repeated/nonrepeated sequences, replication timing, recombination rates, indel density, and guanine-cytosine (GC) content. We propose two different, but not mutually exclusive, hypothetical mechanisms to explain the pattern. One mechanism is that the sequence context preferring insertion formation may also favor nucleotide substitutions. Another mechanism is related to a hypothesis in which indel heterozygosity tends to increase nearby nucleotide substitution rates. It means that if insertions spend more time in heterozygotes, insertions may accumulate more surrounding single-nucleotide changes. In conclusion, we characterize a special genome-wide evolutionary pattern for indels and nearby single-nucleotide changes. This pattern may be driven by natural selection and bias primates' genome evolution and phenotypic variations.
Collapse
Affiliation(s)
- Shengfeng Huang
- Guangdong Key Laboratory of Pharmaceutical Functional Genes, College of Life Sciences, Sun Yat-Sen University, 135 XinGangXi Road,Guangzhou, People's Republic of China
| | | | | | | | | | | |
Collapse
|
43
|
Kiritsi D, He Y, Pasmooij AMG, Onder M, Happle R, Jonkman MF, Bruckner-Tuderman L, Has C. Revertant mosaicism in a human skin fragility disorder results from slipped mispairing and mitotic recombination. J Clin Invest 2012; 122:1742-6. [PMID: 22466645 DOI: 10.1172/jci61976] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2011] [Accepted: 02/15/2012] [Indexed: 11/17/2022] Open
Abstract
Spontaneous gene repair, also called revertant mosaicism, has been documented in several genetic disorders involving organs that undergo self-regeneration, including the skin. Genetic reversion may occur through different mechanisms, and in a single individual, the mutation can be repaired in various ways. Here we describe a disseminated pattern of revertant mosaicism observed in 6 patients with Kindler syndrome (KS), a genodermatosis caused by loss of kindlin-1 (encoded by FERMT1) and clinically characterized by patchy skin pigmentation and atrophy. All patients presented duplication mutations (c.456dupA and c.676dupC) in FERMT1, and slipped mispairing in direct nucleotide repeats was identified as the reversion mechanism in all investigated revertant skin spots. The sequence around the mutations demonstrated high propensity to mutations, favoring both microinsertions and microdeletions. Additionally, in some revertant patches, mitotic recombination generated areas with homozygous normal keratinocytes. Restoration of kindlin-1 expression led to clinically and structurally normal skin. Since loss of kindlin-1 severely impairs keratinocyte proliferation, we predict that revertant cells have a selective advantage that allows their clonal expansion and, consequently, the improvement of the skin condition.
Collapse
Affiliation(s)
- Dimitra Kiritsi
- Department of Dermatology, University Medical Center Freiburg, Freiburg, Germany
| | | | | | | | | | | | | | | |
Collapse
|
44
|
Casola C, Zekonyte U, Phillips AD, Cooper DN, Hahn MW. Interlocus gene conversion events introduce deleterious mutations into at least 1% of human genes associated with inherited disease. Genome Res 2012; 22:429-35. [PMID: 22090377 PMCID: PMC3290778 DOI: 10.1101/gr.127738.111] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2011] [Accepted: 11/15/2011] [Indexed: 01/06/2023]
Abstract
Establishing the molecular basis of DNA mutations that cause inherited disease is of fundamental importance to understanding the origin, nature, and clinical sequelae of genetic disorders in humans. The majority of disease-associated mutations constitute single-base substitutions and short deletions and/or insertions resulting from DNA replication errors and the repair of damaged bases. However, pathological mutations can also be introduced by nonreciprocal recombination events between paralogous sequences, a phenomenon known as interlocus gene conversion (IGC). IGC events have thus far been linked to pathology in more than 20 human genes. However, the large number of duplicated gene sequences in the human genome implies that many more disease-associated mutations could originate via IGC. Here, we have used a genome-wide computational approach to identify disease-associated mutations derived from IGC events. Our approach revealed hundreds of known pathological mutations that could have been caused by IGC. Further, we identified several dozen high-confidence cases of inherited disease mutations resulting from IGC in ∼1% of all genes analyzed. About half of the donor sequences associated with such mutations are functional paralogous genes, suggesting that epistatic interactions or differential expression patterns will determine the impact upon fitness of specific substitutions between duplicated genes. In addition, we identified thousands of hitherto undescribed and potentially deleterious mutations that could arise via IGC. Our findings reveal the extent of the impact of interlocus gene conversion upon the spectrum of human inherited disease.
Collapse
Affiliation(s)
- Claudio Casola
- Department of Biology, Indiana University, Bloomington, Indiana 47405, USA
| | - Ugne Zekonyte
- Department of Biology, Indiana University, Bloomington, Indiana 47405, USA
| | - Andrew D. Phillips
- Institute of Medical Genetics, School of Medicine, Cardiff University, Heath Park, Cardiff CF14 4XN, United Kingdom
| | - David N. Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Heath Park, Cardiff CF14 4XN, United Kingdom
| | - Matthew W. Hahn
- Department of Biology, Indiana University, Bloomington, Indiana 47405, USA
- School of Informatics and Computing, Indiana University, Bloomington, Indiana 47405, USA
| |
Collapse
|
45
|
Lu JT, Wang Y, Gibbs RA, Yu F. Characterizing linkage disequilibrium and evaluating imputation power of human genomic insertion-deletion polymorphisms. Genome Biol 2012; 13:R15. [PMID: 22377349 PMCID: PMC3334570 DOI: 10.1186/gb-2012-13-2-r15] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2011] [Revised: 02/14/2012] [Accepted: 02/29/2012] [Indexed: 02/07/2023] Open
Abstract
Background Indels are an important cause of human variation and central to the study of human disease. The 1000 Genomes Project Low-Coverage Pilot identified over 1.3 million indels shorter than 50 bp, of which over 890 were identified as potentially disruptive variants. Yet, despite their ubiquity, the local genomic characteristics of indels remain unexplored. Results Herein we describe population- and minor allele frequency-based differences in linkage disequilibrium and imputation characteristics for indels included in the 1000 Genomes Project Low-Coverage Pilot for the CEU, YRI and CHB+JPT populations. Common indels were well tagged by nearby SNPs in all studied populations, and were also tagged at a similar rate to common SNPs. Both neutral and functionally deleterious common indels were imputed with greater than 95% concordance from HapMap Phase 3 and OMNI SNP sites. Further, 38 to 56% of low frequency indels were tagged by low frequency SNPs. We were able to impute heterozygous low frequency indels with over 50% concordance. Lastly, our analysis also revealed evidence of ascertainment bias. This bias prevents us from extending the applicability of our results to highly polymorphic indels that could not be identified in the Low-Coverage Pilot. Conclusions Although further scope exists to improve the imputation of low frequency indels, our study demonstrates that there are already ample opportunities to retrospectively impute indels for prior genome-wide association studies and to incorporate indel imputation into future case/control studies.
Collapse
Affiliation(s)
- James T Lu
- Human Genome Sequencing Center, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA
| | | | | | | |
Collapse
|
46
|
Characterization of a large genomic deletion in four Irish families with C7 deficiency. Mol Immunol 2011; 50:57-9. [PMID: 22206826 DOI: 10.1016/j.molimm.2011.12.002] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2011] [Revised: 12/01/2011] [Accepted: 12/07/2011] [Indexed: 11/22/2022]
Abstract
Inherited deficiency of the seventh complement component (C7) is associated with increased susceptibility to Neisseria meningitidis infections. The disease is rare in most Western countries. Here we report new investigations of a large, but incompletely characterized genomic deletion of exons 8 and 9 [c.739-?_1093+?del], previously identified in three unrelated Irish families with C7 deficiency. We have analysed DNA from one individual, who is homozygous for the deletion, by PCR using primers progressively proximal to the deleted exons. Thus we were able to map the deletion boundaries. Amplification across the breakpoint and sequencing revealed an indel mutation that included a 6.4kb deletion together with an insertion of a novel 8bp sequence [c.739+1262_1270-2387delinsGCAGGCCA]. We demonstrated the same defect in the C7 deficient patients from each family and developed a duplex PCR method to enable the detection of alleles containing the deletion in heterozygotes. A member of a fourth family was found to be homozygous for the deletion defect. Thus, the deletion defect may be a more commonly distributed cause of C7 deficiency in Ireland.
Collapse
|
47
|
Zickler AM, Hampp S, Messiaen L, Bengesser K, Mussotter T, Roehl AC, Wimmer K, Mautner VF, Kluwe L, Upadhyaya M, Pasmant E, Chuzhanova N, Kestler HA, Högel J, Legius E, Claes K, Cooper DN, Kehrer-Sawatzki H. Characterization of the nonallelic homologous recombination hotspot PRS3 associated with type-3 NF1 deletions. Hum Mutat 2011; 33:372-83. [PMID: 22045503 DOI: 10.1002/humu.21644] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2011] [Accepted: 10/06/2011] [Indexed: 12/21/2022]
Abstract
Nonallelic homologous recombination (NAHR) is the major mechanism underlying recurrent genomic rearrangements, including the large deletions at 17q11.2 that cause neurofibromatosis type 1 (NF1). Here, we identify a novel NAHR hotspot, responsible for type-3 NF1 deletions that span 1.0 Mb. Breakpoint clustering within this 1-kb hotspot, termed PRS3, was noted in 10 of 11 known type-3 NF1 deletions. PRS3 is located within the LRRC37B pseudogene of the NF1-REPb and NF1-REPc low-copy repeats. In contrast to other previously characterized NAHR hotspots, PRS3 has not developed on a preexisting allelic homologous recombination hotspot. Furthermore, the variation pattern of PRS3 and its flanking regions is unusual since only NF1-REPc (and not NF1-REPb) is characterized by a high single nucleotide polymorphism (SNP) frequency, suggestive of unidirectional sequence transfer via nonallelic homologous gene conversion (NAHGC). By contrast, the previously described intense NAHR hotspots within the CMT1A-REPs, and the PRS1 and PRS2 hotspots underlying type-1 NF1 deletions, experience frequent bidirectional sequence transfer. PRS3 within NF1-REPc was also found to be involved in NAHGC with the LRRC37B gene, the progenitor locus of the LRRC37B-P duplicons, as indicated by the presence of shared SNPs between these loci. PRS3 therefore represents a weak (and probably evolutionarily rather young) NAHR hotspot with unique properties.
Collapse
Affiliation(s)
- Antje M Zickler
- Institute of Human Genetics, University of Ulm, Albert-Einstein-Allee 11, Ulm, Germany
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
48
|
Sironen A, Uimari P, Venhoranta H, Andersson M, Vilkki J. An exonic insertion within Tex14 gene causes spermatogenic arrest in pigs. BMC Genomics 2011; 12:591. [PMID: 22136159 PMCID: PMC3248578 DOI: 10.1186/1471-2164-12-591] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2011] [Accepted: 12/02/2011] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Male infertility is an increasing problem in all domestic species including man. Localization and identification of genes involved in defects causing male infertility provide valuable information of specific events in sperm development. Sperm development is a complex process, where diploid spermatogonia develop into haploid, highly specialized spermatozoa. Correct expression and function of various genes and their protein products are required for production of fertile sperm. We have identified an infertility defect in Finnish Yorkshire boars caused by spermatogenic arrest. The aim of this study was to locate the disease associated region using genome wide screen with the PorcineSNP60 Beadchip and identify the causal mutation by candidate gene approach. RESULTS In the Finnish Yorkshire pig population the spermatogenic arrest (SA) defect appears to be of genetic origin and causes severe degeneration of germ cells and total absence of spermatozoa. Genome wide scan with the PorcineSNP60 Beadchip localized the SA defect to porcine chromosome 12 in a 2 Mbp region. Sequencing of a candidate gene Tex14 revealed a 51 bp insertion within exon 27, which caused differential splicing of the exon and created a premature translation stop codon. The expression of Tex14 was markedly down regulated in the testis of a SA affected boar compared to control boars and no protein product was identified by Western blotting. The SA insertion sequence was also found within intron 27 in all analyzed animals, thus the insertion appears to be a possible duplication event. CONCLUSION In this study we report the identification of a causal mutation for infertility caused by spermatogenic arrest at an early meiotic phase. Our results highlight the role of TEX14 specifically in spermatogenesis and the importance of specific genomic remodeling events as causes for inherited defects.
Collapse
Affiliation(s)
- Anu Sironen
- Agrifood Research Finland, MTT, Biotechnology and Food Research, Genomics, FI-36100 Jokioinen, Finland.
| | | | | | | | | |
Collapse
|
49
|
Pelleymounter LL, Moon I, Johnson JA, Laederach A, Halvorsen M, Eckloff B, Abo R, Rossetti S. A novel application of pattern recognition for accurate SNP and indel discovery from high-throughput data: targeted resequencing of the glucocorticoid receptor co-chaperone FKBP5 in a Caucasian population. Mol Genet Metab 2011; 104:457-69. [PMID: 21917492 PMCID: PMC3224211 DOI: 10.1016/j.ymgme.2011.08.019] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/24/2011] [Revised: 08/18/2011] [Accepted: 08/18/2011] [Indexed: 11/28/2022]
Abstract
The detection of single nucleotide polymorphisms (SNPs) and insertion/deletions (indels) with precision from high-throughput data remains a significant bioinformatics challenge. Accurate detection is necessary before next-generation sequencing can routinely be used in the clinic. In research, scientific advances are inhibited by gaps in data, exemplified by the underrepresented discovery of rare variants, variants in non-coding regions and indels. The continued presence of false positives and false negatives prevents full automation and requires additional manual verification steps. Our methodology presents applications of both pattern recognition and sensitivity analysis to eliminate false positives and aid in the detection of SNP/indel loci and genotypes from high-throughput data. We chose FK506-binding protein 51(FKBP5) (6p21.31) for our clinical target because of its role in modulating pharmacological responses to physiological and synthetic glucocorticoids and because of the complexity of the genomic region. We detected genetic variation across a 160 kb region encompassing FKBP5. 613 SNPs and 57 indels, including a 3.3 kb deletion were discovered. We validated our method using three independent data sets and, with Sanger sequencing and Affymetrix and Illumina microarrays, achieved 99% concordance. Furthermore we were able to detect 267 novel rare variants and assess linkage disequilibrium. Our results showed both a sensitivity and specificity of 98%, indicating near perfect classification between true and false variants. The process is scalable and amenable to automation, with the downstream filters taking only 1.5h to analyze 96 individuals simultaneously. We provide examples of how our level of precision uncovered the interactions of multiple loci, their predicted influences on mRNA stability, perturbations of the hsp90 binding site, and individual variation in FKBP5 expression. Finally we show how our discovery of rare variants may change current conceptions of evolution at this locus.
Collapse
Affiliation(s)
- Linda L Pelleymounter
- Department of Pharmacology, Department of Pharmacology and Experimental Therapeutics, Mayo Clinic, Rochester, MN 55905, USA.
| | | | | | | | | | | | | | | |
Collapse
|
50
|
Laurie S, Toll-Riera M, Radó-Trilla N, Albà MM. Sequence shortening in the rodent ancestor. Genome Res 2011; 22:478-85. [PMID: 22128134 DOI: 10.1101/gr.121897.111] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Insertions and deletions (indels), together with nucleotide substitutions, are major drivers of sequence evolution. An excess of deletions over insertions in genomic sequences-the so-called deletional bias-has been reported in a wide range of species, including mammals. However, this bias has not been found in the coding sequences of some mammalian species, such as human and mouse. To determine the strength of the deletional bias in mammals, and the influence of mutation and selection, we have quantified indels in both neutrally evolving noncoding sequences and protein-coding sequences, in six mammalian branches: human, macaque, ancestral primate, mouse, rat, and ancestral rodent. The results obtained with an improved algorithm for the placement of insertions in multiple alignments, Prank(+F), indicate that contrary to previous results, the only mammalian branch with a strong deletional bias is the rodent ancestral branch. We estimate that such a bias has resulted in an ~2.5% sequence loss of mammalian syntenic region in the ancestor of the mouse and rat. Further, a comparison of coding and noncoding sequences shows that negative selection is acting more strongly against mutations generating amino acid insertions than against mutations resulting in amino acid deletions. The strength of selection against indels is found to be higher in the rodent branches than in the primate branches, consistent with the larger effective population sizes of the rodents.
Collapse
Affiliation(s)
- Steve Laurie
- Evolutionary Genomics Group, Pompeu Fabra University (UPF) and Municipal Institute of Medical Research (FIMIM), Barcelona, Spain
| | | | | | | |
Collapse
|