1
|
Digby B, Finn S, Ó Broin P. Computational approaches and challenges in the analysis of circRNA data. BMC Genomics 2024; 25:527. [PMID: 38807085 DOI: 10.1186/s12864-024-10420-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2024] [Accepted: 05/15/2024] [Indexed: 05/30/2024] Open
Abstract
Circular RNAs (circRNA) are a class of non-coding RNA, forming a single-stranded covalently closed loop structure generated via back-splicing. Advancements in sequencing methods and technologies in conjunction with algorithmic developments of bioinformatics tools have enabled researchers to characterise the origin and function of circRNAs, with practical applications as a biomarker of diseases becoming increasingly relevant. Computational methods developed for circRNA analysis are predicated on detecting the chimeric back-splice junction of circRNAs whilst mitigating false-positive sequencing artefacts. In this review, we discuss in detail the computational strategies developed for circRNA identification, highlighting a selection of tool strengths, weaknesses and assumptions. In addition to circRNA identification tools, we describe methods for characterising the role of circRNAs within the competing endogenous RNA (ceRNA) network, their interactions with RNA-binding proteins, and publicly available databases for rich circRNA annotation.
Collapse
Affiliation(s)
- Barry Digby
- School of Mathematical and Statistical Sciences, University of Galway, Galway, Ireland.
| | - Stephen Finn
- Discipline of Histopathology, School of Medicine, Trinity College Dublin and Cancer Molecular Diagnostic Laboratory, Dublin, Ireland
| | - Pilib Ó Broin
- School of Mathematical and Statistical Sciences, University of Galway, Galway, Ireland
| |
Collapse
|
2
|
Roca-Ayats N, Maceda I, Bruque CD, Martínez-Gil N, Garcia-Giralt N, Cozar M, Mellibovsky L, Van Hul W, Lao O, Grinberg D, Balcells S. Evolutionary and functional analyses of LRP5 in archaic and extant modern humans. Hum Genomics 2024; 18:53. [PMID: 38802968 DOI: 10.1186/s40246-024-00616-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2024] [Accepted: 05/07/2024] [Indexed: 05/29/2024] Open
Abstract
BACKGROUND The human lineage has undergone a postcranial skeleton gracilization (i.e. lower bone mass and strength relative to body size) compared to other primates and archaic populations such as the Neanderthals. This gracilization has been traditionally explained by differences in the mechanical load that our ancestors exercised. However, there is growing evidence that gracilization could also be genetically influenced. RESULTS We have analyzed the LRP5 gene, which is known to be associated with high bone mineral density conditions, from an evolutionary and functional point of view. Taking advantage of the published genomes of archaic Homo populations, our results suggest that this gene has a complex evolutionary history both between archaic and living humans and within living human populations. In particular, we identified the presence of different selective pressures in archaics and extant modern humans, as well as evidence of positive selection in the African and South East Asian populations from the 1000 Genomes Project. Furthermore, we observed a very limited evidence of archaic introgression in this gene (only at three haplotypes of East Asian ancestry out of the 1000 Genomes), compatible with a general erasing of the fingerprint of archaic introgression due to functional differences in archaics compared to extant modern humans. In agreement with this hypothesis, we observed private mutations in the archaic genomes that we experimentally validated as putatively increasing bone mineral density. In particular, four of five archaic missense mutations affecting the first β-propeller of LRP5 displayed enhanced Wnt pathway activation, of which two also displayed reduced negative regulation. CONCLUSIONS In summary, these data suggest a genetic component contributing to the understanding of skeletal differences between extant modern humans and archaic Homo populations.
Collapse
Affiliation(s)
- Neus Roca-Ayats
- Departament de Genètica, Microbiologia i Estadística and IBUB, Universitat de Barcelona, Barcelona, Spain
- Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER) ISCIII, Barcelona, Spain
- Institut de Recerca Sant Joan de Déu (IRSJD), Barcelona, Spain
| | - Iago Maceda
- CNAG, Centre Nacional d'Analisi Genòmic, C/ Baldiri I Reixach 4, 08028, Barcelona, Spain
- Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Carlos David Bruque
- Unidad de Conocimiento Traslacional Hospitalaria Patagónica, Hospital de Alta Complejidad El Calafate - S.A.M.I.C., Santa Cruz, Argentina
| | - Núria Martínez-Gil
- Departament de Genètica, Microbiologia i Estadística and IBUB, Universitat de Barcelona, Barcelona, Spain
- Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER) ISCIII, Barcelona, Spain
- Institut de Recerca Sant Joan de Déu (IRSJD), Barcelona, Spain
| | - Natàlia Garcia-Giralt
- Musculoskeletal Research Group, IMIM (Hospital del Mar Medical Research Institute), Centro de Investigación Biomédica en Red en Fragilidad y Envejecimiento Saludable (CIBERFES), ISCIII, Departament de Genètica, Microbiologia i Estadística, UB, Barcelona, Spain
| | - Mónica Cozar
- Departament de Genètica, Microbiologia i Estadística and IBUB, Universitat de Barcelona, Barcelona, Spain
- Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER) ISCIII, Barcelona, Spain
- Institut de Recerca Sant Joan de Déu (IRSJD), Barcelona, Spain
| | - Leonardo Mellibovsky
- Musculoskeletal Research Group, IMIM (Hospital del Mar Medical Research Institute), Centro de Investigación Biomédica en Red en Fragilidad y Envejecimiento Saludable (CIBERFES), ISCIII, Barcelona, Spain
| | - Wim Van Hul
- Center of Medical Genetics, University of Antwerp, 2650, Antwerp, Belgium
| | - Oscar Lao
- Institute of Evolutionary Biology, CSIC-Universitat Pompeu Fabra, 08003, Barcelona, Spain.
| | - Daniel Grinberg
- Departament de Genètica, Microbiologia i Estadística and IBUB, Universitat de Barcelona, Barcelona, Spain
- Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER) ISCIII, Barcelona, Spain
- Institut de Recerca Sant Joan de Déu (IRSJD), Barcelona, Spain
| | - Susanna Balcells
- Departament de Genètica, Microbiologia i Estadística and IBUB, Universitat de Barcelona, Barcelona, Spain
- Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER) ISCIII, Barcelona, Spain
- Institut de Recerca Sant Joan de Déu (IRSJD), Barcelona, Spain
| |
Collapse
|
3
|
Lam HYI, Ong XE, Mutwil M. Large language models in plant biology. TRENDS IN PLANT SCIENCE 2024:S1360-1385(24)00118-3. [PMID: 38797656 DOI: 10.1016/j.tplants.2024.04.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Revised: 04/29/2024] [Accepted: 04/30/2024] [Indexed: 05/29/2024]
Abstract
Large language models (LLMs), such as ChatGPT, have taken the world by storm. However, LLMs are not limited to human language and can be used to analyze sequential data, such as DNA, protein, and gene expression. The resulting foundation models can be repurposed to identify the complex patterns within the data, resulting in powerful, multipurpose prediction tools able to predict the state of cellular systems. This review outlines the different types of LLMs and showcases their recent uses in biology. Since LLMs have not yet been embraced by the plant community, we also cover how these models can be deployed for the plant kingdom.
Collapse
Affiliation(s)
- Hilbert Yuen In Lam
- School of Biological Sciences, Nanyang Technological University, 60 Nanyang Drive, Singapore, 637551, Singapore
| | - Xing Er Ong
- School of Biological Sciences, Nanyang Technological University, 60 Nanyang Drive, Singapore, 637551, Singapore
| | - Marek Mutwil
- School of Biological Sciences, Nanyang Technological University, 60 Nanyang Drive, Singapore, 637551, Singapore.
| |
Collapse
|
4
|
Delibes C, Ferré M, Rozet M, Desquiret-Dumas V, Descatha A, Gohier B, Gohier P, Amati-Bonneau P, Milea D, Reynier P. Genetic susceptibility to optic neuropathy in patients with alcohol use disorder. J Transl Med 2024; 22:495. [PMID: 38796496 PMCID: PMC11127293 DOI: 10.1186/s12967-024-05334-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Accepted: 05/20/2024] [Indexed: 05/28/2024] Open
Abstract
BACKGROUND The pathophysiology of toxico-nutritional optic neuropathies remains debated, with no clear understanding of the respective roles played by the direct alcohol toxicity, smoking and the often associated vitamin deficiencies, which are risk factors for optic neuropathy. Our aim was to investigate genetic susceptibility in patients with bilateral infraclinical optic neuropathy associated with chronic alcohol use disorder. METHODS This retrospective cohort study included 102 visually asymptomatic patients with documented alcohol use disorder from a French reference center. Optic neuropathy was identified with optical coherence tomography (OCT), after which genetic susceptibility in the group of affected patients was investigated. Genetic testing was performed using panel sequencing of 87 nuclear genes and complete mitochondrial DNA sequencing. RESULTS Optic neuropathy was detected in 36% (37/102) of the included patients. Genetic testing of affected patients disclosed two patients (2/30, 6.7%) with optic neuropathy associated with pathogenic variants affecting the SPG7 gene and five patients (5/30, 16.7%) who harbored variants of uncertain significance close to probable pathogenicity in the genes WFS1, LOXL1, MMP19, NR2F1 and PMPCA. No pathogenic mitochondrial DNA variants were found in this group. CONCLUSIONS OCT can detect presence of asymptomatic optic neuropathy in patients with chronic alcohol use disorder. Furthermore, genetic susceptibility to optic neuropathy in this setting is found in almost a quarter of affected patients. Further studies may clarify the role of preventative measures in patients who might be predisposed to avoidable visual loss and blindness.
Collapse
Affiliation(s)
- Camille Delibes
- Département d'Ophtalmologie, Centre Hospitalier Universitaire (CHU), 49000, Angers, France
| | - Marc Ferré
- Université d'Angers, Unité Mixte de Recherche (UMR) MITOVASC, Institut National de la Santé et de la Recherche Médicale (INSERM U-1083), Centre National de la Recherche Scientifique (CNRS 6015), 49000, Angers, France
- Département de Biochimie et Biologie Moléculaire, Centre Hospitalier Universitaire, 49000, Angers, France
| | - Marine Rozet
- Département de Psychiatrie et d'Addictologie, Centre Hospitalier Universitaire, 49000, Angers, France
| | - Valérie Desquiret-Dumas
- Université d'Angers, Unité Mixte de Recherche (UMR) MITOVASC, Institut National de la Santé et de la Recherche Médicale (INSERM U-1083), Centre National de la Recherche Scientifique (CNRS 6015), 49000, Angers, France
- Département de Biochimie et Biologie Moléculaire, Centre Hospitalier Universitaire, 49000, Angers, France
| | - Alexis Descatha
- Univ. Angers (University of Angers), CHU Angers, Univ. Rennes, Inserm, EHESP, IRSET (Institut de Recherche en Santé, Environnement et Travail) - UMR_S 1085, IRSET-ESTER, SFR ICAT, CAPTV CDC, 49000, Angers, France
- Department of Occupational Medicine, Epidemiology and Prevention, Donald and Barbara Zucker School of Medicine, Hosftra University Northwell Health, New York, NY, 11021, USA
| | - Bénédicte Gohier
- Département de Psychiatrie et d'Addictologie, Centre Hospitalier Universitaire, 49000, Angers, France
- Univ Angers, Université de Nantes, LPPL, SFR CONFLUENCES, 49000, Angers, France
| | - Philippe Gohier
- Département d'Ophtalmologie, Centre Hospitalier Universitaire (CHU), 49000, Angers, France
| | - Patrizia Amati-Bonneau
- Université d'Angers, Unité Mixte de Recherche (UMR) MITOVASC, Institut National de la Santé et de la Recherche Médicale (INSERM U-1083), Centre National de la Recherche Scientifique (CNRS 6015), 49000, Angers, France
- Département de Biochimie et Biologie Moléculaire, Centre Hospitalier Universitaire, 49000, Angers, France
| | - Dan Milea
- Département d'Ophtalmologie, Centre Hospitalier Universitaire (CHU), 49000, Angers, France
- Singapore National Eye Centre, Singapore Eye Research Institute, Duke-NUS, Singapore, Singapore
- Rothschild Foundation Hospital, Paris, France
| | - Pascal Reynier
- Université d'Angers, Unité Mixte de Recherche (UMR) MITOVASC, Institut National de la Santé et de la Recherche Médicale (INSERM U-1083), Centre National de la Recherche Scientifique (CNRS 6015), 49000, Angers, France.
- Département de Biochimie et Biologie Moléculaire, Centre Hospitalier Universitaire, 49000, Angers, France.
| |
Collapse
|
5
|
Gallon R, Brekelmans C, Martin M, Bours V, Schamschula E, Amberger A, Muleris M, Colas C, Dekervel J, De Hertogh G, Coupier J, Colleye O, Sepulchre E, Burn J, Brems H, Legius E, Wimmer K. Constitutional mismatch repair deficiency mimicking Lynch syndrome is associated with hypomorphic mismatch repair gene variants. NPJ Precis Oncol 2024; 8:119. [PMID: 38789506 PMCID: PMC11126593 DOI: 10.1038/s41698-024-00603-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Accepted: 05/08/2024] [Indexed: 05/26/2024] Open
Abstract
Lynch syndrome (LS) and constitutional mismatch repair deficiency (CMMRD) are distinct cancer syndromes caused, respectively, by mono- and bi-allelic germline mismatch repair (MMR) variants. LS predisposes to mainly gastrointestinal and genitourinary cancers in adulthood. CMMRD predisposes to brain, haematological, and LS-spectrum cancers from childhood. Two suspected LS patients with first cancer diagnosis aged 27 or 38 years were found to be homozygous for an MMR (likely) pathogenic variant, MSH6 c.3226C>T (p.(Arg1076Cys)), or variant of uncertain significance (VUS), MLH1 c.306G>A (p.(Glu102=)). MLH1 c.306G>A was shown to cause leaky exon 3 skipping. The apparent genotype-phenotype conflict was resolved by detection of constitutional microsatellite instability in both patients, a hallmark feature of CMMRD. A hypomorphic effect of these and other variants found in additional late onset CMMRD cases, identified by literature review, likely explains a LS-like phenotype. CMMRD testing in carriers of compound heterozygous or homozygous MMR VUS may find similar cases and novel hypomorphic variants. Individualised management of mono- and bi-allelic carriers of hypomorphic MMR variants is needed until we better characterise the associated phenotypes.
Collapse
Affiliation(s)
- Richard Gallon
- Translational and Clinical Research Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK.
| | | | | | | | - Esther Schamschula
- Institute of Human Genetics, Medical University of Innsbruck, Innsbruck, Austria
| | - Albert Amberger
- Institute of Human Genetics, Medical University of Innsbruck, Innsbruck, Austria
| | - Martine Muleris
- Département de Génétique, AP-HP.Sorbonne Université, Hôpital Pitié-Salpêtrière, Paris, France
- Inserm UMRS_938, Sorbonne Université, Centre de Recherche Saint Antoine, Paris, France
| | - Chrystelle Colas
- Département de Génétique, Institut Curie, Paris, France
- INSERM U830, Université de Paris, Paris, France
| | - Jeroen Dekervel
- Department of Digestive Oncology, University Hospital Leuven, Leuven, Belgium
| | - Gert De Hertogh
- Department of Pathology, University Hospital Leuven, Leuven, Belgium
| | | | | | | | - John Burn
- Translational and Clinical Research Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK
| | - Hilde Brems
- Centre for Human Genetics, University Hospital Leuven, Leuven, Belgium
| | - Eric Legius
- Centre for Human Genetics, University Hospital Leuven, Leuven, Belgium
| | - Katharina Wimmer
- Institute of Human Genetics, Medical University of Innsbruck, Innsbruck, Austria.
| |
Collapse
|
6
|
Nakayama D, Makino T. Convergent accelerated evolution of mammal-specific conserved non-coding elements in hibernators. Sci Rep 2024; 14:11754. [PMID: 38782990 PMCID: PMC11116591 DOI: 10.1038/s41598-024-62455-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Accepted: 05/16/2024] [Indexed: 05/25/2024] Open
Abstract
Mammals maintain their body temperature, yet hibernators can temporarily lower their metabolic rate as an energy-saving strategy. It has been proposed that hibernators evolved independently from homeotherms, and it is possible that the convergent evolution of hibernation involved common genomic changes among hibernator-lineages. Since hibernation is a seasonal trait, the evolution of gene regulatory regions in response to changes in season may have been important for the acquisition of hibernation traits. High-frequency accumulation of mutations in conserved non-coding elements (CNEs) could, in principle, alter the expression of neighboring genes and thereby contribute to the acquisition of new traits. To address this possibility, we performed a comparative genomic analysis of mammals to identify accelerated CNEs commonly associated with hibernation. We found that accelerated CNEs are common to hibernator-lineages and could be involved with hibernation. We also found that common factors of genes that located near accelerated CNEs and are differentially expressed between normal and hibernation periods related to gene regulation and cell-fate determination. It suggests that the molecular mechanisms controlling hibernation have undergone convergent evolution. These results help broaden our understanding of the genetic adaptations that facilitated hibernation in mammals and may offer insights pertaining to stress responses and energy conservation.
Collapse
Affiliation(s)
- Daiki Nakayama
- Department of Biology, Faculty of Science, Tohoku University, 6-3, Aramaki Aza Aoba, Aoba-Ku, Sendai, 980-8578, Japan
| | - Takashi Makino
- Department of Biology, Faculty of Science, Tohoku University, 6-3, Aramaki Aza Aoba, Aoba-Ku, Sendai, 980-8578, Japan.
- Graduate School of Life Sciences, Tohoku University, 6-3, Aramaki Aza Aoba, Aoba-Ku, Sendai, 980-8578, Japan.
| |
Collapse
|
7
|
Li G, Wu J, Wang X. Predicting functional UTR variants by integrating region-specific features. Brief Bioinform 2024; 25:bbae248. [PMID: 38783704 PMCID: PMC11116830 DOI: 10.1093/bib/bbae248] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Revised: 03/30/2024] [Accepted: 05/08/2024] [Indexed: 05/25/2024] Open
Abstract
The untranslated region (UTR) of messenger ribonucleic acid (mRNA), including the 5'UTR and 3'UTR, plays a critical role in regulating gene expression and translation. Variants within the UTR can lead to changes associated with human traits and diseases; however, computational prediction of UTR variant effect is challenging. Current noncoding variant prediction mainly focuses on the promoters and enhancers, neglecting the unique sequence of the UTR and thereby limiting their predictive accuracy. In this study, using consolidated datasets of UTR variants from disease databases and large-scale experimental data, we systematically analyzed more than 50 region-specific features of UTR, including functional elements, secondary structure, sequence composition and site conservation. Our analysis reveals that certain features, such as C/G-related sequence composition in 5'UTR and A/T-related sequence composition in 3'UTR, effectively differentiate between nonfunctional and functional variant sets, unveiling potential sequence determinants of functional UTR variants. Leveraging these insights, we developed two classification models to predict functional UTR variants using machine learning, achieving an area under the curve (AUC) value of 0.94 for 5'UTR and 0.85 for 3'UTR, outperforming all existing methods. Our models will be valuable for enhancing clinical interpretation of genetic variants, facilitating the prediction and management of disease risk.
Collapse
Affiliation(s)
- Guangyu Li
- State Key Laboratory of Common Mechanism Research for Major Diseases; Center for bioinformatics, National Infrastructures for Translational Medicine, Institute of Clinical Medicine and Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, 1 Shuai Fu Yuan, Dongcheng District, Beijing 100005, China
| | - Jiayu Wu
- State Key Laboratory of Common Mechanism Research for Major Diseases; Center for bioinformatics, National Infrastructures for Translational Medicine, Institute of Clinical Medicine and Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, 1 Shuai Fu Yuan, Dongcheng District, Beijing 100005, China
| | - Xiaoyue Wang
- State Key Laboratory of Common Mechanism Research for Major Diseases; Center for bioinformatics, National Infrastructures for Translational Medicine, Institute of Clinical Medicine and Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, 1 Shuai Fu Yuan, Dongcheng District, Beijing 100005, China
| |
Collapse
|
8
|
Silva DB, Trinidad M, Ljungdahl A, Revalde JL, Berguig GY, Wallace W, Patrick CS, Bomba L, Arkin M, Dong S, Estrada K, Hutchinson K, LeBowitz JH, Schlessinger A, Johannesen KM, Møller RS, Giacomini KM, Froelich S, Sanders SJ, Wuster A. Haploinsufficiency underlies the neurodevelopmental consequences of SLC6A1 variants. Am J Hum Genet 2024:S0002-9297(24)00162-9. [PMID: 38781976 DOI: 10.1016/j.ajhg.2024.04.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Revised: 04/26/2024] [Accepted: 04/26/2024] [Indexed: 05/25/2024] Open
Abstract
Heterozygous variants in SLC6A1, encoding the GAT-1 GABA transporter, are associated with seizures, developmental delay, and autism. The majority of affected individuals carry missense variants, many of which are recurrent germline de novo mutations, raising the possibility of gain-of-function or dominant-negative effects. To understand the functional consequences, we performed an in vitro GABA uptake assay for 213 unique variants, including 24 control variants. De novo variants consistently resulted in a decrease in GABA uptake, in keeping with haploinsufficiency underlying all neurodevelopmental phenotypes. Where present, ClinVar pathogenicity reports correlated well with GABA uptake data; the functional data can inform future reports for the remaining 72% of unscored variants. Surface localization was assessed for 86 variants; two-thirds of loss-of-function missense variants prevented GAT-1 from being present on the membrane while GAT-1 was on the surface but with reduced activity for the remaining third. Surprisingly, recurrent de novo missense variants showed moderate loss-of-function effects that reduced GABA uptake with no evidence for dominant-negative or gain-of-function effects. Using linear regression across multiple missense severity scores to extrapolate the functional data to all potential SLC6A1 missense variants, we observe an abundance of GAT-1 residues that are sensitive to substitution. The extent of this missense vulnerability accounts for the clinically observed missense enrichment; overlap with hypermutable CpG sites accounts for the recurrent missense variants. Strategies to increase the expression of the wild-type SLC6A1 allele are likely to be beneficial across neurodevelopmental disorders, though the developmental stage and extent of required rescue remain unknown.
Collapse
Affiliation(s)
- Dina Buitrago Silva
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA, USA
| | - Marena Trinidad
- BioMarin Pharmaceutical Inc., Novato, CA, USA; Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA; Innovative Genomics Institute, University of California, Berkeley, Berkeley, CA, USA
| | - Alicia Ljungdahl
- Department of Psychiatry, UCSF Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, CA, USA; Institute of Developmental and Regenerative Medicine, Department of Paediatrics, University of Oxford, Oxford OX3 7TY, UK
| | - Jezrael L Revalde
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA, USA
| | | | | | - Cory S Patrick
- Department of Psychiatry, UCSF Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, CA, USA
| | | | - Michelle Arkin
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA, USA
| | - Shan Dong
- Department of Psychiatry, UCSF Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, CA, USA
| | | | - Keino Hutchinson
- Department of Pharmacological Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | | | - Avner Schlessinger
- Department of Pharmacological Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Katrine M Johannesen
- Department of Regional Health Research, Faculty of Health Sciences, University of Southern Denmark, Odense, Denmark
| | - Rikke S Møller
- Department of Regional Health Research, Faculty of Health Sciences, University of Southern Denmark, Odense, Denmark; Department of Epilepsy Genetics and Personalized Medicine, Member of ERN Epicare, Danish Epilepsy Centre, Dianalund, Denmark
| | - Kathleen M Giacomini
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA, USA
| | | | - Stephan J Sanders
- Department of Psychiatry, UCSF Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, CA, USA; Institute of Developmental and Regenerative Medicine, Department of Paediatrics, University of Oxford, Oxford OX3 7TY, UK.
| | | |
Collapse
|
9
|
Brock DC, Wang M, Hussain HMJ, Rauch DE, Marra M, Pennesi ME, Yang P, Everett L, Ajlan RS, Colbert J, Porto FBO, Matynia A, Gorin MB, Koenekoop RK, Lopez I, Sui R, Zou G, Li Y, Chen R. Comparative analysis of in-silico tools in identifying pathogenic variants in dominant inherited retinal diseases. Hum Mol Genet 2024; 33:945-957. [PMID: 38453143 PMCID: PMC11102593 DOI: 10.1093/hmg/ddae028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Revised: 02/16/2024] [Accepted: 02/19/2024] [Indexed: 03/09/2024] Open
Abstract
Inherited retinal diseases (IRDs) are a group of rare genetic eye conditions that cause blindness. Despite progress in identifying genes associated with IRDs, improvements are necessary for classifying rare autosomal dominant (AD) disorders. AD diseases are highly heterogenous, with causal variants being restricted to specific amino acid changes within certain protein domains, making AD conditions difficult to classify. Here, we aim to determine the top-performing in-silico tools for predicting the pathogenicity of AD IRD variants. We annotated variants from ClinVar and benchmarked 39 variant classifier tools on IRD genes, split by inheritance pattern. Using area-under-the-curve (AUC) analysis, we determined the top-performing tools and defined thresholds for variant pathogenicity. Top-performing tools were assessed using genome sequencing on a cohort of participants with IRDs of unknown etiology. MutScore achieved the highest accuracy within AD genes, yielding an AUC of 0.969. When filtering for AD gain-of-function and dominant negative variants, BayesDel had the highest accuracy with an AUC of 0.997. Five participants with variants in NR2E3, RHO, GUCA1A, and GUCY2D were confirmed to have dominantly inherited disease based on pedigree, phenotype, and segregation analysis. We identified two uncharacterized variants in GUCA1A (c.428T>A, p.Ile143Thr) and RHO (c.631C>G, p.His211Asp) in three participants. Our findings support using a multi-classifier approach comprised of new missense classifier tools to identify pathogenic variants in participants with AD IRDs. Our results provide a foundation for improved genetic diagnosis for people with IRDs.
Collapse
Affiliation(s)
- Daniel C Brock
- Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, United States
- Medical Scientist Training Program, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, United States
| | - Meng Wang
- Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, United States
| | - Hafiz Muhammad Jafar Hussain
- Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, United States
| | - David E Rauch
- Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, United States
| | - Molly Marra
- Department of Ophthalmology, Casey Eye Institute, Oregon Health & Science University, 515 SW Campus Drive, Portland, OR 97239, United States
| | - Mark E Pennesi
- Department of Ophthalmology, Casey Eye Institute, Oregon Health & Science University, 515 SW Campus Drive, Portland, OR 97239, United States
| | - Paul Yang
- Department of Ophthalmology, Casey Eye Institute, Oregon Health & Science University, 515 SW Campus Drive, Portland, OR 97239, United States
| | - Lesley Everett
- Department of Ophthalmology, Casey Eye Institute, Oregon Health & Science University, 515 SW Campus Drive, Portland, OR 97239, United States
| | - Radwan S Ajlan
- Department of Ophthalmology, University of Kansas School of Medicine, 3901 Rainbow Blvd, Kansas City, KS 66160, United States
| | - Jason Colbert
- Department of Ophthalmology, University of Kansas School of Medicine, 3901 Rainbow Blvd, Kansas City, KS 66160, United States
| | - Fernanda Belga Ottoni Porto
- INRET Clínica e Centro de Pesquisa, Rua dos Otoni, 735/507 - Santa Efigênia, Belo Horizonte, MG 30150270, Brazil
- Department of Ophthalmology, Santa Casa de Misericórdia de Belo Horizonte, Av. Francisco Sales, 1111 - Santa Efigênia, Belo Horizonte, MG 30150221, Brazil
- Centro Oftalmológico de Minas Gerais, R. Santa Catarina, 941 - Lourdes, Belo Horizonte, MG 30180070, Brazil
| | - Anna Matynia
- College of Optometry, University of Houston, 4401 Martin Luther King Boulevard, Houston, TX 77004, United States
| | - Michael B Gorin
- Jules Stein Eye Institute, University of California Los Angeles, 100 Stein Plaza, Los Angeles, CA 90095, United States
- Department of Ophthalmology, University of California Los Angeles David Geffen School of Medicine, 10833 Le Conte Ave, Los Angeles, CA 90095, United States
| | - Robert K Koenekoop
- McGill Ocular Genetics Laboratory and Centre, Department of Paediatric Surgery, Human Genetics, and Ophthalmology, McGill University Health Centre, 5252 Boul de Maisonneuve ouest, Montreal, QC H4A 3S5, Canada
| | - Irma Lopez
- McGill Ocular Genetics Laboratory and Centre, Department of Paediatric Surgery, Human Genetics, and Ophthalmology, McGill University Health Centre, 5252 Boul de Maisonneuve ouest, Montreal, QC H4A 3S5, Canada
| | - Ruifang Sui
- Department of Ophthalmology, Peking Union Medical College Hospital, Peking Union Medical College, Chinese Academy of Medical Sciences, WC67+HW Dongcheng, Beijing 100005, China
| | - Gang Zou
- Department of Ophthalmology, Ningxia Eye Hospital, People's Hospital of Ningxia Hui Autonomous Region, First Affiliated Hospital of Northwest University for Nationalities, Ningxia Clinical Research Center on Diseases of Blindness in Eye, F4RJ+43 Xixia District, Yinchuan, Ningxia, China
| | - Yumei Li
- Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, United States
- Human Genome Sequencing Center, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, United States
| | - Rui Chen
- Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, United States
- Human Genome Sequencing Center, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, United States
| |
Collapse
|
10
|
Xiang G, He X, Giardine BM, Isaac KJ, Taylor DJ, McCoy RC, Jansen C, Keller CA, Wixom AQ, Cockburn A, Miller A, Qi Q, He Y, Li Y, Lichtenberg J, Heuston EF, Anderson SM, Luan J, Vermunt MW, Yue F, Sauria MEG, Schatz MC, Taylor J, Gottgens B, Hughes JR, Higgs DR, Weiss MJ, Cheng Y, Blobel GA, Bodine DM, Zhang Y, Li Q, Mahony S, Hardison RC. Interspecies regulatory landscapes and elements revealed by novel joint systematic integration of human and mouse blood cell epigenomes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.04.02.535219. [PMID: 37066352 PMCID: PMC10103973 DOI: 10.1101/2023.04.02.535219] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Knowledge of locations and activities of cis-regulatory elements (CREs) is needed to decipher basic mechanisms of gene regulation and to understand the impact of genetic variants on complex traits. Previous studies identified candidate CREs (cCREs) using epigenetic features in one species, making comparisons difficult between species. In contrast, we conducted an interspecies study defining epigenetic states and identifying cCREs in blood cell types to generate regulatory maps that are comparable between species, using integrative modeling of eight epigenetic features jointly in human and mouse in our Validated Systematic Integration (VISION) Project. The resulting catalogs of cCREs are useful resources for further studies of gene regulation in blood cells, indicated by high overlap with known functional elements and strong enrichment for human genetic variants associated with blood cell phenotypes. The contribution of each epigenetic state in cCREs to gene regulation, inferred from a multivariate regression, was used to estimate epigenetic state Regulatory Potential (esRP) scores for each cCRE in each cell type, which were used to categorize dynamic changes in cCREs. Groups of cCREs displaying similar patterns of regulatory activity in human and mouse cell types, obtained by joint clustering on esRP scores, harbored distinctive transcription factor binding motifs that were similar between species. An interspecies comparison of cCREs revealed both conserved and species-specific patterns of epigenetic evolution. Finally, we showed that comparisons of the epigenetic landscape between species can reveal elements with similar roles in regulation, even in the absence of genomic sequence alignment.
Collapse
|
11
|
Rossen J, Shi H, Strober BJ, Zhang MJ, Kanai M, McCaw ZR, Liang L, Weissbrod O, Price AL. MultiSuSiE improves multi-ancestry fine-mapping in All of Us whole-genome sequencing data. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.05.13.24307291. [PMID: 38798542 PMCID: PMC11118590 DOI: 10.1101/2024.05.13.24307291] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]
Abstract
Leveraging data from multiple ancestries can greatly improve fine-mapping power due to differences in linkage disequilibrium and allele frequencies. We propose MultiSuSiE, an extension of the sum of single effects model (SuSiE) to multiple ancestries that allows causal effect sizes to vary across ancestries based on a multivariate normal prior informed by empirical data. We evaluated MultiSuSiE via simulations and analyses of 14 quantitative traits leveraging whole-genome sequencing data in 47k African-ancestry and 94k European-ancestry individuals from All of Us. In simulations, MultiSuSiE applied to Afr47k+Eur47k was well-calibrated and attained higher power than SuSiE applied to Eur94k; interestingly, higher causal variant PIPs in Afr47k compared to Eur47k were entirely explained by differences in the extent of LD quantified by LD 4th moments. Compared to very recently proposed multi-ancestry fine-mapping methods, MultiSuSiE attained higher power and/or much lower computational costs, making the analysis of large-scale All of Us data feasible. In real trait analyses, MultiSuSiE applied to Afr47k+Eur94k identified 579 fine-mapped variants with PIP > 0.5, and MultiSuSiE applied to Afr47k+Eur47k identified 44% more fine-mapped variants with PIP > 0.5 than SuSiE applied to Eur94k. We validated MultiSuSiE results for real traits via functional enrichment of fine-mapped variants. We highlight several examples where MultiSuSiE implicates well-studied or biologically plausible fine-mapped variants that were not implicated by other methods.
Collapse
|
12
|
Copeland I, Wonkam-Tingang E, Gupta-Malhotra M, Hashmi SS, Han Y, Jajoo A, Hall NJ, Hernandez PP, Lie N, Liu D, Xu J, Rosenfeld J, Haldipur A, Desire Z, Coban-Akdemir ZH, Scott DA, Li Q, Chao HT, Zaske AM, Lupski JR, Milewicz DM, Shete S, Posey JE, Hanchard NA. Exome sequencing implicates ancestry-related Mendelian variation at SYNE1 in childhood-onset essential hypertension. JCI Insight 2024; 9:e172152. [PMID: 38716726 DOI: 10.1172/jci.insight.172152] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Accepted: 03/19/2024] [Indexed: 05/12/2024] Open
Abstract
Childhood-onset essential hypertension (COEH) is an uncommon form of hypertension that manifests in childhood or adolescence and, in the United States, disproportionately affects children of African ancestry. The etiology of COEH is unknown, but its childhood onset, low prevalence, high heritability, and skewed ancestral demography suggest the potential to identify rare genetic variation segregating in a Mendelian manner among affected individuals and thereby implicate genes important to disease pathogenesis. However, no COEH genes have been reported to date. Here, we identify recessive segregation of rare and putatively damaging missense variation in the spectrin domain of spectrin repeat containing nuclear envelope protein 1 (SYNE1), a cardiovascular candidate gene, in 3 of 16 families with early-onset COEH without an antecedent family history. By leveraging exome sequence data from an additional 48 COEH families, 1,700 in-house trios, and publicly available data sets, we demonstrate that compound heterozygous SYNE1 variation in these COEH individuals occurred more often than expected by chance and that this class of biallelic rare variation was significantly enriched among individuals of African genetic ancestry. Using in vitro shRNA knockdown of SYNE1, we show that reduced SYNE1 expression resulted in a substantial decrease in the elasticity of smooth muscle vascular cells that could be rescued by pharmacological inhibition of the downstream RhoA/Rho-associated protein kinase pathway. These results provide insights into the molecular genetics and underlying pathophysiology of COEH and suggest a role for precision therapeutics in the future.
Collapse
Affiliation(s)
- Ian Copeland
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| | - Edmond Wonkam-Tingang
- Childhood Complex Disease Genomics Section, National Human Genome Research Institute, NIH, Bethesda, USA
| | | | - S Shahrukh Hashmi
- Department of Pediatrics, McGovern Medical School, The University of Texas Health Science Center at Houston, Houston, Texas, USA
| | - Yixing Han
- Childhood Complex Disease Genomics Section, National Human Genome Research Institute, NIH, Bethesda, USA
| | - Aarti Jajoo
- Childhood Complex Disease Genomics Section, National Human Genome Research Institute, NIH, Bethesda, USA
| | - Nancy J Hall
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
- US Department of Agriculture Agricultural Research Service Children's Nutrition Research Center, Baylor College of Medicine, Houston, Texas, USA
| | - Paula P Hernandez
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
- US Department of Agriculture Agricultural Research Service Children's Nutrition Research Center, Baylor College of Medicine, Houston, Texas, USA
| | - Natasha Lie
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
- Childhood Complex Disease Genomics Section, National Human Genome Research Institute, NIH, Bethesda, USA
- US Department of Agriculture Agricultural Research Service Children's Nutrition Research Center, Baylor College of Medicine, Houston, Texas, USA
| | - Dan Liu
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| | - Jun Xu
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| | - Jill Rosenfeld
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
- Baylor Genetics, Houston, Texas, USA
| | - Aparna Haldipur
- Childhood Complex Disease Genomics Section, National Human Genome Research Institute, NIH, Bethesda, USA
| | - Zelene Desire
- Childhood Complex Disease Genomics Section, National Human Genome Research Institute, NIH, Bethesda, USA
| | - Zeynep H Coban-Akdemir
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
- Human Genetics Center, The University of Texas Health Science Center at Houston, Houston, Texas, USA
| | - Daryl A Scott
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
- Texas Children's Hospital, Houston, Texas, USA
- Department of Molecular Physiology and Biophysics
| | - Qing Li
- Childhood Complex Disease Genomics Section, National Human Genome Research Institute, NIH, Bethesda, USA
| | - Hsiao-Tuan Chao
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
- Division of Neurology and Developmental Neuroscience, Department of Pediatrics; and
- Department of Neuroscience, Baylor College of Medicine, Houston, Texas, USA
- Cain Pediatric Neurology Research Foundation Laboratories, Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital and Baylor College of Medicine, Houston, Texas, USA
- McNair Medical Institute, The Robert and Janice McNair Foundation, Houston, Texas, USA
| | - Ana M Zaske
- Department of Pediatrics, McGovern Medical School, The University of Texas Health Science Center at Houston, Houston, Texas, USA
| | - James R Lupski
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
- Texas Children's Hospital, Houston, Texas, USA
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, USA
| | - Dianna M Milewicz
- Department of Internal Medicine, McGovern Medical School, The University of Texas Health Science Center at Houston, Houston, Texas, USA
| | - Sanjay Shete
- The University of Texas MD Anderson Cancer Center, Houston, Texas, USA
| | - Jennifer E Posey
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
- McNair Medical Institute, The Robert and Janice McNair Foundation, Houston, Texas, USA
| | - Neil A Hanchard
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
- Childhood Complex Disease Genomics Section, National Human Genome Research Institute, NIH, Bethesda, USA
| |
Collapse
|
13
|
Zhang L, Lee M, Hao X, Ehlert J, Chi Z, Jin B, Maslov AY, Barabási AL, Hoeijmakers JHJ, Edelmann W, Vijg J, Dong X. Negative Selection Allows DNA Mismatch Repair-Deficient Mouse Fibroblasts In Vitro to Tolerate High Levels of Somatic Mutations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.04.592535. [PMID: 38766154 PMCID: PMC11100588 DOI: 10.1101/2024.05.04.592535] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2024]
Abstract
Substantial numbers of somatic mutations have been found to accumulate with age in different human tissues. Clonal cellular amplification of some of these mutations can cause cancer and other diseases. However, it is as yet unclear if and to what extent an increased burden of random mutations can affect cellular function without clonal amplification. We tested this in cell culture, which avoids the limitation that an increased mutation burden in vivo typically leads to cancer. We performed single-cell whole-genome sequencing of primary fibroblasts from DNA mismatch repair (MMR) deficient Msh2 -/- mice and littermate control animals after long-term passaging. Apart from analyzing somatic mutation burden we analyzed clonality, mutational signatures, and hotspots in the genome, characterizing the complete landscape of somatic mutagenesis in normal and MMR-deficient mouse primary fibroblasts during passaging. While growth rate of Msh2 -/- fibroblasts was not significantly different from the controls, the number of de novo single-nucleotide variants (SNVs) increased linearly up until at least 30,000 SNVs per cell, with the frequency of small insertions and deletions (INDELs) plateauing in the Msh2 -/- fibroblasts to about 10,000 INDELS per cell. We provide evidence for negative selection and large-scale mutation-driven population changes, including significant clonal expansion of preexisting mutations and widespread cell-strain-specific hotspots. Overall, our results provide evidence that increased somatic mutation burden drives significant cell evolutionary changes in a dynamic cell culture system without significant effects on growth. Since similar selection processes against mutations preventing organ and tissue dysfunction during aging are difficult to envision, these results suggest that increased somatic mutation burden can play a causal role in aging and diseases other than cancer.
Collapse
|
14
|
Hayden AN, Brandel KL, Merlau PR, Vijayakumar P, Leptich EJ, Pietryk EW, Gaytan ES, Ni CW, Chao HT, Rosenfeld JA, Arey RN. Behavioral screening of conserved RNA-binding proteins reveals CEY-1/YBX RNA-binding protein dysfunction leads to impairments in memory and cognition. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.05.574402. [PMID: 38260399 PMCID: PMC10802296 DOI: 10.1101/2024.01.05.574402] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/24/2024]
Abstract
RNA-binding proteins (RBPs) regulate translation and plasticity which are required for memory. RBP dysfunction has been linked to a range of neurological disorders where cognitive impairments are a key symptom. However, of the 2,000 RBPs in the human genome, many are uncharacterized with regards to neurological phenotypes. To address this, we used the model organism C. elegans to assess the role of 20 conserved RBPs in memory. We identified eight previously uncharacterized memory regulators, three of which are in the C. elegans Y-Box (CEY) RBP family. Of these, we determined that cey-1 is the closest ortholog to the mammalian Y-Box (YBX) RBPs. We found that CEY-1 is both necessary in the nervous system for memory ability and sufficient to increase memory. Leveraging human datasets, we found both copy number variation losses and single nucleotide variants in YBX1 and YBX3 in individuals with neurological symptoms. We identified one predicted deleterious YBX3 variant of unknown significance, p.Asn127Tyr, in two individuals with neurological symptoms. Introducing this variant into endogenous cey-1 locus caused memory deficits in the worm. We further generated two humanized worm lines expressing human YBX3 or YBX1 at the cey-1 locus to test evolutionary conservation of YBXs in memory and the potential functional significance of the p.Asn127Tyr variant. Both YBX1/3 can functionally replace cey-1, and introduction of p.Asn127Tyr into the humanized YBX3 locus caused memory deficits. Our study highlights the worm as a model to reveal memory regulators and identifies YBX dysfunction as a potential new source of rare neurological disease.
Collapse
Affiliation(s)
- Ashley N Hayden
- Department of Neuroscience, Baylor College of Medicine, Houston, TX 77030
- Center for Precision Environmental Health, Baylor College of Medicine, Houston, TX, 77030
| | - Katie L Brandel
- Department of Neuroscience, Baylor College of Medicine, Houston, TX 77030
- Center for Precision Environmental Health, Baylor College of Medicine, Houston, TX, 77030
| | - Paul R Merlau
- Center for Precision Environmental Health, Baylor College of Medicine, Houston, TX, 77030
| | | | - Emily J Leptich
- Department of Neuroscience, Baylor College of Medicine, Houston, TX 77030
- Center for Precision Environmental Health, Baylor College of Medicine, Houston, TX, 77030
| | - Edward W Pietryk
- Center for Precision Environmental Health, Baylor College of Medicine, Houston, TX, 77030
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030
| | - Elizabeth S Gaytan
- Center for Precision Environmental Health, Baylor College of Medicine, Houston, TX, 77030
- Postbaccalaureate Research Education Program, Baylor College of Medicine, Houston, TX, 77030
| | - Connie W Ni
- Center for Precision Environmental Health, Baylor College of Medicine, Houston, TX, 77030
- Department of Neuroscience, Rice University, Houston, TX 77005
| | - Hsiao-Tuan Chao
- Department of Neuroscience, Baylor College of Medicine, Houston, TX 77030
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030
- Department of Pediatrics, Division of Neurology and Developmental Neuroscience, Baylor College of Medicine, Houston, TX, 77030
- Cain Pediatric Neurology Research Foundation Laboratories, Jan and Dan Duncan Neurological Research Institute, Texas Children’s Hospital, Houston, TX, 77030
- McNair Medical Institute, The Robert and Janice McNair Foundation, Houston, TX, 77030
| | - Jill A Rosenfeld
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030
- Baylor Genetics Laboratories, Houston, TX 77021
| | - Rachel N Arey
- Center for Precision Environmental Health, Baylor College of Medicine, Houston, TX, 77030
- Department of Molecular and Cellular Biology, Baylor College of Medicine, Houston, TX, 77030
| |
Collapse
|
15
|
Hadar N, Dolgin V, Oustinov K, Yogev Y, Poleg T, Safran A, Freund O, Agam N, Jean MM, Proskorovski-Ohayon R, Wormser O, Drabkin M, Halperin D, Eskin-Schwartz M, Narkis G, Sued-Hendrickson S, Aminov I, Gombosh M, Aharoni S, Birk OS. VARista: a free web platform for streamlined whole-genome variant analysis across T2T, hg38, and hg19. Hum Genet 2024; 143:695-701. [PMID: 38607411 DOI: 10.1007/s00439-024-02671-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Accepted: 03/24/2024] [Indexed: 04/13/2024]
Abstract
With the increasing importance of genomic data in understanding genetic diseases, there is an essential need for efficient and user-friendly tools that simplify variant analysis. Although multiple tools exist, many present barriers such as steep learning curves, limited reference genome compatibility, or costs. We developed VARista, a free web-based tool, to address these challenges and provide a streamlined solution for researchers, particularly those focusing on rare monogenic diseases. VARista offers a user-centric interface that eliminates much of the technical complexity typically associated with variant analysis. The tool directly supports VCF files generated using reference genomes hg19, hg38, and the emerging T2T, with seamless remapping capabilities between them. Features such as gene summaries and links, tissue and cell-specific gene expression data for both adults and fetuses, as well as automated PCR design and integration with tools such as SpliceAI and AlphaMissense, enable users to focus on the biology and the case itself. As we demonstrate, VARista proved effective in narrowing down potential disease-causing variants, prioritizing them effectively, and providing meaningful biological context, facilitating rapid decision-making. VARista stands out as a freely available and comprehensive tool that consolidates various aspects of variant analysis into a single platform that embraces the forefront of genomic advancements. Its design inherently supports a shift in focus from technicalities to critical thinking, thereby promoting better-informed decisions in genetic disease research. Given its unique capabilities and user-centric design, VARista has the potential to become an essential asset for the genomic research community. https://VARista.link.
Collapse
Affiliation(s)
- Noam Hadar
- The Morris Kahn Laboratory of Human Genetics at the National Institute of Biotechnology in the Negev and Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Vadim Dolgin
- The Morris Kahn Laboratory of Human Genetics at the National Institute of Biotechnology in the Negev and Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Katya Oustinov
- The Morris Kahn Laboratory of Human Genetics at the National Institute of Biotechnology in the Negev and Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Yuval Yogev
- The Morris Kahn Laboratory of Human Genetics at the National Institute of Biotechnology in the Negev and Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Tomer Poleg
- The Morris Kahn Laboratory of Human Genetics at the National Institute of Biotechnology in the Negev and Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Amit Safran
- The Morris Kahn Laboratory of Human Genetics at the National Institute of Biotechnology in the Negev and Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Ofek Freund
- The Morris Kahn Laboratory of Human Genetics at the National Institute of Biotechnology in the Negev and Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Nadav Agam
- The Morris Kahn Laboratory of Human Genetics at the National Institute of Biotechnology in the Negev and Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Matan M Jean
- The Morris Kahn Laboratory of Human Genetics at the National Institute of Biotechnology in the Negev and Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Regina Proskorovski-Ohayon
- The Morris Kahn Laboratory of Human Genetics at the National Institute of Biotechnology in the Negev and Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Ohad Wormser
- The Morris Kahn Laboratory of Human Genetics at the National Institute of Biotechnology in the Negev and Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Max Drabkin
- The Morris Kahn Laboratory of Human Genetics at the National Institute of Biotechnology in the Negev and Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Daniel Halperin
- The Morris Kahn Laboratory of Human Genetics at the National Institute of Biotechnology in the Negev and Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Marina Eskin-Schwartz
- The Morris Kahn Laboratory of Human Genetics at the National Institute of Biotechnology in the Negev and Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel
- Genetics Institute, Soroka University Medical Center, Beer-Sheva, Israel
| | - Ginat Narkis
- The Morris Kahn Laboratory of Human Genetics at the National Institute of Biotechnology in the Negev and Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel
- Genetics Institute, Soroka University Medical Center, Beer-Sheva, Israel
| | - Sufa Sued-Hendrickson
- The Morris Kahn Laboratory of Human Genetics at the National Institute of Biotechnology in the Negev and Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Ilana Aminov
- The Morris Kahn Laboratory of Human Genetics at the National Institute of Biotechnology in the Negev and Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Maya Gombosh
- The Morris Kahn Laboratory of Human Genetics at the National Institute of Biotechnology in the Negev and Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Sarit Aharoni
- The Morris Kahn Laboratory of Human Genetics at the National Institute of Biotechnology in the Negev and Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Ohad S Birk
- The Morris Kahn Laboratory of Human Genetics at the National Institute of Biotechnology in the Negev and Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel.
- Genetics Institute, Soroka University Medical Center, Beer-Sheva, Israel.
| |
Collapse
|
16
|
Hadar N, Porgador O, Cohen I, Levi H, Dolgin V, Yogev Y, Sued-Hendrickson S, Shelef I, Didkovsky E, Eskin-Schwartz M, Birk OS. Heterozygous THBS2 pathogenic variant causes Ehlers-Danlos syndrome with prominent vascular features in humans and mice. Eur J Hum Genet 2024; 32:550-557. [PMID: 38433265 PMCID: PMC11061164 DOI: 10.1038/s41431-024-01559-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Revised: 11/17/2023] [Accepted: 01/30/2024] [Indexed: 03/05/2024] Open
Abstract
Ehlers-Danlos syndromes (EDS) are a group of connective tissue disorders caused by mutations in collagen and collagen-interacting genes. We delineate a novel form of EDS with vascular features through clinical and histopathological phenotyping and genetic studies of a three-generation pedigree, displaying an apparently autosomal dominant phenotype of joint hypermobility and frequent joint dislocations, atrophic scarring, prolonged bleeding time and age-related aortic dilatation and rupture. Coagulation tests as well as platelet counts and function were normal. Reticular dermis displayed highly disorganized collagen fibers and transmission electron microscopy (TEM) revealed abnormally shaped fibroblasts and endothelial cells, with high amount and irregular shape of extracellular matrix (ECM) substance, especially near blood vessels. Genetic analysis unraveled a heterozygous mutation in THBS2 (NM_003247.5:c.2686T>C, p.Cys896Arg). We generated CRISPR/Cas9 knock-in (KI) mice, bearing the heterozygous human mutation in the mouse ortholog. The KI mice demonstrated phenotypic traits correlating with those observed in the human subjects, as evidenced by morphologic, histologic, and TEM analyses, in conjunction with bleeding time assays. Our findings delineate a novel form of human EDS with classical-like elements combined with vascular features, caused by a heterozygous THBS2 missense mutation. We further demonstrate a similar phenotype in heterozygous THBS2Cys896Arg KI mice, in line with previous studies in Thbs2 homozygous null-mutant mice. Notably, THBS2 encodes Thrombospondin-2, a secreted homotrimeric matricellular protein that directly binds the ECM-shaping Matrix Metalloproteinase 2 (MMP2), mediating its clearance. THBS2 loss-of-function attenuates MMP2 clearance, enhancing MMP2-mediated proteoglycan cleavage, causing ECM abnormalities similar to those seen in the human and mouse disease we describe.
Collapse
Affiliation(s)
- Noam Hadar
- The Morris Kahn Laboratory of Human Genetics at the National Institute of Biotechnology in the Negev and Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel
- The Shraga Segal Department of Microbiology, Immunology, and Genetics, Faculty of Health Science, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Omri Porgador
- The Morris Kahn Laboratory of Human Genetics at the National Institute of Biotechnology in the Negev and Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel
- The Shraga Segal Department of Microbiology, Immunology, and Genetics, Faculty of Health Science, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Idan Cohen
- The Shraga Segal Department of Microbiology, Immunology, and Genetics, Faculty of Health Science, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Hilla Levi
- The Shraga Segal Department of Microbiology, Immunology, and Genetics, Faculty of Health Science, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Vadim Dolgin
- The Morris Kahn Laboratory of Human Genetics at the National Institute of Biotechnology in the Negev and Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel
- The Shraga Segal Department of Microbiology, Immunology, and Genetics, Faculty of Health Science, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Yuval Yogev
- The Morris Kahn Laboratory of Human Genetics at the National Institute of Biotechnology in the Negev and Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel
- The Shraga Segal Department of Microbiology, Immunology, and Genetics, Faculty of Health Science, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Sufa Sued-Hendrickson
- The Morris Kahn Laboratory of Human Genetics at the National Institute of Biotechnology in the Negev and Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel
- The Shraga Segal Department of Microbiology, Immunology, and Genetics, Faculty of Health Science, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Ilan Shelef
- Department of Radiology, Soroka Medical Center, and Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer-Sheva, Israel
| | - Elena Didkovsky
- Department of Pathology, Rabin Medical Center, Petah-Tikva, and Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Marina Eskin-Schwartz
- The Morris Kahn Laboratory of Human Genetics at the National Institute of Biotechnology in the Negev and Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel.
- Genetics Institute, Soroka University Medical Center, Beer-Sheva, Israel.
- Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer-Sheva, Israel.
| | - Ohad S Birk
- The Morris Kahn Laboratory of Human Genetics at the National Institute of Biotechnology in the Negev and Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel.
- The Shraga Segal Department of Microbiology, Immunology, and Genetics, Faculty of Health Science, Ben-Gurion University of the Negev, Beer Sheva, Israel.
- Genetics Institute, Soroka University Medical Center, Beer-Sheva, Israel.
| |
Collapse
|
17
|
Moreno JA, Dudchenko O, Feigin CY, Mereby SA, Chen Z, Ramos R, Almet AA, Sen H, Brack BJ, Johnson MR, Li S, Wang W, Gaska JM, Ploss A, Weisz D, Omer AD, Yao W, Colaric Z, Kaur P, Leger JS, Nie Q, Mena A, Flanagan JP, Keller G, Sanger T, Ostrow B, Plikus MV, Kvon EZ, Aiden EL, Mallarino R. Emx2 underlies the development and evolution of marsupial gliding membranes. Nature 2024; 629:127-135. [PMID: 38658750 PMCID: PMC11062917 DOI: 10.1038/s41586-024-07305-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2023] [Accepted: 03/13/2024] [Indexed: 04/26/2024]
Abstract
Phenotypic variation among species is a product of evolutionary changes to developmental programs1,2. However, how these changes generate novel morphological traits remains largely unclear. Here we studied the genomic and developmental basis of the mammalian gliding membrane, or patagium-an adaptative trait that has repeatedly evolved in different lineages, including in closely related marsupial species. Through comparative genomic analysis of 15 marsupial genomes, both from gliding and non-gliding species, we find that the Emx2 locus experienced lineage-specific patterns of accelerated cis-regulatory evolution in gliding species. By combining epigenomics, transcriptomics and in-pouch marsupial transgenics, we show that Emx2 is a critical upstream regulator of patagium development. Moreover, we identify different cis-regulatory elements that may be responsible for driving increased Emx2 expression levels in gliding species. Lastly, using mouse functional experiments, we find evidence that Emx2 expression patterns in gliders may have been modified from a pre-existing program found in all mammals. Together, our results suggest that patagia repeatedly originated through a process of convergent genomic evolution, whereby regulation of Emx2 was altered by distinct cis-regulatory elements in independently evolved species. Thus, different regulatory elements targeting the same key developmental gene may constitute an effective strategy by which natural selection has harnessed regulatory evolution in marsupial genomes to generate phenotypic novelty.
Collapse
Affiliation(s)
- Jorge A Moreno
- Department of Molecular Biology, Princeton University, Princeton, NJ, USA
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | - Olga Dudchenko
- The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
- The Center for Theoretical Biological Physics, Rice University, Houston, TX, USA
| | - Charles Y Feigin
- Department of Molecular Biology, Princeton University, Princeton, NJ, USA
- School of BioSciences, The University of Melbourne, Parkville, Victoria, Australia
- Department of Environment and Genetics, La Trobe University, Bundoora, Victoria, Australia
| | - Sarah A Mereby
- Department of Molecular Biology, Princeton University, Princeton, NJ, USA
| | - Zhuoxin Chen
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine, CA, USA
| | - Raul Ramos
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine, CA, USA
| | - Axel A Almet
- Department of Mathematics, University of California, Irvine, Irvine, CA, USA
- NSF-Simons Center for Multiscale Cell Fate Research, University of California, Irvine, Irvine, CA, USA
| | - Harsha Sen
- Department of Molecular Biology, Princeton University, Princeton, NJ, USA
| | - Benjamin J Brack
- Department of Molecular Biology, Princeton University, Princeton, NJ, USA
| | - Matthew R Johnson
- Department of Molecular Biology, Princeton University, Princeton, NJ, USA
| | - Sha Li
- Department of Molecular Biology, Princeton University, Princeton, NJ, USA
| | - Wei Wang
- Lewis Sigler Center for Integrative Genomics, Princeton University, Princeton, NJ, USA
| | - Jenna M Gaska
- Department of Molecular Biology, Princeton University, Princeton, NJ, USA
| | - Alexander Ploss
- Department of Molecular Biology, Princeton University, Princeton, NJ, USA
| | - David Weisz
- The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Arina D Omer
- The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Weijie Yao
- The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Zane Colaric
- The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Parwinder Kaur
- The University of Western Australia, Crawley, Western Australia, Australia
| | - Judy St Leger
- Cornell University College of Veterinary Medicine, Ithaca, NY, USA
| | - Qing Nie
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine, CA, USA
- Department of Mathematics, University of California, Irvine, Irvine, CA, USA
- NSF-Simons Center for Multiscale Cell Fate Research, University of California, Irvine, Irvine, CA, USA
- Center for Complex Biological Systems, University of California, Irvine, Irvine, CA, USA
| | | | | | - Greta Keller
- Department of Biology, Loyola University, Chicago, IL, USA
| | - Thomas Sanger
- Department of Biology, Loyola University, Chicago, IL, USA
| | - Bruce Ostrow
- Department of Biology, Grand Valley State University, Allendale, MI, USA
| | - Maksim V Plikus
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine, CA, USA
| | - Evgeny Z Kvon
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine, CA, USA
| | - Erez Lieberman Aiden
- The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA.
- The Center for Theoretical Biological Physics, Rice University, Houston, TX, USA.
| | - Ricardo Mallarino
- Department of Molecular Biology, Princeton University, Princeton, NJ, USA.
| |
Collapse
|
18
|
Zhu Z, Bo-Ran Ho B, Chen A, Amrhein J, Apetrei A, Carpenter TO, Lazaretti-Castro M, Colazo JM, McCrystal Dahir K, Geßner M, Gurevich E, Heier CA, Simmons JH, Hunley TE, Hoppe B, Jacobsen C, Kouri A, Ma N, Majumdar S, Molin A, Nokoff N, Ott SM, Peña HG, Santos F, Tebben P, Topor LS, Deng Y, Bergwitz C. An update on clinical presentation and responses to therapy of patients with hereditary hypophosphatemic rickets with hypercalciuria (HHRH). Kidney Int 2024; 105:1058-1076. [PMID: 38364990 PMCID: PMC11106756 DOI: 10.1016/j.kint.2024.01.031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Revised: 12/23/2023] [Accepted: 01/08/2024] [Indexed: 02/18/2024]
Abstract
Pathogenic variants in solute carrier family 34, member 3 (SLC34A3), the gene encoding the sodium-dependent phosphate cotransporter 2c (NPT2c), cause hereditary hypophosphatemic rickets with hypercalciuria (HHRH). Here, we report a pooled analysis of clinical and laboratory records of 304 individuals from 145 kindreds, including 20 previously unreported HHRH kindreds, in which two novel SLC34A3 pathogenic variants were identified. Compound heterozygous/homozygous carriers show above 90% penetrance for kidney and bone phenotypes. The biochemical phenotype for heterozygous carriers is intermediate with decreased serum phosphate, tubular reabsorption of phosphate (TRP (%)), fibroblast growth factor 23, and intact parathyroid hormone, but increased serum 1,25-dihydroxy vitamin D, and urine calcium excretion causing idiopathic hypercalciuria in 38%, with bone phenotypes still observed in 23% of patients. Oral phosphate supplementation is the current standard of care, which typically normalizes serum phosphate. However, although in more than half of individuals this therapy achieves correction of hypophosphatemia it fails to resolve the other outcomes. The American College of Medical Genetics and Genomics score correlated with functional analysis of frequent SLC34A3 pathogenic variants in vitro and baseline disease severity. The number of mutant alleles and baseline TRP (%) were identified as predictors for kidney and bone phenotypes, baseline TRP (%) furthermore predicted response to therapy. Certain SLC34A3/NPT2c pathogenic variants can be identified with partial responses to therapy, whereas with some overlap, others present only with kidney phenotypes and a third group present only with bone phenotypes. Thus, our report highlights important novel clinical aspects of HHRH and heterozygous carriers, raises awareness to this rare group of disorders and can be a foundation for future studies urgently needed to guide therapy of HHRH.
Collapse
Affiliation(s)
- Zewu Zhu
- Department of Internal Medicine, Section of Endocrinology, Yale University School of Medicine, New Haven, Connecticut, USA; Department of Urology, Xiangya Hospital, Central South University, Changsha, Hunan, China
| | - Bryan Bo-Ran Ho
- Department of Internal Medicine, Section of Endocrinology, Yale University School of Medicine, New Haven, Connecticut, USA
| | - Alyssa Chen
- Department of Internal Medicine, Section of Endocrinology, Yale University School of Medicine, New Haven, Connecticut, USA; Department of Otolaryngology, Harvard Medical School, Boston, Massachusetts, USA
| | - James Amrhein
- Pediatric Endocrinology and Diabetes, School of Medicine Greenville Campus, University of South Carolina, Greenville, South Carolina, USA
| | - Andreea Apetrei
- Caen University Hospital, Department of Genetics, UR7450 Biotargen, Reference Center for Rare Diseases of Calcium and Phosphate Metabolism, OSCAR Network, Caen, France
| | - Thomas Oliver Carpenter
- Department of Internal Medicine, Section of Endocrinology, Yale University School of Medicine, New Haven, Connecticut, USA
| | - Marise Lazaretti-Castro
- Division of Endocrinology, Escola Paulista de Medicina-Universidade Federal de Sao Paulo (EPM-UNIFESP), Sao Paulo, Brazil
| | - Juan Manuel Colazo
- Department of Biomedical Engineering, Vanderbilt University, Nashville, Tennessee, USA
| | - Kathryn McCrystal Dahir
- Division of Endocrinology, Program for Metabolic Bone Disorders, Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee, USA
| | - Michaela Geßner
- Pediatric Nephrology, Children's and Adolescents' Hospital, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany
| | - Evgenia Gurevich
- Schneider Children's Medical Center of Israel, Pediatric Nephrology Institute, Petach Tikva, Israel; Faculty of Health Sciences, Ben Gurion University of the Negev, Beer Sheva, Israel
| | | | - Jill Hickman Simmons
- Department of Pediatrics, Division of Endocrinology and Diabetes, Vanderbilt University School of Medicine, Vanderbilt University, Nashville, Tennessee, USA
| | - Tracy Earl Hunley
- Division of Pediatric Nephrology, Vanderbilt University Medical Center, Monroe Carell Jr Children's Hospital at Vanderbilt, Nashville, Tennessee, USA
| | - Bernd Hoppe
- Division of Pediatric Nephrology, Department of Pediatrics, University of Bonn, Bonn, Germany
| | - Christina Jacobsen
- Division of Endocrinology, Harvard Medical School, Boston, Massachusetts, USA
| | - Anne Kouri
- Pediatric Nephrology, University of Minnesota, Minneapolis, Minnesota, USA
| | - Nina Ma
- Section of Pediatric Endocrinology, Children's Hospital Colorado, Aurora, Colorado, USA; Department of Pediatrics, University of Colorado School of Medicine, Aurora, Colorado, USA
| | - Sachin Majumdar
- Department of Internal Medicine, Section of Endocrinology, Yale University School of Medicine, New Haven, Connecticut, USA
| | - Arnaud Molin
- Caen University Hospital, Department of Genetics, UR7450 Biotargen, Reference Center for Rare Diseases of Calcium and Phosphate Metabolism, OSCAR Network, Caen, France
| | - Natalie Nokoff
- Department of Pediatrics, Section of Endocrinology, University of Colorado Anschutz Medical Campus, Aurora, Colorado, USA
| | - Susan M Ott
- Department of Medicine, University of Washington, Seattle, Washington, USA
| | - Helena Gil Peña
- Department of Pediatrics, Hospital Universitario Central de Asturias (HUCA), Oviedo, Spain
| | - Fernando Santos
- Department of Pediatrics, Hospital Universitario Central de Asturias (HUCA), Oviedo, Spain
| | - Peter Tebben
- Division of Endocrinology, Diabetes, Metabolism, and Nutrition, Mayo Clinic, Rochester, Minnesota, USA; Division of Pediatric Endocrinology, Mayo Clinic, Rochester, Minnesota, USA
| | - Lisa Swartz Topor
- Division of Pediatric Endocrinology, Hasbro Children's Hospital, Warren Alpert Medical School of Brown University, Providence, Rhode Island, USA
| | - Yanhong Deng
- Yale School of Public Health, New Haven, Connecticut, USA
| | - Clemens Bergwitz
- Department of Internal Medicine, Section of Endocrinology, Yale University School of Medicine, New Haven, Connecticut, USA.
| |
Collapse
|
19
|
Bulduk BK, Tortajada J, Valiente-Pallejà A, Callado LF, Torrell H, Vilella E, Meana JJ, Muntané G, Martorell L. High number of mitochondrial DNA alterations in postmortem brain tissue of patients with schizophrenia compared to healthy controls. Psychiatry Res 2024; 337:115928. [PMID: 38759415 DOI: 10.1016/j.psychres.2024.115928] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/20/2024] [Revised: 04/12/2024] [Accepted: 04/26/2024] [Indexed: 05/19/2024]
Abstract
Previous studies have shown mitochondrial dysfunction in schizophrenia (SZ) patients, which may be caused by mitochondrial DNA (mtDNA) alterations. However, there are few studies in SZ that have analyzed mtDNA in brain samples by next-generation sequencing (NGS). To address this gap, we used mtDNA-targeted NGS and qPCR to characterize mtDNA alterations in brain samples from patients with SZ (n = 40) and healthy controls (HC) (n = 40). 35 % of SZ patients showed mtDNA alterations, a significantly higher prevalence compared to 10 % of HC. Specifically, SZ patients had a significantly higher frequency of deletions (35 vs. 5 in HC), with a mean number of deletions of 3.8 in SZ vs. 1.0 in HC. Likely pathogenic missense variants were also significantly more frequent in patients with SZ than in HC (10 vs. three HC), encompassing 14 variants in patients and three in HC. The pathogenic tRNA variant m.3243A>G was identified in one SZ patient with a high heteroplasmy level of 32.2 %. While no significant differences in mtDNA copy number (mtDNA-CN) were observed between SZ and HC, antipsychotic users had significantly higher mtDNA-CN than non-users. These findings suggest a potential role for mtDNA alterations in the pathophysiology of SZ that require further validation and functional studies.
Collapse
Affiliation(s)
- Bengisu K Bulduk
- Hospital Universitari Institut Pere Mata (HUIPM), Reus, Catalonia, Spain; Institut d'Investigació Sanitària Pere Virgili (IISPV-CERCA), Universitat Rovira i Virgili (URV), Reus, Catalonia, Spain
| | - Juan Tortajada
- Hospital Universitari Institut Pere Mata (HUIPM), Reus, Catalonia, Spain; Institut d'Investigació Sanitària Pere Virgili (IISPV-CERCA), Universitat Rovira i Virgili (URV), Reus, Catalonia, Spain
| | - Alba Valiente-Pallejà
- Hospital Universitari Institut Pere Mata (HUIPM), Reus, Catalonia, Spain; Institut d'Investigació Sanitària Pere Virgili (IISPV-CERCA), Universitat Rovira i Virgili (URV), Reus, Catalonia, Spain; Centro de Investigación Biomédica en Red de Salud Mental (CIBERSAM), Instituto de Salud Carlos III, Madrid, Spain
| | - Luís F Callado
- Centro de Investigación Biomédica en Red de Salud Mental (CIBERSAM), Instituto de Salud Carlos III, Madrid, Spain; Department of Pharmacology, University of the Basque Country, UPV/EHU, Leioa, and BioBizkaia Health Research Institute, Barakaldo, Bizkaia, Spain
| | - Helena Torrell
- Centre for Omic Sciences (COS), Joint Unit URV-EURECAT Technology Centre of Catalonia, Unique Scientific and Technical Infrastructures, Reus, Catalonia, Spain
| | - Elisabet Vilella
- Hospital Universitari Institut Pere Mata (HUIPM), Reus, Catalonia, Spain; Institut d'Investigació Sanitària Pere Virgili (IISPV-CERCA), Universitat Rovira i Virgili (URV), Reus, Catalonia, Spain; Centro de Investigación Biomédica en Red de Salud Mental (CIBERSAM), Instituto de Salud Carlos III, Madrid, Spain
| | - J Javier Meana
- Centro de Investigación Biomédica en Red de Salud Mental (CIBERSAM), Instituto de Salud Carlos III, Madrid, Spain; Department of Pharmacology, University of the Basque Country, UPV/EHU, Leioa, and BioBizkaia Health Research Institute, Barakaldo, Bizkaia, Spain
| | - Gerard Muntané
- Hospital Universitari Institut Pere Mata (HUIPM), Reus, Catalonia, Spain; Institut d'Investigació Sanitària Pere Virgili (IISPV-CERCA), Universitat Rovira i Virgili (URV), Reus, Catalonia, Spain; Centro de Investigación Biomédica en Red de Salud Mental (CIBERSAM), Instituto de Salud Carlos III, Madrid, Spain; Institut de Biologia Evolutiva (UPF-CSIC), Department of Medicine and Life Sciences, Universitat Pompeu Fabra, Parc de Recerca Biomèdica de Barcelona, Barcelona, Catalonia, Spain.
| | - Lourdes Martorell
- Hospital Universitari Institut Pere Mata (HUIPM), Reus, Catalonia, Spain; Institut d'Investigació Sanitària Pere Virgili (IISPV-CERCA), Universitat Rovira i Virgili (URV), Reus, Catalonia, Spain; Centro de Investigación Biomédica en Red de Salud Mental (CIBERSAM), Instituto de Salud Carlos III, Madrid, Spain.
| |
Collapse
|
20
|
Mohar NP, Cox EM, Adelizzi E, Moore SA, Mathews KD, Darbro BW, Wallrath LL. The Influence of a Genetic Variant in CCDC78 on LMNA-Associated Skeletal Muscle Disease. Int J Mol Sci 2024; 25:4930. [PMID: 38732148 PMCID: PMC11084688 DOI: 10.3390/ijms25094930] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2024] [Revised: 04/12/2024] [Accepted: 04/24/2024] [Indexed: 05/13/2024] Open
Abstract
Mutations in the LMNA gene-encoding A-type lamins can cause Limb-Girdle muscular dystrophy Type 1B (LGMD1B). This disease presents with weakness and wasting of the proximal skeletal muscles and has a variable age of onset and disease severity. This variability has been attributed to genetic background differences among individuals; however, such variants have not been well characterized. To identify such variants, we investigated a multigeneration family in which affected individuals are diagnosed with LGMD1B. The primary genetic cause of LGMD1B in this family is a dominant mutation that activates a cryptic splice site, leading to a five-nucleotide deletion in the mature mRNA. This results in a frame shift and a premature stop in translation. Skeletal muscle biopsies from the family members showed dystrophic features of variable severity, with the muscle fibers of some family members possessing cores, regions of sarcomeric disruption, and a paucity of mitochondria, not commonly associated with LGMD1B. Using whole genome sequencing (WGS), we identified 21 DNA sequence variants that segregate with the family members possessing more profound dystrophic features and muscle cores. These include a relatively common variant in coiled-coil domain containing protein 78 (CCDC78). This variant was given priority because another mutation in CCDC78 causes autosomal dominant centronuclear myopathy-4, which causes cores in addition to centrally positioned nuclei. Therefore, we analyzed muscle biopsies from family members and discovered that those with both the LMNA mutation and the CCDC78 variant contain muscle cores that accumulated both CCDC78 and RyR1. Muscle cores containing mislocalized CCDC78 and RyR1 were absent in the less profoundly affected family members possessing only the LMNA mutation. Taken together, our findings suggest that a relatively common variant in CCDC78 can impart profound muscle pathology in combination with a LMNA mutation and accounts for variability in skeletal muscle disease phenotypes.
Collapse
Affiliation(s)
- Nathaniel P. Mohar
- Interdisciplinary Graduate Program in Genetics, University of Iowa, Iowa City, IA 52242, USA; (N.P.M.); (E.A.)
- Department of Biochemistry and Molecular Biology, Carver College of Medicine, University of Iowa, Iowa City, IA 52242, USA
| | - Efrem M. Cox
- Department of Pathology, Carver College of Medicine, University of Iowa, Iowa City, IA 52242, USA (S.A.M.)
- Department of Neurosurgery, UNLV School of Medicine, Las Vegas, NV 89106, USA
| | - Emily Adelizzi
- Interdisciplinary Graduate Program in Genetics, University of Iowa, Iowa City, IA 52242, USA; (N.P.M.); (E.A.)
- Department of Anatomy and Cell Biology, Carver College of Medicine, University of Iowa, Iowa City, IA 52242, USA
| | - Steven A. Moore
- Department of Pathology, Carver College of Medicine, University of Iowa, Iowa City, IA 52242, USA (S.A.M.)
| | - Katherine D. Mathews
- Department of Pediatrics, Carver College of Medicine, University of Iowa, Iowa City, IA 52242, USA;
| | - Benjamin W. Darbro
- Interdisciplinary Graduate Program in Genetics, University of Iowa, Iowa City, IA 52242, USA; (N.P.M.); (E.A.)
- Department of Pediatrics, Carver College of Medicine, University of Iowa, Iowa City, IA 52242, USA;
| | - Lori L. Wallrath
- Interdisciplinary Graduate Program in Genetics, University of Iowa, Iowa City, IA 52242, USA; (N.P.M.); (E.A.)
- Department of Biochemistry and Molecular Biology, Carver College of Medicine, University of Iowa, Iowa City, IA 52242, USA
| |
Collapse
|
21
|
Wieder N, D'Souza EN, Martin-Geary AC, Lassen FH, Talbot-Martin J, Fernandes M, Chothani SP, Rackham OJL, Schafer S, Aspden JL, MacArthur DG, Davies RW, Whiffin N. Differences in 5'untranslated regions highlight the importance of translational regulation of dosage sensitive genes. Genome Biol 2024; 25:111. [PMID: 38685090 PMCID: PMC11057154 DOI: 10.1186/s13059-024-03248-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Accepted: 04/15/2024] [Indexed: 05/02/2024] Open
Abstract
BACKGROUND Untranslated regions (UTRs) are important mediators of post-transcriptional regulation. The length of UTRs and the composition of regulatory elements within them are known to vary substantially across genes, but little is known about the reasons for this variation in humans. Here, we set out to determine whether this variation, specifically in 5'UTRs, correlates with gene dosage sensitivity. RESULTS We investigate 5'UTR length, the number of alternative transcription start sites, the potential for alternative splicing, the number and type of upstream open reading frames (uORFs) and the propensity of 5'UTRs to form secondary structures. We explore how these elements vary by gene tolerance to loss-of-function (LoF; using the LOEUF metric), and in genes where changes in dosage are known to cause disease. We show that LOEUF correlates with 5'UTR length and complexity. Genes that are most intolerant to LoF have longer 5'UTRs, greater TSS diversity, and more upstream regulatory elements than their LoF tolerant counterparts. We show that these differences are evident in disease gene-sets, but not in recessive developmental disorder genes where LoF of a single allele is tolerated. CONCLUSIONS Our results confirm the importance of post-transcriptional regulation through 5'UTRs in tight regulation of mRNA and protein levels, particularly for genes where changes in dosage are deleterious and lead to disease. Finally, to support gene-based investigation we release a web-based browser tool, VuTR, that supports exploration of the composition of individual 5'UTRs and the impact of genetic variation within them.
Collapse
Affiliation(s)
- Nechama Wieder
- Big Data Institute, University of Oxford, Oxford, UK
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Elston N D'Souza
- Big Data Institute, University of Oxford, Oxford, UK
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Alexandra C Martin-Geary
- Big Data Institute, University of Oxford, Oxford, UK
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Frederik H Lassen
- Big Data Institute, University of Oxford, Oxford, UK
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | | | - Maria Fernandes
- Big Data Institute, University of Oxford, Oxford, UK
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Sonia P Chothani
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore, 169857, Singapore
| | - Owen J L Rackham
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore, 169857, Singapore
- School of Biological Sciences, University of Southampton, Southampton, UK
| | - Sebastian Schafer
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore, 169857, Singapore
| | - Julie L Aspden
- School of Molecular and Cellular Biology, Faculty of Biological Sciences, University of Leeds, Leeds, LS2 9JT, United Kingdom
- LeedsOmics, University of Leeds, Leeds, LS2 9JT, United Kingdom
- Astbury Centre of Structural Molecular Biology, University of Leeds, Leeds, LS2 9JT, United Kingdom
| | - Daniel G MacArthur
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Centre for Population Genomics, Garvan Institute of Medical Research, and UNSW Sydney, Sydney, NSW, Australia
- Centre for Population Genomics, Murdoch Children's Research Institute, Melbourne, VIC, Australia
| | - Robert W Davies
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
- Department of Statistics, University of Oxford, Oxford, UK
| | - Nicola Whiffin
- Big Data Institute, University of Oxford, Oxford, UK.
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK.
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| |
Collapse
|
22
|
Kaplan SJ, Wong W, Yan J, Pulecio J, Cho HS, Li Q, Zhao J, Leslie-Iyer J, Kazakov J, Murphy D, Luo R, Dey KK, Apostolou E, Leslie CS, Huangfu D. CRISPR Screening Uncovers a Long-Range Enhancer for ONECUT1 in Pancreatic Differentiation and Links a Diabetes Risk Variant. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.26.591412. [PMID: 38746154 PMCID: PMC11092487 DOI: 10.1101/2024.04.26.591412] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
Functional enhancer annotation is a valuable first step for understanding tissue-specific transcriptional regulation and prioritizing disease-associated non-coding variants for investigation. However, unbiased enhancer discovery in physiologically relevant contexts remains a major challenge. To discover regulatory elements pertinent to diabetes, we conducted a CRISPR interference (CRISPRi) screen in the human pluripotent stem cell (hPSC) pancreatic differentiation system. Among the enhancers uncovered, we focused on a long-range enhancer ∼664 kb from the ONECUT1 promoter, as coding mutations in ONECUT1 cause pancreatic hypoplasia and neonatal diabetes. Homozygous enhancer deletion in hPSCs was associated with a near-complete loss of ONECUT1 gene expression and compromised pancreatic differentiation. This enhancer contains a confidently fine-mapped type 2 diabetes (T2D) associated variant (rs528350911) which disrupts a GATA motif. Introduction of the risk variant into hPSCs revealed substantially reduced binding of key pancreatic transcription factors (GATA4, GATA6 and FOXA2) on the edited allele, accompanied by a slight reduction of ONECUT1 transcription, supporting a causal role for this risk variant in metabolic disease. This work expands our knowledge about transcriptional regulation in pancreatic development through the characterization of a long-range enhancer and highlights the utility of enhancer discovery in disease-relevant settings for understanding monogenic and complex disease.
Collapse
|
23
|
Xiong Z, Thach TQ, Zhang YD, Sham PC. Improved estimation of functional enrichment in SNP heritability using feasible generalized least squares. HGG ADVANCES 2024; 5:100272. [PMID: 38327050 PMCID: PMC10901842 DOI: 10.1016/j.xhgg.2024.100272] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Revised: 01/23/2024] [Accepted: 01/23/2024] [Indexed: 02/09/2024] Open
Abstract
Functional enrichment results typically implicate tissue or cell-type-specific biological pathways in disease pathogenesis and as therapeutic targets. We propose generalized linkage disequilibrium score regression (g-LDSC) that requires only genome-wide association studies (GWASs) summary-level data to estimate functional enrichment. The method adopts the same assumptions and regression model formulation as stratified linkage disequilibrium score regression (s-LDSC). Although s-LDSC only partially uses LD information, our method uses the whole LD matrix, which accounts for possible correlated error structure via a feasible generalized least-squares estimation. We demonstrate through simulation studies under various scenarios that g-LDSC provides more precise estimates of functional enrichment than s-LDSC, regardless of model misspecification. In an application to GWAS summary statistics of 15 traits from the UK Biobank, estimates of functional enrichment using g-LDSC were lower and more realistic than those obtained from s-LDSC. In addition, g-LDSC detected more significantly enriched functional annotations among 24 functional annotations for the 15 traits than s-LDSC (118 vs. 51).
Collapse
Affiliation(s)
- Zewei Xiong
- Department of Psychiatry, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - Thuan-Quoc Thach
- Department of Psychiatry, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - Yan Dora Zhang
- Department of Statistics and Actuarial Science, The University of Hong Kong, Hong Kong SAR, China; Centre for PanorOmic Sciences, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China.
| | - Pak Chung Sham
- Department of Psychiatry, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China; Centre for PanorOmic Sciences, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China; State Key Laboratory of Brain and Cognitive Sciences, The University of Hong Kong, Hong Kong SAR, China.
| |
Collapse
|
24
|
Hansen TJ, Fong SL, Day JK, Capra JA, Hodges E. Human gene regulatory evolution is driven by the divergence of regulatory element function in both cis and trans. CELL GENOMICS 2024; 4:100536. [PMID: 38604126 PMCID: PMC11019363 DOI: 10.1016/j.xgen.2024.100536] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/04/2023] [Revised: 01/03/2024] [Accepted: 03/10/2024] [Indexed: 04/13/2024]
Abstract
Gene regulatory divergence between species can result from cis-acting local changes to regulatory element DNA sequences or global trans-acting changes to the regulatory environment. Understanding how these mechanisms drive regulatory evolution has been limited by challenges in identifying trans-acting changes. We present a comprehensive approach to directly identify cis- and trans-divergent regulatory elements between human and rhesus macaque lymphoblastoid cells using assay for transposase-accessible chromatin coupled to self-transcribing active regulatory region (ATAC-STARR) sequencing. In addition to thousands of cis changes, we discover an unexpected number (∼10,000) of trans changes and show that cis and trans elements exhibit distinct patterns of sequence divergence and function. We further identify differentially expressed transcription factors that underlie ∼37% of trans differences and trace how cis changes can produce cascades of trans changes. Overall, we find that most divergent elements (67%) experienced changes in both cis and trans, revealing a substantial role for trans divergence-alone and together with cis changes-in regulatory differences between species.
Collapse
Affiliation(s)
- Tyler J Hansen
- Department of Biochemistry, Vanderbilt University School of Medicine, Nashville, TN 37232, USA
| | - Sarah L Fong
- Vanderbilt Genetics Institute, Vanderbilt University School of Medicine, Nashville, TN 37232, USA; Bakar Computational Health Sciences Institute, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Jessica K Day
- Department of Biochemistry, Vanderbilt University School of Medicine, Nashville, TN 37232, USA
| | - John A Capra
- Bakar Computational Health Sciences Institute, University of California, San Francisco, San Francisco, CA 94143, USA; Department of Epidemiology and Biostatistics, University of California, San Francisco, CA 94143, USA.
| | - Emily Hodges
- Department of Biochemistry, Vanderbilt University School of Medicine, Nashville, TN 37232, USA; Vanderbilt Genetics Institute, Vanderbilt University School of Medicine, Nashville, TN 37232, USA; Vanderbilt Ingram Cancer Center, Nashville, TN 37232, USA.
| |
Collapse
|
25
|
Zeng T, Spence JP, Mostafavi H, Pritchard JK. Bayesian estimation of gene constraint from an evolutionary model with gene features. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.05.19.541520. [PMID: 37292653 PMCID: PMC10245655 DOI: 10.1101/2023.05.19.541520] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Measures of selective constraint on genes have been used for many applications including clinical interpretation of rare coding variants, disease gene discovery, and studies of genome evolution. However, widely-used metrics are severely underpowered at detecting constraint for the shortest ∼25% of genes, potentially causing important pathogenic mutations to be overlooked. We developed a framework combining a population genetics model with machine learning on gene features to enable accurate inference of an interpretable constraint metric, shet. Our estimates outperform existing metrics for prioritizing genes important for cell essentiality, human disease, and other phenotypes, especially for short genes. Our new estimates of selective constraint should have wide utility for characterizing genes relevant to human disease. Finally, our inference framework, GeneBayes, provides a flexible platform that can improve estimation of many gene-level properties, such as rare variant burden or gene expression differences.
Collapse
Affiliation(s)
- Tony Zeng
- Department of Genetics, Stanford University, Stanford CA
| | | | | | - Jonathan K. Pritchard
- Department of Genetics, Stanford University, Stanford CA
- Department of Biology, Stanford University, Stanford CA
| |
Collapse
|
26
|
Benegas G, Albors C, Aw AJ, Ye C, Song YS. GPN-MSA: an alignment-based DNA language model for genome-wide variant effect prediction. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.10.10.561776. [PMID: 37873118 PMCID: PMC10592768 DOI: 10.1101/2023.10.10.561776] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
Whereas protein language models have demonstrated remarkable efficacy in predicting the effects of missense variants, DNA counterparts have not yet achieved a similar competitive edge for genome-wide variant effect predictions, especially in complex genomes such as that of humans. To address this challenge, we here introduce GPN-MSA, a novel framework for DNA language models that leverages whole-genome sequence alignments across multiple species and takes only a few hours to train. Across several benchmarks on clinical databases (ClinVar, COSMIC, OMIM), experimental functional assays (DMS, DepMap), and population genomic data (gnomAD), our model for the human genome achieves outstanding performance on deleteriousness prediction for both coding and non-coding variants.
Collapse
Affiliation(s)
- Gonzalo Benegas
- Graduate Group in Computational Biology, University of California, Berkeley
| | - Carlos Albors
- Computer Science Division, University of California, Berkeley
| | - Alan J. Aw
- Department of Statistics, University of California, Berkeley
| | - Chengzhong Ye
- Department of Statistics, University of California, Berkeley
| | - Yun S. Song
- Computer Science Division, University of California, Berkeley
- Department of Statistics, University of California, Berkeley
- Center for Computational Biology, University of California, Berkeley
| |
Collapse
|
27
|
Ding M, Chen K, Yang Y, Zhao H. Prioritizing genomic variants pathogenicity via DNA, RNA, and protein-level features based on extreme gradient boosting. Hum Genet 2024:10.1007/s00439-024-02667-0. [PMID: 38575818 DOI: 10.1007/s00439-024-02667-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Accepted: 03/05/2024] [Indexed: 04/06/2024]
Abstract
Genetic diseases are mostly implicated with genetic variants, including missense, synonymous, non-sense, and copy number variants. These different kinds of variants are indicated to affect phenotypes in various ways from previous studies. It remains essential but challenging to understand the functional consequences of these genetic variants, especially the noncoding ones, due to the lack of corresponding annotations. While many computational methods have been proposed to identify the risk variants. Most of them have only curated DNA-level and protein-level annotations to predict the pathogenicity of the variants, and others have been restricted to missense variants exclusively. In this study, we have curated DNA-, RNA-, and protein-level features to discriminate disease-causing variants in both coding and noncoding regions, where the features of protein sequences and protein structures have been shown essential for analyzing missense variants in coding regions while the features related to RNA-splicing and RBP binding are significant for variants in noncoding regions and synonymous variants in coding regions. Through the integration of these features, we have formulated the Multi-level feature Genomic Variants Predictor (ML-GVP) using the gradient boosting tree. The method has been trained on more than 400,000 variants in the Sherloc-training set from the 6th critical assessment of genome interpretation with superior performance. The method is one of the two best-performing predictors on the blind test in the Sherloc assessment, and is further confirmed by another independent test dataset of de novo variants.
Collapse
Affiliation(s)
- Maolin Ding
- School of Data and Computer Science, Sun Yat-Sen University, Guangzhou, 510000, China
| | - Ken Chen
- School of Data and Computer Science, Sun Yat-Sen University, Guangzhou, 510000, China
| | - Yuedong Yang
- School of Data and Computer Science, Sun Yat-Sen University, Guangzhou, 510000, China.
- Key Laboratory of Machine Intelligence and Advanced Computing (Sun Yat-Sen University), Ministry of Education, Guangzhou, China.
| | - Huiying Zhao
- Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou, 510000, China.
| |
Collapse
|
28
|
Iliopoulou E, Papadogiannis V, Tsigenopoulos CS, Manousaki T. Extensive Loss and Gain of Conserved Noncoding Elements During Early Teleost Evolution. Genome Biol Evol 2024; 16:evae061. [PMID: 38648507 PMCID: PMC11034925 DOI: 10.1093/gbe/evae061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/19/2024] [Indexed: 04/25/2024] Open
Abstract
Conserved noncoding elements in vertebrates are enriched around transcription factor loci associated with development. However, loss and rapid divergence of conserved noncoding elements has been reported in teleost fish, albeit taking only few genomes into consideration. Taking advantage of the recent increase in high-quality teleost genomes, we focus on studying the evolution of teleost conserved noncoding elements, carrying out targeted genomic alignments and comparisons within the teleost phylogeny to detect conserved noncoding elements and reconstruct the ancestral teleost conserved noncoding elements repertoire. This teleost-centric approach confirms previous observations of extensive vertebrate conserved noncoding elements loss early in teleost evolution, but also reveals massive conserved noncoding elements gain in the teleost stem-group over 300 million years ago. Using synteny-based association to link conserved noncoding elements to their putatively regulated target genes, we show the most teleost gained conserved noncoding elements are found in the vicinity of orthologous loci involved in transcriptional regulation and embryonic development that are also associated with conserved noncoding elements in other vertebrates. Moreover, teleost and vertebrate conserved noncoding elements share a highly similar motif and transcription factor binding site vocabulary. We suggest that early teleost conserved noncoding element gains reflect a restructuring of the ancestral conserved noncoding element repertoire through both extreme divergence and de novo emergence. Finally, we support newly identified pan-teleost conserved noncoding elements have potential for accurate resolution of teleost phylogenetic placements in par with coding sequences, unlike ancestral only elements shared with spotted gar. This work provides new insight into conserved noncoding element evolution with great value for follow-up work on phylogenomics, comparative genomics, and the study of gene regulation evolution in teleosts.
Collapse
Affiliation(s)
- Elisavet Iliopoulou
- Hellenic Centre for Marine Research (HCMR), Institute of Marine Biology, Biotechnology & Aquaculture (IMBBC), Heraklion, Greece
- Present Address: Université Paris Cité, CNRS, Institut Jacques Monod, F-75013 Paris, France
| | - Vasileios Papadogiannis
- Hellenic Centre for Marine Research (HCMR), Institute of Marine Biology, Biotechnology & Aquaculture (IMBBC), Heraklion, Greece
- Present Address: Center for Genomic Regulation, Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Costas S Tsigenopoulos
- Hellenic Centre for Marine Research (HCMR), Institute of Marine Biology, Biotechnology & Aquaculture (IMBBC), Heraklion, Greece
| | - Tereza Manousaki
- Hellenic Centre for Marine Research (HCMR), Institute of Marine Biology, Biotechnology & Aquaculture (IMBBC), Heraklion, Greece
| |
Collapse
|
29
|
Gemmell P, Sackton TB, Edwards SV, Liu JS. A phylogenetic method linking nucleotide substitution rates to rates of continuous trait evolution. PLoS Comput Biol 2024; 20:e1011995. [PMID: 38656999 PMCID: PMC11078400 DOI: 10.1371/journal.pcbi.1011995] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Revised: 05/08/2024] [Accepted: 03/13/2024] [Indexed: 04/26/2024] Open
Abstract
Genomes contain conserved non-coding sequences that perform important biological functions, such as gene regulation. We present a phylogenetic method, PhyloAcc-C, that associates nucleotide substitution rates with changes in a continuous trait of interest. The method takes as input a multiple sequence alignment of conserved elements, continuous trait data observed in extant species, and a background phylogeny and substitution process. Gibbs sampling is used to assign rate categories (background, conserved, accelerated) to lineages and explore whether the assigned rate categories are associated with increases or decreases in the rate of trait evolution. We test our method using simulations and then illustrate its application using mammalian body size and lifespan data previously analyzed with respect to protein coding genes. Like other studies, we find processes such as tumor suppression, telomere maintenance, and p53 regulation to be related to changes in longevity and body size. In addition, we also find that skeletal genes, and developmental processes, such as sprouting angiogenesis, are relevant.
Collapse
Affiliation(s)
- Patrick Gemmell
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, United States of America
- Department of Statistics, Harvard University, Cambridge, Massachusetts, United States of America
| | - Timothy B. Sackton
- FAS Informatics Group, Harvard University, Cambridge, Massachusetts, United States of America
| | - Scott V. Edwards
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, United States of America
| | - Jun S. Liu
- Department of Statistics, Harvard University, Cambridge, Massachusetts, United States of America
| |
Collapse
|
30
|
Sepulveda‐Falla D, Vélez JI, Acosta‐Baena N, Baena A, Moreno S, Krasemann S, Lopera F, Mastronardi CA, Arcos‐Burgos M. Genetic modifiers of cognitive decline in PSEN1 E280A Alzheimer's disease. Alzheimers Dement 2024; 20:2873-2885. [PMID: 38450831 PMCID: PMC11032577 DOI: 10.1002/alz.13754] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2023] [Revised: 01/22/2024] [Accepted: 01/29/2024] [Indexed: 03/08/2024]
Abstract
INTRODUCTION Rate of cognitive decline (RCD) in Alzheimer's disease (AD) determines the degree of impairment for patients and of burden for caretakers. We studied the association of RCD with genetic variants in AD. METHODS RCD was evaluated in 62 familial AD (FAD) and 53 sporadic AD (SAD) cases, and analyzed by whole-exome sequencing for association with common exonic functional variants. Findings were validated in post mortem brain tissue. RESULTS One hundred seventy-two gene variants in FAD, and 227 gene variants in SAD associated with RCD. In FAD, performance decline of the immediate recall of the Rey-Osterrieth figure test associated with 122 genetic variants. Olfactory receptor OR51B6 showed the highest number of associated variants. Its expression was detected in temporal cortex neurons. DISCUSSION Impaired olfactory function has been associated with cognitive impairment in AD. Genetic variants in these or other genes could help to identify risk of faster memory decline in FAD and SAD patients.
Collapse
Affiliation(s)
- Diego Sepulveda‐Falla
- Institute of NeuropathologyUniversity Medical Center Hamburg‐EppendorfHamburgGermany
- Grupo de Neurociencias de AntioquiaUniversidad de AntioquiaMedellínColombia
| | - Jorge I. Vélez
- Grupo de Neurociencias de AntioquiaUniversidad de AntioquiaMedellínColombia
- Universidad del NorteBarranquillaColombia
| | | | - Ana Baena
- Grupo de Neurociencias de AntioquiaUniversidad de AntioquiaMedellínColombia
| | - Sonia Moreno
- Grupo de Neurociencias de AntioquiaUniversidad de AntioquiaMedellínColombia
| | - Susanne Krasemann
- Institute of NeuropathologyUniversity Medical Center Hamburg‐EppendorfHamburgGermany
| | - Francisco Lopera
- Grupo de Neurociencias de AntioquiaUniversidad de AntioquiaMedellínColombia
| | - Claudio A. Mastronardi
- Genomics and Predictive Medicine GroupDepartment of Genome SciencesJohn Curtin School of Medical ResearchThe Australian National UniversityCanberraAustralia
- INPAC Research Group, Fundación Universitaria SanitasBogotáColombia
| | - Mauricio Arcos‐Burgos
- Grupo de Investigación en Psiquiatría (GIPSI)Departamento de PsiquiatríaFacultad de MedicinaInstituto de Investigaciones MédicasUniversidad de AntioquiaMedellínColombia
| |
Collapse
|
31
|
Zhang Y, Zhao Y, Dai L, Liu Y, Shi Z. Auriculocondylar syndrome 2 caused by a novel PLCB4 variant in a male Chinese neonate: A case report and review of the literature. Mol Genet Genomic Med 2024; 12:e2441. [PMID: 38618928 PMCID: PMC11017300 DOI: 10.1002/mgg3.2441] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 03/19/2024] [Accepted: 03/27/2024] [Indexed: 04/16/2024] Open
Abstract
BACKGROUND Auriculocondylar syndrome (ARCND) is a rare congenital craniofacial developmental malformation syndrome of the first and second pharyngeal arches with external ear malformation at the junction between the lobe and helix, micromaxillary malformation, and mandibular condylar hypoplasia. Four subtypes of ARCND have been described so far, that is, ARCND1 (OMIM # 602483), ARCND2 (ARCND2A, OMIM # 614669; ARCND2B, OMIM # 620458), ARCND3 (OMIM # 615706), and ARCND4 (OMIM # 620457). METHODS This study reports a case of ARCND2 resulting from a novel pathogenic variant in the PLCB4 gene, and summarizes PLCB4 gene mutation sites and phenotypes of ARCND2. RESULTS The proband, a 5-day-old male neonate, was referred to our hospital for respiratory distress. Micrognathia, microstomia, distinctive question mark ears, as well as mandibular condyle hypoplasia were identified. Trio-based whole-exome sequencing identified a novel missense variant of NM_001377142.1:c.1928C>T (NP_001364071.1:p.Ser643Phe) in the PLCB4 gene, which was predicted to impair the local structural stability with a result that the protein function might be affected. From a review of the literature, only 36 patients with PLCB4 gene mutations were retrieved. CONCLUSION As with other studies examining familial cases of ARCND2, incomplete penetrance and variable expressivity were observed within different families' heterozygous mutations in PLCB4 gene. Although, motor and intellectual development are in the normal range in the vast majority of patients with ARCND2, long-term follow-up and assessment are still required.
Collapse
Affiliation(s)
- Yongli Zhang
- Department of NeonatologyAnhui Provincial Children's Hospital/Children's Hospital of Fudan University (Affiliated Anhui Branch)HefeiAnhuiChina
| | - Yuwei Zhao
- Department of NeonatologyAnhui Provincial Children's Hospital/Children's Hospital of Fudan University (Affiliated Anhui Branch)HefeiAnhuiChina
| | - Liying Dai
- Department of NeonatologyAnhui Provincial Children's Hospital/Children's Hospital of Fudan University (Affiliated Anhui Branch)HefeiAnhuiChina
| | - Yu Liu
- Department of NeonatologyAnhui Provincial Children's Hospital/Children's Hospital of Fudan University (Affiliated Anhui Branch)HefeiAnhuiChina
| | - Zifeng Shi
- Radiology Department, Center of Imaging DiagnosisAnhui Provincial Children's Hospital/Children's Hospital of Fudan University (Affiliated Anhui Branch)HefeiAnhuiChina
| |
Collapse
|
32
|
Wirthlin ME, Schmid TA, Elie JE, Zhang X, Kowalczyk A, Redlich R, Shvareva VA, Rakuljic A, Ji MB, Bhat NS, Kaplow IM, Schäffer DE, Lawler AJ, Wang AZ, Phan BN, Annaldasula S, Brown AR, Lu T, Lim BK, Azim E, Clark NL, Meyer WK, Pond SLK, Chikina M, Yartsev MM, Pfenning AR, Andrews G, Armstrong JC, Bianchi M, Birren BW, Bredemeyer KR, Breit AM, Christmas MJ, Clawson H, Damas J, Di Palma F, Diekhans M, Dong MX, Eizirik E, Fan K, Fanter C, Foley NM, Forsberg-Nilsson K, Garcia CJ, Gatesy J, Gazal S, Genereux DP, Goodman L, Grimshaw J, Halsey MK, Harris AJ, Hickey G, Hiller M, Hindle AG, Hubley RM, Hughes GM, Johnson J, Juan D, Kaplow IM, Karlsson EK, Keough KC, Kirilenko B, Koepfli KP, Korstian JM, Kowalczyk A, Kozyrev SV, Lawler AJ, Lawless C, Lehmann T, Levesque DL, Lewin HA, Li X, Lind A, Lindblad-Toh K, Mackay-Smith A, Marinescu VD, Marques-Bonet T, Mason VC, Meadows JRS, Meyer WK, Moore JE, Moreira LR, Moreno-Santillan DD, Morrill KM, Muntané G, Murphy WJ, Navarro A, Nweeia M, Ortmann S, Osmanski A, Paten B, Paulat NS, Pfenning AR, Phan BN, Pollard KS, Pratt HE, Ray DA, Reilly SK, Rosen JR, Ruf I, Ryan L, Ryder OA, Sabeti PC, Schäffer DE, Serres A, Shapiro B, Smit AFA, Springer M, Srinivasan C, Steiner C, Storer JM, Sullivan KAM, Sullivan PF, Sundström E, Supple MA, Swofford R, Talbot JE, Teeling E, Turner-Maier J, Valenzuela A, Wagner F, Wallerman O, Wang C, Wang J, Weng Z, Wilder AP, Wirthlin ME, Xue JR, Zhang X. Vocal learning-associated convergent evolution in mammalian proteins and regulatory elements. Science 2024; 383:eabn3263. [PMID: 38422184 DOI: 10.1126/science.abn3263] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Accepted: 02/20/2024] [Indexed: 03/02/2024]
Abstract
Vocal production learning ("vocal learning") is a convergently evolved trait in vertebrates. To identify brain genomic elements associated with mammalian vocal learning, we integrated genomic, anatomical, and neurophysiological data from the Egyptian fruit bat (Rousettus aegyptiacus) with analyses of the genomes of 215 placental mammals. First, we identified a set of proteins evolving more slowly in vocal learners. Then, we discovered a vocal motor cortical region in the Egyptian fruit bat, an emergent vocal learner, and leveraged that knowledge to identify active cis-regulatory elements in the motor cortex of vocal learners. Machine learning methods applied to motor cortex open chromatin revealed 50 enhancers robustly associated with vocal learning whose activity tended to be lower in vocal learners. Our research implicates convergent losses of motor cortex regulatory elements in mammalian vocal learning evolution.
Collapse
Affiliation(s)
- Morgan E Wirthlin
- Department of Computational Biology, Carnegie Mellon University, Pittsburgh, PA 15213, USA
- Neuroscience Institute, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Tobias A Schmid
- Helen Wills Neuroscience Institute, University of California, Berkeley, Berkeley, CA 94708, USA
| | - Julie E Elie
- Helen Wills Neuroscience Institute, University of California, Berkeley, Berkeley, CA 94708, USA
- Department of Bioengineering, University of California, Berkeley, Berkeley, CA 94708, USA
| | - Xiaomeng Zhang
- Department of Computational Biology, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Amanda Kowalczyk
- Department of Computational Biology, Carnegie Mellon University, Pittsburgh, PA 15213, USA
- Neuroscience Institute, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Ruby Redlich
- Department of Computational Biology, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Varvara A Shvareva
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94708, USA
| | - Ashley Rakuljic
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94708, USA
| | - Maria B Ji
- Department of Psychology, University of California, Berkeley, Berkeley, CA 94708, USA
| | - Ninad S Bhat
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94708, USA
| | - Irene M Kaplow
- Department of Computational Biology, Carnegie Mellon University, Pittsburgh, PA 15213, USA
- Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA
| | - Daniel E Schäffer
- Department of Computational Biology, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Alyssa J Lawler
- Neuroscience Institute, Carnegie Mellon University, Pittsburgh, PA 15213, USA
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Andrew Z Wang
- Department of Computational Biology, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - BaDoi N Phan
- Department of Computational Biology, Carnegie Mellon University, Pittsburgh, PA 15213, USA
- Neuroscience Institute, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Siddharth Annaldasula
- Department of Computational Biology, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Ashley R Brown
- Department of Computational Biology, Carnegie Mellon University, Pittsburgh, PA 15213, USA
- Neuroscience Institute, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Tianyu Lu
- Department of Computational Biology, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Byung Kook Lim
- Neurobiology section, Division of Biological Science, University of California, San Diego, La Jolla, CA 92093, USA
| | - Eiman Azim
- Molecular Neurobiology Laboratory, Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Nathan L Clark
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA 15213, USA
| | - Wynn K Meyer
- Department of Biological Sciences, Lehigh University, Bethlehem, PA 18015, USA
| | | | - Maria Chikina
- Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA
| | - Michael M Yartsev
- Helen Wills Neuroscience Institute, University of California, Berkeley, Berkeley, CA 94708, USA
- Department of Bioengineering, University of California, Berkeley, Berkeley, CA 94708, USA
| | - Andreas R Pfenning
- Department of Computational Biology, Carnegie Mellon University, Pittsburgh, PA 15213, USA
- Neuroscience Institute, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
33
|
Haghshenas S, Bout HJ, Schijns JM, Levy MA, Kerkhof J, Bhai P, McConkey H, Jenkins ZA, Williams EM, Halliday BJ, Huisman SA, Lauffer P, de Waard V, Witteveen L, Banka S, Brady AF, Galazzi E, van Gils J, Hurst ACE, Kaiser FJ, Lacombe D, Martinez-Monseny AF, Fergelot P, Monteiro FP, Parenti I, Persani L, Santos-Simarro F, Simpson BN, Alders M, Robertson SP, Sadikovic B, Menke LA. Menke-Hennekam syndrome; delineation of domain-specific subtypes with distinct clinical and DNA methylation profiles. HGG ADVANCES 2024; 5:100287. [PMID: 38553851 PMCID: PMC11040166 DOI: 10.1016/j.xhgg.2024.100287] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Revised: 03/26/2024] [Accepted: 03/26/2024] [Indexed: 04/18/2024] Open
Abstract
CREB-binding protein (CBP, encoded by CREBBP) and its paralog E1A-associated protein (p300, encoded by EP300) are involved in histone acetylation and transcriptional regulation. Variants that produce a null allele or disrupt the catalytic domain of either protein cause Rubinstein-Taybi syndrome (RSTS), while pathogenic missense and in-frame indel variants in parts of exons 30 and 31 cause phenotypes recently described as Menke-Hennekam syndrome (MKHK). To distinguish MKHK subtypes and define their characteristics, molecular and extended clinical data on 82 individuals (54 unpublished) with variants affecting CBP (n = 71) or p300 (n = 11) (NP_004371.2 residues 1,705-1,875 and NP_001420.2 residues 1,668-1,833, respectively) were summarized. Additionally, genome-wide DNA methylation profiles were assessed in DNA extracted from whole peripheral blood from 54 individuals. Most variants clustered closely around the zinc-binding residues of two zinc-finger domains (ZZ and TAZ2) and within the first α helix of the fourth intrinsically disordered linker (ID4) of CBP/p300. Domain-specific methylation profiles were discerned for the ZZ domain in CBP/p300 (found in nine out of 10 tested individuals) and TAZ2 domain in CBP (in 14 out of 20), while a domain-specific diagnostic episignature was refined for the ID4 domain in CBP/p300 (in 21 out of 21). Phenotypes including intellectual disability of varying degree and distinct physical features were defined for each of the regions. These findings demonstrate existence of at least three MKHK subtypes, which are domain specific (MKHK-ZZ, MKHK-TAZ2, and MKHK-ID4) rather than gene specific (CREBBP/EP300). DNA methylation episignatures enable stratification of molecular pathophysiologic entities within a gene or across a family of paralogous genes.
Collapse
Affiliation(s)
- Sadegheh Haghshenas
- Verspeeten Clinical Genome Centre, London Health Sciences Centre, London ON N6A 5W9, Canada
| | - Hidde J Bout
- Department of Pediatrics, Emma Children's Hospital, Amsterdam UMC, University of Amsterdam, Amsterdam Reproduction and Development Research Institute, 1105 Amsterdam, AZ, the Netherlands
| | - Josephine M Schijns
- Department of Pediatrics, Emma Children's Hospital, Amsterdam UMC, University of Amsterdam, Amsterdam Reproduction and Development Research Institute, 1105 Amsterdam, AZ, the Netherlands
| | - Michael A Levy
- Verspeeten Clinical Genome Centre, London Health Sciences Centre, London ON N6A 5W9, Canada
| | - Jennifer Kerkhof
- Verspeeten Clinical Genome Centre, London Health Sciences Centre, London ON N6A 5W9, Canada
| | - Pratibha Bhai
- Verspeeten Clinical Genome Centre, London Health Sciences Centre, London ON N6A 5W9, Canada
| | - Haley McConkey
- Verspeeten Clinical Genome Centre, London Health Sciences Centre, London ON N6A 5W9, Canada
| | - Zandra A Jenkins
- Department of Women's and Children's Health, Dunedin School of Medicine, University of Otago, Dunedin 9016, New Zealand
| | - Ella M Williams
- Department of Women's and Children's Health, Dunedin School of Medicine, University of Otago, Dunedin 9016, New Zealand
| | - Benjamin J Halliday
- Department of Women's and Children's Health, Dunedin School of Medicine, University of Otago, Dunedin 9016, New Zealand
| | - Sylvia A Huisman
- Department of Pediatrics, Emma Children's Hospital, Amsterdam UMC, University of Amsterdam, Amsterdam Reproduction and Development Research Institute, 1105 Amsterdam, AZ, the Netherlands; Zodiak, Prinsenstichting, Purmerend, JE 1444, the Netherlands
| | - Peter Lauffer
- Department of Human Genetics, Amsterdam UMC, University of Amsterdam, Amsterdam Reproduction and Development Research Institute, Amsterdam 1105 AZ, the Netherlands
| | - Vivian de Waard
- Department of Medical Biochemistry, Amsterdam UMC, University of Amsterdam, Amsterdam Cardiovascular Sciences, Amsterdam, AZ 1105, the Netherlands
| | - Laura Witteveen
- Department of Pediatrics, Emma Children's Hospital, Amsterdam UMC, University of Amsterdam, Amsterdam Reproduction and Development Research Institute, 1105 Amsterdam, AZ, the Netherlands
| | - Siddharth Banka
- Division of Evolution, Infection and Genomics, School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester M13 9WL, UK; Manchester Centre for Genomic Medicine, Saint Mary's Hospital, Manchester University NHS Foundation Trust, Manchester M13 9WL, UK
| | - Angela F Brady
- North West Thames Regional Genetics Service, Northwick Park Hospital, Harrow HA1 3UJ, UK
| | - Elena Galazzi
- Department of Endocrine & Metabolic Diseases, San Luca Hospital, IRCCS Istituto Auxologico Italiano, 20100 Milan, Italy
| | - Julien van Gils
- Centre Hospitalier Universitaire Bordeaux, 33404 Bordeaux, France
| | - Anna C E Hurst
- Department of Genetics, University of Alabama, Birmingham, AL 35294-0024, USA
| | - Frank J Kaiser
- Institute of Human Genetics, University of Duisburg-Essen, 45122 Essen, Germany; Center for Rare Diseases, University Hospital Essen, 45122 Essen, Germany
| | - Didier Lacombe
- Centre Hospitalier Universitaire Bordeaux, 33404 Bordeaux, France
| | - Antonio F Martinez-Monseny
- Genètica Clínica, Servei de Medicina Genètica i Molecular, Hospital Sant Joan de Déu, 08950 Barcelona, Spain
| | | | | | - Ilaria Parenti
- Institute of Human Genetics, University of Duisburg-Essen, 45122 Essen, Germany
| | - Luca Persani
- Department of Endocrine & Metabolic Diseases, San Luca Hospital, IRCCS Istituto Auxologico Italiano, 20100 Milan, Italy; Department of Medical Biotechnology and Translational Medicine, University of Milan, 20100 Milan, Italy
| | - Fernando Santos-Simarro
- Institute of Medical and Molecular Genetics (INGEMM), Hospital Universitario La Paz, IdiPAZ, CIBERER, ISCIII, 28029 Madrid, Spain; Unit of Molecular Diagnostics and Clinical Genetics, Hospital Universitari Son Espases, Health Research Institute of the Balearic Islands (IdISBa), 07120 Palma, Spain
| | - Brittany N Simpson
- Department of Pediatrics, Division of Human Genetics, Cincinnati Children's Hospital Medical Center, University of Cincinnati School of Medicine, Cincinnati, OH 45206, USA
| | - Mariëlle Alders
- Department of Human Genetics, Amsterdam UMC, University of Amsterdam, Amsterdam Reproduction and Development Research Institute, Amsterdam 1105 AZ, the Netherlands
| | - Stephen P Robertson
- Department of Women's and Children's Health, Dunedin School of Medicine, University of Otago, Dunedin 9016, New Zealand
| | - Bekim Sadikovic
- Verspeeten Clinical Genome Centre, London Health Sciences Centre, London ON N6A 5W9, Canada; Department of Pathology and Laboratory Medicine, Western University, London, ON N6A3K7, Canada.
| | - Leonie A Menke
- Department of Pediatrics, Emma Children's Hospital, Amsterdam UMC, University of Amsterdam, Amsterdam Reproduction and Development Research Institute, 1105 Amsterdam, AZ, the Netherlands.
| |
Collapse
|
34
|
Francis A, Campbell C, Gaunt TR. DrivR-Base: a feature extraction toolkit for variant effect prediction model construction. Bioinformatics 2024; 40:btae197. [PMID: 38603611 PMCID: PMC11057939 DOI: 10.1093/bioinformatics/btae197] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Revised: 03/01/2024] [Accepted: 04/09/2024] [Indexed: 04/13/2024] Open
Abstract
MOTIVATION Recent advancements in sequencing technologies have led to the discovery of numerous variants in the human genome. However, understanding their precise roles in diseases remains challenging due to their complex functional mechanisms. Various methodologies have emerged to predict the pathogenic significance of these genetic variants. Typically, these methods employ an integrative approach, leveraging diverse data sources that provide important insights into genomic function. Despite the abundance of publicly available data sources and databases, the process of navigating, extracting, and pre-processing features for machine learning models can be highly challenging and time-consuming. Furthermore, researchers often invest substantial effort in feature extraction, only to later discover that these features lack informativeness. RESULTS In this article, we introduce DrivR-Base, an innovative resource that efficiently extracts and integrates molecular information (features) related to single nucleotide variants. These features encompass information about the genomic positions and the associated protein positions of a variant. They are derived from a wide array of databases and tools, including structural properties obtained from AlphaFold, regulatory information sourced from ENCODE, and predicted variant consequences from Variant Effect Predictor. DrivR-Base is easily deployable via a Docker container to ensure reproducibility and ease of access across diverse computational environments. The resulting features can be used as input for machine learning models designed to predict the pathogenic impact of human genome variants in disease. Moreover, these feature sets have applications beyond this, including haploinsufficiency prediction and the development of drug repurposing tools. We describe the resource's development, practical applications, and potential for future expansion and enhancement. AVAILABILITY AND IMPLEMENTATION DrivR-Base source code is available at https://github.com/amyfrancis97/DrivR-Base.
Collapse
Affiliation(s)
- Amy Francis
- MRC Integrative Epidemiology Unit, Bristol Medical School (PHS), University of Bristol, Bristol BS8 2BN, United Kingdom
| | - Colin Campbell
- Intelligent Systems Laboratory, University of Bristol, Bristol BS1 5DD, United Kingdom
| | - Tom R Gaunt
- MRC Integrative Epidemiology Unit, Bristol Medical School (PHS), University of Bristol, Bristol BS8 2BN, United Kingdom
| |
Collapse
|
35
|
Hou L, Liu W, Zhang H, Li R, Liu M, Shi H, Wu L. Divergent composition and transposon-silencing activity of small RNAs in mammalian oocytes. Genome Biol 2024; 25:80. [PMID: 38532500 DOI: 10.1186/s13059-024-03214-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Accepted: 03/11/2024] [Indexed: 03/28/2024] Open
Abstract
BACKGROUND Small RNAs are essential for germ cell development and fertilization. However, fundamental questions remain, such as the level of conservation in small RNA composition between species and whether small RNAs control transposable elements in mammalian oocytes. RESULTS Here, we use high-throughput sequencing to profile small RNAs and poly(A)-bearing long RNAs in oocytes of 12 representative vertebrate species (including 11 mammals). The results show that miRNAs are generally expressed in the oocytes of each representative species (although at low levels), whereas endo-siRNAs are specific to mice. Notably, piRNAs are predominant in oocytes of all species (except mice) and vary widely in length. We find PIWIL3-associated piRNAs are widespread in mammals and generally lack 3'-2'-O-methylation. Additionally, sequence identity is low between homologous piRNAs in different species, even among those present in syntenic piRNA clusters. Despite the species-specific divergence, piRNAs retain the capacity to silence younger TE subfamilies in oocytes. CONCLUSIONS Collectively, our findings illustrate a high level of diversity in the small RNA populations of mammalian oocytes. Furthermore, we identify sequence features related to conserved roles of small RNAs in silencing TEs, providing a large-scale reference for future in-depth study of small RNA functions in oocytes.
Collapse
Affiliation(s)
- Li Hou
- Key Laboratory of RNA Science and Engineering, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, 200031, China
| | - Wei Liu
- Key Laboratory of RNA Science and Engineering, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, 200031, China
| | - Hongdao Zhang
- Key Laboratory of RNA Science and Engineering, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, 200031, China
| | - Ronghong Li
- Key Laboratory of RNA Science and Engineering, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, 200031, China
| | - Miao Liu
- Shanghai-MOST Key Laboratory of Health and Disease Genomics, NHC Key Lab of Reproduction Regulation, Shanghai Institute for Biomedical and Pharmaceutical Technologies, Shanghai, 200032, China
| | - Huijuan Shi
- Shanghai-MOST Key Laboratory of Health and Disease Genomics, NHC Key Lab of Reproduction Regulation, Shanghai Institute for Biomedical and Pharmaceutical Technologies, Shanghai, 200032, China
| | - Ligang Wu
- Key Laboratory of RNA Science and Engineering, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, 200031, China.
| |
Collapse
|
36
|
Zhang G, Fu Y, Yang L, Ye F, Zhang P, Zhang S, Ma L, Li J, Wu H, Han X, Wang J, Guo G. Construction of single-cell cross-species chromatin accessibility landscapes with combinatorial-hybridization-based ATAC-seq. Dev Cell 2024; 59:793-811.e8. [PMID: 38330939 DOI: 10.1016/j.devcel.2024.01.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Revised: 11/03/2023] [Accepted: 01/18/2024] [Indexed: 02/10/2024]
Abstract
Despite recent advances in single-cell genomics, the lack of maps for single-cell candidate cis-regulatory elements (cCREs) in non-mammal species has limited our exploration of conserved regulatory programs across vertebrates and invertebrates. Here, we developed a combinatorial-hybridization-based method for single-cell assay for transposase-accessible chromatin using sequencing (scATAC-seq) named CH-ATAC-seq, enabling the construction of single-cell accessible chromatin landscapes for zebrafish, Drosophila, and earthworms (Eisenia andrei). By integrating scATAC censuses of humans, monkeys, and mice, we systematically identified 152 distinct main cell types and around 0.8 million cell-type-specific cCREs. Our analysis provided insights into the conservation of neural, muscle, and immune lineages across species, while epithelial cells exhibited a higher organ-origin heterogeneity. Additionally, a large-scale gene regulatory network (GRN) was constructed in four vertebrates by integrating scRNA-seq censuses. Overall, our study provides a valuable resource for comparative epigenomics, identifying the evolutionary conservation and divergence of gene regulation across different species.
Collapse
Affiliation(s)
- Guodong Zhang
- Bone Marrow Transplantation Center of the First Affiliated Hospital, and Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou 310000, China; Liangzhu Laboratory, Zhejiang University, Hangzhou 311121, China
| | - Yuting Fu
- Bone Marrow Transplantation Center of the First Affiliated Hospital, and Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou 310000, China
| | - Lei Yang
- Bone Marrow Transplantation Center of the First Affiliated Hospital, and Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou 310000, China
| | - Fang Ye
- Bone Marrow Transplantation Center of the First Affiliated Hospital, and Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou 310000, China; Liangzhu Laboratory, Zhejiang University, Hangzhou 311121, China
| | - Peijing Zhang
- Bone Marrow Transplantation Center of the First Affiliated Hospital, and Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou 310000, China
| | - Shuang Zhang
- Bone Marrow Transplantation Center of the First Affiliated Hospital, and Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou 310000, China
| | - Lifeng Ma
- Bone Marrow Transplantation Center of the First Affiliated Hospital, and Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou 310000, China
| | - Jiaqi Li
- Bone Marrow Transplantation Center of the First Affiliated Hospital, and Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou 310000, China
| | - Hanyu Wu
- Bone Marrow Transplantation Center of the First Affiliated Hospital, and Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou 310000, China
| | - Xiaoping Han
- Bone Marrow Transplantation Center of the First Affiliated Hospital, and Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou 310000, China; Zhejiang Provincial Key Laboratory for Tissue Engineering and Regenerative Medicine, Dr. Li Dak Sum & Yip Yio Chin Center for Stem Cell and Regenerative Medicine, Hangzhou 310058, China.
| | - Jingjing Wang
- Bone Marrow Transplantation Center of the First Affiliated Hospital, and Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou 310000, China; Liangzhu Laboratory, Zhejiang University, Hangzhou 311121, China.
| | - Guoji Guo
- Bone Marrow Transplantation Center of the First Affiliated Hospital, and Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou 310000, China; Liangzhu Laboratory, Zhejiang University, Hangzhou 311121, China; Zhejiang Provincial Key Laboratory for Tissue Engineering and Regenerative Medicine, Dr. Li Dak Sum & Yip Yio Chin Center for Stem Cell and Regenerative Medicine, Hangzhou 310058, China; Institute of Hematology, Zhejiang University, Hangzhou, China.
| |
Collapse
|
37
|
Lavillaureix A, Rollier P, Kim A, Panasenkava V, De Tayrac M, Carré W, Guyodo H, Faoucher M, Poirel E, Akloul L, Quélin C, Whalen S, Bos J, Broekema M, van Hagen JM, Grand K, Allen-Sharpley M, Magness E, McLean SD, Kayserili H, Altunoglu U, En Qi Chong A, Xue S, Jeanne M, Almontashiri N, Habhab W, Vanlerberghe C, Faivre L, Viora-Dupont E, Philippe C, Safraou H, Laffargue F, Mittendorf L, Abou Jamra R, Patil SJ, Dalal A, Sarma AS, Keren B, Reversade B, Dubourg C, Odent S, Dupé V. DISP1 deficiency: Monoallelic and biallelic variants cause a spectrum of midline craniofacial malformations. Genet Med 2024; 26:101126. [PMID: 38529886 DOI: 10.1016/j.gim.2024.101126] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2023] [Revised: 03/21/2024] [Accepted: 03/21/2024] [Indexed: 03/27/2024] Open
Abstract
PURPOSE DISP1 encodes a transmembrane protein that regulates the secretion of the morphogen, Sonic hedgehog, a deficiency of which is a major cause of holoprosencephaly (HPE). This disorder covers a spectrum of brain and midline craniofacial malformations. The objective of the present study was to better delineate the clinical phenotypes associated with division transporter dispatched-1 (DISP1) variants. METHODS This study was based on the identification of at least 1 pathogenic variant of the DISP1 gene in individuals for whom detailed clinical data were available. RESULTS A total of 23 DISP1 variants were identified in heterozygous, compound heterozygous or homozygous states in 25 individuals with midline craniofacial defects. Most cases were minor forms of HPE, with craniofacial features such as orofacial cleft, solitary median maxillary central incisor, and congenital nasal pyriform aperture stenosis. These individuals had either monoallelic loss-of-function variants or biallelic missense variants in DISP1. In individuals with severe HPE, the DISP1 variants were commonly found associated with a variant in another HPE-linked gene (ie, oligogenic inheritance). CONCLUSION The genetic findings we have acquired demonstrate a significant involvement of DISP1 variants in the phenotypic spectrum of midline defects. This underlines its importance as a crucial element in the efficient secretion of Sonic hedgehog. We also demonstrated that the very rare solitary median maxillary central incisor and congenital nasal pyriform aperture stenosis combination is part of the DISP1-related phenotype. The present study highlights the clinical risks to be flagged up during genetic counseling after the discovery of a pathogenic DISP1 variant.
Collapse
Affiliation(s)
- Alinoë Lavillaureix
- Génétique Clinique, Centre de Référence Maladies Rares CLAD-Ouest, ERN-ITHACA, FHU GenOMedS, CHU de Rennes, Rennes, France; Univ Rennes, CNRS, INSERM, IGDR (Institut de Génétique et Développement de Rennes)-UMR 6290, Rennes, France
| | - Paul Rollier
- Génétique Clinique, Centre de Référence Maladies Rares CLAD-Ouest, ERN-ITHACA, FHU GenOMedS, CHU de Rennes, Rennes, France; Univ Rennes, CNRS, INSERM, IGDR (Institut de Génétique et Développement de Rennes)-UMR 6290, Rennes, France
| | - Artem Kim
- Univ Rennes, CNRS, INSERM, IGDR (Institut de Génétique et Développement de Rennes)-UMR 6290, Rennes, France; Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA
| | - Veranika Panasenkava
- Univ Rennes, CNRS, INSERM, IGDR (Institut de Génétique et Développement de Rennes)-UMR 6290, Rennes, France
| | - Marie De Tayrac
- Univ Rennes, CNRS, INSERM, IGDR (Institut de Génétique et Développement de Rennes)-UMR 6290, Rennes, France; Génétique Moléculaire et Génomique, FHU GenOMedS, CHU de Rennes, Rennes, France
| | - Wilfrid Carré
- Univ Rennes, CNRS, INSERM, IGDR (Institut de Génétique et Développement de Rennes)-UMR 6290, Rennes, France; Génétique Moléculaire et Génomique, FHU GenOMedS, CHU de Rennes, Rennes, France
| | - Hélène Guyodo
- Univ Rennes, CNRS, INSERM, IGDR (Institut de Génétique et Développement de Rennes)-UMR 6290, Rennes, France
| | - Marie Faoucher
- Univ Rennes, CNRS, INSERM, IGDR (Institut de Génétique et Développement de Rennes)-UMR 6290, Rennes, France; Génétique Moléculaire et Génomique, FHU GenOMedS, CHU de Rennes, Rennes, France
| | - Elisabeth Poirel
- Génétique Clinique, Centre de Référence Maladies Rares CLAD-Ouest, ERN-ITHACA, FHU GenOMedS, CHU de Rennes, Rennes, France
| | - Linda Akloul
- Génétique Clinique, Centre de Référence Maladies Rares CLAD-Ouest, ERN-ITHACA, FHU GenOMedS, CHU de Rennes, Rennes, France
| | - Chloé Quélin
- Génétique Clinique, Centre de Référence Maladies Rares CLAD-Ouest, ERN-ITHACA, FHU GenOMedS, CHU de Rennes, Rennes, France
| | - Sandra Whalen
- APHP, Sorbonne Université, Département de Génétique, Centre de Référence Maladies Rares des Anomalies du Développement et Syndromes Malformatifs, Hôpital Trousseau & Groupe Hospitalier Pitié-Salpêtrière, Paris, France
| | - Jessica Bos
- Department of Human Genetics, Section Clinical Genetic, Amsterdam University Medical Centers, Amsterdam, The Netherlands
| | - Marjoleine Broekema
- Department of Human Genetics, Section Clinical Genetic, Amsterdam University Medical Centers, Amsterdam, The Netherlands
| | - Johanna M van Hagen
- Department of Human Genetics, Section Clinical Genetic, Amsterdam University Medical Centers, Amsterdam, The Netherlands
| | - Katheryn Grand
- Department of Pediatrics, Cedars-Sinai Medical Center, Los Angeles, CA
| | | | - Emily Magness
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX
| | - Scott D McLean
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX; Division of Clinical Genetics, Christus Children's, San Antonio, TX
| | - Hülya Kayserili
- Department of Medical Genetics, Koç University School of Medicine, Istanbul, Turkey
| | - Umut Altunoglu
- Department of Medical Genetics, Koç University School of Medicine, Istanbul, Turkey
| | - Angie En Qi Chong
- Department of Biological Sciences, National University of Singapore, Singapore, Singapore
| | - Shifeng Xue
- Department of Biological Sciences, National University of Singapore, Singapore, Singapore
| | - Médéric Jeanne
- Service de Génétique, FHU GenOMedS, CHRU de Tours, Tours, France; UMR1253, iBrain, Inserm, University of Tours, Tours, France
| | - Naif Almontashiri
- Center for Genetics and Inherited Diseases (CGID), Taibah University, Madinah, Saudi Arabia
| | - Wisam Habhab
- Department of Genetic Medicine, Faculty of Medicine, Princess Al-Jawhara Al-Brahim Center of Excellence in Research of Hereditary Disorders, King Abdulaziz University, Jeddah, Saudi Arabia
| | | | - Laurence Faivre
- Centre de Référence Anomalies du Développement et Syndromes Malformatifs, FHU TRANSLAD, Centre Hospitalier Universitaire, Dijon, France; Genetics of Developmental Disorders, INSERM UMR1231, Université de Bourgogne, Dijon, France
| | - Eléonore Viora-Dupont
- Genetics of Developmental Disorders, INSERM UMR1231, Université de Bourgogne, Dijon, France; Centre de Référence Déficiences Intellectuelles de Causes Rares, FHU TRANSLAD, Centre Hospitalier Universitaire, Dijon, France
| | - Christophe Philippe
- Centre de Référence Déficiences Intellectuelles de Causes Rares, FHU TRANSLAD, Centre Hospitalier Universitaire, Dijon, France; Unité Fonctionnelle Innovation en Diagnostic Génomique des Maladies Rares, CHU Dijon, Dijon, France
| | - Hana Safraou
- Centre de Référence Déficiences Intellectuelles de Causes Rares, FHU TRANSLAD, Centre Hospitalier Universitaire, Dijon, France; Unité Fonctionnelle Innovation en Diagnostic Génomique des Maladies Rares, CHU Dijon, Dijon, France
| | - Fanny Laffargue
- CHU Clermont Ferrand, Service de Génétique Clinique, Clermont Ferrand, France
| | - Luisa Mittendorf
- Department for Children and Adolescents, University Hospital Leipzig, Leipzig, Germany
| | | | | | - Ashwin Dalal
- Diagnostics Division, Centre for DNA Fingerprinting and Diagnostics, Hyderabad, Telangana, India
| | - Asodu Sandeep Sarma
- Diagnostics Division, Centre for DNA Fingerprinting and Diagnostics, Hyderabad, Telangana, India
| | - Boris Keren
- APHP, Sorbonne Université, Département de Génétique Médicale, GH Pitié Salpêtrière, Paris, France
| | - Bruno Reversade
- Laboratory of Human Genetics and Therapeutics, Genome Institute of Singapore (GIS), A∗STAR, Department of Physiology, Cardiovascular Disease, Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore; Department of Medical Genetics, Koç University School of Medicine, Istanbul, Turkey; Laboratory of Human Genetics and Therapeutics Smart-Health Initiative, BESE, KAUST, Thuwal, Kingdom of Saudi Arabia
| | - Christèle Dubourg
- Univ Rennes, CNRS, INSERM, IGDR (Institut de Génétique et Développement de Rennes)-UMR 6290, Rennes, France; Génétique Moléculaire et Génomique, FHU GenOMedS, CHU de Rennes, Rennes, France
| | - Sylvie Odent
- Génétique Clinique, Centre de Référence Maladies Rares CLAD-Ouest, ERN-ITHACA, FHU GenOMedS, CHU de Rennes, Rennes, France; Univ Rennes, CNRS, INSERM, IGDR (Institut de Génétique et Développement de Rennes)-UMR 6290, Rennes, France
| | - Valérie Dupé
- Univ Rennes, CNRS, INSERM, IGDR (Institut de Génétique et Développement de Rennes)-UMR 6290, Rennes, France.
| |
Collapse
|
38
|
Bosman W, Franken GAC, de Las Heras J, Madariaga L, Barakat TS, Oostenbrink R, van Slegtenhorst M, Perdomo-Ramírez A, Claverie-Martín F, van Eerde AM, Vargas-Poussou R, Dubourg LD, González-Recio I, Martínez-Cruz LA, de Baaij JHF, Hoenderop JGJ. Hypomagnesaemia with varying degrees of extrarenal symptoms as a consequence of heterozygous CNNM2 variants. Sci Rep 2024; 14:6917. [PMID: 38519529 PMCID: PMC10959950 DOI: 10.1038/s41598-024-57061-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Accepted: 03/14/2024] [Indexed: 03/25/2024] Open
Abstract
Variants in the CNNM2 gene are causative for hypomagnesaemia, seizures and intellectual disability, although the phenotypes can be variable. This study aims to understand the genotype-phenotype relationship in affected individuals with CNNM2 variants by phenotypic, functional and structural analysis of new as well as previously reported variants. This results in the identification of seven variants that significantly affect CNNM2-mediated Mg2+ transport. Pathogenicity of these variants is further supported by structural modelling, which predicts CNNM2 structure to be affected by all of them. Strikingly, seizures and intellectual disability are absent in 4 out of 7 cases, indicating these phenotypes are caused either by specific CNNM2 variant only or by additional risk factors. Moreover, in line with sporadic observations from previous reports, CNNM2 variants might be associated with disturbances in parathyroid hormone and Ca2+ homeostasis.
Collapse
Affiliation(s)
- Willem Bosman
- Department of Medical BioSciences, Radboudumc, Nijmegen, The Netherlands
| | - Gijs A C Franken
- Department of Medical BioSciences, Radboudumc, Nijmegen, The Netherlands
| | - Javier de Las Heras
- Division of Pediatric Metabolism, Cruces University Hospital, CIBER-ER, Metab-ERN, University of the Basque Country (UPV/EHU), Biobizkaia Health Research Institute, Barakaldo, Spain
| | - Leire Madariaga
- Pediatric Nephrology Department, Cruces University Hospital, CIBERDEM, CIBER-ER, Endo-ERN, Biocruces Bizkaia Health Research Institute and University of the Basque Country (UPV/EHU), Barakaldo, Spain
| | - Tahsin Stefan Barakat
- Deparment of Clinical Genetics, Erasmus MC, Rotterdam, The Netherlands
- Discovery Unit, Department of Clinical Genetics, Erasmus MC, Rotterdam, The Netherlands
- ENCORE Expertise Center for Neurodevelopmental Disorders, Erasmus MC, Rotterdam, The Netherlands
| | - Rianne Oostenbrink
- ENCORE Expertise Center for Neurodevelopmental Disorders, Erasmus MC, Rotterdam, The Netherlands
- Department of General Pediatrics, Erasmus Medical Center Sophia Children's Hospital, Rotterdam, The Netherlands
| | | | - Ana Perdomo-Ramírez
- Unidad de Investigación, Renal Tube Group, Hospital Universitario Nuestra Señora de Candelaria, Santa Cruz de Tenerife, Spain
| | - Félix Claverie-Martín
- Unidad de Investigación, Renal Tube Group, Hospital Universitario Nuestra Señora de Candelaria, Santa Cruz de Tenerife, Spain
| | | | - Rosa Vargas-Poussou
- Service de medecine genomique des maladies rares, AP-HP, universite Paris Cité, Paris, France
- Centre de reference des maladies renales hereditaires de l'enfant et de l'adulte MARHEA, hopital Européen Georges Pompidou, Paris, France
- CNRS, centre de recherche des Cordeliers, Inserm UMRS 1138, Sorbonne universite, universite Paris Cité, Paris, France
| | - Laurence Derain Dubourg
- Hôpital Édouard Herriot, Hospices civils de Lyon, service de nephrologie, dialyse, hypertension et exploration fonctionnelle renale, Lyon, France
- Centre de reference des maladies renales rares et phosphocalciques, Nephrogones, Hôpital Femme-Mère-Enfant Bron, Bron, France
- Faculté de medecine Lyon est, Université Claude Bernard Lyon 1, Villeurbanne, France
| | - Irene González-Recio
- Center for Cooperative Research in Biosciences (CIC bioGUNE), Bizkaia Science and Technology Park, Derio, Bizkaia, Spain
| | - Luis Alfonso Martínez-Cruz
- Center for Cooperative Research in Biosciences (CIC bioGUNE), Bizkaia Science and Technology Park, Derio, Bizkaia, Spain
| | | | | |
Collapse
|
39
|
Baek SC, Kim B, Jang H, Kim K, Park IS, Min DH, Kim VN. Structural atlas of human primary microRNAs generated by SHAPE-MaP. Mol Cell 2024; 84:1158-1172.e6. [PMID: 38447581 DOI: 10.1016/j.molcel.2024.02.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Revised: 12/01/2023] [Accepted: 02/06/2024] [Indexed: 03/08/2024]
Abstract
MicroRNA (miRNA) maturation is critically dependent on structural features of primary transcripts (pri-miRNAs). However, the scarcity of determined pri-miRNA structures has limited our understanding of miRNA maturation. Here, we employed selective 2'-hydroxyl acylation analyzed by primer extension and mutational profiling (SHAPE-MaP), a high-throughput RNA structure probing method, to unravel the secondary structures of 476 high-confidence human pri-miRNAs. Our SHAPE-based structures diverge substantially from those inferred solely from computation, particularly in the apical loop and basal segments, underlining the need for experimental data in RNA structure prediction. By comparing the structures with high-throughput processing data, we determined the optimal structural features of pri-miRNAs. The sequence determinants are influenced substantially by their structural contexts. Moreover, we identified an element termed the bulged GWG motif (bGWG) with a 3' bulge in the lower stem, which promotes processing. Our structure-function mapping better annotates the determinants of pri-miRNA processing and offers practical implications for designing small hairpin RNAs and predicting the impacts of miRNA mutations.
Collapse
Affiliation(s)
- S Chan Baek
- Center for RNA Research, Institute for Basic Science, Seoul 08826, South Korea; School of Biological Science, Seoul National University, Seoul 08826, South Korea
| | - Boseon Kim
- Center for RNA Research, Institute for Basic Science, Seoul 08826, South Korea; School of Biological Science, Seoul National University, Seoul 08826, South Korea
| | - Harim Jang
- Center for RNA Research, Institute for Basic Science, Seoul 08826, South Korea; School of Biological Science, Seoul National University, Seoul 08826, South Korea
| | - Kijun Kim
- Center for RNA Research, Institute for Basic Science, Seoul 08826, South Korea; School of Biological Science, Seoul National University, Seoul 08826, South Korea
| | - Il-Soo Park
- Center for RNA Research, Institute for Basic Science, Seoul 08826, South Korea; Department of Chemistry, Seoul National University, Seoul 08826, South Korea
| | - Dal-Hee Min
- Center for RNA Research, Institute for Basic Science, Seoul 08826, South Korea; Department of Chemistry, Seoul National University, Seoul 08826, South Korea
| | - V Narry Kim
- Center for RNA Research, Institute for Basic Science, Seoul 08826, South Korea; School of Biological Science, Seoul National University, Seoul 08826, South Korea.
| |
Collapse
|
40
|
Lozovska A, Korovesi AG, Dias A, Lopes A, Fowler DA, Martins GG, Nóvoa A, Mallo M. Tgfbr1 controls developmental plasticity between the hindlimb and external genitalia by remodeling their regulatory landscape. Nat Commun 2024; 15:2509. [PMID: 38509075 PMCID: PMC10954616 DOI: 10.1038/s41467-024-46870-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Accepted: 03/13/2024] [Indexed: 03/22/2024] Open
Abstract
The hindlimb and external genitalia of present-day tetrapods are thought to derive from an ancestral common primordium that evolved to generate a wide diversity of structures adapted for efficient locomotion and mating in the ecological niche occupied by the species. We show that despite long evolutionary distance from the ancestral condition, the early primordium of the mouse external genitalia preserved the capacity to take hindlimb fates. In the absence of Tgfbr1, the pericloacal mesoderm generates an extra pair of hindlimbs at the expense of the external genitalia. It has been shown that the hindlimb and the genital primordia share many of their key regulatory factors. Tgfbr1 controls the response to those factors by modulating the accessibility status of regulatory elements that control the gene regulatory networks leading to the formation of genital or hindlimb structures. Our work uncovers a remarkable tissue plasticity with potential implications in the evolution of the hindlimb/genital area of tetrapods, and identifies an additional mechanism for Tgfbr1 activity that might also contribute to the control of other physiological or pathological processes.
Collapse
Affiliation(s)
- Anastasiia Lozovska
- Instituto Gulbenkian de Ciência, Rua da Quinta Grande 6, 2780-156, Oeiras, Portugal
| | - Artemis G Korovesi
- Instituto Gulbenkian de Ciência, Rua da Quinta Grande 6, 2780-156, Oeiras, Portugal
| | - André Dias
- Instituto Gulbenkian de Ciência, Rua da Quinta Grande 6, 2780-156, Oeiras, Portugal
- Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain
| | - Alexandre Lopes
- Instituto Gulbenkian de Ciência, Rua da Quinta Grande 6, 2780-156, Oeiras, Portugal
| | - Donald A Fowler
- Instituto Gulbenkian de Ciência, Rua da Quinta Grande 6, 2780-156, Oeiras, Portugal
| | - Gabriel G Martins
- Instituto Gulbenkian de Ciência, Rua da Quinta Grande 6, 2780-156, Oeiras, Portugal
| | - Ana Nóvoa
- Instituto Gulbenkian de Ciência, Rua da Quinta Grande 6, 2780-156, Oeiras, Portugal
| | - Moisés Mallo
- Instituto Gulbenkian de Ciência, Rua da Quinta Grande 6, 2780-156, Oeiras, Portugal.
| |
Collapse
|
41
|
Haque B, Guirguis G, Curtis M, Mohsin H, Walker S, Morrow MM, Costain G. A comparative medical genomics approach may facilitate the interpretation of rare missense variation. J Med Genet 2024:jmg-2023-109760. [PMID: 38508706 DOI: 10.1136/jmg-2023-109760] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Accepted: 03/12/2024] [Indexed: 03/22/2024]
Abstract
PURPOSE To determine the degree to which likely causal missense variants of single-locus traits in domesticated species have features suggestive of pathogenicity in a human genomic context. METHODS We extracted missense variants from the Online Mendelian Inheritance in Animals database for nine animals (cat, cattle, chicken, dog, goat, horse, pig, rabbit and sheep), mapped coordinates to the human reference genome and annotated variants using genome analysis tools. We also searched a private commercial laboratory database of genetic testing results from >400 000 individuals with suspected rare disorders. RESULTS Of 339 variants that were mappable to the same residue and gene in the human genome, 56 had been previously classified with respect to pathogenicity: 31 (55.4%) pathogenic/likely pathogenic, 1 (1.8%) benign/likely benign and 24 (42.9%) uncertain/other. The odds ratio for a pathogenic/likely pathogenic classification in ClinVar was 7.0 (95% CI 4.1 to 12.0, p<0.0001), compared with all other germline missense variants in these same 220 genes. The remaining 283 variants disproportionately had allele frequencies and REVEL scores that supported pathogenicity. CONCLUSION Cross-species comparisons could facilitate the interpretation of rare missense variation. These results provide further support for comparative medical genomics approaches that connect big data initiatives in human and veterinary genetics.
Collapse
Affiliation(s)
- Bushra Haque
- Program in Genetics and Genome Biology, SickKids Research Institute, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - George Guirguis
- Program in Genetics and Genome Biology, SickKids Research Institute, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Meredith Curtis
- Program in Genetics and Genome Biology, SickKids Research Institute, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Hera Mohsin
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Susan Walker
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
| | | | - Gregory Costain
- Program in Genetics and Genome Biology, SickKids Research Institute, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
- Division of Clinical and Metabolic Genetics, The Hospital for Sick Children, Toronto, Ontario, Canada
- Department of Paediatrics, University of Toronto, Toronto, Ontario, Canada
| |
Collapse
|
42
|
Schoch K, Ruegg MSG, Fellows BJ, Cao J, Uhrig S, Einsele-Scholz S, Biskup S, Hawarden SRA, Salpietro V, Capra V, Brown CM, Accogli A, Shashi V, Bicknell LS. A second hotspot for pathogenic exon-skipping variants in CDC45. Eur J Hum Genet 2024:10.1038/s41431-024-01583-1. [PMID: 38467731 DOI: 10.1038/s41431-024-01583-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2023] [Revised: 02/13/2024] [Accepted: 02/26/2024] [Indexed: 03/13/2024] Open
Abstract
Biallelic pathogenic variants in CDC45 are associated with Meier-Gorlin syndrome with craniosynostosis (MGORS type 7), which also includes short stature and absent/hypoplastic patellae. Identified variants act through a hypomorphic loss of function mechanism, to reduce CDC45 activity and impact DNA replication initiation. In addition to missense and premature termination variants, several pathogenic synonymous variants have been identified, most of which cause increased exon skipping of exon 4, which encodes an essential part of the RecJ-orthologue's DHH domain. Here we have identified a second cohort of families segregating CDC45 variants, where patients have craniosynostosis and a reduction in height, alongside common facial dysmorphisms, including thin eyebrows, consistent with MGORS7. Skipping of exon 15 is a consequence of two different variants, including a shared synonymous variant that is enriched in individuals of East Asian ancestry, while other variants in trans are predicted to alter key intramolecular interactions in α/β domain II, or cause retention of an intron within the 3'UTR. Our cohort and functional data confirm exon skipping is a relatively common pathogenic mechanism in CDC45, and highlights the need for alternative splicing events, such as exon skipping, to be especially considered for variants initially predicted to be less likely to cause the phenotype, particularly synonymous variants.
Collapse
Affiliation(s)
- Kelly Schoch
- Division of Medical Genetics, Department of Pediatrics, Duke University School of Medicine, Durham, NC, USA
| | - Mischa S G Ruegg
- Department of Biochemistry, University of Otago, Dunedin, New Zealand
| | - Bridget J Fellows
- Department of Biochemistry, University of Otago, Dunedin, New Zealand
| | - Joseph Cao
- Division of Pediatric Radiology, Department of Radiology Duke University School of Medicine, Durham, NC, USA
| | - Sabine Uhrig
- Institute of Clinical Genetics, Klinikum Stuttgart, Stuttgart, Germany
| | | | - Saskia Biskup
- Center for Human Genetics Tuebingen and CeGaT GmbH, Tuebingen, Germany
| | - Samuel R A Hawarden
- Department of Pathology, Dunedin School of Medicine, University of Otago, Dunedin, New Zealand
| | - Vincenzo Salpietro
- Department of Biotechnological and Applied Clinical Sciences, University of L'Aquila, L'Aquila, Italy
| | - Valeria Capra
- Genomics and Clinical Genetics, IRCCS Istituto Giannina Gaslini, Genoa, Italy
| | - Chris M Brown
- Department of Biochemistry, University of Otago, Dunedin, New Zealand
| | - Andrea Accogli
- Department of Specialized Medicine, Division of Medical Genetics, McGill University Health Centre, Montreal, QC, Canada
- Department of Human Genetics, Faculty of Medicine, McGill University, Montreal, QC, Canada
| | - Vandana Shashi
- Division of Medical Genetics, Department of Pediatrics, Duke University School of Medicine, Durham, NC, USA
| | - Louise S Bicknell
- Department of Biochemistry, University of Otago, Dunedin, New Zealand.
| |
Collapse
|
43
|
Schraiber JG, Edge MD, Pennell M. Unifying approaches from statistical genetics and phylogenetics for mapping phenotypes in structured populations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.10.579721. [PMID: 38496530 PMCID: PMC10942266 DOI: 10.1101/2024.02.10.579721] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/19/2024]
Abstract
In both statistical genetics and phylogenetics, a major goal is to identify correlations between genetic loci or other aspects of the phenotype or environment and a focal trait. In these two fields, there are sophisticated but disparate statistical traditions aimed at these tasks. The disconnect between their respective approaches is becoming untenable as questions in medicine, conservation biology, and evolutionary biology increasingly rely on integrating data from within and among species, and once-clear conceptual divisions are becoming increasingly blurred. To help bridge this divide, we derive a general model describing the covariance between the genetic contributions to the quantitative phenotypes of different individuals. Taking this approach shows that standard models in both statistical genetics (e.g., Genome-Wide Association Studies; GWAS) and phylogenetic comparative biology (e.g., phylogenetic regression) can be interpreted as special cases of this more general quantitative-genetic model. The fact that these models share the same core architecture means that we can build a unified understanding of the strengths and limitations of different methods for controlling for genetic structure when testing for associations. We develop intuition for why and when spurious correlations may occur using analytical theory and conduct population-genetic and phylogenetic simulations of quantitative traits. The structural similarity of problems in statistical genetics and phylogenetics enables us to take methodological advances from one field and apply them in the other. We demonstrate this by showing how a standard GWAS technique-including both the genetic relatedness matrix (GRM) as well as its leading eigenvectors, corresponding to the principal components of the genotype matrix, in a regression model-can mitigate spurious correlations in phylogenetic analyses. As a case study of this, we re-examine an analysis testing for co-evolution of expression levels between genes across a fungal phylogeny, and show that including covariance matrix eigenvectors as covariates decreases the false positive rate while simultaneously increasing the true positive rate. More generally, this work provides a foundation for more integrative approaches for understanding the genetic architecture of phenotypes and how evolutionary processes shape it.
Collapse
|
44
|
Robson ES, Ioannidis NM. GUANinE v1.0: Benchmark Datasets for Genomic AI Sequence-to-Function Models. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.10.12.562113. [PMID: 37904945 PMCID: PMC10614795 DOI: 10.1101/2023.10.12.562113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/01/2023]
Abstract
Computational genomics increasingly relies on machine learning methods for genome interpretation, and the recent adoption of neural sequence-to-function models highlights the need for rigorous model specification and controlled evaluation, problems familiar to other fields of AI. Research strategies that have greatly benefited other fields - including benchmarking, auditing, and algorithmic fairness - are also needed to advance the field of genomic AI and to facilitate model development. Here we propose a genomic AI benchmark, GUANinE, for evaluating model generalization across a number of distinct genomic tasks. Compared to existing task formulations in computational genomics, GUANinE is large-scale, de-noised, and suitable for evaluating pretrained models. GUANinE v1.0 primarily focuses on functional genomics tasks such as functional element annotation and gene expression prediction, and it also draws upon connections to evolutionary biology through sequence conservation tasks. The current GUANinE tasks provide insight into the performance of existing genomic AI models and non-neural baselines, with opportunities to be refined, revisited, and broadened as the field matures. Finally, the GUANinE benchmark allows us to evaluate new self-supervised T5 models and explore the tradeoffs between tokenization and model performance, while showcasing the potential for self-supervision to complement existing pretraining procedures.
Collapse
Affiliation(s)
- Eyes S Robson
- Center for Computational Biology, UC Berkeley, Berkeley, CA 94720
| | - Nilah M Ioannidis
- Department of Electrical Engineering and Computer Sciences, UC Berkeley, Berkeley, CA 94720
| |
Collapse
|
45
|
Wang D, Li J, Wang E, Wang Y. DVA: predicting the functional impact of single nucleotide missense variants. BMC Bioinformatics 2024; 25:100. [PMID: 38448823 PMCID: PMC10916336 DOI: 10.1186/s12859-024-05709-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Accepted: 02/16/2024] [Indexed: 03/08/2024] Open
Abstract
BACKGROUND In the past decade, single nucleotide variants (SNVs) have been identified as having a significant relationship with the development and treatment of diseases. Among them, prioritizing missense variants for further functional impact investigation is an essential challenge in the study of common disease and cancer. Although several computational methods have been developed to predict the functional impacts of variants, the predictive ability of these methods is still insufficient in the Mendelian and cancer missense variants. RESULTS We present a novel prediction method called the disease-related variant annotation (DVA) method that predicts the effect of missense variants based on a comprehensive feature set of variants, notably, the allele frequency and protein-protein interaction network feature based on graph embedding. Benchmarked against datasets of single nucleotide missense variants, the DVA method outperforms the state-of-the-art methods by up to 0.473 in the area under receiver operating characteristic curve. The results demonstrate that the proposed method can accurately predict the functional impact of single nucleotide missense variants and substantially outperforms existing methods. CONCLUSIONS DVA is an effective framework for identifying the functional impact of disease missense variants based on a comprehensive feature set. Based on different datasets, DVA shows its generalization ability and robustness, and it also provides innovative ideas for the study of the functional mechanism and impact of SNVs.
Collapse
Affiliation(s)
- Dong Wang
- School of Computer Science and Technology, Harbin Institute of Technology Harbin, Harbin, Heilongjiang, China
| | - Jie Li
- School of Computer Science and Technology, Harbin Institute of Technology Harbin, Harbin, Heilongjiang, China.
| | - Edwin Wang
- Cumming School of Medicine, University of Calgary, Calgary, Canada
| | - Yadong Wang
- School of Computer Science and Technology, Harbin Institute of Technology Harbin, Harbin, Heilongjiang, China
| |
Collapse
|
46
|
Zhu H, Choi J, Kui N, Yang T, Wei P, Li D, Sun R. Identification of Pancreatic Cancer Germline Risk Variants With Effects That Are Modified by Smoking. JCO Precis Oncol 2024; 8:e2300355. [PMID: 38564682 DOI: 10.1200/po.23.00355] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2023] [Revised: 12/08/2023] [Accepted: 02/08/2024] [Indexed: 04/04/2024] Open
Abstract
PURPOSE Pancreatic cancer (PC) is a deadly disease most often diagnosed in late stages. Identification of high-risk subjects could both contribute to preventative measures and help diagnose the disease at earlier timepoints. However, known risk factors, assessed independently, are currently insufficient for accurately stratifying patients. We use large-scale data from the UK Biobank (UKB) to identify genetic variant-smoking interaction effects and show their importance in risk assessment. METHODS We draw data from 15,086,830 genetic variants and 315,512 individuals in the UKB. There are 765 cases of PC. Crucially, robust resampling corrections are used to overcome well-known challenges in hypothesis testing for interactions. Replication analysis is conducted in two independent cohorts totaling 793 cases and 570 controls. Integration of functional annotation data and construction of polygenic risk scores (PRS) demonstrate the additional insight provided by interaction effects. RESULTS We identify the genome-wide significant variant rs77196339 on chromosome 2 (per minor allele odds ratio in never-smokers, 2.31 [95% CI, 1.69 to 3.15]; per minor allele odds ratio in ever-smokers, 0.53 [95% CI, 0.30 to 0.91]; P = 3.54 × 10-8) as well as eight other loci with suggestive evidence of interaction effects (P < 5 × 10-6). The rs77196339 region association is validated (P < .05) in the replication sample. PRS incorporating interaction effects show improved discriminatory ability over PRS of main effects alone. CONCLUSION This study of genome-wide germline variants identified smoking to modify the effect of rs77196339 on PC risk. Interactions between known risk factors can provide critical information for identifying high-risk subjects, given the relative inadequacy of models considering only main effects, as demonstrated in PRS. Further studies are necessary to advance toward comprehensive risk prediction approaches for PC.
Collapse
Affiliation(s)
- Huili Zhu
- Section of Hematology and Oncology, Department of Medicine, Baylor College of Medicine, Houston, TX
| | - Jaihee Choi
- Department of Statistics, Rice University, Houston, TX
| | - Naishu Kui
- Department of Biostatistics, University of Texas School of Public Health, Houston, TX
| | - Tianzhong Yang
- Division of Biostatistics and Health Data Science, School of Public Health, University of Minnesota, Minneapolis, MN
| | - Peng Wei
- Department of Biostatistics, Division of Basic Science, The University of Texas MD Anderson Cancer Center, Houston, TX
| | - Donghui Li
- Department of Gastrointestinal Medical Oncology, Division of Cancer Medicine, The University of Texas MD Anderson Cancer Center, Houston, TX
| | - Ryan Sun
- Department of Biostatistics, Division of Basic Science, The University of Texas MD Anderson Cancer Center, Houston, TX
| |
Collapse
|
47
|
Sakamoto F, Kanamori S, Díaz LM, Cádiz A, Ishii Y, Yamaguchi K, Shigenobu S, Nakayama T, Makino T, Kawata M. Detection of evolutionary conserved and accelerated genomic regions related to adaptation to thermal niches in Anolis lizards. Ecol Evol 2024; 14:e11117. [PMID: 38455144 PMCID: PMC10920033 DOI: 10.1002/ece3.11117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Revised: 02/18/2024] [Accepted: 02/22/2024] [Indexed: 03/09/2024] Open
Abstract
Understanding the genetic basis for adapting to thermal environments is important due to serious effects of global warming on ectothermic species. Various genes associated with thermal adaptation in lizards have been identified mainly focusing on changes in gene expression or the detection of positively selected genes using coding regions. Only a few comprehensive genome-wide analyses have included noncoding regions. This study aimed to identify evolutionarily conserved and accelerated genomic regions using whole genomes of eight Anolis lizard species that have repeatedly adapted to similar thermal environments in multiple lineages. Evolutionarily conserved genomic regions were extracted as regions with overall sequence conservation (regions with fewer base substitutions) across all lineages compared with the neutral model. Genomic regions that underwent accelerated evolution in the lineage of interest were identified as those with more base substitutions in the target branch than in the entire background branch. Conserved elements across all branches were relatively abundant in "intergenic" genomic regions among noncoding regions. Accelerated regions (ARs) of each lineage contained a significantly greater proportion of noncoding RNA genes than the entire multiple alignment. Common genes containing ARs within 5 kb of their vicinity in lineages with similar thermal habitats were identified. Many genes associated with circadian rhythms and behavior were found in hot-open and cool-shaded habitat lineages. These genes might play a role in contributing to thermal adaptation and assist future studies examining the function of genes involved in thermal adaptation via genome editing.
Collapse
Affiliation(s)
- Fuku Sakamoto
- Graduate School of Life SciencesTohoku UniversitySendaiJapan
| | | | - Luis M. Díaz
- National Museum of Natural History of CubaHavanaCuba
| | - Antonio Cádiz
- Faculty of BiologyUniversity of HavanaHavanaCuba
- Present address:
Department of BiologyUniversity of MiamiCoral GablesFloridaUSA
| | - Yuu Ishii
- Graduate School of Life SciencesTohoku UniversitySendaiJapan
| | | | - Shuji Shigenobu
- Trans‐Omics FacilityNational Institute for Basic BiologyOkazakiJapan
- Department of Basic Biology, School of Life ScienceThe Graduate University for Advanced Studies, SOKENDAIOkazakiJapan
| | - Takuro Nakayama
- Division of Life Sciences, Center for Computational SciencesUniversity of TsukubaTsukubaJapan
| | - Takashi Makino
- Graduate School of Life SciencesTohoku UniversitySendaiJapan
| | - Masakado Kawata
- Graduate School of Life SciencesTohoku UniversitySendaiJapan
| |
Collapse
|
48
|
Wang S, Jiang T, Yuan C, Wu L, Zhen X, Lei Y, Xie B, Tao R, Li C. An mRNA profiling assay incorporating coding region InDels for body fluid identification and the inference of the donor in mixed samples. Forensic Sci Int Genet 2024; 69:102979. [PMID: 38043150 DOI: 10.1016/j.fsigen.2023.102979] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Revised: 11/23/2023] [Accepted: 11/23/2023] [Indexed: 12/05/2023]
Abstract
Biological traces discovered at crime scenes hold significant significance in forensic investigations. In cases involving mixed body fluid stains, the evidentiary value of DNA profiles depends on the type of body fluid from which the DNA was obtained. Recently, coding region polymorphism analysis has proved to be a promising method for directly linking specific body fluids to their respective DNA contributors in mixtures, which may help to avoid "association fallacy" between separate DNA and RNA evidence. In this study, we present an update on previously reported coding region Single Nucleotide Polymorphisms (cSNPs) by exploring the potential application of coding region Insertion/Deletion polymorphisms (cInDels). Nine promising cInDels, selected from 70 mRNA markers based on stringent screening criteria, were integrated into an existing mRNA profiling assay. Subsequently, the body fluid specificity of our cInDel assay and the genotyping consistency between complementary DNA (cDNA) and genomic DNA (gDNA) were examined. Our study demonstrates that cInDels can function as important multifunctional genetic markers, as they provide not only the ability to confirm the presence of forensically relevant body fluids, but also the ability to associate/dissociate specific body fluids with particular donors.
Collapse
Affiliation(s)
- Shouyu Wang
- Department of Forensic Medicine, Shanghai Medical College, Fudan University, Shanghai 200032, China
| | - Tingting Jiang
- Department of Forensic Genetics, School of Forensic Medicine, Southern Medical University, Guangzhou 510515, China
| | - Chunyan Yuan
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Sciences, Ministry of Justice, PR China, Shanghai 200063, China
| | - Liming Wu
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Sciences, Ministry of Justice, PR China, Shanghai 200063, China
| | - Xiaoyuan Zhen
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Sciences, Ministry of Justice, PR China, Shanghai 200063, China
| | - Yinlei Lei
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Sciences, Ministry of Justice, PR China, Shanghai 200063, China
| | - Baoyan Xie
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Sciences, Ministry of Justice, PR China, Shanghai 200063, China
| | - Ruiyang Tao
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Sciences, Ministry of Justice, PR China, Shanghai 200063, China.
| | - Chengtao Li
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Sciences, Ministry of Justice, PR China, Shanghai 200063, China; Shanghai Medical College, Fudan University, Shanghai 200032, China; Department of Forensic Genetics, School of Forensic Medicine, Southern Medical University, Guangzhou 510515, China.
| |
Collapse
|
49
|
Wollenberg Valero KC. Brief Communication: The Predictable Network Topology of Evolutionary Genomic Constraint. Mol Biol Evol 2024; 41:msae033. [PMID: 38366776 PMCID: PMC10906983 DOI: 10.1093/molbev/msae033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Revised: 01/03/2024] [Accepted: 02/09/2024] [Indexed: 02/18/2024] Open
Abstract
Large-scale comparative genomics studies offer valuable resources for understanding both functional and evolutionary rate constraints. It is suggested that constraint aligns with the topology of genomic networks, increasing toward the center, with intermediate nodes combining relaxed constraint with higher contributions to the phenotype due to pleiotropy. However, this pattern has yet to be demonstrated in vertebrates. This study shows that constraint intensifies toward the network's center in placental mammals. Genes with rate changes associated with emergence of hibernation cluster mostly toward intermediate positions, with higher constraint in faster-evolving genes, which is indicative of a "sweet spot" for adaptation. If this trend holds universally, network node metrics could predict high-constraint regions even in clades lacking empirical constraint data.
Collapse
|
50
|
Lim D, Baek C, Blanchette M. Graphylo: A deep learning approach for predicting regulatory DNA and RNA sites from whole-genome multiple alignments. iScience 2024; 27:109002. [PMID: 38362268 PMCID: PMC10867641 DOI: 10.1016/j.isci.2024.109002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2023] [Revised: 12/17/2023] [Accepted: 01/19/2024] [Indexed: 02/17/2024] Open
Abstract
This study focuses on enhancing the prediction of regulatory functional sites in DNA and RNA sequences, a crucial aspect of gene regulation. Current methods, such as motif overrepresentation and machine learning, often lack specificity. To address this issue, the study leverages evolutionary information and introduces Graphylo, a deep-learning approach for predicting transcription factor binding sites in the human genome. Graphylo combines Convolutional Neural Networks for DNA sequences with Graph Convolutional Networks on phylogenetic trees, using information from placental mammals' genomes and evolutionary history. The research demonstrates that Graphylo consistently outperforms both single-species deep learning techniques and methods that incorporate inter-species conservation scores on a wide range of datasets. It achieves this by utilizing a species-based attention model for evolutionary insights and an integrated gradient approach for nucleotide-level model interpretability. This innovative approach offers a promising avenue for improving the accuracy of regulatory site prediction in genomics.
Collapse
|