1
|
Rynard KM, Han K, Wainberg M, Calarco JA, Lee HO, Lipshitz HD, Smibert CA, Tripathy SJ. ASiDentify (ASiD): a machine learning model to predict new autism spectrum disorder risk genes. Genetics 2025; 230:iyaf040. [PMID: 40088463 DOI: 10.1093/genetics/iyaf040] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2024] [Accepted: 02/26/2025] [Indexed: 03/17/2025] Open
Abstract
Autism spectrum disorder (ASD) is a neurodevelopmental disorder that affects nearly 3% of children and has a strong genetic component. While hundreds of ASD risk genes have been identified through sequencing studies, the genetic heterogeneity of ASD makes identifying additional risk genes using these methods challenging. To predict candidate ASD risk genes, we developed a simple machine learning model, ASiDentify (ASiD), using human genomic, RNA- and protein-based features. ASiD identified over 1,300 candidate ASD risk genes, over 300 of which have not been previously predicted. ASiD made accurate predictions of ASD risk genes using 6 features predictive of ASD risk gene status, including mutational constraint, synapse localization and gene expression in neurons, astrocytes and non-brain tissues. Particular functional groups of proteins found to be strongly implicated in ASD include RNA-binding proteins (RBPs) and chromatin regulators. We constructed additional logistic regression models to make predictions and assess informative features specific to RBPs, including mutational constraint, or chromatin regulators, for which both expression level in excitatory neurons and mutational constraint were informative. The fact that RBPs and chromatin regulators had informative features distinct from all protein-coding genes suggests that specific biological pathways connect risk genes with different molecular functions to ASD.
Collapse
Affiliation(s)
- Katherine M Rynard
- Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A8, Canada
| | - Kara Han
- Campbell Family Mental Health Research Institute, Centre for Addiction and Mental Health, Toronto, ON M5T 1R8, Canada
- Krembil Institute for Neuroinformatics, Centre for Addiction and Mental Health, Toronto, ON M5T 1R8, Canada
| | - Michael Wainberg
- Campbell Family Mental Health Research Institute, Centre for Addiction and Mental Health, Toronto, ON M5T 1R8, Canada
- Krembil Institute for Neuroinformatics, Centre for Addiction and Mental Health, Toronto, ON M5T 1R8, Canada
| | - John A Calarco
- Department of Cell and Systems Biology, University of Toronto, Toronto, ON M5S 3G5, Canada
| | - Hyun O Lee
- Department of Biochemistry, University of Toronto, Toronto, ON M5S 1A8, Canada
| | - Howard D Lipshitz
- Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A8, Canada
| | - Craig A Smibert
- Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A8, Canada
- Department of Biochemistry, University of Toronto, Toronto, ON M5S 1A8, Canada
| | - Shreejoy J Tripathy
- Campbell Family Mental Health Research Institute, Centre for Addiction and Mental Health, Toronto, ON M5T 1R8, Canada
- Krembil Institute for Neuroinformatics, Centre for Addiction and Mental Health, Toronto, ON M5T 1R8, Canada
- Institute of Medical Sciences, University of Toronto, Toronto, ON M5S 1A8, Canada
- Department of Psychiatry, University of Toronto, Toronto, ON M5T 1R8, Canada
- Department of Physiology, University of Toronto, Toronto, ON M5S 1A8, Canada
| |
Collapse
|
2
|
Poquérusse J, Whitford W, Taylor J, Gregersen N, Love DR, Tsang B, Drake KM, Snell RG, Lehnert K, Jacobsen JC. Germline mosaicism in TCF20-associated neurodevelopmental disorders: a case study and literature review. J Hum Genet 2025; 70:215-222. [PMID: 40011607 PMCID: PMC11882450 DOI: 10.1038/s10038-025-01323-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2024] [Revised: 02/05/2025] [Accepted: 02/06/2025] [Indexed: 02/28/2025]
Abstract
Autosomal dominant variants in transcription factor 20 (TCF20) can result in TCF20-associated neurodevelopmental disorder (TAND), a condition characterized by developmental delay and intellectual disability, autism, dysmorphisms, dystonia, and variable other neurological features. To date, a total of 91 individuals with TAND have been reported; ~67% of cases arose de novo, while ~10% were inherited, and, intriguingly, ~8% were either confirmed or suspected to have arisen via germline mosaicism. Here, we describe two siblings with a developmental condition characterized by intellectual disability, autism, a circadian rhythm sleep disorder, and attention deficit hyperactivity disorder (ADHD) caused by a novel heterozygous single nucleotide deletion in the TCF20 gene, NM_001378418.1:c.4737del; NP_001365347.1:p.Lys1579Asnfs*36 (GRCh38/hg38). The variant was not detected in DNA extracted from peripheral blood in either parent by Sanger sequencing of PCR-generated amplicons, or by deep sequencing of PCR amplicons using MiSeq and MinION. However, droplet digital PCR (ddPCR) of DNA derived from early morning urine detected the variation in 3.2% of the father's urothelial cells, confirming germline mosaicism. This report is only the second to confirm with physical evidence TCF20 germline mosaicism and discusses germline mosaicism as a likely under-detected mode of inheritance in neurodevelopmental conditions.
Collapse
Affiliation(s)
- Jessie Poquérusse
- School of Biological Sciences, The University of Auckland, Auckland, New Zealand
- Centre for Brain Research, The University of Auckland, Auckland, New Zealand
| | - Whitney Whitford
- School of Biological Sciences, The University of Auckland, Auckland, New Zealand
- Centre for Brain Research, The University of Auckland, Auckland, New Zealand
| | - Juliet Taylor
- Genetic Health Service New Zealand, Auckland City Hospital, Auckland, New Zealand
| | - Nerine Gregersen
- Genetic Health Service New Zealand, Auckland City Hospital, Auckland, New Zealand
| | - Donald R Love
- Diagnostic Genetics, LabPLUS, Auckland City Hospital, Auckland, New Zealand
- Genetic Pathology, Sidra Medicine, Doha, Qatar
| | - Bobby Tsang
- Pediatrics and Newborn Services, Waitakere Hospital, Auckland, New Zealand
| | - Kylie M Drake
- Canterbury Health Laboratories, Christchurch, New Zealand
| | - Russell G Snell
- School of Biological Sciences, The University of Auckland, Auckland, New Zealand
- Centre for Brain Research, The University of Auckland, Auckland, New Zealand
| | - Klaus Lehnert
- School of Biological Sciences, The University of Auckland, Auckland, New Zealand
- Centre for Brain Research, The University of Auckland, Auckland, New Zealand
| | - Jessie C Jacobsen
- School of Biological Sciences, The University of Auckland, Auckland, New Zealand.
- Centre for Brain Research, The University of Auckland, Auckland, New Zealand.
| |
Collapse
|
3
|
Zerafati-Jahromi G, Oxman E, Hoang HD, Charng WL, Kotla T, Yuan W, Ishibashi K, Sebaoui S, Luedtke K, Winrow B, Ganetzky RD, Ruiz A, Manso-Basúz C, Spataro N, Kannu P, Athey T, Peroutka C, Barnes C, Sidlow R, Anadiotis G, Magnussen K, Valenzuela I, Moles-Fernandez A, Berger S, Grant CL, Vilain E, Arnadottir GA, Sulem P, Sulem TS, Stefansson K, Massey S, Ginn N, Poduri A, D'Gama AM, Valentine R, Trowbridge SK, Murali CN, Franciskovich R, Tran Y, Webb BD, Keppler-Noreuil KM, Hall AL, McGivern B, Monaghan KG, Guillen Sacoto MJ, Baldridge D, Silverman GA, Dahiya S, Turner TN, Schedl T, Corbin JG, Pak SC, Zohn IE, Gurnett CA. Sequence variants in HECTD1 result in a variable neurodevelopmental disorder. Am J Hum Genet 2025; 112:537-553. [PMID: 39879987 PMCID: PMC11947180 DOI: 10.1016/j.ajhg.2025.01.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2024] [Revised: 12/18/2024] [Accepted: 01/02/2025] [Indexed: 01/31/2025] Open
Abstract
Dysregulation of genes encoding the homologous to E6AP C-terminus (HECT) E3 ubiquitin ligases has been linked to cancer and structural birth defects. One member of this family, the HECT-domain-containing protein 1 (HECTD1), mediates developmental pathways, including cell signaling, gene expression, and embryogenesis. Through GeneMatcher, we identified 14 unrelated individuals with 15 different variants in HECTD1 (10 missense, 3 frameshift, 1 nonsense, and 1 splicing variant) with neurodevelopmental disorders (NDDs), including autism, attention-deficit/hyperactivity disorder, and epilepsy. Of these 15 HECTD1 variants, 10 occurred de novo, 3 had unknown inheritance, and 2 were compound heterozygous. While all individuals in this cohort displayed NDDs, no genotype-phenotype correlation was apparent. Conditional knockout of Hectd1 in the neural lineage in mice resulted in microcephaly, severe hippocampal malformations, and complete agenesis of the corpus callosum, supporting a role for Hectd1 in embryonic brain development. Functional studies of select variants in C. elegans revealed dominant effects, including either change-of-function or loss-of-function/haploinsufficient mechanisms, which may explain phenotypic heterogeneity. Significant enrichment of de novo variants in HECTD1 was also shown in an independent cohort of 53,305 published trios with NDDs or congenital heart disease. Thus, our clinical and functional data support a critical requirement of HECTD1 for human brain development.
Collapse
Affiliation(s)
| | - Elias Oxman
- Center for Genetic Medicine Research, Children's National Hospital, Washington, DC, USA
| | - Hieu D Hoang
- Department of Pediatrics, Washington University in St. Louis, St. Louis, MO, USA
| | - Wu-Lin Charng
- Department of Neurology, Washington University in St. Louis, St. Louis, MO, USA
| | - Tanvitha Kotla
- Department of Pediatrics, Washington University in St. Louis, St. Louis, MO, USA
| | - Weimin Yuan
- Department of Pediatrics, Washington University in St. Louis, St. Louis, MO, USA
| | - Keito Ishibashi
- Center for Genetic Medicine Research, Children's National Hospital, Washington, DC, USA
| | - Sonia Sebaoui
- Center for Neuroscience Research, Children's National Hospital, Washington, DC, USA
| | - Kathryn Luedtke
- Center for Genetic Medicine Research, Children's National Hospital, Washington, DC, USA
| | - Bryce Winrow
- Center for Genetic Medicine Research, Children's National Hospital, Washington, DC, USA
| | - Rebecca D Ganetzky
- Mitochondrial Medicine Frontier Program, Division of Human Genetics, Children's Hospital of Philadelphia, Philadelphia, PA, USA; Department of Pediatrics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA; Center for Computational Genomics Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Anna Ruiz
- Center for Genomic Medicine, Parc Taulí Hospital University, Parc Taulí Institute of Research and Innovation (I3PT-CERCA), Autonomous University of Barcelona, Sabadell, Spain
| | - Carmen Manso-Basúz
- Center for Genomic Medicine, Parc Taulí Hospital University, Parc Taulí Institute of Research and Innovation (I3PT-CERCA), Autonomous University of Barcelona, Sabadell, Spain
| | - Nino Spataro
- Center for Genomic Medicine, Parc Taulí Hospital University, Parc Taulí Institute of Research and Innovation (I3PT-CERCA), Autonomous University of Barcelona, Sabadell, Spain
| | - Peter Kannu
- Department of Medical Genetics, Alberta Health Services, Edmonton, AB, Canada
| | - Taryn Athey
- Department of Medical Genetics, Alberta Health Services, Edmonton, AB, Canada
| | - Christina Peroutka
- Department of Pediatrics, University of Virginia, Charlottesville, VA, USA
| | - Caitlin Barnes
- Department of Pediatrics, University of Virginia, Charlottesville, VA, USA
| | - Richard Sidlow
- Department of Medical Genetics and Metabolism, Valley Children's Hospital, Madera, CA, USA
| | - George Anadiotis
- Department of Genetics and Metabolism, Randall Children's Hospital at Legacy Emanuel, Portland, OR, USA
| | - Kari Magnussen
- Department of Genetics and Metabolism, Randall Children's Hospital at Legacy Emanuel, Portland, OR, USA
| | - Irene Valenzuela
- Department of Clinical and Molecular Genetics, University Hospital Vall d'Hebron and Medicine Genetics Group, Valle Hebron Research Institute, Barcelona, Spain
| | - Alejandro Moles-Fernandez
- Department of Clinical and Molecular Genetics, University Hospital Vall d'Hebron and Medicine Genetics Group, Valle Hebron Research Institute, Barcelona, Spain
| | - Seth Berger
- Rare Disease Institute, Children's National Hospital, Washington, DC, USA
| | - Christina L Grant
- Rare Disease Institute, Children's National Hospital, Washington, DC, USA
| | - Eric Vilain
- Institute for Clinical and Translational Science, University of California, Irvine, Irvine, CA, USA
| | | | | | | | | | - Shavonne Massey
- Division of Neurology, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Natalie Ginn
- Division of Neurology, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Annapurna Poduri
- Department of Neurology, Boston Children's Hospital, Boston, MA, USA; Epilepsy Genetics Program, Department of Neurology, Boston Children's Hospital, Boston, MA, USA
| | - Alissa M D'Gama
- Division of Newborn Medicine, Boston Children's Hospital, Boston, MA, USA; Epilepsy Genetics Program, Department of Neurology, Boston Children's Hospital, Boston, MA, USA; Department of Pediatrics, Harvard Medical School, Boston, MA, USA
| | - Rozalia Valentine
- Epilepsy Genetics Program, Department of Neurology, Boston Children's Hospital, Boston, MA, USA
| | - Sara K Trowbridge
- Department of Neurology, Boston Children's Hospital, Boston, MA, USA; Department of Neurology, Harvard Medical School, Boston, MA, USA
| | - Chaya N Murali
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Rachel Franciskovich
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Yen Tran
- Department of Neurology, Baylor College of Medicine, Houston, TX, USA
| | - Bryn D Webb
- Department of Pediatrics, University of Wisconsin School of Medicine and Public Health, Madison, WI, USA
| | - Kim M Keppler-Noreuil
- Department of Pediatrics, University of Wisconsin School of Medicine and Public Health, Madison, WI, USA
| | - April L Hall
- Department of Pediatrics, University of Wisconsin School of Medicine and Public Health, Madison, WI, USA
| | | | | | | | - Dustin Baldridge
- Department of Pediatrics, Washington University in St. Louis, St. Louis, MO, USA
| | - Gary A Silverman
- Department of Pediatrics, Washington University in St. Louis, St. Louis, MO, USA
| | - Sonika Dahiya
- Department of Pathology, Washington University in St. Louis, St. Louis, MO, USA
| | - Tychele N Turner
- Department of Genetics, Washington University in St. Louis, St. Louis, MO, USA
| | - Tim Schedl
- Department of Genetics, Washington University in St. Louis, St. Louis, MO, USA
| | - Joshua G Corbin
- Center for Neuroscience Research, Children's National Hospital, Washington, DC, USA
| | - Stephen C Pak
- Department of Pediatrics, Washington University in St. Louis, St. Louis, MO, USA
| | - Irene E Zohn
- Center for Genetic Medicine Research, Children's National Hospital, Washington, DC, USA.
| | - Christina A Gurnett
- Department of Neurology, Washington University in St. Louis, St. Louis, MO, USA.
| |
Collapse
|
4
|
Gao Y, Shonai D, Trn M, Zhao J, Soderblom EJ, Garcia-Moreno SA, Gersbach CA, Wetsel WC, Dawson G, Velmeshev D, Jiang YH, Sloofman LG, Buxbaum JD, Soderling SH. Proximity analysis of native proteomes reveals phenotypic modifiers in a mouse model of autism and related neurodevelopmental conditions. Nat Commun 2024; 15:6801. [PMID: 39122707 PMCID: PMC11316102 DOI: 10.1038/s41467-024-51037-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2024] [Accepted: 07/23/2024] [Indexed: 08/12/2024] Open
Abstract
One of the main drivers of autism spectrum disorder is risk alleles within hundreds of genes, which may interact within shared but unknown protein complexes. Here we develop a scalable genome-editing-mediated approach to target 14 high-confidence autism risk genes within the mouse brain for proximity-based endogenous proteomics, achieving the identification of high-specificity spatial proteomes. The resulting native proximity proteomes are enriched for human genes dysregulated in the brain of autistic individuals, and reveal proximity interactions between proteins from high-confidence risk genes with those of lower-confidence that may provide new avenues to prioritize genetic risk. Importantly, the datasets are enriched for shared cellular functions and genetic interactions that may underlie the condition. We test this notion by spatial proteomics and CRISPR-based regulation of expression in two autism models, demonstrating functional interactions that modulate mechanisms of their dysregulation. Together, these results reveal native proteome networks in vivo relevant to autism, providing new inroads for understanding and manipulating the cellular drivers underpinning its etiology.
Collapse
Affiliation(s)
- Yudong Gao
- Department of Cell Biology, Duke University School of Medicine, Durham, NC, USA
- Department of Neuroscience, Baylor College of Medicine, Houston, TX, USA
| | - Daichi Shonai
- Department of Molecular Genetics and Microbiology, Duke University School of Medicine, Durham, NC, USA
| | - Matthew Trn
- Department of Cell Biology, Duke University School of Medicine, Durham, NC, USA
| | - Jieqing Zhao
- Department of Biology, Duke University, Durham, NC, USA
| | - Erik J Soderblom
- Department of Cell Biology, Duke University School of Medicine, Durham, NC, USA
- Proteomics and Metabolomics Shared Resource, Duke University School of Medicine, Durham, NC, USA
| | | | - Charles A Gersbach
- Department of Cell Biology, Duke University School of Medicine, Durham, NC, USA
- Department of Biomedical Engineering, Duke University, Durham, NC, USA
- Center for Advanced Genomic Technologies, Duke University, Durham, NC, USA
- Department of Surgery, Duke University Medical Center, Durham, NC, USA
| | - William C Wetsel
- Department of Cell Biology, Duke University School of Medicine, Durham, NC, USA
- Department of Psychiatry and Behavioral Sciences, Duke University School of Medicine, Durham, NC, USA
- Department of Neurobiology, Duke University School of Medicine, Durham, NC, USA
- Mouse Behavioral and Neuroendocrine Analysis Core Facility, Duke University School of Medicine, Durham, NC, USA
| | - Geraldine Dawson
- Department of Psychiatry and Behavioral Sciences, Duke University School of Medicine, Durham, NC, USA
| | - Dmitry Velmeshev
- Department of Neurobiology, Duke University School of Medicine, Durham, NC, USA
| | - Yong-Hui Jiang
- Department of Genetics, Yale University School of Medicine, New Haven, Connecticut, USA
- Department of Neuroscience, Yale University School of Medicine, New Haven, Connecticut, USA
- Department of Pediatrics, Yale University School of Medicine, New Haven, Connecticut, USA
| | - Laura G Sloofman
- Seaver Autism Center for Research and Treatment, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Joseph D Buxbaum
- Seaver Autism Center for Research and Treatment, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- The Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Scott H Soderling
- Department of Cell Biology, Duke University School of Medicine, Durham, NC, USA.
- Department of Neurobiology, Duke University School of Medicine, Durham, NC, USA.
| |
Collapse
|
5
|
Nóbrega IDS, Teles e Silva AL, Yokota-Moreno BY, Sertié AL. The Importance of Large-Scale Genomic Studies to Unravel Genetic Risk Factors for Autism. Int J Mol Sci 2024; 25:5816. [PMID: 38892002 PMCID: PMC11172008 DOI: 10.3390/ijms25115816] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2024] [Revised: 05/17/2024] [Accepted: 05/21/2024] [Indexed: 06/21/2024] Open
Abstract
Autism spectrum disorder (ASD) is a common and highly heritable neurodevelopmental disorder. During the last 15 years, advances in genomic technologies and the availability of increasingly large patient cohorts have greatly expanded our knowledge of the genetic architecture of ASD and its neurobiological mechanisms. Over two hundred risk regions and genes carrying rare de novo and transmitted high-impact variants have been identified. Additionally, common variants with small individual effect size are also important, and a number of loci are now being uncovered. At the same time, these new insights have highlighted ongoing challenges. In this perspective article, we summarize developments in ASD genetic research and address the enormous impact of large-scale genomic initiatives on ASD gene discovery.
Collapse
Affiliation(s)
| | | | | | - Andréa Laurato Sertié
- Faculdade Israelita de Ciências da Saúde Albert Einstein, Hospital Israelita Albert Einstein, Rua Comendador Elias Jafet, 755. Morumbi, São Paulo 05653-000, Brazil; (I.d.S.N.); (A.L.T.e.S.); (B.Y.Y.-M.)
| |
Collapse
|
6
|
Zadok N, Ast G, Sharan R. A network-based method for associating genes with autism spectrum disorder. FRONTIERS IN BIOINFORMATICS 2024; 4:1295600. [PMID: 38525240 PMCID: PMC10960359 DOI: 10.3389/fbinf.2024.1295600] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2023] [Accepted: 02/26/2024] [Indexed: 03/26/2024] Open
Abstract
Autism spectrum disorder (ASD) is a highly heritable complex disease that affects 1% of the population, yet its underlying molecular mechanisms are largely unknown. Here we study the problem of predicting causal genes for ASD by combining genome-scale data with a network propagation approach. We construct a predictor that integrates multiple omic data sets that assess genomic, transcriptomic, proteomic, and phosphoproteomic associations with ASD. In cross validation our predictor yields mean area under the ROC curve of 0.87 and area under the precision-recall curve of 0.89. We further show that it outperforms previous gene-level predictors of autism association. Finally, we show that we can use the model to predict genes associated with Schizophrenia which is known to share genetic components with ASD.
Collapse
Affiliation(s)
- Neta Zadok
- Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv, Israel
| | - Gil Ast
- Department of Human Molecular Genetics and Biochemistry, Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Roded Sharan
- Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv, Israel
| |
Collapse
|
7
|
Jia Q, Wang X, Zhou R, Ma B, Fei F, Han H. Systematic bibliometric and visualized analysis of research hotspots and trends in artificial intelligence in autism spectrum disorder. Front Neuroinform 2023; 17:1310400. [PMID: 38125308 PMCID: PMC10731312 DOI: 10.3389/fninf.2023.1310400] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Accepted: 11/20/2023] [Indexed: 12/23/2023] Open
Abstract
Background Artificial intelligence (AI) has been the subject of studies in autism spectrum disorder (ASD) and may affect its identification, diagnosis, intervention, and other medical practices in the future. Although previous studies have used bibliometric techniques to analyze and investigate AI, there has been little research on the adoption of AI in ASD. This study aimed to explore the broad applications and research frontiers of AI used in ASD. Methods Citation data were retrieved from the Web of Science Core Collection (WoSCC) database to assess the extent to which AI is used in ASD. CiteSpace.5.8. R3 and VOSviewer, two online tools for literature metrology analysis, were used to analyze the data. Results A total of 776 publications from 291 countries and regions were analyzed; of these, 256 publications were from the United States and 173 publications were from China, and England had the largest centrality of 0.33; Stanford University had the highest H-index of 17; and the largest cluster label of co-cited references was machine learning. In addition, keywords with a high number of occurrences in this field were autism spectrum disorder (295), children (255), classification (156) and diagnosis (77). The burst keywords from 2021 to 2023 were infants and feature selection, and from 2022 to 2023, the burst keyword was corpus callosum. Conclusion This research provides a systematic analysis of the literature concerning AI used in ASD, presenting an overall demonstration in this field. In this area, the United States and China have the largest number of publications, England has the greatest influence, and Stanford University is the most influential. In addition, the research on AI used in ASD mostly focuses on classification and diagnosis, and "infants, feature selection, and corpus callosum are at the forefront, providing directions for future research. However, the use of AI technologies to identify ASD will require further research.
Collapse
Affiliation(s)
- Qianfang Jia
- Hebei University of Chinese Medicine, Shijiazhuang, China
| | - Xiaofang Wang
- Hebei University of Chinese Medicine, Shijiazhuang, China
| | - Rongyi Zhou
- Children’s Brain Disease Diagnosis, Treatment and Rehabilitation Center of the First Affiliated Hospital of Henan University of Chinese Medicine, Zhengzhou, China
- School of Pediatric Medicine, Henan University of Chinese Medicine, Zhengzhou, China
| | - Bingxiang Ma
- Children’s Brain Disease Diagnosis, Treatment and Rehabilitation Center of the First Affiliated Hospital of Henan University of Chinese Medicine, Zhengzhou, China
- School of Pediatric Medicine, Henan University of Chinese Medicine, Zhengzhou, China
| | - Fangqin Fei
- Department of Nursing, the First People’s Hospital of Huzhou, Huzhou University, Huzhou, China
| | - Hui Han
- Department of Nursing, the First People’s Hospital of Huzhou, Huzhou University, Huzhou, China
| |
Collapse
|
8
|
Nisar S, Haris M. Neuroimaging genetics approaches to identify new biomarkers for the early diagnosis of autism spectrum disorder. Mol Psychiatry 2023; 28:4995-5008. [PMID: 37069342 PMCID: PMC11041805 DOI: 10.1038/s41380-023-02060-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Revised: 03/23/2023] [Accepted: 03/28/2023] [Indexed: 04/19/2023]
Abstract
Autism-spectrum disorders (ASDs) are developmental disabilities that manifest in early childhood and are characterized by qualitative abnormalities in social behaviors, communication skills, and restrictive or repetitive behaviors. To explore the neurobiological mechanisms in ASD, extensive research has been done to identify potential diagnostic biomarkers through a neuroimaging genetics approach. Neuroimaging genetics helps to identify ASD-risk genes that contribute to structural and functional variations in brain circuitry and validate biological changes by elucidating the mechanisms and pathways that confer genetic risk. Integrating artificial intelligence models with neuroimaging data lays the groundwork for accurate diagnosis and facilitates the identification of early diagnostic biomarkers for ASD. This review discusses the significance of neuroimaging genetics approaches to gaining a better understanding of the perturbed neurochemical system and molecular pathways in ASD and how these approaches can detect structural, functional, and metabolic changes and lead to the discovery of novel biomarkers for the early diagnosis of ASD.
Collapse
Affiliation(s)
- Sabah Nisar
- Laboratory of Molecular and Metabolic Imaging, Sidra Medicine, Doha, Qatar
- Department of Diagnostic Imaging, St Jude Children's Research Hospital, Memphis, TN, USA
| | - Mohammad Haris
- Laboratory of Molecular and Metabolic Imaging, Sidra Medicine, Doha, Qatar.
- Center for Advanced Metabolic Imaging in Precision Medicine, Department of Radiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
- Laboratory Animal Research Center, Qatar University, Doha, Qatar.
| |
Collapse
|
9
|
Neff RA, Bosch-Gutierrez A, Sun Y, Katsyv I, Song WM, Wang M, Walsh MJ, Zhang B. Dysfunction of ubiquitin protein ligase MYCBP2 leads to cell resilience in human breast cancers. NAR Cancer 2023; 5:zcad036. [PMID: 37435531 PMCID: PMC10331931 DOI: 10.1093/narcan/zcad036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Accepted: 06/26/2023] [Indexed: 07/13/2023] Open
Abstract
Breast cancer is the most common type of cancer among women worldwide, and it is estimated that 294 000 new diagnoses and 37 000 deaths will occur each year in the United States alone by 2030. Large-scale genomic studies have identified a number of genetic loci with alterations in breast cancer. However, identification of the genes that are critical for tumorgenicity still remains a challenge. Here, we perform a comprehensive functional multi-omics analysis of somatic mutations in breast cancer and identify previously unknown key regulators of breast cancer tumorgenicity. We identify dysregulation of MYCBP2, an E3 ubiquitin ligase and an upstream regulator of mTOR signaling, is accompanied with decreased disease-free survival. We validate MYCBP2 as a key target through depletion siRNA using in vitro apoptosis assays in MCF10A, MCF7 and T47D cells. We demonstrate that MYCBP2 loss is associated with resistance to apoptosis from cisplatin-induced DNA damage and cell cycle changes, and that CHEK1 inhibition can modulate MYCBP2 activity and caspase cleavage. Furthermore, we show that MYCBP2 knockdown is associated with transcriptomic responses in TSC2 and in apoptosis genes and interleukins. Therefore, we show that MYCBP2 is an important genetic target that represents a key node regulating multiple molecular pathways in breast cancer corresponding with apparent drug resistance in our study.
Collapse
Affiliation(s)
- Ryan A Neff
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029, USA
- Mount Sinai Center for Transformative Disease Modeling, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029, USA
- Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029, USA
- Medical Scientist Training Program, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Almudena Bosch-Gutierrez
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029, USA
- Department of Pharmacological Sciences, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029, USA
- The Mount Sinai Center for RNA Biology and Medicine, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Yifei Sun
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029, USA
- Department of Pharmacological Sciences, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029, USA
- The Mount Sinai Center for RNA Biology and Medicine, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Igor Katsyv
- Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115, USA
| | - Won-min Song
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029, USA
- Mount Sinai Center for Transformative Disease Modeling, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029, USA
- Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029, USA
| | - Minghui Wang
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029, USA
- Mount Sinai Center for Transformative Disease Modeling, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029, USA
- Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029, USA
| | - Martin J Walsh
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029, USA
- Department of Pharmacological Sciences, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029, USA
- The Mount Sinai Center for RNA Biology and Medicine, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Bin Zhang
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029, USA
- Mount Sinai Center for Transformative Disease Modeling, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029, USA
- Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029, USA
- Department of Pharmacological Sciences, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029, USA
| |
Collapse
|
10
|
Córdoba-Jover B, Ribera J, Portolés I, Lecue E, Rodriguez-Vita J, Pérez-Sisqués L, Mannara F, Solsona-Vilarrasa E, García-Ruiz C, Fernández-Checa JC, Casals G, Rodríguez-Revenga L, Álvarez-Mora MI, Arteche-López A, Díaz de Bustamante A, Calvo R, Pujol A, Azkargorta M, Elortza F, Malagelada C, Pinyol R, Huguet-Pradell J, Melgar-Lesmes P, Jiménez W, Morales-Ruiz M. Tcf20 deficiency is associated with increased liver fibrogenesis and alterations in mitochondrial metabolism in mice and humans. Liver Int 2023; 43:1822-1836. [PMID: 37312667 DOI: 10.1111/liv.15640] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Revised: 05/23/2023] [Accepted: 05/28/2023] [Indexed: 06/15/2023]
Abstract
BACKGROUND & AIMS Transcription co-activator factor 20 (TCF20) is a regulator of transcription factors involved in extracellular matrix remodelling. In addition, TCF20 genomic variants in humans have been associated with impaired intellectual disability. Therefore, we hypothesized that TCF20 has several functions beyond those described in neurogenesis, including the regulation of fibrogenesis. METHODS Tcf20 knock-out (Tcf20-/- ) and Tcf20 heterozygous mice were generated by homologous recombination. TCF20 gene genotyping and expression was assessed in patients with pathogenic variants in the TCF20 gene. Neural development was investigated by immufluorescense. Mitochondrial metabolic activity was evaluated with the Seahorse analyser. The proteome analysis was carried out by gas chromatography mass-spectrometry. RESULTS Characterization of Tcf20-/- newborn mice showed impaired neural development and death after birth. In contrast, heterozygous mice were viable but showed higher CCl4 -induced liver fibrosis and a differential expression of genes involved in extracellular matrix homeostasis compared to wild-type mice, along with abnormal behavioural patterns compatible with autism-like phenotypes. Tcf20-/- embryonic livers and mouse embryonic fibroblast (MEF) cells revealed differential expression of structural proteins involved in the mitochondrial oxidative phosphorylation chain, increased rates of mitochondrial metabolic activity and alterations in metabolites of the citric acid cycle. These results parallel to those found in patients with TCF20 pathogenic variants, including alterations of the fibrosis scores (ELF and APRI) and the elevation of succinate concentration in plasma. CONCLUSIONS We demonstrated a new role of Tcf20 in fibrogenesis and mitochondria metabolism in mice and showed the association of TCF20 deficiency with fibrosis and metabolic biomarkers in humans.
Collapse
Affiliation(s)
- Bernat Córdoba-Jover
- Biochemistry and Molecular Genetics Department, Hospital Clinic of Barcelona, Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain
| | - Jordi Ribera
- Biochemistry and Molecular Genetics Department, Hospital Clinic of Barcelona, Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain
- CIBERehd, Instituto de Salud Carlos III, Madrid, Spain
| | - Irene Portolés
- Biochemistry and Molecular Genetics Department, Hospital Clinic of Barcelona, Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain
| | - Elena Lecue
- Biochemistry and Molecular Genetics Department, Hospital Clinic of Barcelona, Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain
| | - Juan Rodriguez-Vita
- Tumour-Stroma Communication Laboratory, Centro de Investigación Príncipe Felipe, Valencia, Spain
| | - Leticia Pérez-Sisqués
- Department of Biomedicine-Biochemistry Unit, School of Medicine University of Barcelona, Barcelona, Spain
| | - Francesco Mannara
- Institut d'Investigacions Biomèdiques August Pi iSunyer (IDIBAPS), Barcelona, Spain
| | - Estel Solsona-Vilarrasa
- CIBERehd, Instituto de Salud Carlos III, Madrid, Spain
- Cell Death and Proliferation, Institute of Biomedical Research of Barcelona (IIBB), Consejo Superior Investigaciones Científicas (CSIC), Liver Unit, Hospital Clínic, IDIBAPS, Universitat de Barcelona, Barcelona, Spain
| | - Carmen García-Ruiz
- CIBERehd, Instituto de Salud Carlos III, Madrid, Spain
- Cell Death and Proliferation, Institute of Biomedical Research of Barcelona (IIBB), Consejo Superior Investigaciones Científicas (CSIC), Liver Unit, Hospital Clínic, IDIBAPS, Universitat de Barcelona, Barcelona, Spain
- USC Research Center for ALPD, Keck School of Medicine, Los Angeles, California, USA
| | - José C Fernández-Checa
- CIBERehd, Instituto de Salud Carlos III, Madrid, Spain
- Cell Death and Proliferation, Institute of Biomedical Research of Barcelona (IIBB), Consejo Superior Investigaciones Científicas (CSIC), Liver Unit, Hospital Clínic, IDIBAPS, Universitat de Barcelona, Barcelona, Spain
- USC Research Center for ALPD, Keck School of Medicine, Los Angeles, California, USA
| | - Gregori Casals
- Biochemistry and Molecular Genetics Department, Hospital Clinic of Barcelona, Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain
- CIBERehd, Instituto de Salud Carlos III, Madrid, Spain
- Department of Biomedicine-Biochemistry Unit, School of Medicine University of Barcelona, Barcelona, Spain
| | - Laia Rodríguez-Revenga
- Biochemistry and Molecular Genetics Department, Hospital Clinic of Barcelona, Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain
- CIBER of Rare Diseases (CIBERER), Instituto de Salud Carlos III, Madrid, Spain
| | - María Isabel Álvarez-Mora
- Biochemistry and Molecular Genetics Department, Hospital Clinic of Barcelona, Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain
- CIBER of Rare Diseases (CIBERER), Instituto de Salud Carlos III, Madrid, Spain
- Genetics Department, 12 de Octubre University Hospital, Madrid, Spain
| | - Ana Arteche-López
- Genetics Department, 12 de Octubre University Hospital, Madrid, Spain
- UDISGEN (Unidad de Dismorfología y Genética), 12 de Octubre University Hospital, Madrid, Spain
| | | | - Rosa Calvo
- Institut d'Investigacions Biomèdiques August Pi iSunyer (IDIBAPS), Barcelona, Spain
- Department of Child and Adolescent Psychiatry and Psychology, Hospital Clinic of Barcelona. School of Medicine, University of Barcelona, Centro de Investigación Biomédica en Red Salud Mental (CIBERSAM), Madrid, Spain
| | - Anna Pujol
- Unidad de Animales Transgénicos UAT-CBATEG, Universitat Autònoma de Barcelona, Cerdanyola del Valles, Spain
| | - Mikel Azkargorta
- Proteomics Platform, CIC bioGUNE, Basque Research and Technology Alliance (BRTA), CIBERehd, Derio, Spain
| | - Felix Elortza
- Proteomics Platform, CIC bioGUNE, Basque Research and Technology Alliance (BRTA), CIBERehd, Derio, Spain
| | - Cristina Malagelada
- Department of Biomedicine-Biochemistry Unit, School of Medicine University of Barcelona, Barcelona, Spain
| | - Roser Pinyol
- Translational Research in Hepatic Oncology Group, Liver Unit, IDIBAPS, Barcelona Clínic Hospital, University of Barcelona, Barcelona, Spain
| | - Júlia Huguet-Pradell
- Translational Research in Hepatic Oncology Group, Liver Unit, IDIBAPS, Barcelona Clínic Hospital, University of Barcelona, Barcelona, Spain
| | - Pedro Melgar-Lesmes
- Biochemistry and Molecular Genetics Department, Hospital Clinic of Barcelona, Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain
- CIBERehd, Instituto de Salud Carlos III, Madrid, Spain
- Department of Biomedicine-Biochemistry Unit, School of Medicine University of Barcelona, Barcelona, Spain
| | - Wladimiro Jiménez
- Biochemistry and Molecular Genetics Department, Hospital Clinic of Barcelona, Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain
- CIBERehd, Instituto de Salud Carlos III, Madrid, Spain
- Department of Biomedicine-Biochemistry Unit, School of Medicine University of Barcelona, Barcelona, Spain
| | - Manuel Morales-Ruiz
- Biochemistry and Molecular Genetics Department, Hospital Clinic of Barcelona, Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain
- CIBERehd, Instituto de Salud Carlos III, Madrid, Spain
- Department of Biomedicine-Biochemistry Unit, School of Medicine University of Barcelona, Barcelona, Spain
| |
Collapse
|
11
|
Fakhar AZ, Liu J, Pajerowska-Mukhtar KM, Mukhtar MS. The Lost and Found: Unraveling the Functions of Orphan Genes. J Dev Biol 2023; 11:27. [PMID: 37367481 PMCID: PMC10299390 DOI: 10.3390/jdb11020027] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Revised: 05/19/2023] [Accepted: 05/26/2023] [Indexed: 06/28/2023] Open
Abstract
Orphan Genes (OGs) are a mysterious class of genes that have recently gained significant attention. Despite lacking a clear evolutionary history, they are found in nearly all living organisms, from bacteria to humans, and they play important roles in diverse biological processes. The discovery of OGs was first made through comparative genomics followed by the identification of unique genes across different species. OGs tend to be more prevalent in species with larger genomes, such as plants and animals, and their evolutionary origins remain unclear but potentially arise from gene duplication, horizontal gene transfer (HGT), or de novo origination. Although their precise function is not well understood, OGs have been implicated in crucial biological processes such as development, metabolism, and stress responses. To better understand their significance, researchers are using a variety of approaches, including transcriptomics, functional genomics, and molecular biology. This review offers a comprehensive overview of the current knowledge of OGs in all domains of life, highlighting the possible role of dark transcriptomics in their evolution. More research is needed to fully comprehend the role of OGs in biology and their impact on various biological processes.
Collapse
Affiliation(s)
| | | | | | - M. Shahid Mukhtar
- Department of Biology, University of Alabama at Birmingham, 1300 University Blvd., Birmingham, AL 35294, USA
| |
Collapse
|
12
|
Kosolwattana T, Liu C, Hu R, Han S, Chen H, Lin Y. A self-inspected adaptive SMOTE algorithm (SASMOTE) for highly imbalanced data classification in healthcare. BioData Min 2023; 16:15. [PMID: 37098549 PMCID: PMC10131309 DOI: 10.1186/s13040-023-00330-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2022] [Accepted: 03/09/2023] [Indexed: 04/27/2023] Open
Abstract
In many healthcare applications, datasets for classification may be highly imbalanced due to the rare occurrence of target events such as disease onset. The SMOTE (Synthetic Minority Over-sampling Technique) algorithm has been developed as an effective resampling method for imbalanced data classification by oversampling samples from the minority class. However, samples generated by SMOTE may be ambiguous, low-quality and non-separable with the majority class. To enhance the quality of generated samples, we proposed a novel self-inspected adaptive SMOTE (SASMOTE) model that leverages an adaptive nearest neighborhood selection algorithm to identify the "visible" nearest neighbors, which are used to generate samples likely to fall into the minority class. To further enhance the quality of the generated samples, an uncertainty elimination via self-inspection approach is introduced in the proposed SASMOTE model. Its objective is to filter out the generated samples that are highly uncertain and inseparable with the majority class. The effectiveness of the proposed algorithm is compared with existing SMOTE-based algorithms and demonstrated through two real-world case studies in healthcare, including risk gene discovery and fatal congenital heart disease prediction. By generating the higher quality synthetic samples, the proposed algorithm is able to help achieve better prediction performance (in terms of F1 score) on average compared to the other methods, which is promising to enhance the usability of machine learning models on highly imbalanced healthcare data.
Collapse
Affiliation(s)
| | - Chenang Liu
- School of Industrial Engineering & Management, Oklahoma State University, Stillwater, USA
| | - Renjie Hu
- Department of Information and Logistics Technology, University of Houston, Houston, USA
| | - Shizhong Han
- Department of Psychiatry and Behavioral Sciences, Johns Hopkins School of Medicine, Baltimore, USA
- Lieber Institute for Brain Development, Baltimore, USA
| | - Hua Chen
- Department of Pharmaceutical Health Outcomes and Policy, University of Houston, Houston, USA
| | - Ying Lin
- Department of Industrial Engineering, University of Houston, Houston, USA.
| |
Collapse
|
13
|
Sala-Gaston J, Costa-Sastre L, Pedrazza L, Martinez-Martinez A, Ventura F, Rosa JL. Regulation of MAPK Signaling Pathways by the Large HERC Ubiquitin Ligases. Int J Mol Sci 2023; 24:ijms24054906. [PMID: 36902336 PMCID: PMC10003351 DOI: 10.3390/ijms24054906] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 02/27/2023] [Accepted: 03/01/2023] [Indexed: 03/06/2023] Open
Abstract
Protein ubiquitylation acts as a complex cell signaling mechanism since the formation of different mono- and polyubiquitin chains determines the substrate's fate in the cell. E3 ligases define the specificity of this reaction by catalyzing the attachment of ubiquitin to the substrate protein. Thus, they represent an important regulatory component of this process. Large HERC ubiquitin ligases belong to the HECT E3 protein family and comprise HERC1 and HERC2 proteins. The physiological relevance of the Large HERCs is illustrated by their involvement in different pathologies, with a notable implication in cancer and neurological diseases. Understanding how cell signaling is altered in these different pathologies is important for uncovering novel therapeutic targets. To this end, this review summarizes the recent advances in how the Large HERCs regulate the MAPK signaling pathways. In addition, we emphasize the potential therapeutic strategies that could be followed to ameliorate the alterations in MAPK signaling caused by Large HERC deficiencies, focusing on the use of specific inhibitors and proteolysis-targeting chimeras.
Collapse
|
14
|
Machine Learning-Based Blood RNA Signature for Diagnosis of Autism Spectrum Disorder. Int J Mol Sci 2023; 24:ijms24032082. [PMID: 36768401 PMCID: PMC9916487 DOI: 10.3390/ijms24032082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Revised: 01/15/2023] [Accepted: 01/17/2023] [Indexed: 01/21/2023] Open
Abstract
Early diagnosis of autism spectrum disorder (ASD) is crucial for providing appropriate treatments and parental guidance from an early age. Yet, ASD diagnosis is a lengthy process, in part due to the lack of reliable biomarkers. We recently applied RNA-sequencing of peripheral blood samples from 73 American and Israeli children with ASD and 26 neurotypically developing (NT) children to identify 10 genes with dysregulated blood expression levels in children with ASD. Machine learning (ML) analyzes data by computerized analytical model building and may be applied to building diagnostic tools based on the optimization of large datasets. Here, we present several ML-generated models, based on RNA expression datasets collected during our recently published RNA-seq study, as tentative tools for ASD diagnosis. Using the random forest classifier, two of our proposed models yield an accuracy of 82% in distinguishing children with ASD and NT children. Our proof-of-concept study requires refinement and independent validation by studies with far larger cohorts of children with ASD and NT children and should thus be perceived as starting point for building more accurate ML-based tools. Eventually, such tools may potentially provide an unbiased means to support the early diagnosis of ASD.
Collapse
|
15
|
Pedrazza L, Martinez-Martinez A, Sánchez-de-Diego C, Valer JA, Pimenta-Lopes C, Sala-Gaston J, Szpak M, Tyler-Smith C, Ventura F, Rosa JL. HERC1 deficiency causes osteopenia through transcriptional program dysregulation during bone remodeling. Cell Death Dis 2023; 14:17. [PMID: 36635269 PMCID: PMC9837143 DOI: 10.1038/s41419-023-05549-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Revised: 12/19/2022] [Accepted: 01/03/2023] [Indexed: 01/13/2023]
Abstract
Bone remodeling is a continuous process between bone-forming osteoblasts and bone-resorbing osteoclasts, with any imbalance resulting in metabolic bone disease, including osteopenia. The HERC1 gene encodes an E3 ubiquitin ligase that affects cellular processes by regulating the ubiquitination of target proteins, such as C-RAF. Of interest, an association exists between biallelic pathogenic sequence variants in the HERC1 gene and the neurodevelopmental disorder MDFPMR syndrome (macrocephaly, dysmorphic facies, and psychomotor retardation). Most pathogenic variants cause loss of HERC1 function, and the affected individuals present with features related to altered bone homeostasis. Herc1-knockout mice offer an excellent model in which to study the role of HERC1 in bone remodeling and to understand its role in disease. In this study, we show that HERC1 regulates osteoblastogenesis and osteoclastogenesis, proving that its depletion increases gene expression of osteoblastic makers during the osteogenic differentiation of mesenchymal stem cells. During this process, HERC1 deficiency increases the levels of C-RAF and of phosphorylated ERK and p38. The Herc1-knockout adult mice developed imbalanced bone homeostasis that presented as osteopenia in both sexes of the adult mice. By contrast, only young female knockout mice had osteopenia and increased number of osteoclasts, with the changes associated with reductions in testosterone and dihydrotestosterone levels. Finally, osteocytes isolated from knockout mice showed a higher expression of osteocytic genes and an increase in the Rankl/Opg ratio, indicating a relevant cell-autonomous role of HERC1 when regulating the transcriptional program of bone formation. Overall, these findings present HERC1 as a modulator of bone homeostasis and highlight potential therapeutic targets for individuals affected by pathological HERC1 variants.
Collapse
Affiliation(s)
- Leonardo Pedrazza
- Departament de Ciències Fisiològiques, Universitat de Barcelona, IDIBELL, L'Hospitalet de Llobregat, Spain
| | - Arturo Martinez-Martinez
- Departament de Ciències Fisiològiques, Universitat de Barcelona, IDIBELL, L'Hospitalet de Llobregat, Spain
| | - Cristina Sánchez-de-Diego
- Departament de Ciències Fisiològiques, Universitat de Barcelona, IDIBELL, L'Hospitalet de Llobregat, Spain
| | - José Antonio Valer
- Departament de Ciències Fisiològiques, Universitat de Barcelona, IDIBELL, L'Hospitalet de Llobregat, Spain
| | - Carolina Pimenta-Lopes
- Departament de Ciències Fisiològiques, Universitat de Barcelona, IDIBELL, L'Hospitalet de Llobregat, Spain
| | - Joan Sala-Gaston
- Departament de Ciències Fisiològiques, Universitat de Barcelona, IDIBELL, L'Hospitalet de Llobregat, Spain
| | - Michal Szpak
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
| | | | - Francesc Ventura
- Departament de Ciències Fisiològiques, Universitat de Barcelona, IDIBELL, L'Hospitalet de Llobregat, Spain
| | - Jose Luis Rosa
- Departament de Ciències Fisiològiques, Universitat de Barcelona, IDIBELL, L'Hospitalet de Llobregat, Spain.
| |
Collapse
|
16
|
Pérez-Villegas EM, Ruiz R, Bachiller S, Ventura F, Armengol JA, Rosa JL. The HERC proteins and the nervous system. Semin Cell Dev Biol 2022; 132:5-15. [PMID: 34848147 DOI: 10.1016/j.semcdb.2021.11.017] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Revised: 11/15/2021] [Accepted: 11/16/2021] [Indexed: 12/14/2022]
Abstract
The HERC protein family is one of three subfamilies of Homologous to E6AP C-terminus (HECT) E3 ubiquitin ligases. Six HERC genes have been described in humans, two of which encode Large HERC proteins -HERC1 and HERC2- with molecular weights above 520 kDa that are constitutively expressed in the brain. There is a large body of evidence that mutations in these Large HERC genes produce clinical syndromes in which key neurodevelopmental events are altered, resulting in intellectual disability and other neurological disorders like epileptic seizures, dementia and/or signs of autism. In line with these consequences in humans, two mice carrying mutations in the Large HERC genes have been studied quite intensely: the tambaleante mutant for Herc1 and the Herc2+/530 mutant for Herc2. In both these mutant mice there are clear signs that autophagy is dysregulated, eliciting cerebellar Purkinje cell death and impairing motor control. The tambaleante mouse was the first of these mice to appear and is the best studied, in which the Herc1 mutation elicits: (i) delayed neural transmission in the peripheral nervous system; (ii) impaired learning, memory and motor control; and (iii) altered presynaptic membrane dynamics. In this review, we discuss the information currently available on HERC proteins in the nervous system and their biological activity, the dysregulation of which could explain certain neurodevelopmental syndromes and/or neurodegenerative diseases.
Collapse
Affiliation(s)
- Eva M Pérez-Villegas
- Department of Physiology, Anatomy and Cell Biology, University Pablo de Olavide, Seville, Spain
| | - Rocío Ruiz
- Department of Biochemistry and Molecular Biology, School of Pharmacy, University of Seville, Seville, Spain
| | - Sara Bachiller
- Clinical Unit of Infectious Diseases, Microbiology and Preventive Medicine, Institute of Biomedicine of Sevilla, Virgen del Rocío University Hospital, CSIC, University of Sevilla, Sevilla, Spain
| | - Francesc Ventura
- Departament de Ciències Fisiològiques, IBIDELL, Universitat de Barcelona, L'Hospitalet de Llobregat, Barcelona, Spain
| | - Jose A Armengol
- Department of Physiology, Anatomy and Cell Biology, University Pablo de Olavide, Seville, Spain.
| | - Jose Luis Rosa
- Departament de Ciències Fisiològiques, IBIDELL, Universitat de Barcelona, L'Hospitalet de Llobregat, Barcelona, Spain.
| |
Collapse
|
17
|
Chen WX, Liu B, Zhou L, Xiong X, Fu J, Huang ZF, Tan T, Tang M, Wang J, Tang YP. De novo mutations within metabolism networks of amino acid/protein/energy in Chinese autistic children with intellectual disability. Hum Genomics 2022; 16:52. [PMID: 36320054 PMCID: PMC9623983 DOI: 10.1186/s40246-022-00427-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2022] [Accepted: 10/19/2022] [Indexed: 11/05/2022] Open
Abstract
BACKGROUND Autism spectrum disorder (ASD) is often accompanied by intellectual disability (ID). Despite extensive studies, however, the genetic basis for this comorbidity is still not clear. In this study, we tried to develop an analyzing pipeline for de novo mutations and possible pathways related to ID phenotype in ASD. Whole-exome sequencing (WES) was performed to screen de novo mutations and candidate genes in 79 ASD children together with their parents (trios). The de novo altering genes and relative pathways which were associated with ID phenotype were analyzed. The connection nodes (genes) of above pathways were selected, and the diagnostic value of these selected genes for ID phenotype in the study population was also evaluated. RESULTS We identified 89 de novo mutant genes, of which 34 genes were previously reported to be associated with ASD, including double hits in the EGF repeats of NOTCH1 gene (p.V999M and p.S1027L). Interestingly, of these 34 genes, 22 may directly affect intelligence quotient (IQ). Further analyses revealed that these IQ-related genes were enriched in protein synthesis, energy metabolism, and amino acid metabolism, and at least 9 genes (CACNA1A, ALG9, PALM2, MGAT4A, PCK2, PLEKHA1, PSME3, ADI1, and TLE3) were involved in all these three pathways. Seven patients who harbored these gene mutations showed a high prevalence of a low IQ score (< 70), a non-verbal language, and an early diagnostic age (< 4 years). Furthermore, our panel of these 9 genes reached a 10.2% diagnostic rate (5/49) in early diagnostic patients with a low IQ score and also reached a 10% diagnostic yield in those with both a low IQ score and non-verbal language (4/40). CONCLUSION We found some new genetic disposition for ASD accompanied with intellectual disability in this study. Our results may be helpful for etiologic research and early diagnoses of intellectual disability in ASD. Larger population studies and further mechanism studies are warranted.
Collapse
Affiliation(s)
- Wen-Xiong Chen
- grid.410737.60000 0000 8653 1072The Assessment and Intervention Center for Autistic Children, Department of Neurology, Guangzhou Women and Children’s Medical Center, Guangzhou Medical University, Guangzhou, 510623 Guangdong China
| | - Bin Liu
- grid.410737.60000 0000 8653 1072Guangzhou Institute of Pediatrics, Guangzhou Women and Children’s Medical Center, Guangzhou Medical University, Guangzhou, 510623 China ,grid.258164.c0000 0004 1790 3548Department of Biobank, Shenzhen Baoan Women’s and Children’s Hospital, Jinan University, Shenzhen, 518102 Guangdong China
| | - Lijie Zhou
- grid.412719.8Department of Pediatric Rehabilitation, Third Affiliated Hospital of Zhengzhou University, Zhengzhou, 450052 China
| | - Xiaoli Xiong
- grid.410737.60000 0000 8653 1072Guangzhou Institute of Pediatrics, Guangzhou Women and Children’s Medical Center, Guangzhou Medical University, Guangzhou, 510623 China
| | - Jie Fu
- grid.410737.60000 0000 8653 1072Guangzhou Institute of Pediatrics, Guangzhou Women and Children’s Medical Center, Guangzhou Medical University, Guangzhou, 510623 China
| | - Zhi-Fang Huang
- grid.410737.60000 0000 8653 1072The Assessment and Intervention Center for Autistic Children, Department of Neurology, Guangzhou Women and Children’s Medical Center, Guangzhou Medical University, Guangzhou, 510623 Guangdong China
| | - Ting Tan
- grid.410737.60000 0000 8653 1072Guangzhou Institute of Pediatrics, Guangzhou Women and Children’s Medical Center, Guangzhou Medical University, Guangzhou, 510623 China
| | - Mingxi Tang
- grid.488387.8Department of Pathology, Affiliated Hospital of Southwest Medical University, Luzhou, 646000 Sichuan China
| | - Jun Wang
- grid.412719.8Department of Pediatric Rehabilitation, Third Affiliated Hospital of Zhengzhou University, Zhengzhou, 450052 China
| | - Ya-Ping Tang
- grid.410737.60000 0000 8653 1072Guangzhou Institute of Pediatrics, Guangzhou Women and Children’s Medical Center, Guangzhou Medical University, Guangzhou, 510623 China ,grid.412719.8Department of Pediatric Rehabilitation, Third Affiliated Hospital of Zhengzhou University, Zhengzhou, 450052 China ,grid.12981.330000 0001 2360 039XGuangdong Provincial Key Laboratory of Brain Function and Disease, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, 510080 Guangdong China
| |
Collapse
|
18
|
Joudar SS, Albahri AS, Hamid RA. Triage and priority-based healthcare diagnosis using artificial intelligence for autism spectrum disorder and gene contribution: A systematic review. Comput Biol Med 2022; 146:105553. [PMID: 35561591 DOI: 10.1016/j.compbiomed.2022.105553] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2022] [Revised: 04/03/2022] [Accepted: 04/20/2022] [Indexed: 11/03/2022]
Abstract
The exact nature, harmful effects and aetiology of autism spectrum disorder (ASD) have caused widespread confusion. Artificial intelligence (AI) science helps solve challenging diagnostic problems in the medical field through extensive experiments. Disease severity is closely related to triage decisions and prioritisation contexts in medicine because both have been widely used to diagnose various diseases via AI, machine learning and automated decision-making techniques. Recently, taking advantage of high-performance AI algorithms has achieved accessible success in diagnosing and predicting risks from clinical and biological data. In contrast, less progress has been made with ASD because of obscure reasons. According to academic literature, ASD diagnosis works from a specific perspective, and much of the confusion arises from the fact that how AI techniques are currently integrated with the diagnosis of ASD concerning the triage and priority strategies and gene contributions. To this end, this study sought to describe a systematic review of the literature to assess the respective AI methods using the available datasets, highlight the tools and strategies used for diagnosing ASD and investigate how AI trends contribute in distinguishing triage and priority for ASD and gene contributions. Accordingly, this study checked the Science Direct, IEEE Xplore Digital Library, Web of Science (WoS), PubMed, and Scopus databases. A set of 363 articles from 2017 to 2022 is collected to reveal a clear picture and a better understanding of all the academic literature through a final set of 18 articles. The retrieved articles were filtered according to the defined inclusion and exclusion criteria and classified into three categories. The first category includes 'Triage patients based on diagnosis methods' which accounts for 16.66% (n = 3/18). The second category includes 'Prioritisation for Risky Genes' which accounts for 66.6% (n = 12/18) and is classified into two subcategories: 'Mutations observation based', 'Biomarkers and toxic chemical observations'. The third category includes 'E-triage using telehealth' which accounts for 16.66% (n = 3/18). This multidisciplinary systematic review revealed the taxonomy, motivations, recommendations and challenges of ASD research that need synergistic attention. Thus, this systematic review performs a comprehensive science mapping analysis and discusses the open issues that help perform and improve the recommended solution of ASD research direction. In addition, this study critically reviews the literature and attempts to address the current research gaps in knowledge and highlights weaknesses that require further research. Finally, a new developed methodology has been suggested as future work for triaging and prioritising ASD patients according to their severity levels by using decision-making techniques.
Collapse
Affiliation(s)
- Shahad Sabbar Joudar
- Informatics Institute for Postgraduate Studies (IIPS), Iraqi Commission for Computers and Informatics (ICCI), Baghdad, Iraq; University of Technology, Baghdad, Iraq
| | - A S Albahri
- Informatics Institute for Postgraduate Studies (IIPS), Iraqi Commission for Computers and Informatics (ICCI), Baghdad, Iraq.
| | - Rula A Hamid
- Informatics Institute for Postgraduate Studies (IIPS), Iraqi Commission for Computers and Informatics (ICCI), Baghdad, Iraq; College of Business Informatics, University of Information Technology and Communications (UOITC), Baghdad, Iraq
| |
Collapse
|
19
|
Arpi MNT, Simpson TI. SFARI genes and where to find them; modelling Autism Spectrum Disorder specific gene expression dysregulation with RNA-seq data. Sci Rep 2022; 12:10158. [PMID: 35710789 PMCID: PMC9203566 DOI: 10.1038/s41598-022-14077-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2021] [Accepted: 06/01/2022] [Indexed: 11/09/2022] Open
Abstract
Autism Spectrum Disorders (ASD) have a strong, yet heterogeneous, genetic component. Among the various methods that are being developed to help reveal the underlying molecular aetiology of the disease one approach that is gaining popularity is the combination of gene expression and clinical genetic data, often using the SFARI-gene database, which comprises lists of curated genes considered to have causative roles in ASD when mutated in patients. We build a gene co-expression network to study the relationship between ASD-specific transcriptomic data and SFARI genes and then analyse it at different levels of granularity. No significant evidence is found of association between SFARI genes and differential gene expression patterns when comparing ASD samples to a control group, nor statistical enrichment of SFARI genes in gene co-expression network modules that have a strong correlation with ASD diagnosis. However, classification models that incorporate topological information from the whole ASD-specific gene co-expression network can predict novel SFARI candidate genes that share features of existing SFARI genes and have support for roles in ASD in the literature. A statistically significant association is also found between the absolute level of gene expression and SFARI's genes and Scores, which can confound the analysis if uncorrected. We propose a novel approach to correct for this that is general enough to be applied to other problems affected by continuous sources of bias. It was found that only co-expression network analyses that integrate information from the whole network are able to reveal signatures linked to ASD diagnosis and novel candidate genes for the study of ASD, which individual gene or module analyses fail to do. It was also found that the influence of SFARI genes permeates not only other ASD scoring systems, but also lists of genes believed to be involved in other neurodevelopmental disorders.
Collapse
Affiliation(s)
| | - T Ian Simpson
- School of Informatics, University of Edinburgh, 10 Crichton Street, Edinburgh, EH8 9AB, UK. .,Simons Initiative for the Developing Brain (SIDB), Centre for Brain Discovery Sciences, University of Edinburgh, Edinburgh, UK.
| |
Collapse
|
20
|
Koçoğlu C, Van Broeckhoven C, van der Zee J. How network-based approaches can complement gene identification studies in frontotemporal dementia. Trends Genet 2022; 38:944-955. [DOI: 10.1016/j.tig.2022.05.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2021] [Revised: 05/04/2022] [Accepted: 05/04/2022] [Indexed: 11/17/2022]
|
21
|
Wu N, Wang Y, Jia JY, Pan YH, Yuan XB. Association of CDH11 with Autism Spectrum Disorder Revealed by Matched-gene Co-expression Analysis and Mouse Behavioral Studies. Neurosci Bull 2021; 38:29-46. [PMID: 34523068 PMCID: PMC8783018 DOI: 10.1007/s12264-021-00770-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2021] [Accepted: 05/25/2021] [Indexed: 11/25/2022] Open
Abstract
A large number of putative risk genes for autism spectrum disorder (ASD) have been reported. The functions of most of these susceptibility genes in developing brains remain unknown, and causal relationships between their variation and autism traits have not been established. The aim of this study was to predict putative risk genes at the whole-genome level based on the analysis of gene co-expression with a group of high-confidence ASD risk genes (hcASDs). The results showed that three gene features - gene size, mRNA abundance, and guanine-cytosine content - affect the genome-wide co-expression profiles of hcASDs. To circumvent the interference of these features in gene co-expression analysis, we developed a method to determine whether a gene is significantly co-expressed with hcASDs by statistically comparing the co-expression profile of this gene with hcASDs to that of this gene with permuted gene sets of feature-matched genes. This method is referred to as "matched-gene co-expression analysis" (MGCA). With MGCA, we demonstrated the convergence in developmental expression profiles of hcASDs and improved the efficacy of risk gene prediction. The results of analysis of two recently-reported ASD candidate genes, CDH11 and CDH9, suggested the involvement of CDH11, but not CDH9, in ASD. Consistent with this prediction, behavioral studies showed that Cdh11-null mice, but not Cdh9-null mice, have multiple autism-like behavioral alterations. This study highlights the power of MGCA in revealing ASD-associated genes and the potential role of CDH11 in ASD.
Collapse
Affiliation(s)
- Nan Wu
- Key Laboratory of Brain Functional Genomics of Shanghai and the Ministry of Education, Institute of Brain Functional Genomics, School of Life Science and the Collaborative Innovation Center for Brain Science, East China Normal University, Shanghai, 200062, China
| | - Yue Wang
- Hussman Institute for Autism, Baltimore, 21201, USA
| | - Jing-Yan Jia
- Key Laboratory of Brain Functional Genomics of Shanghai and the Ministry of Education, Institute of Brain Functional Genomics, School of Life Science and the Collaborative Innovation Center for Brain Science, East China Normal University, Shanghai, 200062, China
| | - Yi-Hsuan Pan
- Key Laboratory of Brain Functional Genomics of Shanghai and the Ministry of Education, Institute of Brain Functional Genomics, School of Life Science and the Collaborative Innovation Center for Brain Science, East China Normal University, Shanghai, 200062, China.
| | - Xiao-Bing Yuan
- Key Laboratory of Brain Functional Genomics of Shanghai and the Ministry of Education, Institute of Brain Functional Genomics, School of Life Science and the Collaborative Innovation Center for Brain Science, East China Normal University, Shanghai, 200062, China. .,Department of Anatomy and Neurobiology, University of Maryland School of Medicine, Baltimore, 21201, USA.
| |
Collapse
|
22
|
Gunning M, Pavlidis P. "Guilt by association" is not competitive with genetic association for identifying autism risk genes. Sci Rep 2021; 11:15950. [PMID: 34354131 PMCID: PMC8342445 DOI: 10.1038/s41598-021-95321-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2021] [Accepted: 07/16/2021] [Indexed: 12/25/2022] Open
Abstract
Discovering genes involved in complex human genetic disorders is a major challenge. Many have suggested that machine learning (ML) algorithms using gene networks can be used to supplement traditional genetic association-based approaches to predict or prioritize disease genes. However, questions have been raised about the utility of ML methods for this type of task due to biases within the data, and poor real-world performance. Using autism spectrum disorder (ASD) as a test case, we sought to investigate the question: can machine learning aid in the discovery of disease genes? We collected 13 published ASD gene prioritization studies and evaluated their performance using known and novel high-confidence ASD genes. We also investigated their biases towards generic gene annotations, like number of association publications. We found that ML methods which do not incorporate genetics information have limited utility for prioritization of ASD risk genes. These studies perform at a comparable level to generic measures of likelihood for the involvement of genes in any condition, and do not out-perform genetic association studies. Future efforts to discover disease genes should be focused on developing and validating statistical models for genetic association, specifically for association between rare variants and disease, rather than developing complex machine learning methods using complex heterogeneous biological data with unknown reliability.
Collapse
Affiliation(s)
- Margot Gunning
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
- Department of Psychiatry, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
- Graduate Program in Bioinformatics, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
| | - Paul Pavlidis
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada.
- Department of Psychiatry, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada.
- Djavad Mowafaghian Centre for Brain Health, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada.
| |
Collapse
|
23
|
Rahman MM, Usman OL, Muniyandi RC, Sahran S, Mohamed S, Razak RA. A Review of Machine Learning Methods of Feature Selection and Classification for Autism Spectrum Disorder. Brain Sci 2020; 10:brainsci10120949. [PMID: 33297436 PMCID: PMC7762227 DOI: 10.3390/brainsci10120949] [Citation(s) in RCA: 43] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2020] [Revised: 12/02/2020] [Accepted: 12/05/2020] [Indexed: 12/27/2022] Open
Abstract
Autism Spectrum Disorder (ASD), according to DSM-5 in the American Psychiatric Association, is a neurodevelopmental disorder that includes deficits of social communication and social interaction with the presence of restricted and repetitive behaviors. Children with ASD have difficulties in joint attention and social reciprocity, using non-verbal and verbal behavior for communication. Due to these deficits, children with autism are often socially isolated. Researchers have emphasized the importance of early identification and early intervention to improve the level of functioning in language, communication, and well-being of children with autism. However, due to limited local assessment tools to diagnose these children, limited speech-language therapy services in rural areas, etc., these children do not get the rehabilitation they need until they get into compulsory schooling at the age of seven years old. Hence, efficient approaches towards early identification and intervention through speedy diagnostic procedures for ASD are required. In recent years, advanced technologies like machine learning have been used to analyze and investigate ASD to improve diagnostic accuracy, time, and quality without complexity. These machine learning methods include artificial neural networks, support vector machines, a priori algorithms, and decision trees, most of which have been applied to datasets connected with autism to construct predictive models. Meanwhile, the selection of features remains an essential task before developing a predictive model for ASD classification. This review mainly investigates and analyzes up-to-date studies on machine learning methods for feature selection and classification of ASD. We recommend methods to enhance machine learning's speedy execution for processing complex data for conceptualization and implementation in ASD diagnostic research. This study can significantly benefit future research in autism using a machine learning approach for feature selection, classification, and processing imbalanced data.
Collapse
Affiliation(s)
- Md. Mokhlesur Rahman
- Center for Cyber Security, Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, Bangi 43600 UKM, Selangor, Malaysia; (M.M.R.); (O.L.U.)
| | - Opeyemi Lateef Usman
- Center for Cyber Security, Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, Bangi 43600 UKM, Selangor, Malaysia; (M.M.R.); (O.L.U.)
| | - Ravie Chandren Muniyandi
- Center for Cyber Security, Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, Bangi 43600 UKM, Selangor, Malaysia; (M.M.R.); (O.L.U.)
- Correspondence: ; Tel.: +60-123249577
| | - Shahnorbanun Sahran
- Center for Artificial Intelligence Technology, Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, Bangi 43600 UKM, Selangor, Malaysia;
| | - Suziyani Mohamed
- Centre of Community Education and Wellbeing, Faculty of Education, Universiti Kebangsaan Malaysia, Bangi 43600 UKM, Selangor, Malaysia;
| | - Rogayah A Razak
- Speech Science Programme, Center for Rehabilitation and Special Needs Studies, Faculty of Health Sciences, Universiti Kebangsaan Malaysia, Jalan Raja Muda Abdul Aziz, Kuala Lumpur 50300, Malaysia;
| |
Collapse
|