1
|
Phan MHQ, Zehnder T, Puntieri F, Magg A, Majchrzycka B, Antonović M, Wieler H, Lo BW, Baranasic D, Lenhard B, Müller F, Vingron M, Ibrahim DM. Conservation of regulatory elements with highly diverged sequences across large evolutionary distances. Nat Genet 2025:10.1038/s41588-025-02202-5. [PMID: 40425826 DOI: 10.1038/s41588-025-02202-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Accepted: 04/22/2025] [Indexed: 05/29/2025]
Abstract
Developmental gene expression is a remarkably conserved process, yet most cis-regulatory elements (CREs) lack sequence conservation, especially at larger evolutionary distances. Some evidence suggests that CREs at the same genomic position remain functionally conserved independent of sequence conservation. However, the extent of such positional conservation remains unclear. Here, we profiled the regulatory genome in mouse and chicken embryonic hearts at equivalent developmental stages and found that most CREs lack sequence conservation. To identify positionally conserved CREs, we introduced the synteny-based algorithm interspecies point projection, which identifies up to fivefold more orthologs than alignment-based approaches. We termed positionally conserved orthologs 'indirectly conserved' and showed that they exhibited chromatin signatures and sequence composition similar to sequence-conserved CREs but greater shuffling of transcription factor binding sites between orthologs. Finally, we validated indirectly conserved chicken enhancers using in vivo reporter assays in mouse. By overcoming alignment-based limitations, we revealed widespread functional conservation of sequence-divergent CREs.
Collapse
Affiliation(s)
- Mai H Q Phan
- Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Center for Regenerative Therapies, Berlin, Germany
- Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Tobias Zehnder
- Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Fiona Puntieri
- Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Andreas Magg
- Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Center for Regenerative Therapies, Berlin, Germany
- Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Blanka Majchrzycka
- Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Center for Regenerative Therapies, Berlin, Germany
- Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Milan Antonović
- Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Center for Regenerative Therapies, Berlin, Germany
- Max Planck Institute for Molecular Genetics, Berlin, Germany
- Institute of Chemistry and Biochemistry, Freie Universität Berlin, Berlin, Germany
| | - Hannah Wieler
- Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Center for Regenerative Therapies, Berlin, Germany
- Max Planck Institute for Molecular Genetics, Berlin, Germany
- Institute of Chemistry and Biochemistry, Freie Universität Berlin, Berlin, Germany
| | - Bai-Wei Lo
- Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Damir Baranasic
- Division of Electronics, Ruder Boskovic Institute, Zagreb, Croatia
- MRC Laboratoy of Medical Sciences, London, UK
- Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, Hammersmith Hospital Campus, London, UK
| | - Boris Lenhard
- MRC Laboratoy of Medical Sciences, London, UK
- Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, Hammersmith Hospital Campus, London, UK
| | - Ferenc Müller
- Department of Cancer and Genomic Sciences, Birmingham Centre for Genome Biology, School of Medical Sciences, College of Medicine and Health, University of Birmingham, Birmingham, UK
| | - Martin Vingron
- Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Daniel M Ibrahim
- Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Center for Regenerative Therapies, Berlin, Germany.
- Max Planck Institute for Molecular Genetics, Berlin, Germany.
| |
Collapse
|
2
|
Zhu Y, Watson C, Safonova Y, Pennell M, Bankevich A. CloseRead: a tool for assessing assembly errors in immunoglobulin loci applied to vertebrate long-read genome assemblies. Genome Biol 2025; 26:131. [PMID: 40394681 PMCID: PMC12090573 DOI: 10.1186/s13059-025-03594-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2024] [Accepted: 04/28/2025] [Indexed: 05/22/2025] Open
Abstract
Despite tremendous advances in long-read sequencing, some structurally complex and repeat-rich genomic regions remain challenging to assemble. Furthermore, we lack tools to assess local assembly quality, making it hard to identify problems and assess progress. Here we develop a new approach "CloseRead" for visualizing local assembly quality and diagnosing errors using multiple metrics. We apply CloseRead to evaluate how well immunoglobulin loci, paradigmatic cases of structurally complex regions, are assembled in 74 state-of-the-art vertebrate genomes. We then show that targeted, local re-assembly can correct the specific errors identified by CloseRead, highlighting the value of an iterative approach to genome assembly.
Collapse
Affiliation(s)
- Yixin Zhu
- Department of Computational Biology, Cornell University, Ithaca, NY, USA
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Corey Watson
- Department of Biochemistry and Molecular Biology, University of Louisville School of Medicine, Louisville, KY, USA
| | - Yana Safonova
- Department of Computer Science and Engineering, Pennsylvania State University, University Park, PA, USA.
- Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA, USA.
| | - Matt Pennell
- Department of Computational Biology, Cornell University, Ithaca, NY, USA.
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA.
| | - Anton Bankevich
- Department of Computer Science and Engineering, Pennsylvania State University, University Park, PA, USA.
- Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA, USA.
| |
Collapse
|
3
|
Barbadilla-Martínez L, Klaassen N, van Steensel B, de Ridder J. Predicting gene expression from DNA sequence using deep learning models. Nat Rev Genet 2025:10.1038/s41576-025-00841-2. [PMID: 40360798 DOI: 10.1038/s41576-025-00841-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/01/2025] [Indexed: 05/15/2025]
Abstract
Transcription of genes is regulated by DNA elements such as promoters and enhancers, the activity of which are in turn controlled by many transcription factors. Owing to the highly complex combinatorial logic involved, it has been difficult to construct computational models that predict gene activity from DNA sequence. Recent advances in deep learning techniques applied to data from epigenome mapping and high-throughput reporter assays have made substantial progress towards addressing this complexity. Such models can capture the regulatory grammar with remarkable accuracy and show great promise in predicting the effects of non-coding variants, uncovering detailed molecular mechanisms of gene regulation and designing synthetic regulatory elements for biotechnology. Here, we discuss the principles of these approaches, the types of training data sets that are available and the strengths and limitations of different approaches.
Collapse
Affiliation(s)
- Lucía Barbadilla-Martínez
- Oncode Institute, Utrecht, The Netherlands
- Center for Molecular Medicine, UMC Utrecht, Utrecht, The Netherlands
| | - Noud Klaassen
- Oncode Institute, Utrecht, The Netherlands
- Division of Molecular Genetics, Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Bas van Steensel
- Oncode Institute, Utrecht, The Netherlands.
- Division of Molecular Genetics, Netherlands Cancer Institute, Amsterdam, The Netherlands.
| | - Jeroen de Ridder
- Oncode Institute, Utrecht, The Netherlands.
- Center for Molecular Medicine, UMC Utrecht, Utrecht, The Netherlands.
| |
Collapse
|
4
|
Sugo N, Atsumi Y, Yamamoto N. Transcription and epigenetic factor dynamics in neuronal activity-dependent gene regulation. Trends Genet 2025; 41:425-436. [PMID: 39875312 DOI: 10.1016/j.tig.2024.12.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2024] [Revised: 12/20/2024] [Accepted: 12/20/2024] [Indexed: 01/30/2025]
Abstract
Neuronal activity, including sensory-evoked and spontaneous firing, regulates the expression of a subset of genes known as activity-dependent genes. A key issue in this process is the activation and accumulation of transcription factors (TFs), which bind to cis-elements at specific enhancers and promoters, ultimately driving RNA synthesis through transcription machinery. Epigenetic factors such as histone modifiers also play a crucial role in facilitating the specific binding of TFs. Recent evidence from epigenome analyses and imaging studies have revealed intriguing mechanisms: the default chromatin structure at activity-dependent genes is formed independently of neuronal activity, while neuronal activity modulates spatiotemporal dynamics of TFs and their interactions with epigenetic factors (EFs). In this article we review new insights into activity-dependent gene regulation that affects brain development and plasticity.
Collapse
Affiliation(s)
- Noriyuki Sugo
- Graduate School of Frontier Biosciences, Osaka University, Suita, Osaka 565-0871, Japan.
| | - Yuri Atsumi
- Graduate School of Frontier Biosciences, Osaka University, Suita, Osaka 565-0871, Japan
| | - Nobuhiko Yamamoto
- Graduate School of Frontier Biosciences, Osaka University, Suita, Osaka 565-0871, Japan; Institute of Neurological and Psychiatric Disorders, Shenzhen Bay Laboratory, Shenzhen, Guangdong 518132, China.
| |
Collapse
|
5
|
Brown AR, Fox GA, Kaplow IM, Lawler AJ, Phan BN, Gadey L, Wirthlin ME, Ramamurthy E, May GE, Chen Z, Su Q, McManus CJ, van de Weerd R, Pfenning AR. An in vivo systemic massively parallel platform for deciphering animal tissue-specific regulatory function. Front Genet 2025; 16:1533900. [PMID: 40270544 PMCID: PMC12016043 DOI: 10.3389/fgene.2025.1533900] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2024] [Accepted: 03/13/2025] [Indexed: 04/25/2025] Open
Abstract
Introduction: Transcriptional regulation is an important process wherein non-protein coding enhancer sequences play a key role in determining cell type identity and phenotypic diversity. In neural tissue, these gene regulatory processes are crucial for coordinating a plethora of interconnected and regionally specialized cell types, ensuring their synchronized activity in generating behavior. Recognizing the intricate interplay of gene regulatory processes in the brain is imperative, as mounting evidence links neurodevelopment and neurological disorders to non-coding genome regions. While genome-wide association studies are swiftly identifying non-coding human disease-associated loci, decoding regulatory mechanisms is challenging due to causal variant ambiguity and their specific tissue impacts. Methods: Massively parallel reporter assays (MPRAs) are widely used in cell culture to study the non-coding enhancer regions, linking genome sequence differences to tissue-specific regulatory function. However, widespread use in animals encounters significant challenges, including insufficient viral library delivery and library quantification, irregular viral transduction rates, and injection site inflammation disrupting gene expression. Here, we introduce a systemic MPRA (sysMPRA) to address these challenges through systemic intravenous AAV viral delivery. Results: We demonstrate successful transduction of the MPRA library into diverse mouse tissues, efficiently identifying tissue specificity in candidate enhancers and aligning well with predictions from machine learning models. We highlight that sysMPRA effectively uncovers regulatory effects stemming from the disruption of MEF2C transcription factor binding sites, single-nucleotide polymorphisms, and the consequences of genetic variations associated with late-onset Alzheimer's disease. Conclusion: SysMPRA is an effective library delivering method that simultaneously determines the transcriptional functions of hundreds of enhancers in vivo across multiple tissues.
Collapse
Affiliation(s)
- Ashley R. Brown
- Ray and Stephanie Lane Department of Computational Biology, Carnegie Mellon University, Pittsburgh, PA, United States
- Neuroscience Institute, Carnegie Mellon University, Pittsburgh, PA, United States
| | - Grant A. Fox
- Ray and Stephanie Lane Department of Computational Biology, Carnegie Mellon University, Pittsburgh, PA, United States
- Neuroscience Institute, Carnegie Mellon University, Pittsburgh, PA, United States
| | - Irene M. Kaplow
- Ray and Stephanie Lane Department of Computational Biology, Carnegie Mellon University, Pittsburgh, PA, United States
- Neuroscience Institute, Carnegie Mellon University, Pittsburgh, PA, United States
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA, United States
| | - Alyssa J. Lawler
- Neuroscience Institute, Carnegie Mellon University, Pittsburgh, PA, United States
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA, United States
| | - BaDoi N. Phan
- Ray and Stephanie Lane Department of Computational Biology, Carnegie Mellon University, Pittsburgh, PA, United States
- Neuroscience Institute, Carnegie Mellon University, Pittsburgh, PA, United States
- Medical Scientist Training Program, University of Pittsburgh School of Medicine, Pittsburgh, PA, United States
| | - Lahari Gadey
- Ray and Stephanie Lane Department of Computational Biology, Carnegie Mellon University, Pittsburgh, PA, United States
- Neuroscience Institute, Carnegie Mellon University, Pittsburgh, PA, United States
| | - Morgan E. Wirthlin
- Ray and Stephanie Lane Department of Computational Biology, Carnegie Mellon University, Pittsburgh, PA, United States
- Neuroscience Institute, Carnegie Mellon University, Pittsburgh, PA, United States
| | - Easwaran Ramamurthy
- Ray and Stephanie Lane Department of Computational Biology, Carnegie Mellon University, Pittsburgh, PA, United States
- Neuroscience Institute, Carnegie Mellon University, Pittsburgh, PA, United States
| | - Gemma E. May
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA, United States
| | - Ziheng Chen
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA, United States
| | - Qiao Su
- Ray and Stephanie Lane Department of Computational Biology, Carnegie Mellon University, Pittsburgh, PA, United States
- Neuroscience Institute, Carnegie Mellon University, Pittsburgh, PA, United States
| | - C. Joel McManus
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA, United States
| | - Robert van de Weerd
- Ray and Stephanie Lane Department of Computational Biology, Carnegie Mellon University, Pittsburgh, PA, United States
- Neuroscience Institute, Carnegie Mellon University, Pittsburgh, PA, United States
| | - Andreas R. Pfenning
- Ray and Stephanie Lane Department of Computational Biology, Carnegie Mellon University, Pittsburgh, PA, United States
- Neuroscience Institute, Carnegie Mellon University, Pittsburgh, PA, United States
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA, United States
| |
Collapse
|
6
|
Linder J, Srivastava D, Yuan H, Agarwal V, Kelley DR. Predicting RNA-seq coverage from DNA sequence as a unifying model of gene regulation. Nat Genet 2025; 57:949-961. [PMID: 39779956 PMCID: PMC11985352 DOI: 10.1038/s41588-024-02053-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2023] [Accepted: 12/04/2024] [Indexed: 01/11/2025]
Abstract
Sequence-based machine-learning models trained on genomics data improve genetic variant interpretation by providing functional predictions describing their impact on the cis-regulatory code. However, current tools do not predict RNA-seq expression profiles because of modeling challenges. Here, we introduce Borzoi, a model that learns to predict cell-type-specific and tissue-specific RNA-seq coverage from DNA sequence. Using statistics derived from Borzoi's predicted coverage, we isolate and accurately score DNA variant effects across multiple layers of regulation, including transcription, splicing and polyadenylation. Evaluated on quantitative trait loci, Borzoi is competitive with and often outperforms state-of-the-art models trained on individual regulatory functions. By applying attribution methods to the derived statistics, we extract cis-regulatory motifs driving RNA expression and post-transcriptional regulation in normal tissues. The wide availability of RNA-seq data across species, conditions and assays profiling specific aspects of regulation emphasizes the potential of this approach to decipher the mapping from DNA sequence to regulatory function.
Collapse
Affiliation(s)
| | | | - Han Yuan
- Calico Life Sciences LLC, South San Francisco, CA, USA
| | - Vikram Agarwal
- mRNA Center of Excellence, Sanofi Pasteur Inc., Cambridge, MA, USA
| | | |
Collapse
|
7
|
Tan F, Dong Y, Qi J, Yu W, Chai R. Artificial Intelligence-Based Approaches for AAV Vector Engineering. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2025; 12:e2411062. [PMID: 39932449 PMCID: PMC11884542 DOI: 10.1002/advs.202411062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/10/2024] [Revised: 12/31/2024] [Indexed: 03/08/2025]
Abstract
Adeno-associated virus (AAV) has emerged as a leading vector for gene therapy due to its broad host range, low pathogenicity, and ability to facilitate long-term gene expression. However, AAV vectors face limitations, including immunogenicity and insufficient targeting specificity. To enhance the efficacy of gene therapy, researchers have been modifying the AAV vector using various methods. Traditional experimental approaches for optimizing AAV vector are often time-consuming, resource-intensive, and difficult to replicate. The advancement of artificial intelligence (AI), particularly machine learning, offers significant potential to accelerate capsid optimization while reducing development time and manufacturing costs. This review compares traditional and AI-based methods of AAV vector engineering and highlights recent research in AAV engineering using AI algorithms.
Collapse
Affiliation(s)
- Fangzhi Tan
- State Key Laboratory of Digital Medical EngineeringDepartment of Otolaryngology Head and Neck SurgeryZhongda HospitalSchool of Life Sciences and TechnologySchool of MedicineAdvanced Institute for Life and HealthJiangsu Province High‐Tech Key Laboratory for Bio‐Medical ResearchSoutheast UniversityNanjing210096China
| | - Yue Dong
- Immunowake, Inc.Shanghai201210China
| | - Jieyu Qi
- Department of NeurologyAerospace Center HospitalSchool of Life ScienceBeijing Institute of TechnologyBeijing100081China
- State Key Laboratory of Hearing and Balance ScienceBeijing Institute of TechnologyBeijing100081China
- School of Medical EngineeringAffiliated Zhuhai People's HospitalBeijing Institute of TechnologyZhuhai519088China
- Advanced Technology Research InstituteBeijing Institute of TechnologyJinan250300China
| | - Wenwu Yu
- School of MathematicsSoutheast UniversityNanjing210096China
| | - Renjie Chai
- State Key Laboratory of Digital Medical EngineeringDepartment of Otolaryngology Head and Neck SurgeryZhongda HospitalSchool of Life Sciences and TechnologySchool of MedicineAdvanced Institute for Life and HealthJiangsu Province High‐Tech Key Laboratory for Bio‐Medical ResearchSoutheast UniversityNanjing210096China
- Department of NeurologyAerospace Center HospitalSchool of Life ScienceBeijing Institute of TechnologyBeijing100081China
- Co‐Innovation Center of NeuroregenerationNantong UniversityNantong226001China
- Department of Otolaryngology Head and Neck SurgerySichuan Provincial People's HospitalSchool of MedicineUniversity of Electronic Science and Technology of ChinaChengdu610072China
- Southeast University Shenzhen Research InstituteShenzhen518063China
| |
Collapse
|
8
|
Zhou H, Clark E, Guan D, Lagarrigue S, Fang L, Cheng H, Tuggle CK, Kapoor M, Wang Y, Giuffra E, Egidy G. Comparative Genomics and Epigenomics of Transcriptional Regulation. Annu Rev Anim Biosci 2025; 13:73-98. [PMID: 39565835 DOI: 10.1146/annurev-animal-111523-102217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2024]
Abstract
Transcriptional regulation in response to diverse physiological cues involves complicated biological processes. Recent initiatives that leverage whole genome sequencing and annotation of regulatory elements significantly contribute to our understanding of transcriptional gene regulation. Advances in the data sets available for comparative genomics and epigenomics can identify evolutionarily constrained regulatory variants and shed light on noncoding elements that influence transcription in different tissues and developmental stages across species. Most epigenomic data, however, are generated from healthy subjects at specific developmental stages. To bridge the genotype-phenotype gap, future research should focus on generating multidimensional epigenomic data under diverse physiological conditions. Farm animal species offer advantages in terms of feasibility, cost, and experimental design for such integrative analyses in comparison to humans. Deep learning modeling and cutting-edge technologies in sequencing and functional screening and validation also provide great promise for better understanding transcriptional regulation in this dynamic field.
Collapse
Affiliation(s)
- Huaijun Zhou
- Department of Animal Science, University of California, Davis, California, USA; , , ,
| | - Emily Clark
- The Roslin Institute, University of Edinburgh, Edinburgh, Midlothian, United Kingdom;
| | - Dailu Guan
- Department of Animal Science, University of California, Davis, California, USA; , , ,
| | | | - Lingzhao Fang
- Center for Quantitative Genetics and Genomics, Aarhus University, Aarhus, Denmark;
| | - Hao Cheng
- Department of Animal Science, University of California, Davis, California, USA; , , ,
| | | | - Muskan Kapoor
- Department of Animal Science, Iowa State University, Ames, Iowa, USA; ,
| | - Ying Wang
- Department of Animal Science, University of California, Davis, California, USA; , , ,
| | | | - Giorgia Egidy
- GABI, AgroParisTech, INRAE, Jouy-en-Josas, France; ,
| |
Collapse
|
9
|
Katikaneni A, Lowe CB. Novelty versus innovation of gene regulatory elements in human evolution and disease. Curr Opin Genet Dev 2025; 90:102279. [PMID: 39591813 PMCID: PMC11769741 DOI: 10.1016/j.gde.2024.102279] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2024] [Revised: 10/10/2024] [Accepted: 10/22/2024] [Indexed: 11/28/2024]
Abstract
It is not currently understood how much of human evolution is due to modifying existing functional elements in the genome versus forging novel elements from nonfunctional DNA. Many early experiments that aimed to assign genetic changes on the human lineage to their resulting phenotypic change have focused on mutations that modify existing elements. However, a number of recent studies have highlighted the potential ease and importance of forging novel gene regulatory elements from nonfunctional sequences on the human lineage. In this review, we distinguish gene regulatory element novelty from innovation. We propose definitions for these terms and emphasize their importance in studying the genetic basis of human uniqueness. We discuss why the forging of novel regulatory elements may have been less emphasized during the previous decades, and why novel regulatory elements are likely to play a significant role in both human adaptation and disease.
Collapse
Affiliation(s)
- Anushka Katikaneni
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, NC 27710, USA; University Program in Genetics and Genomics, Duke University, Durham, NC 27708, USA
| | - Craig B Lowe
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, NC 27710, USA; University Program in Genetics and Genomics, Duke University, Durham, NC 27708, USA.
| |
Collapse
|
10
|
Hecker N, Kempynck N, Mauduit D, Abaffyová D, Vandepoel R, Dieltiens S, Borm L, Sarropoulos I, González-Blas CB, De Man J, Davie K, Leysen E, Vandensteen J, Moors R, Hulselmans G, Lim L, De Wit J, Christiaens V, Poovathingal S, Aerts S. Enhancer-driven cell type comparison reveals similarities between the mammalian and bird pallium. Science 2025; 387:eadp3957. [PMID: 39946451 DOI: 10.1126/science.adp3957] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2024] [Accepted: 11/26/2024] [Indexed: 04/23/2025]
Abstract
Combinations of transcription factors govern the identity of cell types, which is reflected by genomic enhancer codes. We used deep learning to characterize these enhancer codes and devised three metrics to compare cell types in the telencephalon across amniotes. To this end, we generated single-cell multiome and spatially resolved transcriptomics data of the chicken telencephalon. Enhancer codes of orthologous nonneuronal and γ-aminobutyric acid-mediated (GABAergic) cell types show a high degree of similarity across amniotes, whereas excitatory neurons of the mammalian neocortex and avian pallium exhibit varying degrees of similarity. Enhancer codes of avian mesopallial neurons are most similar to those of mammalian deep-layer neurons. With this study, we present generally applicable deep learning approaches to characterize and compare cell types on the basis of genomic regulatory sequences.
Collapse
Affiliation(s)
- Nikolai Hecker
- Laboratory of Computational Biology, VIB Center for AI & Computational Biology, Leuven, Belgium
- VIB-KU Leuven Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - Niklas Kempynck
- Laboratory of Computational Biology, VIB Center for AI & Computational Biology, Leuven, Belgium
- VIB-KU Leuven Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - David Mauduit
- Laboratory of Computational Biology, VIB Center for AI & Computational Biology, Leuven, Belgium
- VIB-KU Leuven Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - Darina Abaffyová
- Laboratory of Computational Biology, VIB Center for AI & Computational Biology, Leuven, Belgium
- VIB-KU Leuven Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - Roel Vandepoel
- Laboratory of Computational Biology, VIB Center for AI & Computational Biology, Leuven, Belgium
- VIB-KU Leuven Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - Sam Dieltiens
- Laboratory of Computational Biology, VIB Center for AI & Computational Biology, Leuven, Belgium
- VIB-KU Leuven Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - Lars Borm
- Laboratory of Computational Biology, VIB Center for AI & Computational Biology, Leuven, Belgium
- VIB-KU Leuven Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - Ioannis Sarropoulos
- Center for Molecular Biology of Heidelberg University, Heidelberg University, Heidelberg, Germany
| | - Carmen Bravo González-Blas
- VIB-KU Leuven Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - Julie De Man
- Laboratory of Computational Biology, VIB Center for AI & Computational Biology, Leuven, Belgium
- VIB-KU Leuven Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - Kristofer Davie
- VIB-KU Leuven Center for Brain & Disease Research, Leuven, Belgium
- Department of Neurosciences, KU Leuven, Leuven, Belgium
| | - Elke Leysen
- VIB-KU Leuven Center for Brain & Disease Research, Leuven, Belgium
- Department of Neurosciences, KU Leuven, Leuven, Belgium
| | - Jeroen Vandensteen
- VIB-KU Leuven Center for Brain & Disease Research, Leuven, Belgium
- Department of Neurosciences, KU Leuven, Leuven, Belgium
| | - Rani Moors
- VIB-KU Leuven Center for Brain & Disease Research, Leuven, Belgium
- Department of Neurosciences, KU Leuven, Leuven, Belgium
| | - Gert Hulselmans
- Laboratory of Computational Biology, VIB Center for AI & Computational Biology, Leuven, Belgium
- VIB-KU Leuven Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - Lynette Lim
- VIB-KU Leuven Center for Brain & Disease Research, Leuven, Belgium
- Department of Neurosciences, KU Leuven, Leuven, Belgium
| | - Joris De Wit
- VIB-KU Leuven Center for Brain & Disease Research, Leuven, Belgium
- Department of Neurosciences, KU Leuven, Leuven, Belgium
| | - Valerie Christiaens
- Laboratory of Computational Biology, VIB Center for AI & Computational Biology, Leuven, Belgium
- VIB-KU Leuven Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | | | - Stein Aerts
- Laboratory of Computational Biology, VIB Center for AI & Computational Biology, Leuven, Belgium
- VIB-KU Leuven Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| |
Collapse
|
11
|
Fan Z, Zhao H, Zhou J, Li D, Fan Y, Bi Y, Ji S. A versatile attention-based neural network for chemical perturbation analysis and its potential to aid surgical treatment: an experimental study. Int J Surg 2024; 110:7671-7686. [PMID: 39017949 PMCID: PMC11634177 DOI: 10.1097/js9.0000000000001781] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2024] [Accepted: 05/30/2024] [Indexed: 07/18/2024]
Abstract
Deep learning models have emerged as rapid, accurate, and effective approaches for clinical decisions. Through a combination of drug screening and deep learning models, drugs that may benefit patients before and after surgery can be discovered to reduce the risk of complications or speed recovery. However, most existing drug prediction methods have high data requirements and lack interpretability, which has a limited role in adjuvant surgical treatment. To address these limitations, the authors propose the attention-based convolution transpositional interfusion network (ACTIN) for flexible and efficient drug discovery. ACTIN leverages the graph convolution and the transformer mechanism, utilizing drug and transcriptome data to assess the impact of chemical pharmacophores containing certain elements on gene expression. Remarkably, just with only 393 training instances, only one-tenth of the other models, ACTIN achieves state-of-the-art performance, demonstrating its effectiveness even with limited data. By incorporating chemical element embedding disparity and attention mechanism-based parameter analysis, it identifies the possible pharmacophore containing certain elements that could interfere with specific cell lines, which is particularly valuable for screening useful pharmacophores for new drugs tailored to adjuvant surgical treatment. To validate its reliability, the authors conducted comprehensive examinations by utilizing transcriptome data from the lung tissue of fatal COVID-19 patients as additional input for ACTIN, the authors generated novel lead chemicals that align with clinical evidence. In summary, ACTIN offers insights into the perturbation biases of elements within pharmacophore on gene expression, which holds the potential for guiding the development of new drugs that benefit surgical treatment.
Collapse
Affiliation(s)
- Zheqi Fan
- Department of Orthopaedics, The First Medical Centre, Chinese PLA General Hospital, Beijing
| | - Houming Zhao
- Department of Urology, The Third Medical Center, Chinese PLA General Hospital, Beijing
| | - Jingcheng Zhou
- Senior Department of Otolaryngology-Head and Neck Surgery, The Sixth Medical Center, Chinese PLA General Hospital, Beijing
| | - Dingchang Li
- Department of General Surgery, The First Medical Centre, Chinese PLA General Hospital, Beijing
| | - Yunlong Fan
- Department of Dermatology, The Seventh Medical Center, Chinese PLA General Hospital, Beijing
| | - Yiming Bi
- Graduate School of PLA Medical College, Chinese PLA General Hospital, Beijing, People’s Republic of China
| | - Shuaifei Ji
- Graduate School of PLA Medical College, Chinese PLA General Hospital, Beijing, People’s Republic of China
| |
Collapse
|
12
|
Arokiaraj CM, Leone MJ, Kleyman M, Chamessian A, Noh MC, Phan BN, Lopes BC, Corrigan KA, Cherupally VK, Yeramosu D, Franusich ME, Podder R, Lele S, Shiers S, Kang B, Kennedy MM, Chen V, Chen Z, Mathys H, Dum RP, Lewis DA, Qadri Y, Price TJ, Pfenning AR, Seal RP. Spatial, transcriptomic, and epigenomic analyses link dorsal horn neurons to chronic pain genetic predisposition. Cell Rep 2024; 43:114876. [PMID: 39453813 PMCID: PMC11801220 DOI: 10.1016/j.celrep.2024.114876] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2024] [Revised: 05/07/2024] [Accepted: 09/30/2024] [Indexed: 10/27/2024] Open
Abstract
Key mechanisms underlying chronic pain occur within the dorsal horn. Genome-wide association studies (GWASs) have identified genetic variants predisposed to chronic pain. However, most of these variants lie within regulatory non-coding regions that have not been linked to spinal cord biology. Here, we take a multi-species approach to determine whether chronic pain variants impact the regulatory genomics of dorsal horn neurons. First, we generate a large rhesus macaque single-nucleus RNA sequencing (snRNA-seq) atlas and integrate it with available human and mouse datasets to produce a single unified, species-conserved atlas of neuron subtypes. Cellular-resolution spatial transcriptomics in mouse shows the precise laminar location of these neuron subtypes, consistent with our analysis of neuron-subtype-selective markers in macaque. Using this cross-species framework, we generate a mouse single-nucleus open chromatin atlas of regulatory elements that shows strong and selective relationships between the neuron-subtype-specific chromatin regions and variants from major chronic pain GWASs.
Collapse
Affiliation(s)
- Cynthia M Arokiaraj
- Department of Neurobiology, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA; Pittsburgh Center for Pain Research, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA
| | - Michael J Leone
- Department of Computational Biology, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA; Medical Scientist Training Program, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA
| | - Michael Kleyman
- Department of Computational Biology, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Alexander Chamessian
- Department of Anesthesiology, Duke University Medical Center, Durham, NC 27708, USA; Department of Anesthesiology, Washington University Pain Center, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Myung-Chul Noh
- Department of Neurobiology, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA; Pittsburgh Center for Pain Research, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA
| | - BaDoi N Phan
- Department of Computational Biology, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA; Medical Scientist Training Program, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA
| | - Bettega C Lopes
- Department of Neurobiology, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA; Pittsburgh Center for Pain Research, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA
| | - Kelly A Corrigan
- Department of Neurobiology, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA; Pittsburgh Center for Pain Research, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA
| | - Vijay Kiran Cherupally
- Department of Computational Biology, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Deepika Yeramosu
- Department of Computational Biology, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Michael E Franusich
- Department of Computational Biology, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Riya Podder
- Department of Computational Biology, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Sumitra Lele
- Department of Computational Biology, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Stephanie Shiers
- Department of Neuroscience and Center for Advanced Pain Studies, University of Texas at Dallas, Richardson, TX 75080, USA
| | - Byungsoo Kang
- Department of Computational Biology, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Meaghan M Kennedy
- Department of Computational Biology, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Viola Chen
- Department of Computational Biology, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Ziheng Chen
- Department of Computational Biology, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA; Department of Biological Sciences, Mellon College of Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Hansruedi Mathys
- Department of Neurobiology, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA
| | - Richard P Dum
- Department of Neurobiology, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA
| | - David A Lewis
- Department of Psychiatry, University of Pittsburgh School of Medicine, Pittsburgh, PA 15261, USA
| | - Yawar Qadri
- Department of Anesthesiology, Emory University, Atlanta, GA 30038, USA
| | - Theodore J Price
- Department of Neuroscience and Center for Advanced Pain Studies, University of Texas at Dallas, Richardson, TX 75080, USA
| | - Andreas R Pfenning
- Department of Computational Biology, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA; Neuroscience Institute, Carnegie Mellon University, Pittsburgh, PA 15213, USA.
| | - Rebecca P Seal
- Department of Neurobiology, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA; Pittsburgh Center for Pain Research, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA; Department of Otolaryngology, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA.
| |
Collapse
|
13
|
Redlich R, Kowalczyk A, Tene M, Sestili HH, Foley K, Saputra E, Clark N, Chikina M, Meyer WK, Pfenning AR. RERconverge Expansion: Using Relative Evolutionary Rates to Study Complex Categorical Trait Evolution. Mol Biol Evol 2024; 41:msae210. [PMID: 39404101 PMCID: PMC11529301 DOI: 10.1093/molbev/msae210] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2024] [Revised: 10/01/2024] [Accepted: 10/08/2024] [Indexed: 10/23/2024] Open
Abstract
Comparative genomics approaches seek to associate molecular evolution with the evolution of phenotypes across a phylogeny. Many of these methods lack the ability to analyze non-ordinal categorical traits with more than two categories. To address this limitation, we introduce an expansion to RERconverge that associates shifts in evolutionary rates with the convergent evolution of categorical traits. The categorical RERconverge expansion includes methods for performing categorical ancestral state reconstruction, statistical tests for associating relative evolutionary rates with categorical variables, and a new method for performing phylogeny-aware permutations, "permulations", on categorical traits. We demonstrate our new method on a three-category diet phenotype, and we compare its performance to binary RERconverge analyses and two existing methods for comparative genomic analyses of categorical traits: phylogenetic simulations and a phylogenetic signal based method. We present an analysis of how the categorical permulations scale with the number of species and the number of categories included in the analysis. Our results show that our new categorical method outperforms phylogenetic simulations at identifying genes and enriched pathways significantly associated with the diet phenotypes and that the categorical ancestral state reconstruction drives an improvement in our ability to capture diet-related enriched pathways compared to binary RERconverge when implemented without user input on phenotype evolution. The categorical expansion to RERconverge will provide a strong foundation for applying the comparative method to categorical traits on larger data sets with more species and more complex trait evolution than have previously been analyzed.
Collapse
Affiliation(s)
- Ruby Redlich
- Department of Computational Biology, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Amanda Kowalczyk
- Department of Computational Biology, Carnegie Mellon University, Pittsburgh, PA 15213, USA
- Neuroscience Institute, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Michael Tene
- Department of Biological Sciences, Lehigh University, Bethlehem, PA 18015, USA
| | - Heather H Sestili
- Department of Computational Biology, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Kathleen Foley
- Department of Biological Sciences, Lehigh University, Bethlehem, PA 18015, USA
| | - Elysia Saputra
- Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA 15260, USA
| | - Nathan Clark
- Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA 15260, USA
| | - Maria Chikina
- Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA 15260, USA
| | - Wynn K Meyer
- Department of Biological Sciences, Lehigh University, Bethlehem, PA 18015, USA
| | - Andreas R Pfenning
- Department of Computational Biology, Carnegie Mellon University, Pittsburgh, PA 15213, USA
- Neuroscience Institute, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| |
Collapse
|
14
|
Uebbing S, Kocher AA, Baumgartner M, Ji Y, Bai S, Xing X, Nottoli T, Noonan JP. Evolutionary Innovations in Conserved Regulatory Elements Associate With Developmental Genes in Mammals. Mol Biol Evol 2024; 41:msae199. [PMID: 39302728 PMCID: PMC11465374 DOI: 10.1093/molbev/msae199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Revised: 08/26/2024] [Accepted: 09/17/2024] [Indexed: 09/22/2024] Open
Abstract
Transcriptional enhancers orchestrate cell type- and time point-specific gene expression programs. Genetic variation within enhancer sequences is an important contributor to phenotypic variation including evolutionary adaptations and human disease. Certain genes and pathways may be more prone to regulatory evolution than others, with different patterns across diverse organisms, but whether such patterns exist has not been investigated at a sufficient scale. To address this question, we identified signatures of accelerated sequence evolution in conserved enhancer elements throughout the mammalian phylogeny at an unprecedented scale. While different genes and pathways were enriched for regulatory evolution in different parts of the tree, we found a striking overall pattern of pleiotropic genes involved in gene regulatory and developmental processes being enriched for accelerated enhancer evolution. These genes were connected to more enhancers than other genes, which was the basis for having an increased amount of sequence acceleration over all their enhancers combined. We provide evidence that sequence acceleration is associated with turnover of regulatory function. Detailed study of one acceleration event in an enhancer of HES1 revealed that sequence evolution led to a new activity domain in the developing limb that emerged concurrently with the evolution of digit reduction in hoofed mammals. Our results provide evidence that enhancer evolution has been a frequent contributor to regulatory innovation at conserved developmental signaling genes in mammals.
Collapse
Affiliation(s)
- Severin Uebbing
- Department of Genetics, Yale School of Medicine, New Haven, CT, USA
- Department of Biology, Genome Biology and Epigenetics, Institute of Biodynamics and Biocomplexity, Utrecht University, Utrecht, The Netherlands
| | - Acadia A Kocher
- Department of Genetics, Yale School of Medicine, New Haven, CT, USA
| | | | - Yu Ji
- Department of Genetics, Yale School of Medicine, New Haven, CT, USA
| | - Suxia Bai
- Yale Genome Editing Center, Yale School of Medicine, New Haven, CT, USA
| | - Xiaojun Xing
- Yale Genome Editing Center, Yale School of Medicine, New Haven, CT, USA
| | - Timothy Nottoli
- Yale Genome Editing Center, Yale School of Medicine, New Haven, CT, USA
| | - James P Noonan
- Department of Genetics, Yale School of Medicine, New Haven, CT, USA
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT, USA
- Department of Neuroscience, Yale School of Medicine, New Haven, CT, USA
- Wu Tsai Institute, Yale University, New Haven, CT, USA
| |
Collapse
|
15
|
Caglayan E, Konopka G. Evolutionary neurogenomics at single-cell resolution. Curr Opin Genet Dev 2024; 88:102239. [PMID: 39094380 DOI: 10.1016/j.gde.2024.102239] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2024] [Revised: 06/20/2024] [Accepted: 07/22/2024] [Indexed: 08/04/2024]
Abstract
The human brain is composed of increasingly recognized heterogeneous cell types. Applying single-cell genomics to brain tissue can elucidate relative cell type proportions as well as differential gene expression and regulation among humans and other species. Here, we review recent studies that utilized high-throughput genomics approaches to compare brains among species at single-cell resolution. These studies identified genomic elements that are similar among species as well as evolutionary novelties on the human lineage. We focus on those human-relevant innovations and discuss the biological implications of these modifications. Finally, we discuss areas of comparative single-cell genomics that remain unexplored either due to needed technological advances or due to biological availability at the brain region or species level.
Collapse
Affiliation(s)
- Emre Caglayan
- Department of Neuroscience, Peter O'Donnell Jr. Brain Institute, UT Southwestern Medical Center, Dallas, TX 75390, USA
| | - Genevieve Konopka
- Department of Neuroscience, Peter O'Donnell Jr. Brain Institute, UT Southwestern Medical Center, Dallas, TX 75390, USA.
| |
Collapse
|
16
|
Rong S, Root E, Reilly SK. Massively parallel approaches for characterizing noncoding functional variation in human evolution. Curr Opin Genet Dev 2024; 88:102256. [PMID: 39217658 PMCID: PMC11648527 DOI: 10.1016/j.gde.2024.102256] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2024] [Revised: 08/02/2024] [Accepted: 08/16/2024] [Indexed: 09/04/2024]
Abstract
The genetic differences underlying unique phenotypes in humans compared to our closest primate relatives have long remained a mystery. Similarly, the genetic basis of adaptations between human groups during our expansion across the globe is poorly characterized. Uncovering the downstream phenotypic consequences of these genetic variants has been difficult, as a substantial portion lies in noncoding regions, such as cis-regulatory elements (CREs). Here, we review recent high-throughput approaches to measure the functions of CREs and the impact of variation within them. CRISPR screens can directly perturb CREs in the genome to understand downstream impacts on gene expression and phenotypes, while massively parallel reporter assays can decipher the regulatory impact of sequence variants. Machine learning has begun to be able to predict regulatory function from sequence alone, further scaling our ability to characterize genome function. Applying these tools across diverse phenotypes, model systems, and ancestries is beginning to revolutionize our understanding of noncoding variation underlying human evolution.
Collapse
Affiliation(s)
- Stephen Rong
- Department of Genetics, Yale University, New Haven, CT, USA.
| | - Elise Root
- Department of Genetics, Yale University, New Haven, CT, USA
| | - Steven K Reilly
- Department of Genetics, Yale University, New Haven, CT, USA; Wu Tsai Institute, Yale University, New Haven, CT, USA.
| |
Collapse
|
17
|
Griffith EC, West AE, Greenberg ME. Neuronal enhancers fine-tune adaptive circuit plasticity. Neuron 2024; 112:3043-3057. [PMID: 39208805 PMCID: PMC11550865 DOI: 10.1016/j.neuron.2024.08.002] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2023] [Revised: 07/22/2024] [Accepted: 08/06/2024] [Indexed: 09/04/2024]
Abstract
Neuronal activity-regulated gene expression plays a crucial role in sculpting neural circuits that underpin adaptive brain function. Transcriptional enhancers are now recognized as key components of gene regulation that orchestrate spatiotemporally precise patterns of gene transcription. We propose that the dynamics of enhancer activation uniquely position these genomic elements to finely tune activity-dependent cellular plasticity. Enhancer specificity and modularity can be exploited to gain selective genetic access to specific cell states, and the precise modulation of target gene expression within restricted cellular contexts enabled by targeted enhancer manipulation allows for fine-grained evaluation of gene function. Mounting evidence also suggests that enduring stimulus-induced changes in enhancer states can modify target gene activation upon restimulation, thereby contributing to a form of cell-wide metaplasticity. We advocate for focused exploration of activity-dependent enhancer function to gain new insight into the mechanisms underlying brain plasticity and cognitive dysfunction.
Collapse
Affiliation(s)
- Eric C Griffith
- Department of Neurobiology, Harvard Medical School, Boston, MA, USA
| | - Anne E West
- Department of Neurobiology, Duke University Medical Center, Durham, NC, USA.
| | | |
Collapse
|
18
|
Wang S, Di Y, Yang Y, Salovska B, Li W, Hu L, Yin J, Shao W, Zhou D, Cheng J, Liu D, Yang H, Liu Y. PTMoreR-enabled cross-species PTM mapping and comparative phosphoproteomics across mammals. CELL REPORTS METHODS 2024; 4:100859. [PMID: 39255793 PMCID: PMC11440062 DOI: 10.1016/j.crmeth.2024.100859] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Revised: 05/13/2024] [Accepted: 08/15/2024] [Indexed: 09/12/2024]
Abstract
To support PTM proteomic analysis and annotation in different species, we developed PTMoreR, a user-friendly tool that considers the surrounding amino acid sequences of PTM sites during BLAST, enabling a motif-centric analysis across species. By controlling sequence window similarity, PTMoreR can map phosphoproteomic results between any two species, perform site-level functional enrichment analysis, and generate kinase-substrate networks. We demonstrate that the majority of real P-sites in mice can be inferred from experimentally derived human P-sites with PTMoreR mapping. Furthermore, the compositions of 129 mammalian phosphoproteomes can also be predicted using PTMoreR. The method also identifies cross-species phosphorylation events that occur on proteins with an increased tendency to respond to the environmental factors. Moreover, the classic kinase motifs can be extracted across mammalian species, offering an evolutionary angle for refining current motifs. PTMoreR supports PTM proteomics in non-human species and facilitates quantitative phosphoproteomic analysis.
Collapse
Affiliation(s)
- Shisheng Wang
- Department of Pulmonary and Critical Care Medicine, Proteomics-Metabolomics Analysis Platform, and NHC Key Lab of Transplant Engineering and Immunology, West China Hospital, Sichuan University, Chengdu 610041, China
| | - Yi Di
- Yale Cancer Biology Institute, Yale University, West Haven, CT 06516, USA
| | - Yin Yang
- Department of Pulmonary and Critical Care Medicine, Proteomics-Metabolomics Analysis Platform, and NHC Key Lab of Transplant Engineering and Immunology, West China Hospital, Sichuan University, Chengdu 610041, China
| | - Barbora Salovska
- Yale Cancer Biology Institute, Yale University, West Haven, CT 06516, USA
| | - Wenxue Li
- Yale Cancer Biology Institute, Yale University, West Haven, CT 06516, USA
| | - Liqiang Hu
- Department of Pulmonary and Critical Care Medicine, Proteomics-Metabolomics Analysis Platform, and NHC Key Lab of Transplant Engineering and Immunology, West China Hospital, Sichuan University, Chengdu 610041, China
| | - Jiahui Yin
- Information Research Institute, Tongji University, Shanghai 200092, China
| | - Wenguang Shao
- State Key Laboratory of Microbial Metabolism, School of Life Science & Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Dong Zhou
- Department of Medicine, Division of Nephrology, University of Connecticut School of Medicine, Farmington, CT 06030, USA
| | - Jingqiu Cheng
- Department of Pulmonary and Critical Care Medicine, Proteomics-Metabolomics Analysis Platform, and NHC Key Lab of Transplant Engineering and Immunology, West China Hospital, Sichuan University, Chengdu 610041, China
| | - Dan Liu
- Department of Pulmonary and Critical Care Medicine, Proteomics-Metabolomics Analysis Platform, and NHC Key Lab of Transplant Engineering and Immunology, West China Hospital, Sichuan University, Chengdu 610041, China; State Key Laboratory of Respiratory Health and Multimorbidity, West China Hospital, Sichuan University, Chengdu 610041, China.
| | - Hao Yang
- Department of Pulmonary and Critical Care Medicine, Proteomics-Metabolomics Analysis Platform, and NHC Key Lab of Transplant Engineering and Immunology, West China Hospital, Sichuan University, Chengdu 610041, China.
| | - Yansheng Liu
- Yale Cancer Biology Institute, Yale University, West Haven, CT 06516, USA; Department of Pharmacology, Yale University School of Medicine, New Haven, CT 06520, USA; Department of Biomedical Informatics & Data Science, Yale Univeristy School of Medicine, New Haven, CT 06510, USA.
| |
Collapse
|
19
|
Liu L, Chen A, Li Y, Mulder J, Heyn H, Xu X. Spatiotemporal omics for biology and medicine. Cell 2024; 187:4488-4519. [PMID: 39178830 DOI: 10.1016/j.cell.2024.07.040] [Citation(s) in RCA: 30] [Impact Index Per Article: 30.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2024] [Revised: 07/05/2024] [Accepted: 07/23/2024] [Indexed: 08/26/2024]
Abstract
The completion of the Human Genome Project has provided a foundational blueprint for understanding human life. Nonetheless, understanding the intricate mechanisms through which our genetic blueprint is involved in disease or orchestrates development across temporal and spatial dimensions remains a profound scientific challenge. Recent breakthroughs in cellular omics technologies have paved new pathways for understanding the regulation of genomic elements and the relationship between gene expression, cellular functions, and cell fate determination. The advent of spatial omics technologies, encompassing both imaging and sequencing-based methodologies, has enabled a comprehensive understanding of biological processes from a cellular ecosystem perspective. This review offers an updated overview of how spatial omics has advanced our understanding of the translation of genetic information into cellular heterogeneity and tissue structural organization and their dynamic changes over time. It emphasizes the discovery of various biological phenomena, related to organ functionality, embryogenesis, species evolution, and the pathogenesis of diseases.
Collapse
Affiliation(s)
| | - Ao Chen
- BGI Research, Shenzhen 518083, China
| | | | - Jan Mulder
- Department of Neuroscience, Karolinska Institute, Stockholm, Sweden
| | - Holger Heyn
- Centro Nacional de Análisis Genómico (CNAG), Barcelona, Spain
| | - Xun Xu
- BGI Research, Hangzhou 310030, China; BGI Research, Shenzhen 518083, China.
| |
Collapse
|
20
|
Moriano J, Leonardi O, Vitriolo A, Testa G, Boeckx C. A multi-layered integrative analysis reveals a cholesterol metabolic program in outer radial glia with implications for human brain evolution. Development 2024; 151:dev202390. [PMID: 39114968 PMCID: PMC11385646 DOI: 10.1242/dev.202390] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2023] [Accepted: 07/18/2024] [Indexed: 08/28/2024]
Abstract
The definition of molecular and cellular mechanisms contributing to brain ontogenetic trajectories is essential to investigate the evolution of our species. Yet their functional dissection at an appropriate level of granularity remains challenging. Capitalizing on recent efforts that have extensively profiled neural stem cells from the developing human cortex, we develop an integrative computational framework to perform trajectory inference and gene regulatory network reconstruction, (pseudo)time-informed non-negative matrix factorization for learning the dynamics of gene expression programs, and paleogenomic analysis for a higher-resolution mapping of derived regulatory variants in our species in comparison with our closest relatives. We provide evidence for cell type-specific regulation of gene expression programs during indirect neurogenesis. In particular, our analysis uncovers a key role for a cholesterol program in outer radial glia, regulated by zinc-finger transcription factor KLF6. A cartography of the regulatory landscape impacted by Homo sapiens-derived variants reveals signals of selection clustering around regulatory regions associated with GLI3, a well-known regulator of radial glial cell cycle, and impacting KLF6 regulation. Our study contributes to the evidence of significant changes in metabolic pathways in recent human brain evolution.
Collapse
Affiliation(s)
- Juan Moriano
- Department of General Linguistics, University of Barcelona, 08007 Barcelona, Spain
- University of Barcelona Institute of Complex Systems, 08007 Barcelona, Spain
| | | | - Alessandro Vitriolo
- Human Technopole, Viale Rita Levi-Montalcini 1, 20157 Milan, Italy
- Department of Oncology and Hemato-Oncology, University of Milan, Via Santa Sofia 9, 20122 Milan, Italy
| | - Giuseppe Testa
- Human Technopole, Viale Rita Levi-Montalcini 1, 20157 Milan, Italy
- Department of Oncology and Hemato-Oncology, University of Milan, Via Santa Sofia 9, 20122 Milan, Italy
| | - Cedric Boeckx
- Department of General Linguistics, University of Barcelona, 08007 Barcelona, Spain
- University of Barcelona Institute of Complex Systems, 08007 Barcelona, Spain
- University of Barcelona Institute of Neurosciences, 08007 Barcelona, Spain
- Catalan Institute for Research and Advanced Studies (ICREA), 08007 Barcelona, Spain
| |
Collapse
|
21
|
Zhu Y, Watson C, Safonova Y, Pennell M, Bankevich A. Assessing Assembly Errors in Immunoglobulin Loci: A Comprehensive Evaluation of Long-read Genome Assemblies Across Vertebrates. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.19.604360. [PMID: 39091785 PMCID: PMC11291089 DOI: 10.1101/2024.07.19.604360] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 08/04/2024]
Abstract
Long-read sequencing technologies have revolutionized genome assembly producing near-complete chromosome assemblies for numerous organisms, which are invaluable to research in many fields. However, regions with complex repetitive structure continue to represent a challenge for genome assembly algorithms, particularly in areas with high heterozygosity. Robust and comprehensive solutions for the assessment of assembly accuracy and completeness in these regions do not exist. In this study we focus on the assembly of biomedically important antibody-encoding immunoglobulin (IG) loci, which are characterized by complex duplications and repeat structures. High-quality full-length assemblies for these loci are critical for resolving haplotype-level annotations of IG genes, without which, functional and evolutionary studies of antibody immunity across vertebrates are not tractable. To address these challenges, we developed a pipeline, "CloseRead", that generates multiple assembly verification metrics for analysis and visualization. These metrics expand upon those of existing quality assessment tools and specifically target complex and highly heterozygous regions. Using CloseRead, we systematically assessed the accuracy and completeness of IG loci in publicly available assemblies of 74 vertebrate species, identifying problematic regions. We also demonstrated that inspecting assembly graphs for problematic regions can both identify the root cause of assembly errors and illuminate solutions for improving erroneous assemblies. For a subset of species, we were able to correct assembly errors through targeted reassembly. Together, our analysis demonstrated the utility of assembly assessment in improving the completeness and accuracy of IG loci across species.
Collapse
Affiliation(s)
- Yixin Zhu
- Department of Quantitative and Computational Biology and Biological Sciences, University of Southern California, Los Angeles, CA, United States
| | - Corey Watson
- Department of Biochemistry and Molecular Biology, University of Louisville School of Medicine, Louisville, KY, United States
| | - Yana Safonova
- Department of Computer Science and Engineering, Pennsylvania State University, PA, United States
| | - Matt Pennell
- Department of Quantitative and Computational Biology and Biological Sciences, University of Southern California, Los Angeles, CA, United States
| | - Anton Bankevich
- Department of Computer Science and Engineering, Pennsylvania State University, PA, United States
| |
Collapse
|
22
|
Buckley RM, Ostrander EA. Large-scale genomic analysis of the domestic dog informs biological discovery. Genome Res 2024; 34:811-821. [PMID: 38955465 PMCID: PMC11293549 DOI: 10.1101/gr.278569.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/04/2024]
Abstract
Recent advances in genomics, coupled with a unique population structure and remarkable levels of variation, have propelled the domestic dog to new levels as a system for understanding fundamental principles in mammalian biology. Central to this advance are more than 350 recognized breeds, each a closed population that has undergone selection for unique features. Genetic variation in the domestic dog is particularly well characterized compared with other domestic mammals, with almost 3000 high-coverage genomes publicly available. Importantly, as the number of sequenced genomes increases, new avenues for analysis are becoming available. Herein, we discuss recent discoveries in canine genomics regarding behavior, morphology, and disease susceptibility. We explore the limitations of current data sets for variant interpretation, tradeoffs between sequencing strategies, and the burgeoning role of long-read genomes for capturing structural variants. In addition, we consider how large-scale collections of whole-genome sequence data drive rare variant discovery and assess the geographic distribution of canine diversity, which identifies Asia as a major source of missing variation. Finally, we review recent comparative genomic analyses that will facilitate annotation of the noncoding genome in dogs.
Collapse
Affiliation(s)
- Reuben M Buckley
- National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Elaine A Ostrander
- National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| |
Collapse
|
23
|
Edenhofer FC, Térmeg A, Ohnuki M, Jocher J, Kliesmete Z, Briem E, Hellmann I, Enard W. Generation and characterization of inducible KRAB-dCas9 iPSCs from primates for cross-species CRISPRi. iScience 2024; 27:110090. [PMID: 38947524 PMCID: PMC11214527 DOI: 10.1016/j.isci.2024.110090] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 03/28/2024] [Accepted: 05/21/2024] [Indexed: 07/02/2024] Open
Abstract
Comparisons of molecular phenotypes across primates provide unique information to understand human biology and evolution, and single-cell RNA-seq CRISPR interference (CRISPRi) screens are a powerful approach to analyze them. Here, we generate and validate three human, three gorilla, and two cynomolgus iPS cell lines that carry a dox-inducible KRAB-dCas9 construct at the AAVS1 locus. We show that despite variable expression levels of KRAB-dCas9 among lines, comparable downregulation of target genes and comparable phenotypic effects are observed in a single-cell RNA-seq CRISPRi screen. Hence, we provide valuable resources for performing and further extending CRISPRi in human and non-human primates.
Collapse
Affiliation(s)
- Fiona C. Edenhofer
- Anthropology and Human Genomics, Faculty of Biology, Ludwig-Maximilians-Universität München, 82152 Planegg, Germany
| | - Anita Térmeg
- Anthropology and Human Genomics, Faculty of Biology, Ludwig-Maximilians-Universität München, 82152 Planegg, Germany
| | - Mari Ohnuki
- Anthropology and Human Genomics, Faculty of Biology, Ludwig-Maximilians-Universität München, 82152 Planegg, Germany
- Institute for the Advanced Study of Human Biology, Kyoto University, Kyoto 606-8501, Japan
- Hakubi Center, Kyoto University, Kyoto 606-8501, Japan
| | - Jessica Jocher
- Anthropology and Human Genomics, Faculty of Biology, Ludwig-Maximilians-Universität München, 82152 Planegg, Germany
| | - Zane Kliesmete
- Anthropology and Human Genomics, Faculty of Biology, Ludwig-Maximilians-Universität München, 82152 Planegg, Germany
| | - Eva Briem
- Anthropology and Human Genomics, Faculty of Biology, Ludwig-Maximilians-Universität München, 82152 Planegg, Germany
| | - Ines Hellmann
- Anthropology and Human Genomics, Faculty of Biology, Ludwig-Maximilians-Universität München, 82152 Planegg, Germany
| | - Wolfgang Enard
- Anthropology and Human Genomics, Faculty of Biology, Ludwig-Maximilians-Universität München, 82152 Planegg, Germany
| |
Collapse
|
24
|
Zhao L, Hao R, Chai Z, Fu W, Yang W, Li C, Liu Q, Jiang Y. DeepOCR: A multi-species deep-learning framework for accurate identification of open chromatin regions in livestock. Comput Biol Chem 2024; 110:108077. [PMID: 38691895 DOI: 10.1016/j.compbiolchem.2024.108077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Revised: 03/27/2024] [Accepted: 04/16/2024] [Indexed: 05/03/2024]
Abstract
A wealth of experimental evidence has suggested that open chromatin regions (OCRs) are involved in many critical biological activities, such as DNA replication, enhancer activity, and gene transcription. Accurately identifying OCRs in livestock species can provide critical insights into the distribution and characteristics of OCRs for disease treatment in livestock, thereby improving animal welfare. However, most current machine-learning methods for OCR prediction were originally designed for a limited number of model organisms, such as humans and some model organisms, and thus their performance on non-model organisms, specifically livestock, is often unsatisfactory. To bridge this gap, we propose DeepOCR, a lightweight depth-separable residual network model for predicting OCRs in livestock, including chicken, cattle, and sheep. DeepOCR integrates a single convolution layer and two improved residue structure blocks to extract and learn important features from the input DNA sequences. A fully connected layer was also employed to further process the extracted features and improve the robustness of the entire network. Our benchmarking experiments demonstrated superior prediction performance of DeepOCR compared to state-of-the-art approaches on testing datasets of the three species. The source code of DeepOCR is freely available for academic purposes at https://github.com/jasonzhao371/DeepOCR/. We anticipate DeepOCR servers as a practical and reliable computational tool for OCR-related studies in livestock species.
Collapse
Affiliation(s)
- Liangwei Zhao
- College of Information Engineering, Northwest A&F University, Yangling 712100, China
| | - Ran Hao
- College of Information Engineering, Northwest A&F University, Yangling 712100, China
| | - Ziyi Chai
- College of Information Engineering, Northwest A&F University, Yangling 712100, China
| | - Weiwei Fu
- College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, Gansu 730020, China
| | - Wei Yang
- National Clinical Research Center for Infectious Diseases, Shenzhen Third People's Hospital, Shenzhen 518112, China
| | - Chen Li
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia.
| | - Quanzhong Liu
- College of Information Engineering, Northwest A&F University, Yangling 712100, China.
| | - Yu Jiang
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, China; Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China.
| |
Collapse
|
25
|
Wirthlin ME, Schmid TA, Elie JE, Zhang X, Kowalczyk A, Redlich R, Shvareva VA, Rakuljic A, Ji MB, Bhat NS, Kaplow IM, Schäffer DE, Lawler AJ, Wang AZ, Phan BN, Annaldasula S, Brown AR, Lu T, Lim BK, Azim E, Clark NL, Meyer WK, Pond SLK, Chikina M, Yartsev MM, Pfenning AR. Vocal learning-associated convergent evolution in mammalian proteins and regulatory elements. Science 2024; 383:eabn3263. [PMID: 38422184 PMCID: PMC11313673 DOI: 10.1126/science.abn3263] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Accepted: 02/20/2024] [Indexed: 03/02/2024]
Abstract
Vocal production learning ("vocal learning") is a convergently evolved trait in vertebrates. To identify brain genomic elements associated with mammalian vocal learning, we integrated genomic, anatomical, and neurophysiological data from the Egyptian fruit bat (Rousettus aegyptiacus) with analyses of the genomes of 215 placental mammals. First, we identified a set of proteins evolving more slowly in vocal learners. Then, we discovered a vocal motor cortical region in the Egyptian fruit bat, an emergent vocal learner, and leveraged that knowledge to identify active cis-regulatory elements in the motor cortex of vocal learners. Machine learning methods applied to motor cortex open chromatin revealed 50 enhancers robustly associated with vocal learning whose activity tended to be lower in vocal learners. Our research implicates convergent losses of motor cortex regulatory elements in mammalian vocal learning evolution.
Collapse
Affiliation(s)
- Morgan E. Wirthlin
- Department of Computational Biology, Carnegie Mellon University; Pittsburgh, PA 15213, USA
- Present address: Department of Biomedical Engineering, Duke University; Durham, NC 27705
| | - Tobias A. Schmid
- Helen Wills Neuroscience Institute, University of California, Berkeley; Berkeley, CA 94708, USA
| | - Julie E. Elie
- Helen Wills Neuroscience Institute, University of California, Berkeley; Berkeley, CA 94708, USA
- Department of Bioengineering, University of California, Berkeley; Berkeley, CA 94708, USA
| | - Xiaomeng Zhang
- Department of Computational Biology, Carnegie Mellon University; Pittsburgh, PA 15213, USA
| | - Amanda Kowalczyk
- Department of Computational Biology, Carnegie Mellon University; Pittsburgh, PA 15213, USA
- Present address: Department of Biomedical Engineering, Duke University; Durham, NC 27705
| | - Ruby Redlich
- Department of Computational Biology, Carnegie Mellon University; Pittsburgh, PA 15213, USA
| | - Varvara A. Shvareva
- Department of Molecular and Cell Biology, University of California, Berkeley; Berkeley, CA 94708, USA
| | - Ashley Rakuljic
- Department of Molecular and Cell Biology, University of California, Berkeley; Berkeley, CA 94708, USA
| | - Maria B. Ji
- Department of Psychology, University of California, Berkeley; Berkeley, CA 94708, USA
| | - Ninad S. Bhat
- Department of Molecular and Cell Biology, University of California, Berkeley; Berkeley, CA 94708, USA
| | - Irene M. Kaplow
- Department of Computational Biology, Carnegie Mellon University; Pittsburgh, PA 15213, USA
- Present address: Department of Biomedical Engineering, Duke University; Durham, NC 27705
| | - Daniel E. Schäffer
- Department of Computational Biology, Carnegie Mellon University; Pittsburgh, PA 15213, USA
| | - Alyssa J. Lawler
- Present address: Department of Biomedical Engineering, Duke University; Durham, NC 27705
- Department of Biological Sciences, Carnegie Mellon University; Pittsburgh, PA 15213, USA
| | - Andrew Z. Wang
- Department of Computational Biology, Carnegie Mellon University; Pittsburgh, PA 15213, USA
| | - BaDoi N. Phan
- Department of Computational Biology, Carnegie Mellon University; Pittsburgh, PA 15213, USA
- Present address: Department of Biomedical Engineering, Duke University; Durham, NC 27705
| | - Siddharth Annaldasula
- Department of Computational Biology, Carnegie Mellon University; Pittsburgh, PA 15213, USA
| | - Ashley R. Brown
- Department of Computational Biology, Carnegie Mellon University; Pittsburgh, PA 15213, USA
- Present address: Department of Biomedical Engineering, Duke University; Durham, NC 27705
| | - Tianyu Lu
- Department of Computational Biology, Carnegie Mellon University; Pittsburgh, PA 15213, USA
| | - Byung Kook Lim
- Neurobiology section, Division of Biological Science, University of California, San Diego; La Jolla, CA 92093, USA
| | - Eiman Azim
- Molecular Neurobiology Laboratory, Salk Institute for Biological Studies; La Jolla, CA 92037, USA
| | - Nathan L. Clark
- Department of Biological Sciences, University of Pittsburgh; Pittsburgh, PA 15213, USA
| | - Wynn K. Meyer
- Department of Biological Sciences, Lehigh University; Bethlehem, PA 18015, USA
| | | | - Maria Chikina
- Department of Computational and Systems Biology, University of Pittsburgh; Pittsburgh, PA 15213, USA
| | - Michael M. Yartsev
- Helen Wills Neuroscience Institute, University of California, Berkeley; Berkeley, CA 94708, USA
- Department of Bioengineering, University of California, Berkeley; Berkeley, CA 94708, USA
| | - Andreas R. Pfenning
- Department of Computational Biology, Carnegie Mellon University; Pittsburgh, PA 15213, USA
| |
Collapse
|
26
|
Uebbing S, Kocher AA, Baumgartner M, Ji Y, Bai S, Xing X, Nottoli T, Noonan JP. Evolutionary innovation in conserved regulatory elements across the mammalian tree of life. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.31.578197. [PMID: 38352419 PMCID: PMC10862883 DOI: 10.1101/2024.01.31.578197] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/22/2024]
Abstract
Transcriptional enhancers orchestrate cell type- and time point-specific gene expression programs. Evolution of enhancer sequences can alter target gene expression without causing detrimental misexpression in other contexts. It has long been thought that this modularity allows evolutionary changes in enhancers to escape pleiotropic constraints, which is especially important for evolutionary constrained developmental patterning genes. However, there is still little data supporting this hypothesis. Here we identified signatures of accelerated evolution in conserved enhancer elements across the mammalian phylogeny. We found that pleiotropic genes involved in gene regulatory and developmental processes were enriched for accelerated sequence evolution within their enhancer elements. These genes were associated with an excess number of enhancers compared to other genes, and due to this they exhibit a substantial degree of sequence acceleration over all their enhancers combined. We provide evidence that sequence acceleration is associated with turnover of regulatory function. We studied one acceleration event in depth and found that its sequence evolution led to the emergence of a new enhancer activity domain that may be involved in the evolution of digit reduction in hoofed mammals. Our results provide tangible evidence that enhancer evolution has been a frequent contributor to modifications involving constrained developmental signaling genes in mammals.
Collapse
Affiliation(s)
- Severin Uebbing
- Department of Genetics, Yale School of Medicine, New Haven CT, USA
- Genome Biology and Epigenetics, Institute of Biodynamics and Biocomplexity, Department of Biology, Utrecht University, Utrecht, The Netherlands
| | - Acadia A Kocher
- Department of Genetics, Yale School of Medicine, New Haven CT, USA
- Present address: Division of Molecular Genetics, Netherlands Cancer Institute, Amsterdam, The Netherlands
| | | | - Yu Ji
- Department of Genetics, Yale School of Medicine, New Haven CT, USA
| | - Suxia Bai
- Yale Genome Editing Center, Yale School of Medicine, New Haven CT, USA
| | - Xiaojun Xing
- Yale Genome Editing Center, Yale School of Medicine, New Haven CT, USA
| | - Timothy Nottoli
- Yale Genome Editing Center, Yale School of Medicine, New Haven CT, USA
| | - James P Noonan
- Department of Genetics, Yale School of Medicine, New Haven CT, USA
- Department of Ecology and Evolutionary Biology, Yale University, New Haven CT, USA
- Department of Neuroscience, Yale School of Medicine, New Haven CT, USA
- Wu Tsai Institute, Yale University, New Haven CT, USA
| |
Collapse
|
27
|
Redlich R, Kowalczyk A, Tene M, Sestili HH, Foley K, Saputra E, Clark N, Chikina M, Meyer WK, Pfenning A. RERconverge Expansion: Using Relative Evolutionary Rates to Study Complex Categorical Trait Evolution. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.06.570425. [PMID: 38106136 PMCID: PMC10723433 DOI: 10.1101/2023.12.06.570425] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
Comparative genomics approaches seek to associate evolutionary genetic changes with the evolution of phenotypes across a phylogeny. Many of these methods, including our evolutionary rates based method, RERconverge, lack the capability of analyzing non-ordinal, multicategorical traits. To address this limitation, we introduce an expansion to RERconverge that associates shifts in evolutionary rates with the convergent evolution of multi-categorical traits. The categorical RERconverge expansion includes methods for performing categorical ancestral state reconstruction, statistical tests for associating relative evolutionary rates with categorical variables, and a new method for performing phylogenetic permulations on multi-categorical traits. In addition to demonstrating our new method on a three-category diet phenotype, we compare its performance to naive pairwise binary RERconverge analyses and two existing methods for comparative genomic analyses of categorical traits: phylogenetic simulations and a phylogenetic signal based method. We also present a diagnostic analysis of the new permulations approach demonstrating how the method scales with the number of species and the number of categories included in the analysis. Our results show that our new categorical method outperforms phylogenetic simulations at identifying genes and enriched pathways significantly associated with the diet phenotype and that the new ancestral reconstruction drives an improvement in our ability to capture diet-related enriched pathways. Our categorical permulations were able to account for non-uniform null distributions and correct for non-independence in gene rank during pathway enrichment analysis. The categorical expansion to RERconverge will provide a strong foundation for applying the comparative method to categorical traits on larger data sets with more species and more complex trait evolution.
Collapse
|
28
|
Uhlen M, Quake SR. Sequential sequencing by synthesis and the next-generation sequencing revolution. Trends Biotechnol 2023; 41:1565-1572. [PMID: 37482467 DOI: 10.1016/j.tibtech.2023.06.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Revised: 06/11/2023] [Accepted: 06/15/2023] [Indexed: 07/25/2023]
Abstract
The impact of next-generation sequencing (NGS) cannot be overestimated. The technology has transformed the field of life science, contributing to a dramatic expansion in our understanding of human health and disease and our understanding of biology and ecology. The vast majority of the major NGS systems today are based on the concept of 'sequencing by synthesis' (SBS) with sequential detection of nucleotide incorporation using an engineered DNA polymerase. Based on this strategy, various alternative platforms have been developed, including the use of either native nucleotides or reversible terminators and different strategies for the attachment of DNA to a solid support. In this review, some of the key concepts leading to this remarkable development are discussed.
Collapse
Affiliation(s)
- Mathias Uhlen
- Science for Life Laboratory, Department of Protein Science, KTH Royal Institute of Technology, Stockholm, Sweden; Department of Neuroscience, Karolinska Institutet, Stockholm, Sweden.
| | - Stephen R Quake
- Departments of Bioengineering and Applied Physics, Stanford University, Stanford, CA, USA; Chan Zuckerberg Initiative, Redwood City, California, USA, Stanford, CA, USA
| |
Collapse
|
29
|
Zemke NR, Armand EJ, Wang W, Lee S, Zhou J, Li YE, Liu H, Tian W, Nery JR, Castanon RG, Bartlett A, Osteen JK, Li D, Zhuo X, Xu V, Chang L, Dong K, Indralingam HS, Rink JA, Xie Y, Miller M, Krienen FM, Zhang Q, Taskin N, Ting J, Feng G, McCarroll SA, Callaway EM, Wang T, Lein ES, Behrens MM, Ecker JR, Ren B. Conserved and divergent gene regulatory programs of the mammalian neocortex. Nature 2023; 624:390-402. [PMID: 38092918 PMCID: PMC10719095 DOI: 10.1038/s41586-023-06819-6] [Citation(s) in RCA: 30] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2023] [Accepted: 11/01/2023] [Indexed: 12/17/2023]
Abstract
Divergence of cis-regulatory elements drives species-specific traits1, but how this manifests in the evolution of the neocortex at the molecular and cellular level remains unclear. Here we investigated the gene regulatory programs in the primary motor cortex of human, macaque, marmoset and mouse using single-cell multiomics assays, generating gene expression, chromatin accessibility, DNA methylome and chromosomal conformation profiles from a total of over 200,000 cells. From these data, we show evidence that divergence of transcription factor expression corresponds to species-specific epigenome landscapes. We find that conserved and divergent gene regulatory features are reflected in the evolution of the three-dimensional genome. Transposable elements contribute to nearly 80% of the human-specific candidate cis-regulatory elements in cortical cells. Through machine learning, we develop sequence-based predictors of candidate cis-regulatory elements in different species and demonstrate that the genomic regulatory syntax is highly preserved from rodents to primates. Finally, we show that epigenetic conservation combined with sequence similarity helps to uncover functional cis-regulatory elements and enhances our ability to interpret genetic variants contributing to neurological disease and traits.
Collapse
Affiliation(s)
- Nathan R Zemke
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, CA, USA
- Center for Epigenomics, University of California, San Diego School of Medicine, La Jolla, CA, USA
| | - Ethan J Armand
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, CA, USA
- Bioinformatics and Systems Biology Program, University of California, San Diego, La Jolla, CA, USA
| | - Wenliang Wang
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Seoyeon Lee
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, CA, USA
| | - Jingtian Zhou
- Bioinformatics and Systems Biology Program, University of California, San Diego, La Jolla, CA, USA
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Yang Eric Li
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, CA, USA
| | - Hanqing Liu
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA, USA
- Division of Biological Sciences, University of California, San Diego, La Jolla, CA, USA
| | - Wei Tian
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Joseph R Nery
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Rosa G Castanon
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Anna Bartlett
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Julia K Osteen
- Computational Neurobiology Laboratory, The Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Daofeng Li
- Department of Genetics, The Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine, St Louis, MO, USA
| | - Xiaoyu Zhuo
- Department of Genetics, The Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine, St Louis, MO, USA
| | - Vincent Xu
- Department of Genetics, The Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine, St Louis, MO, USA
| | - Lei Chang
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, CA, USA
| | - Keyi Dong
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, CA, USA
- Center for Epigenomics, University of California, San Diego School of Medicine, La Jolla, CA, USA
| | - Hannah S Indralingam
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, CA, USA
- Center for Epigenomics, University of California, San Diego School of Medicine, La Jolla, CA, USA
| | - Jonathan A Rink
- Computational Neurobiology Laboratory, The Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Yang Xie
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, CA, USA
| | - Michael Miller
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, CA, USA
- Center for Epigenomics, University of California, San Diego School of Medicine, La Jolla, CA, USA
| | - Fenna M Krienen
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
- Department of Genetics, Harvard Medical School, Boston, USA
| | - Qiangge Zhang
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- McGovern Institute for Brain Research, Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Naz Taskin
- Allen Institute for Brain Science, Seattle, WA, USA
| | | | - Guoping Feng
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- McGovern Institute for Brain Research, Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Steven A McCarroll
- Department of Genetics, Harvard Medical School, Boston, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Edward M Callaway
- Systems Neurobiology Laboratories, The Salk Institute for Biological Studies, La Jolla, CA, USA
- Department of Neurosciences, University of California San Diego, La Jolla, CA, USA
| | - Ting Wang
- Department of Genetics, The Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine, St Louis, MO, USA
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
| | - Ed S Lein
- Allen Institute for Brain Science, Seattle, WA, USA
- Department of Neurological Surgery, University of Washington, Seattle, WA, USA
| | - M Margarita Behrens
- Computational Neurobiology Laboratory, The Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Joseph R Ecker
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA, USA.
- Howard Hughes Medical Institute, The Salk Institute for Biological Studies, La Jolla, CA, USA.
| | - Bing Ren
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, CA, USA.
- Center for Epigenomics, University of California, San Diego School of Medicine, La Jolla, CA, USA.
- Institute of Genomic Medicine, Moores Cancer Center, School of Medicine, University of California San Diego, La Jolla, CA, USA.
| |
Collapse
|
30
|
Zhang YJ, Luo Z, Sun Y, Liu J, Chen Z. From beasts to bytes: Revolutionizing zoological research with artificial intelligence. Zool Res 2023; 44:1115-1131. [PMID: 37933101 PMCID: PMC10802096 DOI: 10.24272/j.issn.2095-8137.2023.263] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Accepted: 10/30/2023] [Indexed: 11/08/2023] Open
Abstract
Since the late 2010s, Artificial Intelligence (AI) including machine learning, boosted through deep learning, has boomed as a vital tool to leverage computer vision, natural language processing and speech recognition in revolutionizing zoological research. This review provides an overview of the primary tasks, core models, datasets, and applications of AI in zoological research, including animal classification, resource conservation, behavior, development, genetics and evolution, breeding and health, disease models, and paleontology. Additionally, we explore the challenges and future directions of integrating AI into this field. Based on numerous case studies, this review outlines various avenues for incorporating AI into zoological research and underscores its potential to enhance our understanding of the intricate relationships that exist within the animal kingdom. As we build a bridge between beast and byte realms, this review serves as a resource for envisioning novel AI applications in zoological research that have not yet been explored.
Collapse
Affiliation(s)
- Yu-Juan Zhang
- Chongqing Key Laboratory of Vector Insects
- Chongqing Key Laboratory of Animal Biology
- College of Life Science, Chongqing Normal University, Chongqing 401331, China
| | - Zeyu Luo
- Chongqing Key Laboratory of Vector Insects
- Chongqing Key Laboratory of Animal Biology
- College of Life Science, Chongqing Normal University, Chongqing 401331, China
| | - Yawen Sun
- Chongqing Key Laboratory of Vector Insects
- Chongqing Key Laboratory of Animal Biology
- College of Life Science, Chongqing Normal University, Chongqing 401331, China
| | - Junhao Liu
- Chongqing Key Laboratory of Vector Insects
- Chongqing Key Laboratory of Animal Biology
- College of Life Science, Chongqing Normal University, Chongqing 401331, China
| | - Zongqing Chen
- School of Mathematical Sciences
- National Center for Applied Mathematics in Chongqing, Chongqing Normal University, Chongqing 401331, China. E-mail:
| |
Collapse
|
31
|
Roach JC, Freidin MB. Editorial: Insights in human and medical genomics: 2022. Front Genet 2023; 14:1287894. [PMID: 37818104 PMCID: PMC10561311 DOI: 10.3389/fgene.2023.1287894] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2023] [Accepted: 09/15/2023] [Indexed: 10/12/2023] Open
Affiliation(s)
- Jared C. Roach
- Institute for Systems Biology, Seattle, WA, United States
| | - Maxim B. Freidin
- Department of Biology, School of Biological and Behavioural Sciences, Queen Mary University of London, London, United Kingdom
| |
Collapse
|
32
|
Cicconardi F, Milanetti E, Pinheiro de Castro EC, Mazo-Vargas A, Van Belleghem SM, Ruggieri AA, Rastas P, Hanly J, Evans E, Jiggins CD, Owen McMillan W, Papa R, Di Marino D, Martin A, Montgomery SH. Evolutionary dynamics of genome size and content during the adaptive radiation of Heliconiini butterflies. Nat Commun 2023; 14:5620. [PMID: 37699868 PMCID: PMC10497600 DOI: 10.1038/s41467-023-41412-5] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2022] [Accepted: 08/30/2023] [Indexed: 09/14/2023] Open
Abstract
Heliconius butterflies, a speciose genus of Müllerian mimics, represent a classic example of an adaptive radiation that includes a range of derived dietary, life history, physiological and neural traits. However, key lineages within the genus, and across the broader Heliconiini tribe, lack genomic resources, limiting our understanding of how adaptive and neutral processes shaped genome evolution during their radiation. Here, we generate highly contiguous genome assemblies for nine Heliconiini, 29 additional reference-assembled genomes, and improve 10 existing assemblies. Altogether, we provide a dataset of annotated genomes for a total of 63 species, including 58 species within the Heliconiini tribe. We use this extensive dataset to generate a robust and dated heliconiine phylogeny, describe major patterns of introgression, explore the evolution of genome architecture, and the genomic basis of key innovations in this enigmatic group, including an assessment of the evolution of putative regulatory regions at the Heliconius stem. Our work illustrates how the increased resolution provided by such dense genomic sampling improves our power to generate and test gene-phenotype hypotheses, and precisely characterize how genomes evolve.
Collapse
Affiliation(s)
- Francesco Cicconardi
- School of Biological Sciences, Bristol University, Bristol, United Kingdom.
- Department of Zoology, University of Cambridge, Cambridge, United Kingdom.
| | - Edoardo Milanetti
- Department of Physics, Sapienza University, Piazzale Aldo Moro 5, 00185, Rome, Italy
- Center for Life Nano- & Neuro-Science, Italian Institute of Technology, Viale Regina Elena 291, 00161, Rome, Italy
| | | | - Anyi Mazo-Vargas
- Department of Ecology and Evolutionary Biology, Cornell University, Ithaca, NY, 14853, USA
| | - Steven M Van Belleghem
- Department of Biology, University of Puerto Rico, Rio Piedras, PR, Puerto Rico
- Ecology, Evolution and Conservation Biology, Biology Department, KU Leuven, Leuven, Belgium
| | | | - Pasi Rastas
- Institute of Biotechnology, University of Helsinki, Helsinki, Finland
| | - Joseph Hanly
- Department of Biological Sciences, The George Washington University, Washington DC, WA, 20052, USA
- Smithsonian Tropical Research Institute, Panama City, Panama
| | - Elizabeth Evans
- Department of Biology, University of Puerto Rico, Rio Piedras, PR, Puerto Rico
| | - Chris D Jiggins
- Department of Zoology, University of Cambridge, Cambridge, United Kingdom
| | - W Owen McMillan
- Smithsonian Tropical Research Institute, Panama City, Panama
| | - Riccardo Papa
- Department of Biology, University of Puerto Rico, Rio Piedras, PR, Puerto Rico
- Molecular Sciences and Research Center, University of Puerto Rico, San Juan, PR, Puerto Rico
- Comprehensive Cancer Center, University of Puerto Rico, San Juan, PR, Puerto Rico
| | - Daniele Di Marino
- Department of Life and Environmental Sciences, New York-Marche Structural Biology Center (NY-MaSBiC), Polytechnic University of Marche, Via Brecce Bianche, 60131, Ancona, Italy
- Neuronal Death and Neuroprotection Unit, Department of Neuroscience, Mario Negri Institute for Pharmacological Research-IRCCS, Via Mario Negri 2, 20156, Milano, Italy
- National Biodiversity Future Center (NBFC), Palermo, Italy
| | - Arnaud Martin
- Department of Biological Sciences, The George Washington University, Washington DC, WA, 20052, USA
| | - Stephen H Montgomery
- School of Biological Sciences, Bristol University, Bristol, United Kingdom.
- Smithsonian Tropical Research Institute, Panama City, Panama.
| |
Collapse
|
33
|
Ye M, Zhang DF. Understanding and modeling human traits and diseases: Insights from the comparative genomics resources of Zoonomia. Innovation (N Y) 2023; 4:100444. [PMID: 37305857 PMCID: PMC10251142 DOI: 10.1016/j.xinn.2023.100444] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2023] [Accepted: 05/17/2023] [Indexed: 06/13/2023] Open
Affiliation(s)
- Maosen Ye
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China
| | - Deng-Feng Zhang
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China
- Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming 650204, China
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650204, China
- National Research Facility for Phenotypic & Genetic Analysis of Model Animals (Primate Facility), Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650107, China
- KIZ/CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China
| |
Collapse
|
34
|
Sullivan PF, Meadows JRS, Gazal S, Phan BN, Li X, Genereux DP, Dong MX, Bianchi M, Andrews G, Sakthikumar S, Nordin J, Roy A, Christmas MJ, Marinescu VD, Wang C, Wallerman O, Xue J, Yao S, Sun Q, Szatkiewicz J, Wen J, Huckins LM, Lawler A, Keough KC, Zheng Z, Zeng J, Wray NR, Li Y, Johnson J, Chen J, Paten B, Reilly SK, Hughes GM, Weng Z, Pollard KS, Pfenning AR, Forsberg-Nilsson K, Karlsson EK, Lindblad-Toh K, Andrews G, Armstrong JC, Bianchi M, Birren BW, Bredemeyer KR, Breit AM, Christmas MJ, Clawson H, Damas J, Di Palma F, Diekhans M, Dong MX, Eizirik E, Fan K, Fanter C, Foley NM, Forsberg-Nilsson K, Garcia CJ, Gatesy J, Gazal S, Genereux DP, Goodman L, Grimshaw J, Halsey MK, Harris AJ, Hickey G, Hiller M, Hindle AG, Hubley RM, Hughes GM, Johnson J, Juan D, Kaplow IM, Karlsson EK, Keough KC, Kirilenko B, Koepfli KP, Korstian JM, Kowalczyk A, Kozyrev SV, Lawler AJ, Lawless C, Lehmann T, Levesque DL, Lewin HA, Li X, Lind A, Lindblad-Toh K, Mackay-Smith A, Marinescu VD, Marques-Bonet T, Mason VC, Meadows JRS, Meyer WK, Moore JE, Moreira LR, Moreno-Santillan DD, Morrill KM, Muntané G, Murphy WJ, Navarro A, et alSullivan PF, Meadows JRS, Gazal S, Phan BN, Li X, Genereux DP, Dong MX, Bianchi M, Andrews G, Sakthikumar S, Nordin J, Roy A, Christmas MJ, Marinescu VD, Wang C, Wallerman O, Xue J, Yao S, Sun Q, Szatkiewicz J, Wen J, Huckins LM, Lawler A, Keough KC, Zheng Z, Zeng J, Wray NR, Li Y, Johnson J, Chen J, Paten B, Reilly SK, Hughes GM, Weng Z, Pollard KS, Pfenning AR, Forsberg-Nilsson K, Karlsson EK, Lindblad-Toh K, Andrews G, Armstrong JC, Bianchi M, Birren BW, Bredemeyer KR, Breit AM, Christmas MJ, Clawson H, Damas J, Di Palma F, Diekhans M, Dong MX, Eizirik E, Fan K, Fanter C, Foley NM, Forsberg-Nilsson K, Garcia CJ, Gatesy J, Gazal S, Genereux DP, Goodman L, Grimshaw J, Halsey MK, Harris AJ, Hickey G, Hiller M, Hindle AG, Hubley RM, Hughes GM, Johnson J, Juan D, Kaplow IM, Karlsson EK, Keough KC, Kirilenko B, Koepfli KP, Korstian JM, Kowalczyk A, Kozyrev SV, Lawler AJ, Lawless C, Lehmann T, Levesque DL, Lewin HA, Li X, Lind A, Lindblad-Toh K, Mackay-Smith A, Marinescu VD, Marques-Bonet T, Mason VC, Meadows JRS, Meyer WK, Moore JE, Moreira LR, Moreno-Santillan DD, Morrill KM, Muntané G, Murphy WJ, Navarro A, Nweeia M, Ortmann S, Osmanski A, Paten B, Paulat NS, Pfenning AR, Phan BN, Pollard KS, Pratt HE, Ray DA, Reilly SK, Rosen JR, Ruf I, Ryan L, Ryder OA, Sabeti PC, Schäffer DE, Serres A, Shapiro B, Smit AFA, Springer M, Srinivasan C, Steiner C, Storer JM, Sullivan KAM, Sullivan PF, Sundström E, Supple MA, Swofford R, Talbot JE, Teeling E, Turner-Maier J, Valenzuela A, Wagner F, Wallerman O, Wang C, Wang J, Weng Z, Wilder AP, Wirthlin ME, Xue JR, Zhang X. Leveraging base-pair mammalian constraint to understand genetic variation and human disease. Science 2023; 380:eabn2937. [PMID: 37104612 PMCID: PMC10259825 DOI: 10.1126/science.abn2937] [Show More Authors] [Citation(s) in RCA: 57] [Impact Index Per Article: 28.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2021] [Accepted: 02/09/2023] [Indexed: 04/29/2023]
Abstract
Thousands of genomic regions have been associated with heritable human diseases, but attempts to elucidate biological mechanisms are impeded by an inability to discern which genomic positions are functionally important. Evolutionary constraint is a powerful predictor of function, agnostic to cell type or disease mechanism. Single-base phyloP scores from 240 mammals identified 3.3% of the human genome as significantly constrained and likely functional. We compared phyloP scores to genome annotation, association studies, copy-number variation, clinical genetics findings, and cancer data. Constrained positions are enriched for variants that explain common disease heritability more than other functional annotations. Our results improve variant annotation but also highlight that the regulatory landscape of the human genome still needs to be further explored and linked to disease.
Collapse
Affiliation(s)
- Patrick F Sullivan
- Department of Genetics, University of North Carolina Medical School, Chapel Hill, NC 27599, USA
- Department of Medical Epidemiology and Biostatistics, Karolinska Institute, 17177 Stockholm, Sweden
| | - Jennifer R S Meadows
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, 75132 Uppsala, Sweden
| | - Steven Gazal
- Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA
- Center for Genetic Epidemiology, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA
| | - BaDoi N Phan
- Department of Computational Biology, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Xue Li
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA 01605, USA
- Broad Institute of MIT and Harvard, Cambridge, MA 02139, USA
| | - Diane P Genereux
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA 01605, USA
| | - Michael X Dong
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, 75132 Uppsala, Sweden
| | - Matteo Bianchi
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, 75132 Uppsala, Sweden
| | - Gregory Andrews
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA 01605, USA
| | - Sharadha Sakthikumar
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, 75132 Uppsala, Sweden
- Broad Institute of MIT and Harvard, Cambridge, MA 02139, USA
| | - Jessika Nordin
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, 75132 Uppsala, Sweden
| | - Ananya Roy
- Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Uppsala University, 75185 Uppsala, Sweden
| | - Matthew J Christmas
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, 75132 Uppsala, Sweden
| | - Voichita D Marinescu
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, 75132 Uppsala, Sweden
| | - Chao Wang
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, 75132 Uppsala, Sweden
| | - Ola Wallerman
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, 75132 Uppsala, Sweden
| | - James Xue
- Broad Institute of MIT and Harvard, Cambridge, MA 02139, USA
- Center for System Biology, Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
| | - Shuyang Yao
- Department of Medical Epidemiology and Biostatistics, Karolinska Institute, 17177 Stockholm, Sweden
| | - Quan Sun
- Department of Genetics, University of North Carolina Medical School, Chapel Hill, NC 27599, USA
| | - Jin Szatkiewicz
- Department of Genetics, University of North Carolina Medical School, Chapel Hill, NC 27599, USA
| | - Jia Wen
- Department of Genetics, University of North Carolina Medical School, Chapel Hill, NC 27599, USA
| | - Laura M Huckins
- Department of Genetic and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Alyssa Lawler
- Neuroscience Institute, Carnegie Mellon University, Pittsburgh, PA 15213, USA
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Kathleen C Keough
- Gladstone Institutes, San Francisco, CA 94158, USA
- Department of Epidemiology and Biostatistics, University of California, San Francisco, CA 94158, USA
| | - Zhili Zheng
- Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD 4072, Australia
| | - Jian Zeng
- Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD 4072, Australia
| | - Naomi R Wray
- Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD 4072, Australia
| | - Yun Li
- Department of Genetics, University of North Carolina Medical School, Chapel Hill, NC 27599, USA
| | - Jessica Johnson
- Department of Genetic and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Jiawen Chen
- Department of Biostatistics, University of North Carolina Medical School, Chapel Hill, NC 27599, USA
| | - Benedict Paten
- UC Santa Cruz Genomics Institute, Santa Cruz, CA 95064, USA
| | - Steven K Reilly
- Department of Genetics, Yale School of Medicine, New Haven, CT 06510, USA
| | - Graham M Hughes
- School of Biology and Environmental Science, University College Dublin, Belfield, Dublin 4, Ireland
| | - Zhiping Weng
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA 01605, USA
| | - Katherine S Pollard
- Gladstone Institutes, San Francisco, CA 94158, USA
- Department of Epidemiology and Biostatistics, University of California, San Francisco, CA 94158, USA
- Chan Zuckerberg Biohub, San Francisco, CA 94158, USA
| | - Andreas R Pfenning
- Department of Computational Biology, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
- Neuroscience Institute, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Karin Forsberg-Nilsson
- Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Uppsala University, 75185 Uppsala, Sweden
- Biodiscovery Institute, University of Nottingham, Nottingham NG7 2RD, UK
| | - Elinor K Karlsson
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA 01605, USA
- Broad Institute of MIT and Harvard, Cambridge, MA 02139, USA
- Program in Molecular Medicine, UMass Chan Medical School, Worcester, MA 01605, USA
| | - Kerstin Lindblad-Toh
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, 75132 Uppsala, Sweden
- Broad Institute of MIT and Harvard, Cambridge, MA 02139, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
35
|
Abstract
Diverse mammal genomes open a new portal to hidden aspects of evolutionary history.
Collapse
Affiliation(s)
- Nathan S Upham
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Michael J Landis
- Department of Biology, Washington University, St. Louis, MO, USA
| |
Collapse
|