1
|
Gong X, He W, Jin W, Ma H, Wang G, Li J, Xiao Y, Zhao Y, Chen Q, Guo H, Yang J, Qi Y, Dong W, Fu M, Li X, Liu J, Liu X, Yin A, Zhang Y, Wei Y. Disruption of maternal vascular remodeling by a fetal endoretrovirus-derived gene in preeclampsia. Genome Biol 2024; 25:117. [PMID: 38715110 PMCID: PMC11075363 DOI: 10.1186/s13059-024-03265-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Accepted: 04/30/2024] [Indexed: 05/12/2024] Open
Abstract
BACKGROUND Preeclampsia, one of the most lethal pregnancy-related diseases, is associated with the disruption of uterine spiral artery remodeling during placentation. However, the early molecular events leading to preeclampsia remain unknown. RESULTS By analyzing placentas from preeclampsia, non-preeclampsia, and twin pregnancies with selective intrauterine growth restriction, we show that the pathogenesis of preeclampsia is attributed to immature trophoblast and maldeveloped endothelial cells. Delayed epigenetic reprogramming during early extraembryonic tissue development leads to generation of excessive immature trophoblast cells. We find reduction of de novo DNA methylation in these trophoblast cells results in selective overexpression of maternally imprinted genes, including the endoretrovirus-derived gene PEG10 (paternally expressed gene 10). PEG10 forms virus-like particles, which are transferred from the trophoblast to the closely proximate endothelial cells. In normal pregnancy, only a low amount of PEG10 is transferred to maternal cells; however, in preeclampsia, excessive PEG10 disrupts maternal vascular development by inhibiting TGF-beta signaling. CONCLUSIONS Our study reveals the intricate epigenetic mechanisms that regulate trans-generational genetic conflict and ultimately ensure proper maternal-fetal interface formation.
Collapse
Affiliation(s)
- Xiaoli Gong
- Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing, China
| | - Wei He
- Medical Genetic Center, Guangdong Women and Children Hospital, Guangzhou, China
| | - Wan Jin
- Euler Technology, Beijing, China
- Department of Biological Repositories, Zhongnan Hospital of Wuhan University, Wuhan, China
| | - Hongwei Ma
- Department of Obstetrics and Gynecology, West China Second University Hospital of Sichuan University, Chengdu, China
- Department Key Laboratory of Birth Defects and Related Diseases of Women and Children (Sichuan University), Ministry of Education, Chengdu, China
| | - Gang Wang
- Department of Urology, Zhongnan Hospital of Wuhan University, Wuhan, China
- Department of Biological Repositories, Zhongnan Hospital of Wuhan University, Wuhan, China
- Human Genetic Resources Preservation Center of Hubei Province, Wuhan, China
- Laboratory of Precision Medicine, Zhongnan Hospital of Wuhan University, Wuhan, China
| | - Jiaxin Li
- Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing, China
| | - Yu Xiao
- Department of Urology, Zhongnan Hospital of Wuhan University, Wuhan, China
- Department of Biological Repositories, Zhongnan Hospital of Wuhan University, Wuhan, China
- Human Genetic Resources Preservation Center of Hubei Province, Wuhan, China
- Laboratory of Precision Medicine, Zhongnan Hospital of Wuhan University, Wuhan, China
| | - Yangyu Zhao
- Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing, China
| | | | | | - Jiexia Yang
- Medical Genetic Center, Guangdong Women and Children Hospital, Guangzhou, China
| | - Yiming Qi
- Medical Genetic Center, Guangdong Women and Children Hospital, Guangzhou, China
| | - Wei Dong
- Maternity Ward, Haidian Maternal and Child Health Hospital, Beijing, China
| | - Meng Fu
- Department of Obstetrics and Gynecology, Haidian Maternal and Child Health Hospital, Beijing, China
| | - Xiaojuan Li
- Euler Technology, Beijing, China
- Present Address: International Max Planck Research School for Genome Science, and University of Göttingen, Göttingen Center for Molecular Biosciences, Göttingen, Germany
| | | | - Xinghui Liu
- Department of Obstetrics and Gynecology, West China Second University Hospital of Sichuan University, Chengdu, China.
- Department Key Laboratory of Birth Defects and Related Diseases of Women and Children (Sichuan University), Ministry of Education, Chengdu, China.
| | - Aihua Yin
- Medical Genetic Center, Guangdong Women and Children Hospital, Guangzhou, China.
| | - Yi Zhang
- Euler Technology, Beijing, China.
| | - Yuan Wei
- Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing, China.
| |
Collapse
|
2
|
Uebbing S, Kocher AA, Baumgartner M, Ji Y, Bai S, Xing X, Nottoli T, Noonan JP. Evolutionary innovation in conserved regulatory elements across the mammalian tree of life. bioRxiv 2024:2024.01.31.578197. [PMID: 38352419 PMCID: PMC10862883 DOI: 10.1101/2024.01.31.578197] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/22/2024]
Abstract
Transcriptional enhancers orchestrate cell type- and time point-specific gene expression programs. Evolution of enhancer sequences can alter target gene expression without causing detrimental misexpression in other contexts. It has long been thought that this modularity allows evolutionary changes in enhancers to escape pleiotropic constraints, which is especially important for evolutionary constrained developmental patterning genes. However, there is still little data supporting this hypothesis. Here we identified signatures of accelerated evolution in conserved enhancer elements across the mammalian phylogeny. We found that pleiotropic genes involved in gene regulatory and developmental processes were enriched for accelerated sequence evolution within their enhancer elements. These genes were associated with an excess number of enhancers compared to other genes, and due to this they exhibit a substantial degree of sequence acceleration over all their enhancers combined. We provide evidence that sequence acceleration is associated with turnover of regulatory function. We studied one acceleration event in depth and found that its sequence evolution led to the emergence of a new enhancer activity domain that may be involved in the evolution of digit reduction in hoofed mammals. Our results provide tangible evidence that enhancer evolution has been a frequent contributor to modifications involving constrained developmental signaling genes in mammals.
Collapse
Affiliation(s)
- Severin Uebbing
- Department of Genetics, Yale School of Medicine, New Haven CT, USA
- Genome Biology and Epigenetics, Institute of Biodynamics and Biocomplexity, Department of Biology, Utrecht University, Utrecht, The Netherlands
| | - Acadia A Kocher
- Department of Genetics, Yale School of Medicine, New Haven CT, USA
- Present address: Division of Molecular Genetics, Netherlands Cancer Institute, Amsterdam, The Netherlands
| | | | - Yu Ji
- Department of Genetics, Yale School of Medicine, New Haven CT, USA
| | - Suxia Bai
- Yale Genome Editing Center, Yale School of Medicine, New Haven CT, USA
| | - Xiaojun Xing
- Yale Genome Editing Center, Yale School of Medicine, New Haven CT, USA
| | - Timothy Nottoli
- Yale Genome Editing Center, Yale School of Medicine, New Haven CT, USA
| | - James P Noonan
- Department of Genetics, Yale School of Medicine, New Haven CT, USA
- Department of Ecology and Evolutionary Biology, Yale University, New Haven CT, USA
- Department of Neuroscience, Yale School of Medicine, New Haven CT, USA
- Wu Tsai Institute, Yale University, New Haven CT, USA
| |
Collapse
|
3
|
Redlich R, Kowalczyk A, Tene M, Sestili HH, Foley K, Saputra E, Clark N, Chikina M, Meyer WK, Pfenning A. RERconverge Expansion: Using Relative Evolutionary Rates to Study Complex Categorical Trait Evolution. bioRxiv 2023:2023.12.06.570425. [PMID: 38106136 PMCID: PMC10723433 DOI: 10.1101/2023.12.06.570425] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
Comparative genomics approaches seek to associate evolutionary genetic changes with the evolution of phenotypes across a phylogeny. Many of these methods, including our evolutionary rates based method, RERconverge, lack the capability of analyzing non-ordinal, multicategorical traits. To address this limitation, we introduce an expansion to RERconverge that associates shifts in evolutionary rates with the convergent evolution of multi-categorical traits. The categorical RERconverge expansion includes methods for performing categorical ancestral state reconstruction, statistical tests for associating relative evolutionary rates with categorical variables, and a new method for performing phylogenetic permulations on multi-categorical traits. In addition to demonstrating our new method on a three-category diet phenotype, we compare its performance to naive pairwise binary RERconverge analyses and two existing methods for comparative genomic analyses of categorical traits: phylogenetic simulations and a phylogenetic signal based method. We also present a diagnostic analysis of the new permulations approach demonstrating how the method scales with the number of species and the number of categories included in the analysis. Our results show that our new categorical method outperforms phylogenetic simulations at identifying genes and enriched pathways significantly associated with the diet phenotype and that the new ancestral reconstruction drives an improvement in our ability to capture diet-related enriched pathways. Our categorical permulations were able to account for non-uniform null distributions and correct for non-independence in gene rank during pathway enrichment analysis. The categorical expansion to RERconverge will provide a strong foundation for applying the comparative method to categorical traits on larger data sets with more species and more complex trait evolution.
Collapse
|
4
|
Liu A, Wang N, Xie G, Li Y, Yan X, Li X, Zhu Z, Li Z, Yang J, Meng F, Dou M, Chen W, Ma N, Jiang Y, Gao Y, Wang Y. GC-biased gene conversion drives accelerated evolution of ultraconserved elements in mammalian and avian genomes. Genome Res 2023; 33:1673-1689. [PMID: 37884342 PMCID: PMC10691551 DOI: 10.1101/gr.277784.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Accepted: 08/23/2023] [Indexed: 10/28/2023]
Abstract
Ultraconserved elements (UCEs) are the most conserved regions among the genomes of evolutionarily distant species and are thought to play critical biological functions. However, some UCEs rapidly evolved in specific lineages, and whether they contributed to adaptive evolution is still controversial. Here, using an increased number of sequenced genomes with high taxonomic coverage, we identified 2191 mammalian UCEs and 5938 avian UCEs from 95 mammal and 94 bird genomes, respectively. Our results show that these UCEs are functionally constrained and that their adjacent genes are prone to widespread expression with low expression diversity across tissues. Functional enrichment of mammalian and avian UCEs shows different trends indicating that UCEs may contribute to adaptive evolution of taxa. Focusing on lineage-specific accelerated evolution, we discover that the proportion of fast-evolving UCEs in nine mammalian and 10 avian test lineages range from 0.19% to 13.2%. Notably, up to 62.1% of fast-evolving UCEs in test lineages are much more likely to result from GC-biased gene conversion (gBGC). A single cervid-specific gBGC region embracing the uc.359 allele significantly alters the expression of Nova1 and other neural-related genes in the rat brain. Combined with the altered regulatory activity of ancient gBGC-induced fast-evolving UCEs in eutherians, our results provide evidence that synergy between gBGC and selection shaped lineage-specific substitution patterns, even in the most constrained regulatory elements. In summary, our results show that gBGC played an important role in facilitating lineage-specific accelerated evolution of UCEs, and further support the idea that a combination of multiple evolutionary forces shapes adaptive evolution.
Collapse
Affiliation(s)
- Anguo Liu
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Nini Wang
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Faculty of Mathematics and Natural Sciences, University of Cologne, and Cologne Excellence Cluster for Cellular Stress Responses in Aging-Associated Diseases (CECAD), University Hospital Cologne, Cologne 50931, Germany
| | - Guoxiang Xie
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Yang Li
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Xixi Yan
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Xinmei Li
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Zhenliang Zhu
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
- College of Veterinary Medicine, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Animal Biotechnology, Ministry of Agriculture, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Zhuohui Li
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Jing Yang
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
- College of Veterinary Medicine, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Animal Biotechnology, Ministry of Agriculture, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Fanxin Meng
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Mingle Dou
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Weihuang Chen
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Nange Ma
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Yu Jiang
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Center for Functional Genomics, Institute of Future Agriculture, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Yuanpeng Gao
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China;
- College of Veterinary Medicine, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Animal Biotechnology, Ministry of Agriculture, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Yu Wang
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China;
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
| |
Collapse
|
5
|
Umu SU, Paynter VM, Trondsen H, Buschmann T, Rounge TB, Peterson KJ, Fromm B. Accurate microRNA annotation of animal genomes using trained covariance models of curated microRNA complements in MirMachine. Cell Genom 2023; 3:100348. [PMID: 37601971 PMCID: PMC10435380 DOI: 10.1016/j.xgen.2023.100348] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Revised: 03/15/2023] [Accepted: 05/26/2023] [Indexed: 08/22/2023]
Abstract
The annotation of microRNAs depends on the availability of transcriptomics data and expert knowledge. This has led to a gap between the availability of novel genomes and high-quality microRNA complements. Using >16,000 microRNAs from the manually curated microRNA gene database MirGeneDB, we generated trained covariance models for all conserved microRNA families. These models are available in our tool MirMachine, which annotates conserved microRNAs within genomes. We successfully applied MirMachine to a range of animal species, including those with large genomes and genome duplications and extinct species, where small RNA sequencing is hard to achieve. We further describe a microRNA score of expected microRNAs that can be used to assess the completeness of genome assemblies. MirMachine closes a long-persisting gap in the microRNA field by facilitating automated genome annotation pipelines and deeper studies into the evolution of genome regulation, even in extinct organisms.
Collapse
Affiliation(s)
- Sinan Uğur Umu
- Department of Pathology, Institute of Clinical Medicine, University of Oslo, Oslo, Norway
| | - Vanessa M. Paynter
- The Arctic University Museum of Norway, UiT - The Arctic University of Norway, Tromsø, Norway
| | - Håvard Trondsen
- Department of Pathology, Institute of Clinical Medicine, University of Oslo, Oslo, Norway
| | | | - Trine B. Rounge
- Department of Research, Cancer Registry of Norway, Oslo, Norway
- Centre for Bioinformatics, Department of Pharmacy, University of Oslo, Oslo, Norway
| | - Kevin J. Peterson
- Department of Biological Sciences, Dartmouth College, Hanover, NH, USA
| | - Bastian Fromm
- The Arctic University Museum of Norway, UiT - The Arctic University of Norway, Tromsø, Norway
| |
Collapse
|
6
|
Kocher AA, Dutrow EV, Uebbing S, Yim KM, Larios MFR, Baumgartner M, Nottoli T, Noonan JP. CpG island turnover events predict evolutionary changes in enhancer activity. bioRxiv 2023:2023.05.09.540063. [PMID: 37214934 PMCID: PMC10197647 DOI: 10.1101/2023.05.09.540063] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Genetic changes that modify the function of transcriptional enhancers have been linked to the evolution of biological diversity across species. Multiple studies have focused on the role of nucleotide substitutions, transposition, and insertions and deletions in altering enhancer function. Here we show that turnover of CpG islands (CGIs), which contribute to enhancer activation, is broadly associated with changes in enhancer activity across mammals, including humans. We integrated maps of CGIs and enhancer activity-associated histone modifications obtained from multiple tissues in nine mammalian species and found that CGI content in enhancers was strongly associated with increased histone modification levels. CGIs showed widespread turnover across species and species-specific CGIs were strongly enriched for enhancers exhibiting species-specific activity across all tissues and species we examined. Genes associated with enhancers with species-specific CGIs showed concordant biases in their expression, supporting that CGI turnover contributes to gene regulatory innovation. Our results also implicate CGI turnover in the evolution of Human Gain Enhancers (HGEs), which show increased activity in human embryonic development and may have contributed to the evolution of uniquely human traits. Using a humanized mouse model, we show that a highly conserved HGE with a large CGI absent from the mouse ortholog shows increased activity at the human CGI in the humanized mouse diencephalon. Collectively, our results point to CGI turnover as a mechanism driving gene regulatory changes potentially underlying trait evolution in mammals.
Collapse
Affiliation(s)
- Acadia A. Kocher
- Department of Genetics, Yale School of Medicine, New Haven CT 06510, USA
| | - Emily V. Dutrow
- Department of Genetics, Yale School of Medicine, New Haven CT 06510, USA
- Present address: Cancer Genetics and Comparative Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Severin Uebbing
- Department of Genetics, Yale School of Medicine, New Haven CT 06510, USA
| | - Kristina M. Yim
- Department of Genetics, Yale School of Medicine, New Haven CT 06510, USA
| | | | | | - Timothy Nottoli
- Department of Comparative Medicine, Yale School of Medicine, New Haven, CT 06510, USA
- Yale Genome Editing Center, Yale School of Medicine, New Haven, CT 06510, USA
| | - James P. Noonan
- Department of Genetics, Yale School of Medicine, New Haven CT 06510, USA
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT 06520, USA
- Department of Neuroscience, Yale School of Medicine, New Haven, CT 06510, USA
- Wu Tsai Institute, Yale University, New Haven, CT 06510, USA
| |
Collapse
|
7
|
Wolf M, Zapf K, Gupta DK, Hiller M, Árnason Ú, Janke A. The genome of the pygmy right whale illuminates the evolution of rorquals. BMC Biol 2023; 21:79. [PMID: 37041515 PMCID: PMC10091562 DOI: 10.1186/s12915-023-01579-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2022] [Accepted: 03/27/2023] [Indexed: 04/13/2023] Open
Abstract
BACKGROUND Baleen whales are a clade of gigantic and highly specialized marine mammals. Their genomes have been used to investigate their complex evolutionary history and to decipher the molecular mechanisms that allowed them to reach these dimensions. However, many unanswered questions remain, especially about the early radiation of rorquals and how cancer resistance interplays with their huge number of cells. The pygmy right whale is the smallest and most elusive among the baleen whales. It reaches only a fraction of the body length compared to its relatives and it is the only living member of an otherwise extinct family. This placement makes the pygmy right whale genome an interesting target to update the complex phylogenetic past of baleen whales, because it splits up an otherwise long branch that leads to the radiation of rorquals. Apart from that, genomic data of this species might help to investigate cancer resistance in large whales, since these mechanisms are not as important for the pygmy right whale as in other giant rorquals and right whales. RESULTS Here, we present a first de novo genome of the species and test its potential in phylogenomics and cancer research. To do so, we constructed a multi-species coalescent tree from fragments of a whole-genome alignment and quantified the amount of introgression in the early evolution of rorquals. Furthermore, a genome-wide comparison of selection rates between large and small-bodied baleen whales revealed a small set of conserved candidate genes with potential connections to cancer resistance. CONCLUSIONS Our results suggest that the evolution of rorquals is best described as a hard polytomy with a rapid radiation and high levels of introgression. The lack of shared positive selected genes between different large-bodied whale species supports a previously proposed convergent evolution of gigantism and hence cancer resistance in baleen whales.
Collapse
Affiliation(s)
- Magnus Wolf
- Senckenberg Biodiversity and Climate Research Centre (BiK-F), Georg-Voigt-Strasse 14-16, Frankfurt Am Main, Germany
- Institute for Ecology, Evolution and Diversity, Goethe University, Max-Von-Laue-Strasse. 9, Frankfurt Am Main, Germany
| | - Konstantin Zapf
- Senckenberg Biodiversity and Climate Research Centre (BiK-F), Georg-Voigt-Strasse 14-16, Frankfurt Am Main, Germany
- Institute for Ecology, Evolution and Diversity, Goethe University, Max-Von-Laue-Strasse. 9, Frankfurt Am Main, Germany
| | - Deepak Kumar Gupta
- LOEWE-Centre for Translational Biodiversity Genomics (TBG), Senckenberg Nature Research Society, Georg-Voigt-Straße 14-16, Frankfurt Am Main, Germany
| | - Michael Hiller
- Senckenberg Biodiversity and Climate Research Centre (BiK-F), Georg-Voigt-Strasse 14-16, Frankfurt Am Main, Germany
- LOEWE-Centre for Translational Biodiversity Genomics (TBG), Senckenberg Nature Research Society, Georg-Voigt-Straße 14-16, Frankfurt Am Main, Germany
- Institute of Cell Biology and Neuroscience, Goethe University Frankfurt, Max-Von-Laue-Str. 9, Frankfurt Am Main, Germany
| | - Úlfur Árnason
- Department of Clinical Sciences Lund, Lund University, Lund, Sweden
- Department of Neurosurgery, Skane University Hospital in Lund, Lund, Sweden
| | - Axel Janke
- Senckenberg Biodiversity and Climate Research Centre (BiK-F), Georg-Voigt-Strasse 14-16, Frankfurt Am Main, Germany
- Institute for Ecology, Evolution and Diversity, Goethe University, Max-Von-Laue-Strasse. 9, Frankfurt Am Main, Germany
- LOEWE-Centre for Translational Biodiversity Genomics (TBG), Senckenberg Nature Research Society, Georg-Voigt-Straße 14-16, Frankfurt Am Main, Germany
| |
Collapse
|
8
|
Sandmann CL, Schulz JF, Ruiz-Orera J, Kirchner M, Ziehm M, Adami E, Marczenke M, Christ A, Liebe N, Greiner J, Schoenenberger A, Muecke MB, Liang N, Moritz RL, Sun Z, Deutsch EW, Gotthardt M, Mudge JM, Prensner JR, Willnow TE, Mertins P, van Heesch S, Hubner N. Evolutionary origins and interactomes of human, young microproteins and small peptides translated from short open reading frames. Mol Cell 2023; 83:994-1011.e18. [PMID: 36806354 PMCID: PMC10032668 DOI: 10.1016/j.molcel.2023.01.023] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Revised: 12/12/2022] [Accepted: 01/25/2023] [Indexed: 02/19/2023]
Abstract
All species continuously evolve short open reading frames (sORFs) that can be templated for protein synthesis and may provide raw materials for evolutionary adaptation. We analyzed the evolutionary origins of 7,264 recently cataloged human sORFs and found that most were evolutionarily young and had emerged de novo. We additionally identified 221 previously missed sORFs potentially translated into peptides of up to 15 amino acids-all of which are smaller than the smallest human microprotein annotated to date. To investigate the bioactivity of sORF-encoded small peptides and young microproteins, we subjected 266 candidates to a mass-spectrometry-based interactome screen with motif resolution. Based on these interactomes and additional cellular assays, we can associate several candidates with mRNA splicing, translational regulation, and endocytosis. Our work provides insights into the evolutionary origins and interaction potential of young and small proteins, thereby helping to elucidate this underexplored territory of the human proteome.
Collapse
Affiliation(s)
- Clara-L Sandmann
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany; DZHK (German Centre for Cardiovascular Research), Partner Site Berlin, 13347 Berlin, Germany
| | - Jana F Schulz
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany; DZHK (German Centre for Cardiovascular Research), Partner Site Berlin, 13347 Berlin, Germany
| | - Jorge Ruiz-Orera
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | - Marieluise Kirchner
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany; Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Core Facility Proteomics, 10117 Berlin, Germany
| | - Matthias Ziehm
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany; Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Core Facility Proteomics, 10117 Berlin, Germany
| | - Eleonora Adami
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | - Maike Marczenke
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | - Annabel Christ
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | - Nina Liebe
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | - Johannes Greiner
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | - Aaron Schoenenberger
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | - Michael B Muecke
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany; DZHK (German Centre for Cardiovascular Research), Partner Site Berlin, 13347 Berlin, Germany; Charité-Universitätsmedizin, 10117 Berlin, Germany
| | - Ning Liang
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | | | - Zhi Sun
- Institute for Systems Biology, Seattle, WA 98109, USA
| | | | - Michael Gotthardt
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany; DZHK (German Centre for Cardiovascular Research), Partner Site Berlin, 13347 Berlin, Germany; Charité-Universitätsmedizin, 10117 Berlin, Germany
| | - Jonathan M Mudge
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - John R Prensner
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Department of Pediatric Oncology, Dana-Farber Cancer Institute, Boston, MA 02215, USA; Division of Pediatric Hematology/Oncology, Boston Children's Hospital, Boston, MA 02115, USA
| | - Thomas E Willnow
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany; Department of Biomedicine, Aarhus University, 8000 Aarhus, Denmark
| | - Philipp Mertins
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany; Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Core Facility Proteomics, 10117 Berlin, Germany
| | | | - Norbert Hubner
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany; DZHK (German Centre for Cardiovascular Research), Partner Site Berlin, 13347 Berlin, Germany; Charité-Universitätsmedizin, 10117 Berlin, Germany.
| |
Collapse
|
9
|
Fedorova AD, Kiniry SJ, Andreev DE, Mudge JM, Baranov PV. Thousands of human non-AUG extended proteoforms lack evidence of evolutionary selection among mammals. Nat Commun 2022; 13:7910. [PMID: 36564405 PMCID: PMC9789052 DOI: 10.1038/s41467-022-35595-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2022] [Accepted: 12/12/2022] [Indexed: 12/24/2022] Open
Abstract
The synthesis of most proteins begins at AUG codons, yet a small number of non-AUG initiated proteoforms are also known. Here we analyse a large number of publicly available Ribo-seq datasets to identify novel, previously uncharacterised non-AUG proteoforms using Trips-Viz implementation of a novel algorithm for detecting translated ORFs. In parallel we analyse genomic alignment of 120 mammals to identify evidence of protein coding evolution in sequences encoding potential extensions. Unexpectedly we find that the number of non-AUG proteoforms identified with ribosome profiling data greatly exceeds those with strong phylogenetic support suggesting their recent evolution. Our study argues that the protein coding potential of human genome greatly exceeds that detectable through comparative genomics and exposes the existence of multiple proteins encoded by the same genomic loci.
Collapse
Affiliation(s)
- Alla D Fedorova
- School of Biochemistry and Cell Biology, University College Cork, Cork, Ireland.
- SFI Centre for Research Training in Genomics Data Science, University College Cork, Cork, Ireland.
| | - Stephen J Kiniry
- School of Biochemistry and Cell Biology, University College Cork, Cork, Ireland
| | - Dmitry E Andreev
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, RAS, Moscow, Russia
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, Russia
| | - Jonathan M Mudge
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Pavel V Baranov
- School of Biochemistry and Cell Biology, University College Cork, Cork, Ireland.
| |
Collapse
|
10
|
Nino Barreat JG, Katzourakis A. Evolutionary Analysis of Placental Orthologues Reveals Two Ancient DNA Virus Integrations. J Virol 2022. [DOI: 10.1128/jvi.00933-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The genomes of vertebrates preserve a large diversity of endogenous viral elements (remnants of ancient viruses that accumulate in host genomes over evolutionary time). Although retroviruses account for the vast majority of these elements, diverse DNA viruses have also been found and novel lineages are being described.
Collapse
|
11
|
Zhang Y, Zhang Q, Zhou J, Zou Q. A survey on the algorithm and development of multiple sequence alignment. Brief Bioinform 2022; 23:6546258. [PMID: 35272347 DOI: 10.1093/bib/bbac069] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Revised: 01/30/2022] [Accepted: 02/09/2022] [Indexed: 12/21/2022] Open
Abstract
Multiple sequence alignment (MSA) is an essential cornerstone in bioinformatics, which can reveal the potential information in biological sequences, such as function, evolution and structure. MSA is widely used in many bioinformatics scenarios, such as phylogenetic analysis, protein analysis and genomic analysis. However, MSA faces new challenges with the gradual increase in sequence scale and the increasing demand for alignment accuracy. Therefore, developing an efficient and accurate strategy for MSA has become one of the research hotspots in bioinformatics. In this work, we mainly summarize the algorithms for MSA and its applications in bioinformatics. To provide a structured and clear perspective, we systematically introduce MSA's knowledge, including background, database, metric and benchmark. Besides, we list the most common applications of MSA in the field of bioinformatics, including database searching, phylogenetic analysis, genomic analysis, metagenomic analysis and protein analysis. Furthermore, we categorize and analyze classical and state-of-the-art algorithms, divided into progressive alignment, iterative algorithm, heuristics, machine learning and divide-and-conquer. Moreover, we also discuss the challenges and opportunities of MSA in bioinformatics. Our work provides a comprehensive survey of MSA applications and their relevant algorithms. It could bring valuable insights for researchers to contribute their knowledge to MSA and relevant studies.
Collapse
Affiliation(s)
- Yongqing Zhang
- School of Computer Science, Chengdu University of Information Technology, 610225, Chengdu, China.,School of Computer Science and Engineering, University of Electronic Science and Technology of China, 611731, Chengdu, China
| | - Qiang Zhang
- School of Computer Science, Chengdu University of Information Technology, 610225, Chengdu, China
| | - Jiliu Zhou
- School of Computer Science, Chengdu University of Information Technology, 610225, Chengdu, China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, 610054, Chengdu, China
| |
Collapse
|
12
|
Abstract
Across the human genome, there are nearly 500 'ultraconserved' elements: regions of at least 200 contiguous nucleotides that are perfectly conserved in both the mouse and rat genomes. Remarkably, the majority of these sequences are non-coding, and many can function as enhancers that activate tissue-specific gene expression during embryonic development. From their first description more than 15 years ago, their extreme conservation has both fascinated and perplexed researchers in genomics and evolutionary biology. The intrigue around ultraconserved elements only grew with the observation that they are dispensable for viability. Here, we review recent progress towards understanding the general importance and the specific functions of ultraconserved sequences in mammalian development and human disease and discuss possible explanations for their extreme conservation.
Collapse
Affiliation(s)
- Valentina Snetkova
- Environmental Genomics & Systems Biology Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Len A. Pennacchio
- Environmental Genomics & Systems Biology Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA,Comparative Biochemistry Program, University of California, Berkeley, CA 94720, USA,U.S. Department of Energy Joint Genome Institute, 1 Cyclotron Road, Berkeley, CA 94720, USA,To whom correspondence should be addressed: L.A.P., ; A.V., ; D.E.D., (lead contact)
| | - Axel Visel
- Environmental Genomics & Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA. .,US Department of Energy Joint Genome Institute, Berkeley, CA, USA. .,School of Natural Sciences, University of California, Merced, Merced, CA, USA.
| | - Diane E. Dickel
- Environmental Genomics & Systems Biology Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA,To whom correspondence should be addressed: L.A.P., ; A.V., ; D.E.D., (lead contact)
| |
Collapse
|
13
|
Cunial F, Denas O, Belazzougui D. Fast and compact matching statistics analytics. Bioinformatics 2022; 38:1838-1845. [PMID: 35134833 PMCID: PMC9665870 DOI: 10.1093/bioinformatics/btac064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2021] [Revised: 01/08/2022] [Accepted: 01/31/2022] [Indexed: 02/03/2023] Open
Abstract
MOTIVATION Fast, lightweight methods for comparing the sequence of ever larger assembled genomes from ever growing databases are increasingly needed in the era of accurate long reads and pan-genome initiatives. Matching statistics is a popular method for computing whole-genome phylogenies and for detecting structural rearrangements between two genomes, since it is amenable to fast implementations that require a minimal setup of data structures. However, current implementations use a single core, take too much memory to represent the result, and do not provide efficient ways to analyze the output in order to explore local similarities between the sequences. RESULTS We develop practical tools for computing matching statistics between large-scale strings, and for analyzing its values, faster and using less memory than the state-of-the-art. Specifically, we design a parallel algorithm for shared-memory machines that computes matching statistics 30 times faster with 48 cores in the cases that are most difficult to parallelize. We design a lossy compression scheme that shrinks the matching statistics array to a bitvector that takes from 0.8 to 0.2 bits per character, depending on the dataset and on the value of a threshold, and that achieves 0.04 bits per character in some variants. And we provide efficient implementations of range-maximum and range-sum queries that take a few tens of milliseconds while operating on our compact representations, and that allow computing key local statistics about the similarity between two strings. Our toolkit makes construction, storage and analysis of matching statistics arrays practical for multiple pairs of the largest genomes available today, possibly enabling new applications in comparative genomics. AVAILABILITY AND IMPLEMENTATION Our C/C++ code is available at https://github.com/odenas/indexed_ms under GPL-3.0. The data underlying this article are available in NCBI Genome at https://www.ncbi.nlm.nih.gov/genome and in the International Genome Sample Resource (IGSR) at https://www.internationalgenome.org. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Fabio Cunial
- Max Planck Institute for Molecular Cell Biology and Genetics (MPI-CBG and CSBD), Dresden 01307, Germany,To whom correspondence should be addressed.
| | | | - Djamal Belazzougui
- CAPA, DTISI, Centre de Recherche sur l’Information Scientifique et Techique, Algiers, Algeria
| |
Collapse
|
14
|
Chai S, Tian R, Xu S, Ren W, Yang G. Evolution of Fertilization-Related Genes Provides Insights Into Reproductive Health in Natural Ascrotal Mammals. Front Ecol Evol 2022. [DOI: 10.3389/fevo.2021.828325] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
Cryptorchidism is the failure of one or both testes to descend into the bottom of the scrotum. This common congenital malformation in humans and domestic animals is the best characterized risk factor for abnormal sperm functions and infertility. However, current treatment approaches for cryptorchidism do not ensure paternity in all cases. Some lineages of mammals (such as elephants and cetaceans) have natural ascrotal testes (i.e., undescended or incompletely descended testes) and normal sperm motility and fertility, providing an opportunity to understand the genetic basis of cryptorchidism. In this study, we showed that genes associated with sperm motility and competition/fertility in ascrotal mammals experienced frequent, strong selective pressure. The fixation of specific amino acids and positive selection in ascrotal mammals could affect the physicochemical properties and functions of fertilization-related proteins. In a comparison between mammals with undescended testes and incompletely descended testes, discrepancies in genes showing evidence for adaptive evolution and in functional enrichment suggested that multiple molecular mechanisms contribute to the maintenance of fertility in the challenging testicular environment. Our findings revealed substantial heterogeneity in the divergence of fertilization-related genes between natural scrotal and ascrotal mammals and provide insight into molecular mechanisms underlying normal sperm motility and competition in natural ascrotal mammals. We provide a detailed theoretical basis for understanding the pathology of cryptorchidism from a molecular evolutionary perspective. This study may contribute to the establishment of diagnostic and therapeutic targets for sperm motility and fertility disorders due to congenital cryptorchidism in humans and domestic animals.
Collapse
|
15
|
Buggiotti L, Yurchenko AA, Yudin NS, Vander Jagt CJ, Vorobieva NV, Kusliy MA, Vasiliev SK, Rodionov AN, Boronetskaya OI, Zinovieva NA, Graphodatsky AS, Daetwyler HD, Larkin DM. Demographic History, Adaptation, and NRAP Convergent Evolution at Amino Acid Residue 100 in the World Northernmost Cattle from Siberia. Mol Biol Evol 2021; 38:3093-3110. [PMID: 33784744 PMCID: PMC8321547 DOI: 10.1093/molbev/msab078] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Native cattle breeds represent an important cultural heritage. They are a reservoir of genetic variation useful for properly responding to agriculture needs in the light of ongoing climate changes. Evolutionary processes that occur in response to extreme environmental conditions could also be better understood using adapted local populations. Herein, different evolutionary histories of the world northernmost native cattle breeds from Russia were investigated. They highlighted Kholmogory as a typical taurine cattle, whereas Yakut cattle separated from European taurines approximately 5,000 years ago and contain numerous ancestral and some novel genetic variants allowing their adaptation to harsh conditions of living above the Polar Circle. Scans for selection signatures pointed to several common gene pathways related to adaptation to harsh climates in both breeds. But genes affected by selection from these pathways were mostly different. A Yakut cattle breed-specific missense mutation in a highly conserved NRAP gene represents a unique example of a young amino acid residue convergent change shared with at least 16 species of hibernating/cold-adapted mammals from six distinct phylogenetic orders. This suggests a convergent evolution event along the mammalian phylogenetic tree and fast fixation in a single isolated cattle population exposed to a harsh climate.
Collapse
Affiliation(s)
- Laura Buggiotti
- Royal Veterinary College, University of London, London, United Kingdom
| | - Andrey A Yurchenko
- The Federal Research Center Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences (ICG SB RAS), Novosibirsk, Russia
- Kurchatov Genomics Center, Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Science, Novosibirsk, Russia
| | - Nikolay S Yudin
- The Federal Research Center Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences (ICG SB RAS), Novosibirsk, Russia
- Kurchatov Genomics Center, Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Science, Novosibirsk, Russia
| | | | - Nadezhda V Vorobieva
- Department of the Diversity and Evolution of Genomes, Institute of Molecular and Cellular Biology SB RAS, Novosibirsk, Russia
| | - Mariya A Kusliy
- Department of the Diversity and Evolution of Genomes, Institute of Molecular and Cellular Biology SB RAS, Novosibirsk, Russia
| | - Sergei K Vasiliev
- Paleometal Archeology Department, Institute of Archaeology and Ethnography SB RAS, Novosibirsk, Russia
| | - Andrey N Rodionov
- L.K. Ernst Federal Research Centre for Animal Husbandry, Podolsk, Russia
| | - Oksana I Boronetskaya
- Moscow Agrarian Academy, Timiryazev Russian State Agrarian University, Moscow, Russia
| | | | - Alexander S Graphodatsky
- Department of the Diversity and Evolution of Genomes, Institute of Molecular and Cellular Biology SB RAS, Novosibirsk, Russia
| | - Hans D Daetwyler
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, Australia
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC, Australia
| | - Denis M Larkin
- Royal Veterinary College, University of London, London, United Kingdom
- The Federal Research Center Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences (ICG SB RAS), Novosibirsk, Russia
- Kurchatov Genomics Center, Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Science, Novosibirsk, Russia
| |
Collapse
|
16
|
Snetkova V, Ypsilanti AR, Akiyama JA, Mannion BJ, Plajzer-Frick I, Novak CS, Harrington AN, Pham QT, Kato M, Zhu Y, Godoy J, Meky E, Hunter RD, Shi M, Kvon EZ, Afzal V, Tran S, Rubenstein JLR, Visel A, Pennacchio LA, Dickel DE. Ultraconserved enhancer function does not require perfect sequence conservation. Nat Genet 2021; 53:521-528. [PMID: 33782603 PMCID: PMC8038972 DOI: 10.1038/s41588-021-00812-3] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2020] [Accepted: 02/04/2021] [Indexed: 01/09/2023]
Abstract
Ultraconserved enhancer sequences show perfect conservation between human and rodent genomes, suggesting that their functions are highly sensitive to mutation. However, current models of enhancer function do not sufficiently explain this extreme evolutionary constraint. We subjected 23 ultraconserved enhancers to different levels of mutagenesis, collectively introducing 1,547 mutations, and examined their activities in transgenic mouse reporter assays. Overall, we find that the regulatory properties of ultraconserved enhancers are robust to mutation. Upon mutagenesis, nearly all (19/23, 83%) still functioned as enhancers at one developmental stage, as did most of those tested again later in development (5/9, 56%). Replacement of endogenous enhancers with mutated alleles in mice corroborated results of transgenic assays, including the functional resilience of ultraconserved enhancers to mutation. Our findings show that the currently known activities of ultraconserved enhancers do not necessarily require the perfect conservation observed in evolution and suggest that additional regulatory or other functions contribute to their sequence constraint.
Collapse
Affiliation(s)
- Valentina Snetkova
- Environmental Genomics & System Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Athena R Ypsilanti
- Department of Psychiatry, Neuroscience Program, UCSF Weill Institute for Neurosciences, and the Nina Ireland Laboratory of Developmental Neurobiology, University of California, San Francisco, San Francisco, CA, USA
| | - Jennifer A Akiyama
- Environmental Genomics & System Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Brandon J Mannion
- Environmental Genomics & System Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Comparative Biochemistry Program, University of California, Berkeley, Berkeley, CA, USA
| | - Ingrid Plajzer-Frick
- Environmental Genomics & System Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Catherine S Novak
- Environmental Genomics & System Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Anne N Harrington
- Environmental Genomics & System Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Quan T Pham
- Environmental Genomics & System Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Momoe Kato
- Environmental Genomics & System Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Yiwen Zhu
- Environmental Genomics & System Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Janeth Godoy
- Environmental Genomics & System Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Eman Meky
- Environmental Genomics & System Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Riana D Hunter
- Environmental Genomics & System Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Marie Shi
- Environmental Genomics & System Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Evgeny Z Kvon
- Environmental Genomics & System Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Department of Developmental & Cell Biology, Department of Ecology & Evolutionary Biology, University of California, Irvine, Irvine, CA, USA
| | - Veena Afzal
- Environmental Genomics & System Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Stella Tran
- Environmental Genomics & System Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - John L R Rubenstein
- Department of Psychiatry, Neuroscience Program, UCSF Weill Institute for Neurosciences, and the Nina Ireland Laboratory of Developmental Neurobiology, University of California, San Francisco, San Francisco, CA, USA
| | - Axel Visel
- Environmental Genomics & System Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA.
- School of Natural Sciences, University of California, Merced, Merced, CA, USA.
| | - Len A Pennacchio
- Environmental Genomics & System Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
- Comparative Biochemistry Program, University of California, Berkeley, Berkeley, CA, USA.
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA.
| | - Diane E Dickel
- Environmental Genomics & System Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
| |
Collapse
|
17
|
Sharma V, Hecker N, Walther F, Stuckas H, Hiller M. Convergent Losses of TLR5 Suggest Altered Extracellular Flagellin Detection in Four Mammalian Lineages. Mol Biol Evol 2021; 37:1847-1854. [PMID: 32145026 DOI: 10.1093/molbev/msaa058] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Toll-like receptors (TLRs) play an important role for the innate immune system by detecting pathogen-associated molecular patterns. TLR5 encodes the major extracellular receptor for bacterial flagellin and frequently evolves under positive selection, consistent with coevolutionary arms races between the host and pathogens. Furthermore, TLR5 is inactivated in several vertebrates and a TLR5 stop codon polymorphism is widespread in human populations. Here, we analyzed the genomes of 120 mammals and discovered that TLR5 is convergently lost in four independent lineages, comprising guinea pigs, Yangtze river dolphin, pinnipeds, and pangolins. Validated inactivating mutations, absence of protein-coding transcript expression, and relaxed selection on the TLR5 remnants confirm these losses. PCR analysis further confirmed the loss of TLR5 in the pinniped stem lineage. Finally, we show that TLR11, encoding a second extracellular flagellin receptor, is also absent in these four lineages. Independent losses of TLR5 and TLR11 suggest that a major pathway for detecting flagellated bacteria is not essential for different mammals and predicts an impaired capacity to sense extracellular flagellin.
Collapse
Affiliation(s)
- Virag Sharma
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany.,Max Planck Institute for the Physics of Complex Systems, Dresden, Germany.,Center for Systems Biology Dresden, Dresden, Germany.,CRTD-DFG Center for Regenerative Therapies Dresden, Carl Gustav Carus Faculty of Medicine, Technische Universität Dresden, Dresden; Paul Langerhans Institute Dresden (PLID) of the Helmholtz Center Munich at University Hospital Carl Gustav Carus and Faculty of Medicine, Technische Universität Dresden, Dresden; German Center for Diabetes Research (DZD), Munich, Neuherberg, Germany
| | - Nikolai Hecker
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany.,Max Planck Institute for the Physics of Complex Systems, Dresden, Germany.,Center for Systems Biology Dresden, Dresden, Germany
| | - Felix Walther
- Senckenberg Natural History Collections Dresden, Senckenberg - Leibniz Institution for Biodiversity and Earth System Research, Dresden, Germany
| | - Heiko Stuckas
- Senckenberg Natural History Collections Dresden, Senckenberg - Leibniz Institution for Biodiversity and Earth System Research, Dresden, Germany
| | - Michael Hiller
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany.,Max Planck Institute for the Physics of Complex Systems, Dresden, Germany.,Center for Systems Biology Dresden, Dresden, Germany
| |
Collapse
|
18
|
Jebb D, Huang Z, Pippel M, Hughes GM, Lavrichenko K, Devanna P, Winkler S, Jermiin LS, Skirmuntt EC, Katzourakis A, Burkitt-Gray L, Ray DA, Sullivan KAM, Roscito JG, Kirilenko BM, Dávalos LM, Corthals AP, Power ML, Jones G, Ransome RD, Dechmann DKN, Locatelli AG, Puechmaille SJ, Fedrigo O, Jarvis ED, Hiller M, Vernes SC, Myers EW, Teeling EC. Six reference-quality genomes reveal evolution of bat adaptations. Nature 2020; 583:578-584. [PMID: 32699395 PMCID: PMC8075899 DOI: 10.1038/s41586-020-2486-3] [Citation(s) in RCA: 150] [Impact Index Per Article: 37.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2019] [Accepted: 06/09/2020] [Indexed: 11/08/2022]
Abstract
Bats possess extraordinary adaptations, including flight, echolocation, extreme longevity and unique immunity. High-quality genomes are crucial for understanding the molecular basis and evolution of these traits. Here we incorporated long-read sequencing and state-of-the-art scaffolding protocols1 to generate, to our knowledge, the first reference-quality genomes of six bat species (Rhinolophus ferrumequinum, Rousettus aegyptiacus, Phyllostomus discolor, Myotis myotis, Pipistrellus kuhlii and Molossus molossus). We integrated gene projections from our 'Tool to infer Orthologs from Genome Alignments' (TOGA) software with de novo and homology gene predictions as well as short- and long-read transcriptomics to generate highly complete gene annotations. To resolve the phylogenetic position of bats within Laurasiatheria, we applied several phylogenetic methods to comprehensive sets of orthologous protein-coding and noncoding regions of the genome, and identified a basal origin for bats within Scrotifera. Our genome-wide screens revealed positive selection on hearing-related genes in the ancestral branch of bats, which is indicative of laryngeal echolocation being an ancestral trait in this clade. We found selection and loss of immunity-related genes (including pro-inflammatory NF-κB regulators) and expansions of anti-viral APOBEC3 genes, which highlights molecular mechanisms that may contribute to the exceptional immunity of bats. Genomic integrations of diverse viruses provide a genomic record of historical tolerance to viral infection in bats. Finally, we found and experimentally validated bat-specific variation in microRNAs, which may regulate bat-specific gene-expression programs. Our reference-quality bat genomes provide the resources required to uncover and validate the genomic basis of adaptations of bats, and stimulate new avenues of research that are directly relevant to human health and disease1.
Collapse
Affiliation(s)
- David Jebb
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
- Max Planck Institute for the Physics of Complex Systems, Dresden, Germany
- Center for Systems Biology Dresden, Dresden, Germany
| | - Zixia Huang
- School of Biology and Environmental Science, University College Dublin, Dublin, Ireland
| | - Martin Pippel
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
- Center for Systems Biology Dresden, Dresden, Germany
| | - Graham M Hughes
- School of Biology and Environmental Science, University College Dublin, Dublin, Ireland
| | - Ksenia Lavrichenko
- Neurogenetics of Vocal Communication Group, Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands
| | - Paolo Devanna
- Neurogenetics of Vocal Communication Group, Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands
| | - Sylke Winkler
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
| | - Lars S Jermiin
- School of Biology and Environmental Science, University College Dublin, Dublin, Ireland
- Research School of Biology, Australian National University, Canberra, Australian Capital Territory, Australia
- Earth Institute, University College Dublin, Dublin, Ireland
| | - Emilia C Skirmuntt
- Peter Medawar Building for Pathogen Research, Department of Zoology, University of Oxford, Oxford, UK
| | - Aris Katzourakis
- Peter Medawar Building for Pathogen Research, Department of Zoology, University of Oxford, Oxford, UK
| | - Lucy Burkitt-Gray
- Conway Institute of Biomolecular and Biomedical Science, University College Dublin, Dublin, Ireland
| | - David A Ray
- Department of Biological Sciences, Texas Tech University, Lubbock, TX, USA
| | - Kevin A M Sullivan
- Department of Biological Sciences, Texas Tech University, Lubbock, TX, USA
| | - Juliana G Roscito
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
- Max Planck Institute for the Physics of Complex Systems, Dresden, Germany
- Center for Systems Biology Dresden, Dresden, Germany
| | - Bogdan M Kirilenko
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
- Max Planck Institute for the Physics of Complex Systems, Dresden, Germany
- Center for Systems Biology Dresden, Dresden, Germany
| | - Liliana M Dávalos
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY, USA
- Consortium for Inter-Disciplinary Environmental Research, Stony Brook University, Stony Brook, NY, USA
| | | | - Megan L Power
- School of Biology and Environmental Science, University College Dublin, Dublin, Ireland
| | - Gareth Jones
- School of Biological Sciences, University of Bristol, Bristol, UK
| | - Roger D Ransome
- School of Biological Sciences, University of Bristol, Bristol, UK
| | - Dina K N Dechmann
- Department of Migration, Max Planck Institute of Animal Behavior, Radolfzell, Germany
- Department of Biology, University of Konstanz, Konstanz, Germany
- Smithsonian Tropical Research Institute, Panama City, Panama
| | - Andrea G Locatelli
- School of Biology and Environmental Science, University College Dublin, Dublin, Ireland
| | - Sébastien J Puechmaille
- ISEM, University of Montpellier, Montpellier, France
- Zoological Institute and Museum, University of Greifswald, Greifswald, Germany
| | - Olivier Fedrigo
- Vertebrate Genomes Laboratory, The Rockefeller University, New York, NY, USA
| | - Erich D Jarvis
- Vertebrate Genomes Laboratory, The Rockefeller University, New York, NY, USA
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Michael Hiller
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany.
- Max Planck Institute for the Physics of Complex Systems, Dresden, Germany.
- Center for Systems Biology Dresden, Dresden, Germany.
| | - Sonja C Vernes
- Neurogenetics of Vocal Communication Group, Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands.
- Donders Institute for Brain, Cognition and Behaviour, Nijmegen, The Netherlands.
| | - Eugene W Myers
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany.
- Center for Systems Biology Dresden, Dresden, Germany.
- Faculty of Computer Science, Technical University Dresden, Dresden, Germany.
| | - Emma C Teeling
- School of Biology and Environmental Science, University College Dublin, Dublin, Ireland.
| |
Collapse
|