1
|
Jayakrishnan M, Havlová M, Veverka V, Regnard C, Becker PB. Genomic context-dependent histone H3K36 methylation by three Drosophila methyltransferases and implications for dedicated chromatin readers. Nucleic Acids Res 2024:gkae449. [PMID: 38813825 DOI: 10.1093/nar/gkae449] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Revised: 05/03/2024] [Accepted: 05/28/2024] [Indexed: 05/31/2024] Open
Abstract
Methylation of histone H3 at lysine 36 (H3K36me3) marks active chromatin. The mark is interpreted by epigenetic readers that assist transcription and safeguard the integrity of the chromatin fiber. The chromodomain protein MSL3 binds H3K36me3 to target X-chromosomal genes in male Drosophila for dosage compensation. The PWWP-domain protein JASPer recruits the JIL1 kinase to active chromatin on all chromosomes. Unexpectedly, depletion of K36me3 had variable, locus-specific effects on the interactions of those readers. This observation motivated a systematic and comprehensive study of K36 methylation in a defined cellular model. Contrasting prevailing models, we found that K36me1, K36me2 and K36me3 each contribute to distinct chromatin states. A gene-centric view of the changing K36 methylation landscape upon depletion of the three methyltransferases Set2, NSD and Ash1 revealed local, context-specific methylation signatures. Set2 catalyzes K36me3 predominantly at transcriptionally active euchromatin. NSD places K36me2/3 at defined loci within pericentric heterochromatin and on weakly transcribed euchromatic genes. Ash1 deposits K36me1 at regions with enhancer signatures. The genome-wide mapping of MSL3 and JASPer suggested that they bind K36me2 in addition to K36me3, which was confirmed by direct affinity measurement. This dual specificity attracts the readers to a broader range of chromosomal locations and increases the robustness of their actions.
Collapse
Affiliation(s)
- Muhunden Jayakrishnan
- Biomedical Center, Molecular Biology Division, Ludwig-Maximilians-Universität, Munich, Germany
| | - Magdalena Havlová
- Institute of Organic Chemistry and Biochemistry (IOCB) of the Czech Academy of Sciences, Prague, Czech Republic
| | - Václav Veverka
- Institute of Organic Chemistry and Biochemistry (IOCB) of the Czech Academy of Sciences, Prague, Czech Republic
- Department of Cell Biology, Faculty of Science, Charles University, Prague, Czech Republic
| | - Catherine Regnard
- Biomedical Center, Molecular Biology Division, Ludwig-Maximilians-Universität, Munich, Germany
| | - Peter B Becker
- Biomedical Center, Molecular Biology Division, Ludwig-Maximilians-Universität, Munich, Germany
| |
Collapse
|
2
|
Makova KD, Pickett BD, Harris RS, Hartley GA, Cechova M, Pal K, Nurk S, Yoo D, Li Q, Hebbar P, McGrath BC, Antonacci F, Aubel M, Biddanda A, Borchers M, Bornberg-Bauer E, Bouffard GG, Brooks SY, Carbone L, Carrel L, Carroll A, Chang PC, Chin CS, Cook DE, Craig SJC, de Gennaro L, Diekhans M, Dutra A, Garcia GH, Grady PGS, Green RE, Haddad D, Hallast P, Harvey WT, Hickey G, Hillis DA, Hoyt SJ, Jeong H, Kamali K, Pond SLK, LaPolice TM, Lee C, Lewis AP, Loh YHE, Masterson P, McGarvey KM, McCoy RC, Medvedev P, Miga KH, Munson KM, Pak E, Paten B, Pinto BJ, Potapova T, Rhie A, Rocha JL, Ryabov F, Ryder OA, Sacco S, Shafin K, Shepelev VA, Slon V, Solar SJ, Storer JM, Sudmant PH, Sweetalana, Sweeten A, Tassia MG, Thibaud-Nissen F, Ventura M, Wilson MA, Young AC, Zeng H, Zhang X, Szpiech ZA, Huber CD, Gerton JL, Yi SV, Schatz MC, Alexandrov IA, Koren S, O'Neill RJ, Eichler EE, Phillippy AM. The complete sequence and comparative analysis of ape sex chromosomes. Nature 2024:10.1038/s41586-024-07473-2. [PMID: 38811727 DOI: 10.1038/s41586-024-07473-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Accepted: 04/26/2024] [Indexed: 05/31/2024]
Abstract
Apes possess two sex chromosomes-the male-specific Y chromosome and the X chromosome, which is present in both males and females. The Y chromosome is crucial for male reproduction, with deletions being linked to infertility1. The X chromosome is vital for reproduction and cognition2. Variation in mating patterns and brain function among apes suggests corresponding differences in their sex chromosomes. However, owing to their repetitive nature and incomplete reference assemblies, ape sex chromosomes have been challenging to study. Here, using the methodology developed for the telomere-to-telomere (T2T) human genome, we produced gapless assemblies of the X and Y chromosomes for five great apes (bonobo (Pan paniscus), chimpanzee (Pan troglodytes), western lowland gorilla (Gorilla gorilla gorilla), Bornean orangutan (Pongo pygmaeus) and Sumatran orangutan (Pongo abelii)) and a lesser ape (the siamang gibbon (Symphalangus syndactylus)), and untangled the intricacies of their evolution. Compared with the X chromosomes, the ape Y chromosomes vary greatly in size and have low alignability and high levels of structural rearrangements-owing to the accumulation of lineage-specific ampliconic regions, palindromes, transposable elements and satellites. Many Y chromosome genes expand in multi-copy families and some evolve under purifying selection. Thus, the Y chromosome exhibits dynamic evolution, whereas the X chromosome is more stable. Mapping short-read sequencing data to these assemblies revealed diversity and selection patterns on sex chromosomes of more than 100 individual great apes. These reference assemblies are expected to inform human evolution and conservation genetics of non-human apes, all of which are endangered species.
Collapse
Affiliation(s)
| | - Brandon D Pickett
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | | | | | - Monika Cechova
- University of California Santa Cruz, Santa Cruz, CA, USA
| | - Karol Pal
- Penn State University, University Park, PA, USA
| | - Sergey Nurk
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - DongAhn Yoo
- University of Washington School of Medicine, Seattle, WA, USA
| | - Qiuhui Li
- Johns Hopkins University, Baltimore, MD, USA
| | - Prajna Hebbar
- University of California Santa Cruz, Santa Cruz, CA, USA
| | | | | | | | | | | | - Erich Bornberg-Bauer
- University of Münster, Münster, Germany
- MPI for Developmental Biology, Tübingen, Germany
| | - Gerard G Bouffard
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Shelise Y Brooks
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Lucia Carbone
- Oregon Health and Science University, Portland, OR, USA
- Oregon National Primate Research Center, Hillsboro, OR, USA
| | - Laura Carrel
- Penn State University School of Medicine, Hershey, PA, USA
| | | | | | - Chen-Shan Chin
- Foundation of Biological Data Sciences, Belmont, CA, USA
| | | | | | | | - Mark Diekhans
- University of California Santa Cruz, Santa Cruz, CA, USA
| | - Amalia Dutra
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Gage H Garcia
- University of Washington School of Medicine, Seattle, WA, USA
| | | | | | - Diana Haddad
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Pille Hallast
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | | | - Glenn Hickey
- University of California Santa Cruz, Santa Cruz, CA, USA
| | - David A Hillis
- University of California Santa Barbara, Santa Barbara, CA, USA
| | | | - Hyeonsoo Jeong
- University of Washington School of Medicine, Seattle, WA, USA
| | | | | | | | - Charles Lee
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | | | - Yong-Hwee E Loh
- University of California Santa Barbara, Santa Barbara, CA, USA
| | - Patrick Masterson
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Kelly M McGarvey
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | | | | | - Karen H Miga
- University of California Santa Cruz, Santa Cruz, CA, USA
| | | | - Evgenia Pak
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Benedict Paten
- University of California Santa Cruz, Santa Cruz, CA, USA
| | | | | | - Arang Rhie
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Joana L Rocha
- University of California Berkeley, Berkeley, CA, USA
| | - Fedor Ryabov
- Masters Program in National Research, University Higher School of Economics, Moscow, Russia
| | | | - Samuel Sacco
- University of California Santa Cruz, Santa Cruz, CA, USA
| | | | | | | | - Steven J Solar
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | | | | | - Sweetalana
- Penn State University, University Park, PA, USA
| | - Alex Sweeten
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
- Johns Hopkins University, Baltimore, MD, USA
| | | | - Françoise Thibaud-Nissen
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Mario Ventura
- Università degli Studi di Bari Aldo Moro, Bari, Italy
| | | | - Alice C Young
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | | | - Xinru Zhang
- Penn State University, University Park, PA, USA
| | | | | | | | - Soojin V Yi
- University of California Santa Barbara, Santa Barbara, CA, USA
| | | | | | - Sergey Koren
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | | | - Evan E Eichler
- University of Washington School of Medicine, Seattle, WA, USA.
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA.
| | - Adam M Phillippy
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA.
| |
Collapse
|
3
|
Chen W, Wang X, Sun J, Wang X, Zhu Z, Ayhan DH, Yi S, Yan M, Zhang L, Meng T, Mu Y, Li J, Meng D, Bian J, Wang K, Wang L, Chen S, Chen R, Jin J, Li B, Zhang X, Deng XW, He H, Guo L. Two telomere-to-telomere gapless genomes reveal insights into Capsicum evolution and capsaicinoid biosynthesis. Nat Commun 2024; 15:4295. [PMID: 38769327 PMCID: PMC11106260 DOI: 10.1038/s41467-024-48643-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Accepted: 05/08/2024] [Indexed: 05/22/2024] Open
Abstract
Chili pepper (Capsicum) is known for its unique fruit pungency due to the presence of capsaicinoids. The evolutionary history of capsaicinoid biosynthesis and the mechanism of their tissue specificity remain obscure due to the lack of high-quality Capsicum genomes. Here, we report two telomere-to-telomere (T2T) gap-free genomes of C. annuum and its wild nonpungent relative C. rhomboideum to investigate the evolution of fruit pungency in chili peppers. We precisely delineate Capsicum centromeres, which lack high-copy tandem repeats but are extensively invaded by CRM retrotransposons. Through phylogenomic analyses, we estimate the evolutionary timing of capsaicinoid biosynthesis. We reveal disrupted coding and regulatory regions of key biosynthesis genes in nonpungent species. We also find conserved placenta-specific accessible chromatin regions, which likely allow for tissue-specific biosynthetic gene coregulation and capsaicinoid accumulation. These T2T genomic resources will accelerate chili pepper genetic improvement and help to understand Capsicum genome evolution.
Collapse
Affiliation(s)
- Weikai Chen
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang, 261325, China
| | - Xiangfeng Wang
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang, 261325, China
| | - Jie Sun
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang, 261325, China
| | - Xinrui Wang
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang, 261325, China
| | - Zhangsheng Zhu
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang, 261325, China
- College of Horticulture, South China Agricultural University, Guangzhou, 510642, China
| | - Dilay Hazal Ayhan
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang, 261325, China
| | - Shu Yi
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang, 261325, China
| | - Ming Yan
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang, 261325, China
| | - Lili Zhang
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang, 261325, China
- College of Modern Agriculture and Environment, Weifang Institute of Technology, Weifang, 262500, China
| | - Tan Meng
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang, 261325, China
| | - Yu Mu
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang, 261325, China
| | - Jun Li
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang, 261325, China
| | - Dian Meng
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang, 261325, China
| | - Jianxin Bian
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang, 261325, China
| | - Ke Wang
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang, 261325, China
- College of Life Sciences, Shandong Agricultural University, Tai'an, 271018, China
| | - Lu Wang
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang, 261325, China
| | - Shaoying Chen
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang, 261325, China
| | - Ruidong Chen
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang, 261325, China
| | - Jingyun Jin
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang, 261325, China
| | - Bosheng Li
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang, 261325, China
| | - Xingping Zhang
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang, 261325, China
| | - Xing Wang Deng
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang, 261325, China
- Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, 100871, China
| | - Hang He
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang, 261325, China.
- Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, 100871, China.
| | - Li Guo
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang, 261325, China.
| |
Collapse
|
4
|
Gafurov A, Vinar T, Medvedev P, Brejova B. Efficient Analysis of Annotation Colocalization Accounting for Genomic Contexts. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.11.22.568259. [PMID: 38045397 PMCID: PMC10690252 DOI: 10.1101/2023.11.22.568259] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/05/2023]
Abstract
An annotation is a set of genomic intervals sharing a particular function or property. Examples include genes or their exons, evolutionarily conserved elements, and regions with a particular epigenetic state. A common task is to compare two annotations to determine if one is enriched or depleted in the regions covered by the other. We study the problem of assigning statistical significance to such a comparison based on a null model representing two random unrelated annotations. To incorporate more background information into such analyses,we propose a new null model based on a Markov chain which differentiates among several genomic contexts. These contexts can capture various confounding factors, such as GC content or assembly gaps. We then develop a new algorithm for estimating p-values by computing the exact expectation and variance of the test statistics and then estimating the p-value using a normal approximation. Compared to the previous algorithm by Gafurov et al., the new algorithm provides three advances: (1) the running time is improved from quadratic to linear or quasi-linear, (2) the algorithm can handle two different test statistics, and (3) the algorithm can handle both simple and context-dependent Markov chain null models. We demonstrate the efficiency and accuracy of our algorithm on synthetic and real data sets, including the recent human telomere-to-telomere assembly. In particular, our algorithm computed p-values for 450 pairs of human genome annotations using 24 threads in under three hours. Moreover, the use of genomic contexts to correct for GC bias resulted in the reversal of some previously published findings.
Collapse
|
5
|
Sacristan C, Samejima K, Ruiz LA, Deb M, Lambers MLA, Buckle A, Brackley CA, Robertson D, Hori T, Webb S, Kiewisz R, Bepler T, van Kwawegen E, Risteski P, Vukušić K, Tolić IM, Müller-Reichert T, Fukagawa T, Gilbert N, Marenduzzo D, Earnshaw WC, Kops GJPL. Vertebrate centromeres in mitosis are functionally bipartite structures stabilized by cohesin. Cell 2024:S0092-8674(24)00409-4. [PMID: 38744280 DOI: 10.1016/j.cell.2024.04.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Revised: 01/30/2024] [Accepted: 04/14/2024] [Indexed: 05/16/2024]
Abstract
Centromeres are scaffolds for the assembly of kinetochores that ensure chromosome segregation during cell division. How vertebrate centromeres obtain a three-dimensional structure to accomplish their primary function is unclear. Using super-resolution imaging, capture-C, and polymer modeling, we show that vertebrate centromeres are partitioned by condensins into two subdomains during mitosis. The bipartite structure is found in human, mouse, and chicken cells and is therefore a fundamental feature of vertebrate centromeres. Super-resolution imaging and electron tomography reveal that bipartite centromeres assemble bipartite kinetochores, with each subdomain binding a distinct microtubule bundle. Cohesin links the centromere subdomains, limiting their separation in response to spindle forces and avoiding merotelic kinetochore-spindle attachments. Lagging chromosomes during cancer cell divisions frequently have merotelic attachments in which the centromere subdomains are separated and bioriented. Our work reveals a fundamental aspect of vertebrate centromere biology with implications for understanding the mechanisms that guarantee faithful chromosome segregation.
Collapse
Affiliation(s)
- Carlos Sacristan
- Oncode Institute, Hubrecht Institute, Royal Netherlands Academy of Arts and Sciences (KNAW), and University Medical Center Utrecht, Utrecht, the Netherlands.
| | - Kumiko Samejima
- Wellcome Centre for Cell Biology, Institute of Cell Biology, University of Edinburgh, Edinburgh, UK.
| | - Lorena Andrade Ruiz
- Oncode Institute, Hubrecht Institute, Royal Netherlands Academy of Arts and Sciences (KNAW), and University Medical Center Utrecht, Utrecht, the Netherlands
| | - Moonmoon Deb
- Wellcome Centre for Cell Biology, Institute of Cell Biology, University of Edinburgh, Edinburgh, UK
| | - Maaike L A Lambers
- Oncode Institute, Hubrecht Institute, Royal Netherlands Academy of Arts and Sciences (KNAW), and University Medical Center Utrecht, Utrecht, the Netherlands
| | - Adam Buckle
- MRC Human Genetics Unit, Institute of Genetics & Molecular Medicine, University of Edinburgh, Edinburgh, UK
| | - Chris A Brackley
- SUPA School of Physics and Astronomy, University of Edinburgh, Edinburgh, UK
| | - Daniel Robertson
- Wellcome Centre for Cell Biology, Institute of Cell Biology, University of Edinburgh, Edinburgh, UK
| | - Tetsuya Hori
- Laboratory of Chromosome Biology, Graduate School of Frontier Biosciences, Osaka University, Suita, Osaka, Japan
| | - Shaun Webb
- Wellcome Centre for Cell Biology, Institute of Cell Biology, University of Edinburgh, Edinburgh, UK
| | - Robert Kiewisz
- Simons Machine Learning Center, New York Structural Biology Center, New York, NY 10027, USA; Biocomputing Unit, Centro Nacional de Biotecnologia (CNB-CSIC), Darwin, 3, Campus Universidad Autonoma, Cantoblanco, Madrid 28049, Spain
| | - Tristan Bepler
- Simons Machine Learning Center, New York Structural Biology Center, New York, NY 10027, USA
| | - Eloïse van Kwawegen
- Oncode Institute, Hubrecht Institute, Royal Netherlands Academy of Arts and Sciences (KNAW), and University Medical Center Utrecht, Utrecht, the Netherlands
| | | | | | | | - Thomas Müller-Reichert
- Experimental Center, Faculty of Medicine Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
| | - Tatsuo Fukagawa
- Laboratory of Chromosome Biology, Graduate School of Frontier Biosciences, Osaka University, Suita, Osaka, Japan
| | - Nick Gilbert
- MRC Human Genetics Unit, Institute of Genetics & Molecular Medicine, University of Edinburgh, Edinburgh, UK
| | - Davide Marenduzzo
- SUPA School of Physics and Astronomy, University of Edinburgh, Edinburgh, UK
| | - William C Earnshaw
- Wellcome Centre for Cell Biology, Institute of Cell Biology, University of Edinburgh, Edinburgh, UK.
| | - Geert J P L Kops
- Oncode Institute, Hubrecht Institute, Royal Netherlands Academy of Arts and Sciences (KNAW), and University Medical Center Utrecht, Utrecht, the Netherlands.
| |
Collapse
|
6
|
Logsdon GA, Rozanski AN, Ryabov F, Potapova T, Shepelev VA, Catacchio CR, Porubsky D, Mao Y, Yoo D, Rautiainen M, Koren S, Nurk S, Lucas JK, Hoekzema K, Munson KM, Gerton JL, Phillippy AM, Ventura M, Alexandrov IA, Eichler EE. The variation and evolution of complete human centromeres. Nature 2024; 629:136-145. [PMID: 38570684 PMCID: PMC11062924 DOI: 10.1038/s41586-024-07278-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2023] [Accepted: 03/07/2024] [Indexed: 04/05/2024]
Abstract
Human centromeres have been traditionally very difficult to sequence and assemble owing to their repetitive nature and large size1. As a result, patterns of human centromeric variation and models for their evolution and function remain incomplete, despite centromeres being among the most rapidly mutating regions2,3. Here, using long-read sequencing, we completely sequenced and assembled all centromeres from a second human genome and compared it to the finished reference genome4,5. We find that the two sets of centromeres show at least a 4.1-fold increase in single-nucleotide variation when compared with their unique flanks and vary up to 3-fold in size. Moreover, we find that 45.8% of centromeric sequence cannot be reliably aligned using standard methods owing to the emergence of new α-satellite higher-order repeats (HORs). DNA methylation and CENP-A chromatin immunoprecipitation experiments show that 26% of the centromeres differ in their kinetochore position by >500 kb. To understand evolutionary change, we selected six chromosomes and sequenced and assembled 31 orthologous centromeres from the common chimpanzee, orangutan and macaque genomes. Comparative analyses reveal a nearly complete turnover of α-satellite HORs, with characteristic idiosyncratic changes in α-satellite HORs for each species. Phylogenetic reconstruction of human haplotypes supports limited to no recombination between the short (p) and long (q) arms across centromeres and reveals that novel α-satellite HORs share a monophyletic origin, providing a strategy to estimate the rate of saltatory amplification and mutation of human centromeric DNA.
Collapse
Affiliation(s)
- Glennis A Logsdon
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Department of Genetics, Epigenetics Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Allison N Rozanski
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Fedor Ryabov
- Masters Program in National Research University Higher School of Economics, Moscow, Russia
| | - Tamara Potapova
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | | | - Claudia R Catacchio
- Department of Biosciences, Biotechnology and Environment, University of Bari Aldo Moro, Bari, Italy
| | - David Porubsky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Yafei Mao
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong University, Shanghai, China
| | - DongAhn Yoo
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Mikko Rautiainen
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
- Institute for Molecular Medicine Finland (FIMM), Helsinki Institute of Life Science (HiLIFE), University of Helsinki, Helsinki, Finland
| | - Sergey Koren
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Sergey Nurk
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
- Oxford Nanopore Technologies, Oxford, United Kingdom
| | - Julian K Lucas
- Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, CA, USA
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Kendra Hoekzema
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Katherine M Munson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | | | - Adam M Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Mario Ventura
- Department of Biosciences, Biotechnology and Environment, University of Bari Aldo Moro, Bari, Italy
| | - Ivan A Alexandrov
- Department of Human Molecular Genetics and Biochemistry, Tel Aviv University, Tel Aviv, Israel
- Department of Anatomy and Anthropology, Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- Dan David Center for Human Evolution and Biohistory Research, Tel Aviv University, Tel Aviv, Israel
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA.
| |
Collapse
|
7
|
Zhao N, Lai C, Wang Y, Dai S, Gu H. Understanding the role of DNA methylation in colorectal cancer: Mechanisms, detection, and clinical significance. Biochim Biophys Acta Rev Cancer 2024; 1879:189096. [PMID: 38499079 DOI: 10.1016/j.bbcan.2024.189096] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Revised: 02/18/2024] [Accepted: 03/13/2024] [Indexed: 03/20/2024]
Abstract
Colorectal cancer (CRC) is one of the deadliest malignancies worldwide, ranking third in incidence and second in mortality. Remarkably, early stage localized CRC has a 5-year survival rate of over 90%; in stark contrast, the corresponding 5-year survival rate for metastatic CRC (mCRC) is only 14%. Compounding this problem is the staggering lack of effective therapeutic strategies. Beyond genetic mutations, which have been identified as critical instigators of CRC initiation and progression, the importance of epigenetic modifications, particularly DNA methylation (DNAm), cannot be underestimated, given that DNAm can be used for diagnosis, treatment monitoring and prognostic evaluation. This review addresses the intricate mechanisms governing aberrant DNAm in CRC and its profound impact on critical oncogenic pathways. In addition, a comprehensive review of the various techniques used to detect DNAm alterations in CRC is provided, along with an exploration of the clinical utility of cancer-specific DNAm alterations.
Collapse
Affiliation(s)
- Ningning Zhao
- Anhui Province Key Laboratory of Medical Physics and Technology, Institute of Health and Medical Technology, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei 230031, China; Hefei Cancer Hospital, Chinese Academy of Sciences, Hefei 230031, China
| | - Chuanxi Lai
- Division of Colorectal Surgery, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang 310016, China
| | - Yunfei Wang
- Zhejiang ShengTing Biotech. Ltd, Hangzhou 310000, China
| | - Sheng Dai
- Division of Colorectal Surgery, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang 310016, China.
| | - Hongcang Gu
- Anhui Province Key Laboratory of Medical Physics and Technology, Institute of Health and Medical Technology, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei 230031, China; Hefei Cancer Hospital, Chinese Academy of Sciences, Hefei 230031, China.
| |
Collapse
|
8
|
Parl FF. Analysis of CENP-B Boxes as Anchor of Kinetochores in Centromeres of Human Chromosomes. Bioinform Biol Insights 2024; 18:11779322241248913. [PMID: 38690324 PMCID: PMC11060027 DOI: 10.1177/11779322241248913] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2023] [Accepted: 04/04/2024] [Indexed: 05/02/2024] Open
Abstract
The kinetochore is a multiprotein structure that attaches at one end to DNA in the centromere and at the other end to microtubules in the mitotic spindle. By connecting centromere and spindle, the kinetochore controls the migration of chromosomes during cell division. The exact position where the kinetochore assembles on each centromere was uncertain because large sections of centromeric DNA had not been sequenced due to highly repetitive alpha-satellite arrays. Embedded in the arrays is a 17 bp consensus sequence, the so-called CENP-B box, which binds the CENP-B protein, the only protein that binds directly to centromeric DNA. Recently, the Telomere-to-Telomere Consortium published the complete centromeric DNA sequences of all chromosomes including their epigenetic modifications in the T2T-CHM13 map. I used data from the T2T-CHM13 map to locate the CENP-B boxes in the centromeres as anchor of kinetochores. Most of the CENP-B boxes in centromeric DNA are methylated with the exception of the so-called centromere dip region (CDR), where CENP-B protein dimers bind to adjacent unmethylated CENP-B boxes and interact with CENP-A and CENP-C proteins to assemble the kinetochore. The centromeres of all chromosomes combined have a size of 407 Mb of which the kinetochores account for 5.0 Mb or 1.2%. There is no correlation between centromere and kinetochore size (P = .77). While the number of CENP-B boxes varies 4-fold between chromosomes, their density (number/Kb) varies less than 2-fold with a mean of 2.61 ± 0.33. The narrow range ensures a uniform pull of the spindle on the centromeres. I illustrate the findings in a model of the human kinetochore anchored at unmethylated CENP-B boxes in the CDR and present circos plots of chromosomes to show the location of kinetochores in their respective centromeres.
Collapse
Affiliation(s)
- Fritz F Parl
- Department of Pathology, Microbiology, and Immunology, Vanderbilt University Medical Center, Nashville, TN, USA
| |
Collapse
|
9
|
Kixmoeller K, Chang YW, Black BE. Centromeric chromatin clearings demarcate the site of kinetochore formation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.26.591177. [PMID: 38712116 PMCID: PMC11071481 DOI: 10.1101/2024.04.26.591177] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2024]
Abstract
The centromere is the chromosomal locus that recruits the kinetochore, directing faithful propagation of the genome during cell division. The kinetochore has been interrogated by electron microscopy since the middle of the last century, but with methodologies that compromised fine structure. Using cryo-ET on human mitotic chromosomes, we reveal a distinctive architecture at the centromere: clustered 20-25 nm nucleosome-associated complexes within chromatin clearings that delineate them from surrounding chromatin. Centromere components CENP-C and CENP-N are each required for the integrity of the complexes, while CENP-C is also required to maintain the chromatin clearing. We further visualize the scaffold of the fibrous corona, a structure amplified at unattached kinetochores, revealing crescent-shaped parallel arrays of fibrils that extend >1 μm. Thus, we reveal how the organization of centromeric chromatin creates a clearing at the site of kinetochore formation as well as the nature of kinetochore amplification mediated by corona fibrils.
Collapse
Affiliation(s)
- Kathryn Kixmoeller
- Department of Biochemistry and Biophysics, Perelman School of Medicine, University of Pennsylvania, PA, USA
- Biochemistry Biophysics Chemical Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, PA, USA
- Institute of Structural Biology, Perelman School of Medicine, University of Pennsylvania, PA, USA
- Penn Center for Genome Integrity, Perelman School of Medicine, University of Pennsylvania, PA, USA
- Epigenetics Institute, Perelman School of Medicine, University of Pennsylvania, PA, USA
| | - Yi-Wei Chang
- Department of Biochemistry and Biophysics, Perelman School of Medicine, University of Pennsylvania, PA, USA
- Biochemistry Biophysics Chemical Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, PA, USA
- Institute of Structural Biology, Perelman School of Medicine, University of Pennsylvania, PA, USA
| | - Ben E. Black
- Department of Biochemistry and Biophysics, Perelman School of Medicine, University of Pennsylvania, PA, USA
- Biochemistry Biophysics Chemical Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, PA, USA
- Institute of Structural Biology, Perelman School of Medicine, University of Pennsylvania, PA, USA
- Penn Center for Genome Integrity, Perelman School of Medicine, University of Pennsylvania, PA, USA
- Epigenetics Institute, Perelman School of Medicine, University of Pennsylvania, PA, USA
| |
Collapse
|
10
|
Teschendorff AE. On epigenetic stochasticity, entropy and cancer risk. Philos Trans R Soc Lond B Biol Sci 2024; 379:20230054. [PMID: 38432318 PMCID: PMC10909509 DOI: 10.1098/rstb.2023.0054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2023] [Accepted: 09/26/2023] [Indexed: 03/05/2024] Open
Abstract
Epigenetic changes are known to accrue in normal cells as a result of ageing and cumulative exposure to cancer risk factors. Increasing evidence points towards age-related epigenetic changes being acquired in a quasi-stochastic manner, and that they may play a causal role in cancer development. Here, I describe the quasi-stochastic nature of DNA methylation (DNAm) changes in ageing cells as well as in normal cells at risk of neoplastic transformation, discussing the implications of this stochasticity for developing cancer risk prediction strategies, and in particular, how it may require a conceptual paradigm shift in how we select cancer risk markers. I also describe the mounting evidence that a significant proportion of DNAm changes in ageing and cancer development are related to cell proliferation, reflecting tissue-turnover and the opportunity this offers for predicting cancer risk via the development of epigenetic mitotic-like clocks. Finally, I describe how age-associated DNAm changes may be causally implicated in cancer development via an irreversible suppression of tissue-specific transcription factors that increases epigenetic and transcriptomic entropy, promoting a more plastic yet aberrant cancer stem-cell state. This article is part of a discussion meeting issue 'Causes and consequences of stochastic processes in development and disease'.
Collapse
Affiliation(s)
- Andrew E. Teschendorff
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, Shanghai Institute for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, 320 Yue Yang Road, Shanghai 200031, People's Republic of China
| |
Collapse
|
11
|
Glunčić M, Vlahović I, Rosandić M, Paar V. Novel Concept of Alpha Satellite Cascading Higher-Order Repeats (HORs) and Precise Identification of 15mer and 20mer Cascading HORs in Complete T2T-CHM13 Assembly of Human Chromosome 15. Int J Mol Sci 2024; 25:4395. [PMID: 38673983 PMCID: PMC11050224 DOI: 10.3390/ijms25084395] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2024] [Revised: 04/08/2024] [Accepted: 04/11/2024] [Indexed: 04/28/2024] Open
Abstract
Unraveling the intricate centromere structure of human chromosomes holds profound implications, illuminating fundamental genetic mechanisms and potentially advancing our comprehension of genetic disorders and therapeutic interventions. This study rigorously identified and structurally analyzed alpha satellite higher-order repeats (HORs) within the centromere of human chromosome 15 in the complete T2T-CHM13 assembly using the high-precision GRM2023 algorithm. The most extensive alpha satellite HOR array in chromosome 15 reveals a novel cascading HOR, housing 429 15mer HOR copies, containing 4-, 7- and 11-monomer subfragments. Within each row of cascading HORs, all alpha satellite monomers are of distinct types, as in regular Willard's HORs. However, different HOR copies within the same cascading 15mer HOR contain more than one monomer of the same type. Each canonical 15mer HOR copy comprises 15 monomers belonging to only 9 different monomer types. Notably, 65% of the 429 15mer cascading HOR copies exhibit canonical structures, while 35% display variant configurations. Identified as the second most extensive alpha satellite HOR, another novel cascading HOR within human chromosome 15 encompasses 164 20mer HOR copies, each featuring two subfragments. Moreover, a distinct pattern emerges as interspersed 25mer/26mer structures differing from regular Willard's HORs and giving rise to a 34-monomer subfragment. Only a minor 18mer HOR array of 12 HOR copies is of the regular Willard's type. These revelations highlight the complexity within the chromosome 15 centromeric region, accentuating deviations from anticipated highly regular patterns and hinting at profound information encoding and functional potential within the human centromere.
Collapse
Affiliation(s)
- Matko Glunčić
- Faculty of Science, University of Zagreb, 10000 Zagreb, Croatia;
| | - Ines Vlahović
- Algebra LAB, Algebra University College, 10000 Zagreb, Croatia;
| | - Marija Rosandić
- Department of Internal Medicine, University Hospital Centre Zagreb, 10000 Zagreb, Croatia;
- Croatian Academy of Sciences and Arts, 10000 Zagreb, Croatia
| | - Vladimir Paar
- Faculty of Science, University of Zagreb, 10000 Zagreb, Croatia;
- Croatian Academy of Sciences and Arts, 10000 Zagreb, Croatia
| |
Collapse
|
12
|
Hogan MP, Holding ML, Nystrom GS, Colston TJ, Bartlett DA, Mason AJ, Ellsworth SA, Rautsaw RM, Lawrence KC, Strickland JL, He B, Fraser P, Margres MJ, Gilbert DM, Gibbs HL, Parkinson CL, Rokyta DR. The genetic regulatory architecture and epigenomic basis for age-related changes in rattlesnake venom. Proc Natl Acad Sci U S A 2024; 121:e2313440121. [PMID: 38578985 PMCID: PMC11032440 DOI: 10.1073/pnas.2313440121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Accepted: 03/13/2024] [Indexed: 04/07/2024] Open
Abstract
Developmental phenotypic changes can evolve under selection imposed by age- and size-related ecological differences. Many of these changes occur through programmed alterations to gene expression patterns, but the molecular mechanisms and gene-regulatory networks underlying these adaptive changes remain poorly understood. Many venomous snakes, including the eastern diamondback rattlesnake (Crotalus adamanteus), undergo correlated changes in diet and venom expression as snakes grow larger with age, providing models for identifying mechanisms of timed expression changes that underlie adaptive life history traits. By combining a highly contiguous, chromosome-level genome assembly with measures of expression, chromatin accessibility, and histone modifications, we identified cis-regulatory elements and trans-regulatory factors controlling venom ontogeny in the venom glands of C. adamanteus. Ontogenetic expression changes were significantly correlated with epigenomic changes within genes, immediately adjacent to genes (e.g., promoters), and more distant from genes (e.g., enhancers). We identified 37 candidate transcription factors (TFs), with the vast majority being up-regulated in adults. The ontogenetic change is largely driven by an increase in the expression of TFs associated with growth signaling, transcriptional activation, and circadian rhythm/biological timing systems in adults with corresponding epigenomic changes near the differentially expressed venom genes. However, both expression activation and repression contributed to the composition of both adult and juvenile venoms, demonstrating the complexity and potential evolvability of gene regulation for this trait. Overall, given that age-based trait variation is common across the tree of life, we provide a framework for understanding gene-regulatory-network-driven life-history evolution more broadly.
Collapse
Affiliation(s)
- Michael P. Hogan
- Department of Biological Science, Florida State University, Tallahassee, FL32306
| | - Matthew L. Holding
- Department of Biological Science, Florida State University, Tallahassee, FL32306
- Life Sciences Institute, University of Michigan, Ann Arbor, MI48109
| | - Gunnar S. Nystrom
- Department of Biological Science, Florida State University, Tallahassee, FL32306
| | - Timothy J. Colston
- Department of Biological Science, Florida State University, Tallahassee, FL32306
- Department of Biology, University of Puerto Rico at Mayagüez, Mayagüez, PR00681
| | - Daniel A. Bartlett
- Department of Biological Science, Florida State University, Tallahassee, FL32306
| | - Andrew J. Mason
- Department of Biological Sciences, Clemson University, Clemson, SC29634
- Department of Evolution, Ecology and Organismal Biology, The Ohio State University, Columbus, OH43210
| | - Schyler A. Ellsworth
- Department of Biological Science, Florida State University, Tallahassee, FL32306
| | - Rhett M. Rautsaw
- Department of Biological Sciences, Clemson University, Clemson, SC29634
- Department of Integrative Biology, University of South Florida, Tampa, FL33620
- School of Biological Sciences, Washington State University, Pullman, WA99164
| | - Kylie C. Lawrence
- Department of Biological Science, Florida State University, Tallahassee, FL32306
| | - Jason L. Strickland
- Department of Biological Sciences, Clemson University, Clemson, SC29634
- Department of Biology, University of South Alabama, Mobile, AL36688
| | - Bing He
- Department of Biological Science, Florida State University, Tallahassee, FL32306
| | - Peter Fraser
- Department of Biological Science, Florida State University, Tallahassee, FL32306
| | - Mark J. Margres
- Department of Integrative Biology, University of South Florida, Tampa, FL33620
| | - David M. Gilbert
- Laboratory of Chromosome Replication and Epigenome Regulation, San Diego Biomedical Research Institute, San Diego, CA92121
| | - H. Lisle Gibbs
- Department of Evolution, Ecology and Organismal Biology, The Ohio State University, Columbus, OH43210
| | - Christopher L. Parkinson
- Department of Biological Sciences, Clemson University, Clemson, SC29634
- Department of Forestry and Environmental Conservation, Clemson University, Clemson, SC29634
| | - Darin R. Rokyta
- Department of Biological Science, Florida State University, Tallahassee, FL32306
| |
Collapse
|
13
|
Bell CG. Epigenomic insights into common human disease pathology. Cell Mol Life Sci 2024; 81:178. [PMID: 38602535 PMCID: PMC11008083 DOI: 10.1007/s00018-024-05206-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Revised: 03/11/2024] [Accepted: 03/13/2024] [Indexed: 04/12/2024]
Abstract
The epigenome-the chemical modifications and chromatin-related packaging of the genome-enables the same genetic template to be activated or repressed in different cellular settings. This multi-layered mechanism facilitates cell-type specific function by setting the local sequence and 3D interactive activity level. Gene transcription is further modulated through the interplay with transcription factors and co-regulators. The human body requires this epigenomic apparatus to be precisely installed throughout development and then adequately maintained during the lifespan. The causal role of the epigenome in human pathology, beyond imprinting disorders and specific tumour suppressor genes, was further brought into the spotlight by large-scale sequencing projects identifying that mutations in epigenomic machinery genes could be critical drivers in both cancer and developmental disorders. Abrogation of this cellular mechanism is providing new molecular insights into pathogenesis. However, deciphering the full breadth and implications of these epigenomic changes remains challenging. Knowledge is accruing regarding disease mechanisms and clinical biomarkers, through pathogenically relevant and surrogate tissue analyses, respectively. Advances include consortia generated cell-type specific reference epigenomes, high-throughput DNA methylome association studies, as well as insights into ageing-related diseases from biological 'clocks' constructed by machine learning algorithms. Also, 3rd-generation sequencing is beginning to disentangle the complexity of genetic and DNA modification haplotypes. Cell-free DNA methylation as a cancer biomarker has clear clinical utility and further potential to assess organ damage across many disorders. Finally, molecular understanding of disease aetiology brings with it the opportunity for exact therapeutic alteration of the epigenome through CRISPR-activation or inhibition.
Collapse
Affiliation(s)
- Christopher G Bell
- William Harvey Research Institute, Barts & The London Faculty of Medicine, Queen Mary University of London, Charterhouse Square, London, EC1M 6BQ, UK.
| |
Collapse
|
14
|
Smith T, Olagunju T, Rosen B, Neibergs H, Becker G, Davenport K, Elsik C, Hadfield T, Koren S, Kuhn K, Rhie A, Shira K, Skibiel A, Stegemiller M, Thorne J, Villamediana P, Cockett N, Murdoch B. The first complete T2T Assemblies of Cattle and Sheep Y-Chromosomes uncover remarkable divergence in structure and gene content. RESEARCH SQUARE 2024:rs.3.rs-4033388. [PMID: 38712074 PMCID: PMC11071540 DOI: 10.21203/rs.3.rs-4033388/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2024]
Abstract
Reference genomes of cattle and sheep have lacked contiguous assemblies of the sex-determining Y chromosome. We assembled complete and gapless telomere to telomere (T2T) Y chromosomes for these species. The pseudo-autosomal regions were similar in length, but the total chromosome size was substantially different, with the cattle Y more than twice the length of the sheep Y. The length disparity was accounted for by expanded ampliconic region in cattle. The genic amplification in cattle contrasts with pseudogenization in sheep suggesting opposite evolutionary mechanisms since their divergence 18MYA. The centromeres also differed dramatically despite the close relationship between these species at the overall genome sequence level. These Y chromosome have been added to the current reference assemblies in GenBank opening new opportunities for the study of evolution and variation while supporting efforts to improve sustainability in these important livestock species that generally use sire-driven genetic improvement strategies.
Collapse
Affiliation(s)
- Timothy Smith
- USDA, ARS, U.S. Meat Animal Research Center (USMARC)
| | | | | | | | | | | | | | | | - Sergey Koren
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health
| | | | - Arang Rhie
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | | | | | | | | | | | | | | |
Collapse
|
15
|
Choudalakis M, Bashtrykov P, Jeltsch A. RepEnTools: an automated repeat enrichment analysis package for ChIP-seq data reveals hUHRF1 Tandem-Tudor domain enrichment in young repeats. Mob DNA 2024; 15:6. [PMID: 38570859 PMCID: PMC10988844 DOI: 10.1186/s13100-024-00315-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Accepted: 03/05/2024] [Indexed: 04/05/2024] Open
Abstract
BACKGROUND Repeat elements (REs) play important roles for cell function in health and disease. However, RE enrichment analysis in short-read high-throughput sequencing (HTS) data, such as ChIP-seq, is a challenging task. RESULTS Here, we present RepEnTools, a software package for genome-wide RE enrichment analysis of ChIP-seq and similar chromatin pulldown experiments. Our analysis package bundles together various software with carefully chosen and validated settings to provide a complete solution for RE analysis, starting from raw input files to tabular and graphical outputs. RepEnTools implementations are easily accessible even with minimal IT skills (Galaxy/UNIX). To demonstrate the performance of RepEnTools, we analysed chromatin pulldown data by the human UHRF1 TTD protein domain and discovered enrichment of TTD binding on young primate and hominid specific polymorphic repeats (SVA, L1PA1/L1HS) overlapping known enhancers and decorated with H3K4me1-K9me2/3 modifications. We corroborated these new bioinformatic findings with experimental data by qPCR assays using newly developed primate and hominid specific qPCR assays which complement similar research tools. Finally, we analysed mouse UHRF1 ChIP-seq data with RepEnTools and showed that the endogenous mUHRF1 protein colocalizes with H3K4me1-H3K9me3 on promoters of REs which were silenced by UHRF1. These new data suggest a functional role for UHRF1 in silencing of REs that is mediated by TTD binding to the H3K4me1-K9me3 double mark and conserved in two mammalian species. CONCLUSIONS RepEnTools improves the previously available programmes for RE enrichment analysis in chromatin pulldown studies by leveraging new tools, enhancing accessibility and adding some key functions. RepEnTools can analyse RE enrichment rapidly, efficiently, and accurately, providing the community with an up-to-date, reliable and accessible tool for this important type of analysis.
Collapse
Affiliation(s)
- Michel Choudalakis
- Department of Biochemistry, Institute of Biochemistry and Technical Biochemistry, University of Stuttgart, Allmandring 31, 70569, Stuttgart, Germany
| | - Pavel Bashtrykov
- Department of Biochemistry, Institute of Biochemistry and Technical Biochemistry, University of Stuttgart, Allmandring 31, 70569, Stuttgart, Germany.
| | - Albert Jeltsch
- Department of Biochemistry, Institute of Biochemistry and Technical Biochemistry, University of Stuttgart, Allmandring 31, 70569, Stuttgart, Germany.
| |
Collapse
|
16
|
Martínez Duncker Rebolledo E, Chan D, Christensen KE, Reagan AM, Howell GR, Rozen R, Trasler J. Sperm DNA methylation defects in a new mouse model of the 5,10-methylenetetrahydrofolate reductase 677C>T variant and correction with moderate dose folic acid supplementation. Mol Hum Reprod 2024; 30:gaae008. [PMID: 38366926 PMCID: PMC10980591 DOI: 10.1093/molehr/gaae008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2023] [Revised: 01/30/2024] [Indexed: 02/19/2024] Open
Abstract
5,10-Methylenetetrahydrofolate reductase (MTHFR) is an enzyme that plays a key role in providing methyl groups for DNA methylation, including during spermatogenesis. A common genetic variant in humans (MTHFR 677C>T) results in reduced enzyme activity and has been linked to various disorders, including male infertility. A new animal model has been created by reproducing the human equivalent of the polymorphism in mice using CRISPR/Cas9. Biochemical parameters in the Mthfr 677TT mice recapitulate alterations found in MTHFR 677TT men. Our aims were to characterize the sperm DNA methylome of the Mthfr 677CC and TT mice on a control diet (2 mg folic acid/kg diet) and assess the effects of folic acid supplementation (10 mg/kg diet) on the sperm DNA methylome. Body and reproductive organ weights, testicular sperm counts, and histology were examined. DNA methylation in sperm was assessed using bisulfite pyrosequencing and whole-genome bisulfite sequencing (WGBS). Reproductive parameters and locus-specific imprinted gene methylation were unaffected by genotype or diet. Using WGBS, sperm from 677TT mice had 360 differentially methylated tiles as compared to 677CC mice, predominantly hypomethylation (60% of tiles). Folic acid supplementation mostly caused hypermethylation in sperm of males of both genotypes and was found to partially correct the DNA methylation alterations in sperm associated with the TT genotype. The new mouse model will be useful in understanding the role of MTHFR deficiency in male fertility and in designing folate supplementation regimens for the clinic.
Collapse
Affiliation(s)
- Edgar Martínez Duncker Rebolledo
- Child Health and Human Development Program, Research Institute of the McGill University Health Centre, Montreal, QC, Canada
- Department of Human Genetics, McGill University, Montreal, QC, Canada
| | - Donovan Chan
- Child Health and Human Development Program, Research Institute of the McGill University Health Centre, Montreal, QC, Canada
| | - Karen E Christensen
- Child Health and Human Development Program, Research Institute of the McGill University Health Centre, Montreal, QC, Canada
| | | | - Gareth R Howell
- The Jackson Laboratory, Bar Harbor, ME, USA
- Graduate School of Biomedical Sciences, Tufts University School of Medicine, Boston, MA, USA
- Graduate School of Biomedical Sciences and Engineering, University of Maine, Orono, ME, USA
| | - Rima Rozen
- Child Health and Human Development Program, Research Institute of the McGill University Health Centre, Montreal, QC, Canada
- Department of Human Genetics, McGill University, Montreal, QC, Canada
- Department of Pediatrics, McGill University, Montreal, QC, Canada
| | - Jacquetta Trasler
- Child Health and Human Development Program, Research Institute of the McGill University Health Centre, Montreal, QC, Canada
- Department of Human Genetics, McGill University, Montreal, QC, Canada
- Department of Pediatrics, McGill University, Montreal, QC, Canada
- Department of Pharmacology & Therapeutics, Montreal, QC, Canada
| |
Collapse
|
17
|
Ermini L, Driguez P. The Application of Long-Read Sequencing to Cancer. Cancers (Basel) 2024; 16:1275. [PMID: 38610953 PMCID: PMC11011098 DOI: 10.3390/cancers16071275] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2024] [Revised: 03/20/2024] [Accepted: 03/21/2024] [Indexed: 04/14/2024] Open
Abstract
Cancer is a multifaceted disease arising from numerous genomic aberrations that have been identified as a result of advancements in sequencing technologies. While next-generation sequencing (NGS), which uses short reads, has transformed cancer research and diagnostics, it is limited by read length. Third-generation sequencing (TGS), led by the Pacific Biosciences and Oxford Nanopore Technologies platforms, employs long-read sequences, which have marked a paradigm shift in cancer research. Cancer genomes often harbour complex events, and TGS, with its ability to span large genomic regions, has facilitated their characterisation, providing a better understanding of how complex rearrangements affect cancer initiation and progression. TGS has also characterised the entire transcriptome of various cancers, revealing cancer-associated isoforms that could serve as biomarkers or therapeutic targets. Furthermore, TGS has advanced cancer research by improving genome assemblies, detecting complex variants, and providing a more complete picture of transcriptomes and epigenomes. This review focuses on TGS and its growing role in cancer research. We investigate its advantages and limitations, providing a rigorous scientific analysis of its use in detecting previously hidden aberrations missed by NGS. This promising technology holds immense potential for both research and clinical applications, with far-reaching implications for cancer diagnosis and treatment.
Collapse
Affiliation(s)
- Luca Ermini
- NORLUX Neuro-Oncology Laboratory, Department of Cancer Research, Luxembourg Institute of Health, L-1210 Luxembourg, Luxembourg
| | - Patrick Driguez
- Bioscience Core Lab, King Abdullah University of Science and Technology, Thuwal 23955-6900, Saudi Arabia
| |
Collapse
|
18
|
Scelfo A, Angrisani A, Grillo M, Barnes BM, Muyas F, Sauer CM, Leung CWB, Dumont M, Grison M, Mazaud D, Garnier M, Guintini L, Nelson L, Esashi F, Cortés-Ciriano I, Taylor SS, Déjardin J, Wilhelm T, Fachinetti D. Specialized replication mechanisms maintain genome stability at human centromeres. Mol Cell 2024; 84:1003-1020.e10. [PMID: 38359824 DOI: 10.1016/j.molcel.2024.01.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Revised: 12/12/2023] [Accepted: 01/19/2024] [Indexed: 02/17/2024]
Abstract
The high incidence of whole-arm chromosome aneuploidy and translocations in tumors suggests instability of centromeres, unique loci built on repetitive sequences and essential for chromosome separation. The causes behind this fragility and the mechanisms preserving centromere integrity remain elusive. We show that replication stress, hallmark of pre-cancerous lesions, promotes centromeric breakage in mitosis, due to spindle forces and endonuclease activities. Mechanistically, we unveil unique dynamics of the centromeric replisome distinct from the rest of the genome. Locus-specific proteomics identifies specialized DNA replication and repair proteins at centromeres, highlighting them as difficult-to-replicate regions. The translesion synthesis pathway, along with other factors, acts to sustain centromere replication and integrity. Prolonged stress causes centromeric alterations like ruptures and translocations, as observed in ovarian cancer models experiencing replication stress. This study provides unprecedented insights into centromere replication and integrity, proposing mechanistic insights into the origins of centromere alterations leading to abnormal cancerous karyotypes.
Collapse
Affiliation(s)
- Andrea Scelfo
- Institut Curie, PSL Research University, Sorbonne Université, CNRS, UMR144, 26 rue d'Ulm, Paris 75005, France
| | - Annapaola Angrisani
- Institut Curie, PSL Research University, Sorbonne Université, CNRS, UMR144, 26 rue d'Ulm, Paris 75005, France
| | - Marco Grillo
- Institut Curie, PSL Research University, Sorbonne Université, CNRS, UMR144, 26 rue d'Ulm, Paris 75005, France
| | - Bethany M Barnes
- Division of Cancer Sciences, School of Medical Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester Cancer Research Centre, Wilmslow Road, Manchester M20 4GJ, UK
| | - Francesc Muyas
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge CB10 1SD, UK
| | - Carolin M Sauer
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge CB10 1SD, UK
| | | | - Marie Dumont
- Institut Curie, PSL Research University, Sorbonne Université, CNRS, UMR144, 26 rue d'Ulm, Paris 75005, France
| | - Marine Grison
- Institut Curie, PSL Research University, Sorbonne Université, CNRS, UMR144, 26 rue d'Ulm, Paris 75005, France
| | - David Mazaud
- Plateforme Imagerie PICT-IBiSA, Institut Curie, PSL Research University, Paris 75005, France; Institut Curie, PSL Research University, Sorbonne Université, CNRS, UMR3664, 26 rue d'Ulm, Paris 75005, France
| | - Mickaël Garnier
- Plateforme Imagerie PICT-IBiSA, Institut Curie, PSL Research University, Paris 75005, France; Institut Curie, PSL Research University, Sorbonne Université, CNRS, UMR3664, 26 rue d'Ulm, Paris 75005, France
| | - Laetitia Guintini
- Institute of Human Genetics, CNRS-Université de Montpellier, Montpellier 34396, France
| | - Louisa Nelson
- Division of Cancer Sciences, School of Medical Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester Cancer Research Centre, Wilmslow Road, Manchester M20 4GJ, UK
| | - Fumiko Esashi
- Sir William Dunn School of Pathology, University of Oxford, Oxford OX1 3RE, UK
| | - Isidro Cortés-Ciriano
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge CB10 1SD, UK
| | - Stephen S Taylor
- Division of Cancer Sciences, School of Medical Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester Cancer Research Centre, Wilmslow Road, Manchester M20 4GJ, UK
| | - Jérôme Déjardin
- Institute of Human Genetics, CNRS-Université de Montpellier, Montpellier 34396, France
| | - Therese Wilhelm
- Institut Curie, PSL Research University, Sorbonne Université, CNRS, UMR144, 26 rue d'Ulm, Paris 75005, France; Institut Curie, PSL Research University, Sorbonne Université, CNRS, UMR3664, 26 rue d'Ulm, Paris 75005, France.
| | - Daniele Fachinetti
- Institut Curie, PSL Research University, Sorbonne Université, CNRS, UMR144, 26 rue d'Ulm, Paris 75005, France; Institut Curie, PSL Research University, Sorbonne Université, CNRS, UMR3664, 26 rue d'Ulm, Paris 75005, France.
| |
Collapse
|
19
|
Naish M, Henderson IR. The structure, function, and evolution of plant centromeres. Genome Res 2024; 34:161-178. [PMID: 38485193 PMCID: PMC10984392 DOI: 10.1101/gr.278409.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/22/2024]
Abstract
Centromeres are essential regions of eukaryotic chromosomes responsible for the formation of kinetochore complexes, which connect to spindle microtubules during cell division. Notably, although centromeres maintain a conserved function in chromosome segregation, the underlying DNA sequences are diverse both within and between species and are predominantly repetitive in nature. The repeat content of centromeres includes high-copy tandem repeats (satellites), and/or specific families of transposons. The functional region of the centromere is defined by loading of a specific histone 3 variant (CENH3), which nucleates the kinetochore and shows dynamic regulation. In many plants, the centromeres are composed of satellite repeat arrays that are densely DNA methylated and invaded by centrophilic retrotransposons. In some cases, the retrotransposons become the sites of CENH3 loading. We review the structure of plant centromeres, including monocentric, holocentric, and metapolycentric architectures, which vary in the number and distribution of kinetochore attachment sites along chromosomes. We discuss how variation in CENH3 loading can drive genome elimination during early cell divisions of plant embryogenesis. We review how epigenetic state may influence centromere identity and discuss evolutionary models that seek to explain the paradoxically rapid change of centromere sequences observed across species, including the potential roles of recombination. We outline putative modes of selection that could act within the centromeres, as well as the role of repeats in driving cycles of centromere evolution. Although our primary focus is on plant genomes, we draw comparisons with animal and fungal centromeres to derive a eukaryote-wide perspective of centromere structure and function.
Collapse
Affiliation(s)
- Matthew Naish
- Department of Plant Sciences, University of Cambridge, Cambridge CB2 3EA, United Kingdom
| | - Ian R Henderson
- Department of Plant Sciences, University of Cambridge, Cambridge CB2 3EA, United Kingdom
| |
Collapse
|
20
|
Koren S, Bao Z, Guarracino A, Ou S, Goodwin S, Jenike KM, Lucas J, McNulty B, Park J, Rautiainen M, Rhie A, Roelofs D, Schneiders H, Vrijenhoek I, Nijbroek K, Ware D, Schatz MC, Garrison E, Huang S, McCombie WR, Miga KH, Wittenberg AH, Phillippy AM. Gapless assembly of complete human and plant chromosomes using only nanopore sequencing. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.15.585294. [PMID: 38529488 PMCID: PMC10962732 DOI: 10.1101/2024.03.15.585294] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/27/2024]
Abstract
The combination of ultra-long Oxford Nanopore (ONT) sequencing reads with long, accurate PacBio HiFi reads has enabled the completion of a human genome and spurred similar efforts to complete the genomes of many other species. However, this approach for complete, "telomere-to-telomere" genome assembly relies on multiple sequencing platforms, limiting its accessibility. ONT "Duplex" sequencing reads, where both strands of the DNA are read to improve quality, promise high per-base accuracy. To evaluate this new data type, we generated ONT Duplex data for three widely-studied genomes: human HG002, Solanum lycopersicum Heinz 1706 (tomato), and Zea mays B73 (maize). For the diploid, heterozygous HG002 genome, we also used "Pore-C" chromatin contact mapping to completely phase the haplotypes. We found the accuracy of Duplex data to be similar to HiFi sequencing, but with read lengths tens of kilobases longer, and the Pore-C data to be compatible with existing diploid assembly algorithms. This combination of read length and accuracy enables the construction of a high-quality initial assembly, which can then be further resolved using the ultra-long reads, and finally phased into chromosome-scale haplotypes with Pore-C. The resulting assemblies have a base accuracy exceeding 99.999% (Q50) and near-perfect continuity, with most chromosomes assembled as single contigs. We conclude that ONT sequencing is a viable alternative to HiFi sequencing for de novo genome assembly, and has the potential to provide a single-instrument solution for the reconstruction of complete genomes.
Collapse
Affiliation(s)
- Sergey Koren
- Genome Informatics Section, Center for Genomics and Data Science Research, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Zhigui Bao
- Department of Molecular Biology, Max Planck Institute for Biology Tübingen, Tübingen, BadenWürttemberg, Germany
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Andrea Guarracino
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, Tennessee, USA
- Human Technopole, Milan, Italy
| | - Shujun Ou
- Ohio State University, Columbus, OH, USA
| | - Sara Goodwin
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Katharine M. Jenike
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
| | - Julian Lucas
- University of California Santa Cruz, Santa Cruz, CA, USA
| | - Brandy McNulty
- University of California Santa Cruz, Santa Cruz, CA, USA
| | - Jimin Park
- University of California Santa Cruz, Santa Cruz, CA, USA
| | - Mikko Rautiainen
- Genome Informatics Section, Center for Genomics and Data Science Research, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Arang Rhie
- Genome Informatics Section, Center for Genomics and Data Science Research, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Dick Roelofs
- KeyGene, Agro Business Park 90, 6708 PW Wageningen, Netherlands
| | | | - Ilse Vrijenhoek
- KeyGene, Agro Business Park 90, 6708 PW Wageningen, Netherlands
| | - Koen Nijbroek
- KeyGene, Agro Business Park 90, 6708 PW Wageningen, Netherlands
| | - Doreen Ware
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Michael C. Schatz
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
| | - Erik Garrison
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, Tennessee, USA
| | - Sanwen Huang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
- State Key Laboratory of Tropical Crop Breeding, Chinese Academy of Tropical Agricultural Sciences, Haikou, Hainan, China
| | | | - Karen H. Miga
- University of California Santa Cruz, Santa Cruz, CA, USA
| | | | - Adam M. Phillippy
- Genome Informatics Section, Center for Genomics and Data Science Research, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| |
Collapse
|
21
|
Di Tommaso E, Giunta S. Dynamic interplay between human alpha-satellite DNA structure and centromere functions. Semin Cell Dev Biol 2024; 156:130-140. [PMID: 37926668 DOI: 10.1016/j.semcdb.2023.10.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Revised: 10/04/2023] [Accepted: 10/10/2023] [Indexed: 11/07/2023]
Abstract
Maintenance of genome stability relies on functional centromeres for correct chromosome segregation and faithful inheritance of the genetic information. The human centromere is the primary constriction within mitotic chromosomes made up of repetitive alpha-satellite DNA hierarchically organized in megabase-long arrays of near-identical higher order repeats (HORs). Centromeres are epigenetically specified by the presence of the centromere-specific histone H3 variant, CENP-A, which enables the assembly of the kinetochore for microtubule attachment. Notably, centromeric DNA is faithfully inherited as intact haplotypes from the parents to the offspring without intervening recombination, yet, outside of meiosis, centromeres are akin to common fragile sites (CFSs), manifesting crossing-overs and ongoing sequence instability. Consequences of DNA changes within the centromere are just starting to emerge, with unclear effects on intra- and inter-generational inheritance driven by centromere's essential role in kinetochore assembly. Here, we review evidence of meiotic selection operating to mitigate centromere drive, as well as recent reports on centromere damage, recombination and repair during the mitotic cell division. We propose an antagonistic pleiotropy interpretation to reconcile centromere DNA instability as both driver of aneuploidy that underlies degenerative diseases, while also potentially necessary for the maintenance of homogenized HORs for centromere function. We attempt to provide a framework for this conceptual leap taking into consideration the structural interface of centromere-kinetochore interaction and present case scenarios for its malfunctioning. Finally, we offer an integrated working model to connect DNA instability, chromatin, and structural changes with functional consequences on chromosome integrity.
Collapse
Affiliation(s)
- Elena Di Tommaso
- Laboratory of Genome Evolution, Department of Biology & Biotechnology Charles Darwin, Sapienza University of Rome, Rome 00185, Italy
| | - Simona Giunta
- Laboratory of Genome Evolution, Department of Biology & Biotechnology Charles Darwin, Sapienza University of Rome, Rome 00185, Italy.
| |
Collapse
|
22
|
Annapragada AV, Niknafs N, White JR, Bruhm DC, Cherry C, Medina JE, Adleff V, Hruban C, Mathios D, Foda ZH, Phallen J, Scharpf RB, Velculescu VE. Genome-wide repeat landscapes in cancer and cell-free DNA. Sci Transl Med 2024; 16:eadj9283. [PMID: 38478628 DOI: 10.1126/scitranslmed.adj9283] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Accepted: 02/16/2024] [Indexed: 03/22/2024]
Abstract
Genetic changes in repetitive sequences are a hallmark of cancer and other diseases, but characterizing these has been challenging using standard sequencing approaches. We developed a de novo kmer finding approach, called ARTEMIS (Analysis of RepeaT EleMents in dISease), to identify repeat elements from whole-genome sequencing. Using this method, we analyzed 1.2 billion kmers in 2837 tissue and plasma samples from 1975 patients, including those with lung, breast, colorectal, ovarian, liver, gastric, head and neck, bladder, cervical, thyroid, or prostate cancer. We identified tumor-specific changes in these patients in 1280 repeat element types from the LINE, SINE, LTR, transposable element, and human satellite families. These included changes to known repeats and 820 elements that were not previously known to be altered in human cancer. Repeat elements were enriched in regions of driver genes, and their representation was altered by structural changes and epigenetic states. Machine learning analyses of genome-wide repeat landscapes and fragmentation profiles in cfDNA detected patients with early-stage lung or liver cancer in cross-validated and externally validated cohorts. In addition, these repeat landscapes could be used to noninvasively identify the tissue of origin of tumors. These analyses reveal widespread changes in repeat landscapes of human cancers and provide an approach for their detection and characterization that could benefit early detection and disease monitoring of patients with cancer.
Collapse
Affiliation(s)
- Akshaya V Annapragada
- Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
| | - Noushin Niknafs
- Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
| | - James R White
- Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
| | - Daniel C Bruhm
- Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
| | - Christopher Cherry
- Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
| | - Jamie E Medina
- Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
| | - Vilmos Adleff
- Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
| | - Carolyn Hruban
- Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
| | - Dimitrios Mathios
- Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
| | - Zachariah H Foda
- Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
- Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
| | - Jillian Phallen
- Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
| | - Robert B Scharpf
- Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
| | - Victor E Velculescu
- Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
- Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
| |
Collapse
|
23
|
Kovaka S, Hook PW, Jenike KM, Shivakumar V, Morina LB, Razaghi R, Timp W, Schatz MC. Uncalled4 improves nanopore DNA and RNA modification detection via fast and accurate signal alignment. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.05.583511. [PMID: 38496646 PMCID: PMC10942365 DOI: 10.1101/2024.03.05.583511] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/19/2024]
Abstract
Nanopore signal analysis enables detection of nucleotide modifications from native DNA and RNA sequencing, providing both accurate genetic/transcriptomic and epigenetic information without additional library preparation. Presently, only a limited set of modifications can be directly basecalled (e.g. 5-methylcytosine), while most others require exploratory methods that often begin with alignment of nanopore signal to a nucleotide reference. We present Uncalled4, a toolkit for nanopore signal alignment, analysis, and visualization. Uncalled4 features an efficient banded signal alignment algorithm, BAM signal alignment file format, statistics for comparing signal alignment methods, and a reproducible de novo training method for k-mer-based pore models, revealing potential errors in ONT's state-of-the-art DNA model. We apply Uncalled4 to RNA 6-methyladenine (m6A) detection in seven human cell lines, identifying 26% more modifications than Nanopolish using m6Anet, including in several genes where m6A has known implications in cancer. Uncalled4 is available open-source at github.com/skovaka/uncalled4.
Collapse
|
24
|
Brenner LM, Meyer F, Yang H, Köhler AR, Bashtrykov P, Guo M, Jeltsch A, Lungu C, Olayioye MA. Repeat DNA methylation is modulated by adherens junction signaling. Commun Biol 2024; 7:286. [PMID: 38454140 PMCID: PMC10920906 DOI: 10.1038/s42003-024-05990-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Accepted: 02/28/2024] [Indexed: 03/09/2024] Open
Abstract
Through its involvement in gene transcription and heterochromatin formation, DNA methylation regulates how cells interact with their environment. Nevertheless, the extracellular signaling cues that modulate the distribution of this central chromatin modification are largely unclear. DNA methylation is highly abundant at repetitive elements, but its investigation in live cells has been complicated by methodological challenges. Utilizing a CRISPR/dCas9 biosensor that reads DNA methylation of human α-satellite repeats in live cells, we here uncover a signaling pathway linking the chromatin and transcriptional state of repetitive elements to epithelial adherens junction integrity. Specifically, we find that in confluent breast epithelial cell monolayers, α-satellite repeat methylation is reduced by comparison to low density cultures. This is coupled with increased transcriptional activity at repeats. Through comprehensive perturbation experiments, we identify the junctional protein E-cadherin, which links to the actin cytoskeleton, as a central molecular player for signal relay into the nucleus. Furthermore, we find that this pathway is impaired in cancer cells that lack E-cadherin and are not contact-inhibited. This suggests that the molecular connection between cell density and repetitive element methylation could play a role in the maintenance of epithelial tissue homeostasis.
Collapse
Affiliation(s)
- Lisa-Marie Brenner
- Institute of Cell Biology and Immunology, University of Stuttgart, Allmandring 31, 70569, Stuttgart, Germany
| | - Florian Meyer
- Institute of Cell Biology and Immunology, University of Stuttgart, Allmandring 31, 70569, Stuttgart, Germany
| | - Haiqian Yang
- Department of Mechanical Engineering, Massachusetts Institute of Technology, Cambridge, 02139, MA, USA
| | - Anja R Köhler
- Institute of Biochemistry and Technical Biochemistry, University of Stuttgart, Allmandring 31, 70569, Stuttgart, Germany
| | - Pavel Bashtrykov
- Institute of Biochemistry and Technical Biochemistry, University of Stuttgart, Allmandring 31, 70569, Stuttgart, Germany
| | - Ming Guo
- Department of Mechanical Engineering, Massachusetts Institute of Technology, Cambridge, 02139, MA, USA
| | - Albert Jeltsch
- Institute of Biochemistry and Technical Biochemistry, University of Stuttgart, Allmandring 31, 70569, Stuttgart, Germany
| | - Cristiana Lungu
- Institute of Cell Biology and Immunology, University of Stuttgart, Allmandring 31, 70569, Stuttgart, Germany.
- Stuttgart Research Center Systems Biology (SRCSB), University of Stuttgart, Nobelstraße 15, 70569, Stuttgart, Germany.
| | - Monilola A Olayioye
- Institute of Cell Biology and Immunology, University of Stuttgart, Allmandring 31, 70569, Stuttgart, Germany.
- Stuttgart Research Center Systems Biology (SRCSB), University of Stuttgart, Nobelstraße 15, 70569, Stuttgart, Germany.
| |
Collapse
|
25
|
Ramirez P, Sun W, Kazempour Dehkordi S, Zare H, Fongang B, Bieniek KF, Frost B. Nanopore-based DNA long-read sequencing analysis of the aged human brain. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.01.578450. [PMID: 38370753 PMCID: PMC10871260 DOI: 10.1101/2024.02.01.578450] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/20/2024]
Abstract
Aging disrupts cellular processes such as DNA repair and epigenetic control, leading to a gradual buildup of genomic alterations that can have detrimental effects in post-mitotic cells. Genomic alterations in regions of the genome that are rich in repetitive sequences, often termed "dark loci," are difficult to resolve using traditional sequencing approaches. New long-read technologies offer promising avenues for exploration of previously inaccessible regions of the genome. Using nanopore-based long-read whole-genome sequencing of DNA extracted from aged 18 human brains, we identify previously unreported structural variants and methylation patterns within repetitive DNA, focusing on transposable elements ("jumping genes") as crucial sources of variation, particularly in dark loci. Our analyses reveal potential somatic insertion variants and provides DNA methylation frequencies for many retrotransposon families. We further demonstrate the utility of this technology for the study of these challenging genomic regions in brains affected by Alzheimer's disease and identify significant differences in DNA methylation in pathologically normal brains versus those affected by Alzheimer's disease. Highlighting the power of this approach, we discover specific polymorphic retrotransposons with altered DNA methylation patterns. These retrotransposon loci have the potential to contribute to pathology, warranting further investigation in Alzheimer's disease research. Taken together, our study provides the first long-read DNA sequencing-based analysis of retrotransposon sequences, structural variants, and DNA methylation in the aging brain affected with Alzheimer's disease neuropathology.
Collapse
Affiliation(s)
- Paulino Ramirez
- Barshop Institute for Longevity and Aging Studies, University of Texas Health San Antonio, San Antonio, Texas
- Glenn Biggs Institute for Alzheimer’s and Neurodegenerative Diseases, University of Texas Health San Antonio, San Antonio, Texas
- Department of Cell Systems and Anatomy, University of Texas Health San Antonio, San Antonio, Texas
| | - Wenyan Sun
- Barshop Institute for Longevity and Aging Studies, University of Texas Health San Antonio, San Antonio, Texas
- Glenn Biggs Institute for Alzheimer’s and Neurodegenerative Diseases, University of Texas Health San Antonio, San Antonio, Texas
- Department of Cell Systems and Anatomy, University of Texas Health San Antonio, San Antonio, Texas
- School of Pharmacy, University of Missouri-Kansas City, Kansas City, Missouri
| | - Shiva Kazempour Dehkordi
- Glenn Biggs Institute for Alzheimer’s and Neurodegenerative Diseases, University of Texas Health San Antonio, San Antonio, Texas
- Department of Cell Systems and Anatomy, University of Texas Health San Antonio, San Antonio, Texas
| | - Habil Zare
- Glenn Biggs Institute for Alzheimer’s and Neurodegenerative Diseases, University of Texas Health San Antonio, San Antonio, Texas
- Department of Cell Systems and Anatomy, University of Texas Health San Antonio, San Antonio, Texas
| | - Bernard Fongang
- Glenn Biggs Institute for Alzheimer’s and Neurodegenerative Diseases, University of Texas Health San Antonio, San Antonio, Texas
- Department of Biochemistry & Structural Biology, University of Texas Health San Antonio, San Antonio, Texas
| | - Kevin F. Bieniek
- Glenn Biggs Institute for Alzheimer’s and Neurodegenerative Diseases, University of Texas Health San Antonio, San Antonio, Texas
- Department of Pathology, University of Texas Health San Antonio, San Antonio, Texas
| | - Bess Frost
- Barshop Institute for Longevity and Aging Studies, University of Texas Health San Antonio, San Antonio, Texas
- Glenn Biggs Institute for Alzheimer’s and Neurodegenerative Diseases, University of Texas Health San Antonio, San Antonio, Texas
- Department of Cell Systems and Anatomy, University of Texas Health San Antonio, San Antonio, Texas
| |
Collapse
|
26
|
Wang J, Nakato R. Churros: a Docker-based pipeline for large-scale epigenomic analysis. DNA Res 2024; 31:dsad026. [PMID: 38102723 DOI: 10.1093/dnares/dsad026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Revised: 11/23/2023] [Accepted: 12/13/2023] [Indexed: 12/17/2023] Open
Abstract
The epigenome, which reflects the modifications on chromatin or DNA sequences, provides crucial insight into gene expression regulation and cellular activity. With the continuous accumulation of epigenomic datasets such as chromatin immunoprecipitation followed by sequencing (ChIP-seq) data, there is a great demand for a streamlined pipeline to consistently process them, especially for large-dataset comparisons involving hundreds of samples. Here, we present Churros, an end-to-end epigenomic analysis pipeline that is environmentally independent and optimized for handling large-scale data. We successfully demonstrated the effectiveness of Churros by analyzing large-scale ChIP-seq datasets with the hg38 or Telomere-to-Telomere (T2T) human reference genome. We found that applying T2T to the typical analysis workflow has important impacts on read mapping, quality checks, and peak calling. We also introduced a useful feature to study context-specific epigenomic landscapes. Churros will contribute a comprehensive and unified resource for analyzing large-scale epigenomic data.
Collapse
Affiliation(s)
- Jiankang Wang
- School of Biomedical Sciences, Hunan University, Changsha, Hunan, China
- Institute for Quantitative Biosciences, The University of Tokyo, Bunkyo-ku, Tokyo, Japan
| | - Ryuichiro Nakato
- Institute for Quantitative Biosciences, The University of Tokyo, Bunkyo-ku, Tokyo, Japan
| |
Collapse
|
27
|
Li J, Cai X, Jiang P, Wang H, Zhang S, Sun T, Chen C, Fan K. Co-based Nanozymatic Profiling: Advances Spanning Chemistry, Biomedical, and Environmental Sciences. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2024; 36:e2307337. [PMID: 37724878 DOI: 10.1002/adma.202307337] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Revised: 09/12/2023] [Indexed: 09/21/2023]
Abstract
Nanozymes, next-generation enzyme-mimicking nanomaterials, have entered an era of rational design; among them, Co-based nanozymes have emerged as captivating players over times. Co-based nanozymes have been developed and have garnered significant attention over the past five years. Their extraordinary properties, including regulatable enzymatic activity, stability, and multifunctionality stemming from magnetic properties, photothermal conversion effects, cavitation effects, and relaxation efficiency, have made Co-based nanozymes a rising star. This review presents the first comprehensive profiling of the Co-based nanozymes in the chemistry, biology, and environmental sciences. The review begins by scrutinizing the various synthetic methods employed for Co-based nanozyme fabrication, such as template and sol-gel methods, highlighting their distinctive merits from a chemical standpoint. Furthermore, a detailed exploration of their wide-ranging applications in biosensing and biomedical therapeutics, as well as their contributions to environmental monitoring and remediation is provided. Notably, drawing inspiration from state-of-the-art techniques such as omics, a comprehensive analysis of Co-based nanozymes is undertaken, employing analogous statistical methodologies to provide valuable guidance. To conclude, a comprehensive outlook on the challenges and prospects for Co-based nanozymes is presented, spanning from microscopic physicochemical mechanisms to macroscopic clinical translational applications.
Collapse
Affiliation(s)
- Jingqi Li
- College of Chemistry, Chemical Engineering and Resource Utilization, Northeast Forestry University, Harbin, 150040, P. R. China
- Aulin College, Northeast Forestry University, Harbin, 150040, P. R. China
| | - Xinda Cai
- College of Chemistry, Chemical Engineering and Resource Utilization, Northeast Forestry University, Harbin, 150040, P. R. China
- Aulin College, Northeast Forestry University, Harbin, 150040, P. R. China
| | - Peng Jiang
- College of Chemistry, Chemical Engineering and Resource Utilization, Northeast Forestry University, Harbin, 150040, P. R. China
- Aulin College, Northeast Forestry University, Harbin, 150040, P. R. China
| | - Huayuan Wang
- College of Chemistry, Chemical Engineering and Resource Utilization, Northeast Forestry University, Harbin, 150040, P. R. China
- Aulin College, Northeast Forestry University, Harbin, 150040, P. R. China
| | - Shiwei Zhang
- College of Chemistry, Chemical Engineering and Resource Utilization, Northeast Forestry University, Harbin, 150040, P. R. China
- Aulin College, Northeast Forestry University, Harbin, 150040, P. R. China
| | - Tiedong Sun
- College of Chemistry, Chemical Engineering and Resource Utilization, Northeast Forestry University, Harbin, 150040, P. R. China
- Aulin College, Northeast Forestry University, Harbin, 150040, P. R. China
| | - Chunxia Chen
- College of Chemistry, Chemical Engineering and Resource Utilization, Northeast Forestry University, Harbin, 150040, P. R. China
- Aulin College, Northeast Forestry University, Harbin, 150040, P. R. China
| | - Kelong Fan
- CAS Engineering Laboratory for Nanozyme, Key Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, P. R. China
- Nanozyme Medical Center, School of Basic Medical Sciences, Zhengzhou University, Zhengzhou, 450001, P. R. China
| |
Collapse
|
28
|
Huang J, Yang F, Liu Y, Wang Y. N6-methyladenosine RNA methylation in diabetic kidney disease. Biomed Pharmacother 2024; 171:116185. [PMID: 38237350 DOI: 10.1016/j.biopha.2024.116185] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Revised: 01/13/2024] [Accepted: 01/16/2024] [Indexed: 02/08/2024] Open
Abstract
Diabetic kidney disease (DKD) is a major microvascular complication of diabetes, and hyperglycemic memory associated with diabetes carries the risk of disease occurrence, even after the termination of blood glucose injury. The existence of hyperglycemic memory supports the concept of an epigenetic mechanism involving n6-methyladenosine (m6A) modification. Several studies have shown that m6A plays a key role in the pathogenesis of DKD. This review addresses the role and mechanism of m6A RNA modification in the progression of DKD, including the regulatory role of m6A modification in pathological processes, such as inflammation, oxidative stress, fibrosis, and non-coding (nc) RNA. This reveals the importance of m6A in the occurrence and development of DKD, suggesting that m6A may play a role in hyperglycemic memory phenomenon. This review also discusses how some gray areas, such as m6A modified multiple enzymes, interact to affect the development of DKD and provides countermeasures. In conclusion, this review enhances our understanding of DKD from the perspective of m6A modifications and provides new targets for future therapeutic strategies. In addition, the insights discussed here support the existence of hyperglycemic memory effects in DKD, which may have far-reaching implications for the development of novel treatments. We hypothesize that m6A RNA modification, as a key factor regulating the development of DKD, provides a new perspective for the in-depth exploration of DKD and provides a novel option for the clinical management of patients with DKD.
Collapse
Affiliation(s)
- Jiaan Huang
- Hebei Provincial Key Laboratory of Integrated Traditional Chinese and Western Medicine for Liver and Kidney Diseases, Shijiazhuang 05000, China; Hebei University of Traditional Chinese Medicine, NO.326, Xinshi South Road, Qiaoxi District, Shijiazhuang 05000, China
| | - Fan Yang
- Hebei Provincial Key Laboratory of Integrated Traditional Chinese and Western Medicine for Liver and Kidney Diseases, Shijiazhuang 05000, China; Hebei University of Traditional Chinese Medicine, NO.326, Xinshi South Road, Qiaoxi District, Shijiazhuang 05000, China
| | - Yan Liu
- Hebei Provincial Key Laboratory of Integrated Traditional Chinese and Western Medicine for Liver and Kidney Diseases, Shijiazhuang 05000, China; Hebei University of Traditional Chinese Medicine, NO.326, Xinshi South Road, Qiaoxi District, Shijiazhuang 05000, China
| | - Yuehua Wang
- Hebei Provincial Key Laboratory of Integrated Traditional Chinese and Western Medicine for Liver and Kidney Diseases, Shijiazhuang 05000, China; Hebei University of Traditional Chinese Medicine, NO.326, Xinshi South Road, Qiaoxi District, Shijiazhuang 05000, China.
| |
Collapse
|
29
|
Fernandes JB, Naish M, Lian Q, Burns R, Tock AJ, Rabanal FA, Wlodzimierz P, Habring A, Nicholas RE, Weigel D, Mercier R, Henderson IR. Structural variation and DNA methylation shape the centromere-proximal meiotic crossover landscape in Arabidopsis. Genome Biol 2024; 25:30. [PMID: 38254210 PMCID: PMC10804481 DOI: 10.1186/s13059-024-03163-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Accepted: 01/01/2024] [Indexed: 01/24/2024] Open
Abstract
BACKGROUND Centromeres load kinetochore complexes onto chromosomes, which mediate spindle attachment and allow segregation during cell division. Although centromeres perform a conserved cellular function, their underlying DNA sequences are highly divergent within and between species. Despite variability in DNA sequence, centromeres are also universally suppressed for meiotic crossover recombination, across eukaryotes. However, the genetic and epigenetic factors responsible for suppression of centromeric crossovers remain to be completely defined. RESULTS To explore the centromere-proximal meiotic recombination landscape, we map 14,397 crossovers against fully assembled Arabidopsis thaliana (A. thaliana) genomes. A. thaliana centromeres comprise megabase satellite repeat arrays that load nucleosomes containing the CENH3 histone variant. Each chromosome contains a structurally polymorphic region of ~3-4 megabases, which lack crossovers and include the satellite arrays. This polymorphic region is flanked by ~1-2 megabase low-recombination zones. These recombination-suppressed regions are enriched for Gypsy/Ty3 retrotransposons, and additionally contain expressed genes with high genetic diversity that initiate meiotic recombination, yet do not crossover. We map crossovers at high-resolution in proximity to CEN3, which resolves punctate centromere-proximal hotspots that overlap gene islands embedded in heterochromatin. Centromeres are densely DNA methylated and the recombination landscape is remodelled in DNA methylation mutants. We observe that the centromeric low-recombining zones decrease and increase crossovers in CG (met1) and non-CG (cmt3) mutants, respectively, whereas the core non-recombining zones remain suppressed. CONCLUSION Our work relates the genetic and epigenetic organization of A. thaliana centromeres and flanking pericentromeric heterochromatin to the zones of crossover suppression that surround the CENH3-occupied satellite repeat arrays.
Collapse
Affiliation(s)
- Joiselle B Fernandes
- Department of Plant Sciences, University of Cambridge, Cambridge, CB2 3EA, UK
- Department of Chromosome Biology, Max Planck Institute for Plant Breeding Research, D-50829, Cologne, Germany
| | - Matthew Naish
- Department of Plant Sciences, University of Cambridge, Cambridge, CB2 3EA, UK
| | - Qichao Lian
- Department of Chromosome Biology, Max Planck Institute for Plant Breeding Research, D-50829, Cologne, Germany
| | - Robin Burns
- Department of Plant Sciences, University of Cambridge, Cambridge, CB2 3EA, UK
| | - Andrew J Tock
- Department of Plant Sciences, University of Cambridge, Cambridge, CB2 3EA, UK
| | - Fernando A Rabanal
- Department of Molecular Biology, Max Planck Institute for Biology, Tübingen, D-72076, Tübingen, Germany
| | - Piotr Wlodzimierz
- Department of Plant Sciences, University of Cambridge, Cambridge, CB2 3EA, UK
| | - Anette Habring
- Department of Molecular Biology, Max Planck Institute for Biology, Tübingen, D-72076, Tübingen, Germany
| | - Robert E Nicholas
- Department of Plant Sciences, University of Cambridge, Cambridge, CB2 3EA, UK
| | - Detlef Weigel
- Department of Molecular Biology, Max Planck Institute for Biology, Tübingen, D-72076, Tübingen, Germany
- University of Tübingen, Institute for Bioinformatics and Medical Informatics, D-72076, Tübingen, Germany
| | - Raphael Mercier
- Department of Chromosome Biology, Max Planck Institute for Plant Breeding Research, D-50829, Cologne, Germany
| | - Ian R Henderson
- Department of Plant Sciences, University of Cambridge, Cambridge, CB2 3EA, UK.
| |
Collapse
|
30
|
Song Y, Peng Y, Liu L, Li G, Zhao X, Wang X, Cao S, Muyle A, Zhou Y, Zhou H. Phased gap-free genome assembly of octoploid cultivated strawberry illustrates the genetic and epigenetic divergence among subgenomes. HORTICULTURE RESEARCH 2024; 11:uhad252. [PMID: 38269295 PMCID: PMC10807706 DOI: 10.1093/hr/uhad252] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Accepted: 11/18/2023] [Indexed: 01/26/2024]
Abstract
The genetic and epigenetic mechanisms underlying the coexistence and coordination of the four diverged subgenomes (ABCD) in octoploid strawberries (Fragaria × ananassa) remains poorly understood. In this study, we have assembled a haplotype-phased gap-free octoploid genome for the strawberry, which allowed us to uncover the sequence, structure, and epigenetic divergences among the subgenomes. The diploid progenitors of the octoploid strawberry, apart from subgenome A (Fragaria vesca), have been a subject of public controversy. Phylogenomic analyses revealed a close relationship between diploid species Fragaria iinumae and subgenomes B, C, and D. Subgenome A, closely related to F. vesca, retains the highest number of genes, exhibits the lowest content of transposable elements (TEs), experiences the strongest purifying selection, shows the lowest DNA methylation levels, and displays the highest expression level compared to the other three subgenomes. Transcriptome and DNA methylome analyses revealed that subgenome A-biased genes were enriched in fruit development biological processes. In contrast, although subgenomes B, C, and D contain equivalent amounts of repetitive sequences, they exhibit diverged methylation levels, particularly for TEs located near genes. Taken together, our findings provide valuable insights into the evolutionary patterns of subgenome structure, divergence and epigenetic dynamics in octoploid strawberries, which could be utilized in strawberry genetics and breeding research.
Collapse
Affiliation(s)
- Yanhong Song
- National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences, Zhengzhou, 450009, China
| | - Yanling Peng
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Lifeng Liu
- National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences, Zhengzhou, 450009, China
| | - Gang Li
- National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences, Zhengzhou, 450009, China
| | - Xia Zhao
- National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences, Zhengzhou, 450009, China
| | - Xu Wang
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Shuo Cao
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Aline Muyle
- CEFE, University of Montpellier, CNRS, EPHE, IRD, Montpellier 34000, France
| | - Yongfeng Zhou
- National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
- National Key Laboratory of Tropical Crop Breeding, Tropical Crops Genetic Resources Institute, Chinese Academy of Tropical Agricultural Sciences, Haikou 570000, China
| | - Houcheng Zhou
- National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences, Zhengzhou, 450009, China
| |
Collapse
|
31
|
Volpe E, Corda L, Tommaso ED, Pelliccia F, Ottalevi R, Licastro D, Guarracino A, Capulli M, Formenti G, Tassone E, Giunta S. The complete diploid reference genome of RPE-1 identifies human phased epigenetic landscapes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.01.565049. [PMID: 38168337 PMCID: PMC10760208 DOI: 10.1101/2023.11.01.565049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2024]
Abstract
Comparative analysis of recent human genome assemblies highlights profound sequence divergence that peaks within polymorphic loci such as centromeres. This raises the question about the adequacy of relying on human reference genomes to accurately analyze sequencing data derived from experimental cell lines. Here, we generated the complete diploid genome assembly for the human retinal epithelial cells (RPE-1), a widely used non-cancer laboratory cell line with a stable karyotype, to use as matched reference for multi-omics sequencing data analysis. Our RPE1v1.0 assembly presents completely phased haplotypes and chromosome-level scaffolds that span centromeres with ultra-high base accuracy (>QV60). We mapped the haplotype-specific genomic variation specific to this cell line including t(Xq;10q), a stable 73.18 Mb duplication of chromosome 10 translocated onto the microdeleted chromosome X telomere t(Xq;10q). Polymorphisms between haplotypes of the same genome reveals genetic and epigenetic variation for all chromosomes, especially at centromeres. The RPE-1 assembly as matched reference genome improves mapping quality of multi-omics reads originating from RPE-1 cells with drastic reduction in alignments mismatches compared to using the most complete human reference to date (CHM13). Leveraging the accuracy achieved using a matched reference, we were able to identify the kinetochore sites at base pair resolution and show unprecedented variation between haplotypes. This work showcases the use of matched reference genomes for multiomics analyses and serves as the foundation for a call to comprehensively assemble experimentally relevant cell lines for widespread application.
Collapse
Affiliation(s)
- Emilia Volpe
- Giunta Laboratory of Genome Evolution, Department of Biology and Biotechnologies Charles Darwin, University of Rome “Sapienza”, Piazzale Aldo Moro 5, 00185 Rome, Italy
| | - Luca Corda
- Giunta Laboratory of Genome Evolution, Department of Biology and Biotechnologies Charles Darwin, University of Rome “Sapienza”, Piazzale Aldo Moro 5, 00185 Rome, Italy
| | - Elena Di Tommaso
- Giunta Laboratory of Genome Evolution, Department of Biology and Biotechnologies Charles Darwin, University of Rome “Sapienza”, Piazzale Aldo Moro 5, 00185 Rome, Italy
| | - Franca Pelliccia
- Giunta Laboratory of Genome Evolution, Department of Biology and Biotechnologies Charles Darwin, University of Rome “Sapienza”, Piazzale Aldo Moro 5, 00185 Rome, Italy
| | - Riccardo Ottalevi
- Department of Bioinformatic, Dante Genomics Corp Inc., 667 Madison Avenue, New York, NY 10065 USA and S.s.17, 67100, L’Aquila, Italy
| | | | - Andrea Guarracino
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN 38163, USA
| | - Mattia Capulli
- Department of Biotechnological and Applied Clinical Sciences, University of L’Aquila, L’Aquila, Italy
| | - Giulio Formenti
- The Rockefeller University, 1230 York Avenue, 10065 New York, USA
| | - Evelyne Tassone
- Giunta Laboratory of Genome Evolution, Department of Biology and Biotechnologies Charles Darwin, University of Rome “Sapienza”, Piazzale Aldo Moro 5, 00185 Rome, Italy
| | - Simona Giunta
- Giunta Laboratory of Genome Evolution, Department of Biology and Biotechnologies Charles Darwin, University of Rome “Sapienza”, Piazzale Aldo Moro 5, 00185 Rome, Italy
| |
Collapse
|
32
|
González-Muñoz JF, Sánchez-Sendra B, Monteagudo C. Diagnostic Algorithm to Subclassify Atypical Spitzoid Tumors in Low and High Risk According to Their Methylation Status. Int J Mol Sci 2023; 25:318. [PMID: 38203489 PMCID: PMC10779069 DOI: 10.3390/ijms25010318] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Revised: 12/20/2023] [Accepted: 12/21/2023] [Indexed: 01/12/2024] Open
Abstract
Current diagnostic algorithms are insufficient for the optimal clinical and therapeutic management of cutaneous spitzoid tumors, particularly atypical spitzoid tumors (AST). Therefore, it is crucial to identify new markers that allow for reliable and reproducible diagnostic assessment and can also be used as a predictive tool to anticipate the individual malignant potential of each patient, leading to tailored individual therapy. Using Reduced Representation Bisulfite Sequencing (RRBS), we studied genome-wide methylation profiles of a series of Spitz nevi (SN), spitzoid melanoma (SM), and AST. We established a diagnostic algorithm based on the methylation status of seven cg sites located in TETK4P2 (Tektin 4 Pseudogene 2), MYO1D (Myosin ID), and PMF1-BGLAP (PMF1-BGLAP Readthrough), which allows the distinction between SN and SM but is also capable of subclassifying AST according to their similarity to the methylation levels of Spitz nevi or spitzoid melanoma. Thus, our epigenetic algorithm can predict the risk level of AST and predict its potential clinical outcomes.
Collapse
Affiliation(s)
| | - Beatriz Sánchez-Sendra
- Skin Cancer Research Group, Biomedical Research Institute INCLIVA, 46010 Valencia, Spain (B.S.-S.)
| | - Carlos Monteagudo
- Skin Cancer Research Group, Biomedical Research Institute INCLIVA, 46010 Valencia, Spain (B.S.-S.)
- Department of Pathology, University of Valencia, 46010 Valencia, Spain
| |
Collapse
|
33
|
Xu R, Pan Z, Nakagawa T. Gross Chromosomal Rearrangement at Centromeres. Biomolecules 2023; 14:28. [PMID: 38254628 PMCID: PMC10813616 DOI: 10.3390/biom14010028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 12/19/2023] [Accepted: 12/22/2023] [Indexed: 01/24/2024] Open
Abstract
Centromeres play essential roles in the faithful segregation of chromosomes. CENP-A, the centromere-specific histone H3 variant, and heterochromatin characterized by di- or tri-methylation of histone H3 9th lysine (H3K9) are the hallmarks of centromere chromatin. Contrary to the epigenetic marks, DNA sequences underlying the centromere region of chromosomes are not well conserved through evolution. However, centromeres consist of repetitive sequences in many eukaryotes, including animals, plants, and a subset of fungi, including fission yeast. Advances in long-read sequencing techniques have uncovered the complete sequence of human centromeres containing more than thousands of alpha satellite repeats and other types of repetitive sequences. Not only tandem but also inverted repeats are present at a centromere. DNA recombination between centromere repeats can result in gross chromosomal rearrangement (GCR), such as translocation and isochromosome formation. CENP-A chromatin and heterochromatin suppress the centromeric GCR. The key player of homologous recombination, Rad51, safeguards centromere integrity through conservative noncrossover recombination between centromere repeats. In contrast to Rad51-dependent recombination, Rad52-mediated single-strand annealing (SSA) and microhomology-mediated end-joining (MMEJ) lead to centromeric GCR. This review summarizes recent findings on the role of centromere and recombination proteins in maintaining centromere integrity and discusses how GCR occurs at centromeres.
Collapse
Affiliation(s)
- Ran Xu
- Department of Biological Sciences, Graduate School of Science, Osaka University, 1-1 Machikaneyama, Toyonaka 560-0043, Osaka, Japan
- Forefront Research Center, Graduate School of Science, Osaka University, 1-1 Machikaneyama, Toyonaka 560-0043, Osaka, Japan
| | - Ziyi Pan
- Department of Biological Sciences, Graduate School of Science, Osaka University, 1-1 Machikaneyama, Toyonaka 560-0043, Osaka, Japan
- Forefront Research Center, Graduate School of Science, Osaka University, 1-1 Machikaneyama, Toyonaka 560-0043, Osaka, Japan
| | - Takuro Nakagawa
- Department of Biological Sciences, Graduate School of Science, Osaka University, 1-1 Machikaneyama, Toyonaka 560-0043, Osaka, Japan
- Forefront Research Center, Graduate School of Science, Osaka University, 1-1 Machikaneyama, Toyonaka 560-0043, Osaka, Japan
| |
Collapse
|
34
|
Hardikar S, Ren R, Ying Z, Horton JR, Bramble MD, Liu B, Lu Y, Liu B, Dan J, Zhang X, Cheng X, Chen T. The ICF syndrome protein CDCA7 harbors a unique DNA-binding domain that recognizes a CpG dyad in the context of a non-B DNA. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.15.571946. [PMID: 38168392 PMCID: PMC10760177 DOI: 10.1101/2023.12.15.571946] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2024]
Abstract
CDCA7 , encoding a protein with a C-terminal cysteine-rich domain (CRD), is mutated in immunodeficiency, centromeric instability and facial anomalies (ICF) syndrome, a disease related to hypomethylation of juxtacentromeric satellite DNA. How CDCA7 directs DNA methylation to juxtacentromeric regions is unknown. Here, we show that the CDCA7 CRD adopts a unique zinc-binding structure that recognizes a CpG dyad in a non-B DNA formed by two sequence motifs. CDCA7, but not ICF mutants, preferentially binds the non-B DNA with strand-specific CpG hemi-methylation. The unmethylated sequence motif is highly enriched at centromeres of human chromosomes, whereas the methylated motif is distributed throughout the genome. At S phase, CDCA7, but not ICF mutants, is concentrated in constitutive heterochromatin foci, and the formation of such foci can be inhibited by exogenous hemi-methylated non-B DNA bound by the CRD. Binding of the non-B DNA formed in juxtacentromeric regions during DNA replication provides a mechanism by which CDCA7 controls the specificity of DNA methylation.
Collapse
|
35
|
Chaisson MJP, Sulovari A, Valdmanis PN, Miller DE, Eichler EE. Advances in the discovery and analyses of human tandem repeats. Emerg Top Life Sci 2023; 7:361-381. [PMID: 37905568 PMCID: PMC10806765 DOI: 10.1042/etls20230074] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Revised: 10/18/2023] [Accepted: 10/18/2023] [Indexed: 11/02/2023]
Abstract
Long-read sequencing platforms provide unparalleled access to the structure and composition of all classes of tandemly repeated DNA from STRs to satellite arrays. This review summarizes our current understanding of their organization within the human genome, their importance with respect to disease, as well as the advances and challenges in understanding their genetic diversity and functional effects. Novel computational methods are being developed to visualize and associate these complex patterns of human variation with disease, expression, and epigenetic differences. We predict accurate characterization of this repeat-rich form of human variation will become increasingly relevant to both basic and clinical human genetics.
Collapse
Affiliation(s)
- Mark J P Chaisson
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, U.S.A
- The Genomic and Epigenomic Regulation Program, USC Norris Cancer Center, University of Southern California, Los Angeles, CA 90089, U.S.A
| | - Arvis Sulovari
- Computational Biology, Cajal Neuroscience Inc, Seattle, WA 98102, U.S.A
| | - Paul N Valdmanis
- Division of Medical Genetics, Department of Medicine, University of Washington School of Medicine, Seattle, WA 98195, U.S.A
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, U.S.A
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA 98195, U.S.A
| | - Danny E Miller
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA 98195, U.S.A
- Brotman Baty Institute for Precision Medicine, University of Washington, Seattle, WA 98195, U.S.A
- Department of Pediatrics, University of Washington, Seattle, WA 98195, U.S.A
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, U.S.A
- Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, U.S.A
| |
Collapse
|
36
|
Yang Y, Wu Z, Wu Z, Li T, Shen Z, Zhou X, Wu X, Li G, Zhang Y. A near-complete assembly of asparagus bean provides insights into anthocyanin accumulation in pods. PLANT BIOTECHNOLOGY JOURNAL 2023; 21:2473-2489. [PMID: 37558431 PMCID: PMC10651155 DOI: 10.1111/pbi.14142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Revised: 07/11/2023] [Accepted: 07/23/2023] [Indexed: 08/11/2023]
Abstract
Asparagus bean (Vigna unguiculata ssp. sesquipedialis), a subspecies of V. unguiculata, is a vital legume crop widely cultivated in Asia for its tender pods consumed as vegetables. However, the existing asparagus bean assemblies still contain numerous gaps and unanchored sequences, which presents challenges to functional genomics research. Here, we present an improved reference genome sequence of an elite asparagus bean variety, Fengchan 6, achieved through the integration of nanopore ultra-long reads, PacBio high-fidelity reads, and Hi-C technology. The improved assembly is 521.3 Mb in length and demonstrates several enhancements, including a higher N50 length (46.4 Mb), an anchor ratio of 99.8%, and the presence of only one gap. Furthermore, we successfully assembled 14 telomeres and all 11 centromeres, including four telomere-to-telomere chromosomes. Remarkably, the centromeric regions cover a total length of 38.1 Mb, providing valuable insights into the complex architecture of centromeres. Among the 30 594 predicted protein-coding genes, we identified 2356 genes that are tandemly duplicated in segmental duplication regions. These findings have implications for defence responses and may contribute to evolutionary processes. By utilizing the reference genome, we were able to effectively identify the presence of the gene VuMYB114, which regulates the accumulation of anthocyanins, thereby controlling the purple coloration of the pods. This discovery holds significant implications for understanding the underlying mechanisms of color determination and the breeding process. Overall, the highly improved reference genome serves as crucial resource and lays a solid foundation for asparagus bean genomic studies and genetic improvement efforts.
Collapse
Affiliation(s)
- Yi Yang
- Vegetable Research InstituteGuangdong Academy of Agricultural SciencesGuangzhouChina
- Guangdong Key Laboratory for New Technology Research of VegetablesGuangzhouChina
| | - Zhikun Wu
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic CenterSun Yat‐Sen UniversityGuangzhouChina
| | - Zengxiang Wu
- Vegetable Research InstituteGuangdong Academy of Agricultural SciencesGuangzhouChina
- Guangdong Key Laboratory for New Technology Research of VegetablesGuangzhouChina
| | - Tinyao Li
- Vegetable Research InstituteGuangdong Academy of Agricultural SciencesGuangzhouChina
- Guangdong Key Laboratory for New Technology Research of VegetablesGuangzhouChina
| | - Zhuo Shen
- Vegetable Research InstituteGuangdong Academy of Agricultural SciencesGuangzhouChina
- Guangdong Key Laboratory for New Technology Research of VegetablesGuangzhouChina
| | - Xuan Zhou
- Vegetable Research InstituteGuangdong Academy of Agricultural SciencesGuangzhouChina
- Guangdong Key Laboratory for New Technology Research of VegetablesGuangzhouChina
| | - Xinyi Wu
- Institute of VegetableZhejiang Academy of Agricultural SciencesHangzhouChina
| | - Guojing Li
- Institute of VegetableZhejiang Academy of Agricultural SciencesHangzhouChina
| | - Yan Zhang
- Vegetable Research InstituteGuangdong Academy of Agricultural SciencesGuangzhouChina
- Guangdong Key Laboratory for New Technology Research of VegetablesGuangzhouChina
| |
Collapse
|
37
|
Makova KD, Pickett BD, Harris RS, Hartley GA, Cechova M, Pal K, Nurk S, Yoo D, Li Q, Hebbar P, McGrath BC, Antonacci F, Aubel M, Biddanda A, Borchers M, Bomberg E, Bouffard GG, Brooks SY, Carbone L, Carrel L, Carroll A, Chang PC, Chin CS, Cook DE, Craig SJ, de Gennaro L, Diekhans M, Dutra A, Garcia GH, Grady PG, Green RE, Haddad D, Hallast P, Harvey WT, Hickey G, Hillis DA, Hoyt SJ, Jeong H, Kamali K, Kosakovsky Pond SL, LaPolice TM, Lee C, Lewis AP, Loh YHE, Masterson P, McCoy RC, Medvedev P, Miga KH, Munson KM, Pak E, Paten B, Pinto BJ, Potapova T, Rhie A, Rocha JL, Ryabov F, Ryder OA, Sacco S, Shafin K, Shepelev VA, Slon V, Solar SJ, Storer JM, Sudmant PH, Sweetalana, Sweeten A, Tassia MG, Thibaud-Nissen F, Ventura M, Wilson MA, Young AC, Zeng H, Zhang X, Szpiech ZA, Huber CD, Gerton JL, Yi SV, Schatz MC, Alexandrov IA, Koren S, O’Neill RJ, Eichler E, Phillippy AM. The Complete Sequence and Comparative Analysis of Ape Sex Chromosomes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.30.569198. [PMID: 38077089 PMCID: PMC10705393 DOI: 10.1101/2023.11.30.569198] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/24/2023]
Abstract
Apes possess two sex chromosomes-the male-specific Y and the X shared by males and females. The Y chromosome is crucial for male reproduction, with deletions linked to infertility. The X chromosome carries genes vital for reproduction and cognition. Variation in mating patterns and brain function among great apes suggests corresponding differences in their sex chromosome structure and evolution. However, due to their highly repetitive nature and incomplete reference assemblies, ape sex chromosomes have been challenging to study. Here, using the state-of-the-art experimental and computational methods developed for the telomere-to-telomere (T2T) human genome, we produced gapless, complete assemblies of the X and Y chromosomes for five great apes (chimpanzee, bonobo, gorilla, Bornean and Sumatran orangutans) and a lesser ape, the siamang gibbon. These assemblies completely resolved ampliconic, palindromic, and satellite sequences, including the entire centromeres, allowing us to untangle the intricacies of ape sex chromosome evolution. We found that, compared to the X, ape Y chromosomes vary greatly in size and have low alignability and high levels of structural rearrangements. This divergence on the Y arises from the accumulation of lineage-specific ampliconic regions and palindromes (which are shared more broadly among species on the X) and from the abundance of transposable elements and satellites (which have a lower representation on the X). Our analysis of Y chromosome genes revealed lineage-specific expansions of multi-copy gene families and signatures of purifying selection. In summary, the Y exhibits dynamic evolution, while the X is more stable. Finally, mapping short-read sequencing data from >100 great ape individuals revealed the patterns of diversity and selection on their sex chromosomes, demonstrating the utility of these reference assemblies for studies of great ape evolution. These complete sex chromosome assemblies are expected to further inform conservation genetics of nonhuman apes, all of which are endangered species.
Collapse
Affiliation(s)
| | - Brandon D. Pickett
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | | | | | - Monika Cechova
- University of California Santa Cruz, Santa Cruz, CA, USA
| | - Karol Pal
- Penn State University, University Park, PA, USA
| | - Sergey Nurk
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - DongAhn Yoo
- University of Washington School of Medicine, Seattle, WA, USA
| | - Qiuhui Li
- Johns Hopkins University, Baltimore, MD, USA
| | - Prajna Hebbar
- University of California Santa Cruz, Santa Cruz, CA, USA
| | | | | | | | | | | | - Erich Bomberg
- University of Münster, Münster, Germany
- MPI for Developmental Biology, Tübingen, Germany
| | - Gerard G. Bouffard
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Shelise Y. Brooks
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Lucia Carbone
- Oregon Health & Science University, Portland, OR, USA
- Oregon National Primate Research Center, Hillsboro, OR, USA
| | - Laura Carrel
- Penn State University School of Medicine, Hershey, PA, USA
| | | | | | - Chen-Shan Chin
- Foundation of Biological Data Sciences, Belmont, CA, USA
| | | | | | | | - Mark Diekhans
- University of California Santa Cruz, Santa Cruz, CA, USA
| | - Amalia Dutra
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Gage H. Garcia
- University of Washington School of Medicine, Seattle, WA, USA
| | | | | | - Diana Haddad
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Pille Hallast
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | | | - Glenn Hickey
- University of California Santa Cruz, Santa Cruz, CA, USA
| | - David A. Hillis
- University of California Santa Barbara, Santa Barbara, CA, USA
| | | | - Hyeonsoo Jeong
- University of Washington School of Medicine, Seattle, WA, USA
| | | | | | | | - Charles Lee
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | | | | | - Patrick Masterson
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | | | | | - Karen H. Miga
- University of California Santa Cruz, Santa Cruz, CA, USA
| | | | - Evgenia Pak
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Benedict Paten
- University of California Santa Cruz, Santa Cruz, CA, USA
| | | | | | - Arang Rhie
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | | | - Fedor Ryabov
- Masters Program in National Research University Higher School of Economics, Moscow, Russia
| | | | - Samuel Sacco
- University of California Santa Cruz, Santa Cruz, CA, USA
| | | | | | | | - Steven J. Solar
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | | | | | - Sweetalana
- Penn State University, University Park, PA, USA
| | - Alex Sweeten
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
- Johns Hopkins University, Baltimore, MD, USA
| | | | - Françoise Thibaud-Nissen
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | | | | | - Alice C. Young
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | | | - Xinru Zhang
- Penn State University, University Park, PA, USA
| | | | | | | | - Soojin V. Yi
- University of California Santa Barbara, Santa Barbara, CA, USA
| | | | | | - Sergey Koren
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | | | - Evan Eichler
- University of Washington School of Medicine, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Adam M. Phillippy
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| |
Collapse
|
38
|
Morselli M, Bennett R, Shaidani NI, Horb M, Peshkin L, Pellegrini M. Age-associated DNA methylation changes in Xenopus frogs. Epigenetics 2023; 18:2201517. [PMID: 37092296 PMCID: PMC10128463 DOI: 10.1080/15592294.2023.2201517] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2022] [Accepted: 04/06/2023] [Indexed: 04/25/2023] Open
Abstract
Age-associated changes in DNA methylation have been characterized across various animals, but not yet in amphibians, which are of particular interest because they include widely studied model organisms. In this study, we present clear evidence that the aquatic vertebrate species Xenopus tropicalis displays patterns of age-associated changes in DNA methylation. We have generated whole-genome bisulfite sequencing (WGBS) profiles from skin samples of nine frogs representing young, mature, and old adults and characterized the gene- and chromosome-scale DNA methylation changes with age. Many of the methylation features and changes we observe are consistent with what is known in mammalian species, suggesting that the mechanism of age-related changes is conserved. Moreover, we selected a few thousand age-associated CpG sites to build an assay based on targeted DNA methylation analysis (TBSseq) to expand our findings in future studies involving larger cohorts of individuals. Preliminary results of a pilot TBSeq experiment recapitulate the findings obtained with WGBS setting the basis for the development of an epigenetic clock assay. The results of this study will allow us to leverage the unique resources available for Xenopus to study how DNA methylation relates to other hallmarks of ageing.
Collapse
Affiliation(s)
- Marco Morselli
- Molecular, Cell & Developmental Biology, UCLA, Los Angeles, CA, USA
| | - Ronan Bennett
- Molecular, Cell & Developmental Biology, UCLA, Los Angeles, CA, USA
| | - Nikko-Ideen Shaidani
- Eugene Bell Center for Regenerative Biology and Tissue Engineering and National Xenopus Resource, Marine Biological Laboratory, Woods Hole, MA, USA
| | - Marko Horb
- Eugene Bell Center for Regenerative Biology and Tissue Engineering and National Xenopus Resource, Marine Biological Laboratory, Woods Hole, MA, USA
| | - Leonid Peshkin
- Eugene Bell Center for Regenerative Biology and Tissue Engineering and National Xenopus Resource, Marine Biological Laboratory, Woods Hole, MA, USA
- Systems Biology, Harvard Medical School, Boston, MA, USA
| | | |
Collapse
|
39
|
Sen Gupta A, Seidel C, Tsuchiya D, McKinney S, Yu Z, Smith SE, Unruh JR, Gerton JL. Defining a core configuration for human centromeres during mitosis. Nat Commun 2023; 14:7947. [PMID: 38040722 PMCID: PMC10692335 DOI: 10.1038/s41467-023-42980-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Accepted: 10/25/2023] [Indexed: 12/03/2023] Open
Abstract
The centromere components cohesin, CENP-A, and centromeric DNA are essential for biorientation of sister chromatids on the mitotic spindle and accurate sister chromatid segregation. Insight into the 3D organization of centromere components would help resolve how centromeres function on the mitotic spindle. We use ChIP-seq and super-resolution microscopy with single particle averaging to examine the geometry of essential centromeric components on human chromosomes. Both modalities suggest cohesin is enriched at pericentromeric DNA. CENP-A localizes to a subset of the α-satellite DNA, with clusters separated by ~562 nm and a perpendicular intervening ~190 nM wide axis of cohesin in metaphase chromosomes. Differently sized α-satellite arrays achieve a similar core structure. Here we present a working model for a common core configuration of essential centromeric components that includes CENP-A nucleosomes, α-satellite DNA and pericentromeric cohesion. This configuration helps reconcile how centromeres function and serves as a foundation to add components of the chromosome segregation machinery.
Collapse
Affiliation(s)
| | - Chris Seidel
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | - Dai Tsuchiya
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | - Sean McKinney
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | - Zulin Yu
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | - Sarah E Smith
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | - Jay R Unruh
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | - Jennifer L Gerton
- Stowers Institute for Medical Research, Kansas City, MO, USA.
- Department of Biochemistry and Molecular Biology, University of Kansas, Kansas City, KS, USA.
| |
Collapse
|
40
|
He Y, Chu Y, Guo S, Hu J, Li R, Zheng Y, Ma X, Du Z, Zhao L, Yu W, Xue J, Bian W, Yang F, Chen X, Zhang P, Wu R, Ma Y, Shao C, Chen J, Wang J, Li J, Wu J, Hu X, Long Q, Jiang M, Ye H, Song S, Li G, Wei Y, Xu Y, Ma Y, Chen Y, Wang K, Bao J, Xi W, Wang F, Ni W, Zhang M, Yu Y, Li S, Kang Y, Gao Z. T2T-YAO: A Telomere-to-telomere Assembled Diploid Reference Genome for Han Chinese. GENOMICS, PROTEOMICS & BIOINFORMATICS 2023; 21:1085-1100. [PMID: 37595788 PMCID: PMC11082261 DOI: 10.1016/j.gpb.2023.08.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Revised: 08/01/2023] [Accepted: 08/08/2023] [Indexed: 08/20/2023]
Abstract
Since its initial release in 2001, the human reference genome has undergone continuous improvement in quality, and the recently released telomere-to-telomere (T2T) version - T2T-CHM13 - reaches its highest level of continuity and accuracy after 20 years of effort by working on a simplified, nearly homozygous genome of a hydatidiform mole cell line. Here, to provide an authentic complete diploid human genome reference for the Han Chinese, the largest population in the world, we assembled the genome of a male Han Chinese individual, T2T-YAO, which includes T2T assemblies of all the 22 + X + M and 22 + Y chromosomes in both haploids. The quality of T2T-YAO is much better than those of all currently available diploid assemblies, and its haploid version, T2T-YAO-hp, generated by selecting the better assembly for each autosome, reaches the top quality of fewer than one error per 29.5 Mb, even higher than that of T2T-CHM13. Derived from an individual living in the aboriginal region of the Han population, T2T-YAO shows clear ancestry and potential genetic continuity from the ancient ancestors. Each haplotype of T2T-YAO possesses ∼ 330-Mb exclusive sequences, ∼ 3100 unique genes, and tens of thousands of nucleotide and structural variations as compared with CHM13, highlighting the necessity of a population-stratified reference genome. The construction of T2T-YAO, an accurate and authentic representative of the Chinese population, would enable precise delineation of genomic variations and advance our understandings in the hereditability of diseases and phenotypes, especially within the context of the unique variations of the Chinese population.
Collapse
Affiliation(s)
- Yukun He
- Department of Respiratory and Critical Care Medicine, Peking University People's Hospital, Beijing 100044, China
| | - Yanan Chu
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China
| | - Shuming Guo
- Linfen Clinical Medicine Research Center, Linfen 041000, China; Institute of Chest and Lung Diseases, Shanxi Medical University, Taiyuan 030001, China
| | - Jiang Hu
- GrandOmics Biosciences Co., Ltd, Wuhan 430076, China
| | - Ran Li
- Department of Respiratory and Critical Care Medicine, Peking University People's Hospital, Beijing 100044, China; Beijing Key Laboratory of Genome and Precision Medicine Technologies, Beijing 100101, China
| | - Yali Zheng
- Department of Respiratory, Critical Care and Sleep Medicine, Xiang'an Hospital of Xiamen University, School of Medicine, Xiamen University, Xiamen 361101, China
| | - Xinqian Ma
- Department of Respiratory and Critical Care Medicine, Peking University People's Hospital, Beijing 100044, China; Beijing Key Laboratory of Genome and Precision Medicine Technologies, Beijing 100101, China
| | - Zhenglin Du
- Institute of PSI Genomics, Wenzhou 325024, China
| | - Lili Zhao
- Department of Respiratory and Critical Care Medicine, Peking University People's Hospital, Beijing 100044, China; Beijing Key Laboratory of Genome and Precision Medicine Technologies, Beijing 100101, China
| | - Wenyi Yu
- Department of Respiratory and Critical Care Medicine, Peking University People's Hospital, Beijing 100044, China; Beijing Key Laboratory of Genome and Precision Medicine Technologies, Beijing 100101, China
| | - Jianbo Xue
- Department of Respiratory and Critical Care Medicine, Peking University People's Hospital, Beijing 100044, China; Beijing Key Laboratory of Genome and Precision Medicine Technologies, Beijing 100101, China
| | - Wenjie Bian
- Department of Respiratory and Critical Care Medicine, Peking University People's Hospital, Beijing 100044, China; Beijing Key Laboratory of Genome and Precision Medicine Technologies, Beijing 100101, China
| | - Feifei Yang
- Department of Respiratory and Critical Care Medicine, Peking University People's Hospital, Beijing 100044, China; Beijing Key Laboratory of Genome and Precision Medicine Technologies, Beijing 100101, China
| | - Xi Chen
- Department of Respiratory and Critical Care Medicine, Peking University People's Hospital, Beijing 100044, China; Beijing Key Laboratory of Genome and Precision Medicine Technologies, Beijing 100101, China
| | - Pingan Zhang
- Department of Respiratory and Critical Care Medicine, Peking University People's Hospital, Beijing 100044, China; Beijing Key Laboratory of Genome and Precision Medicine Technologies, Beijing 100101, China
| | - Rihan Wu
- Department of Respiratory and Critical Care Medicine, Peking University People's Hospital, Beijing 100044, China; Beijing Key Laboratory of Genome and Precision Medicine Technologies, Beijing 100101, China
| | - Yifan Ma
- Department of Respiratory and Critical Care Medicine, Peking University People's Hospital, Beijing 100044, China; Beijing Key Laboratory of Genome and Precision Medicine Technologies, Beijing 100101, China
| | - Changjun Shao
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China
| | - Jing Chen
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China
| | - Jian Wang
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China
| | - Jiwei Li
- Department of Respiratory, Critical Care and Sleep Medicine, Xiang'an Hospital of Xiamen University, School of Medicine, Xiamen University, Xiamen 361101, China
| | - Jing Wu
- Department of Respiratory, Critical Care and Sleep Medicine, Xiang'an Hospital of Xiamen University, School of Medicine, Xiamen University, Xiamen 361101, China
| | - Xiaoyi Hu
- Department of Respiratory, Critical Care and Sleep Medicine, Xiang'an Hospital of Xiamen University, School of Medicine, Xiamen University, Xiamen 361101, China
| | - Qiuyue Long
- Department of Respiratory, Critical Care and Sleep Medicine, Xiang'an Hospital of Xiamen University, School of Medicine, Xiamen University, Xiamen 361101, China
| | - Mingzheng Jiang
- Department of Respiratory, Critical Care and Sleep Medicine, Xiang'an Hospital of Xiamen University, School of Medicine, Xiamen University, Xiamen 361101, China
| | - Hongli Ye
- Department of Respiratory, Critical Care and Sleep Medicine, Xiang'an Hospital of Xiamen University, School of Medicine, Xiamen University, Xiamen 361101, China
| | - Shixu Song
- Department of Respiratory, Critical Care and Sleep Medicine, Xiang'an Hospital of Xiamen University, School of Medicine, Xiamen University, Xiamen 361101, China
| | - Guangyao Li
- Linfen Clinical Medicine Research Center, Linfen 041000, China
| | - Yue Wei
- Linfen Clinical Medicine Research Center, Linfen 041000, China
| | - Yu Xu
- Beijing Jishuitan Hospital, Capital Medical University, Beijing 100035, China
| | - Yanliang Ma
- Department of Respiratory and Critical Care Medicine, Peking University People's Hospital, Beijing 100044, China; Beijing Key Laboratory of Genome and Precision Medicine Technologies, Beijing 100101, China
| | - Yanwen Chen
- Department of Respiratory and Critical Care Medicine, Peking University People's Hospital, Beijing 100044, China; Beijing Key Laboratory of Genome and Precision Medicine Technologies, Beijing 100101, China
| | - Keqiang Wang
- Department of Respiratory and Critical Care Medicine, Peking University People's Hospital, Beijing 100044, China; Beijing Key Laboratory of Genome and Precision Medicine Technologies, Beijing 100101, China
| | - Jing Bao
- Department of Respiratory and Critical Care Medicine, Peking University People's Hospital, Beijing 100044, China; Beijing Key Laboratory of Genome and Precision Medicine Technologies, Beijing 100101, China
| | - Wen Xi
- Department of Respiratory and Critical Care Medicine, Peking University People's Hospital, Beijing 100044, China; Beijing Key Laboratory of Genome and Precision Medicine Technologies, Beijing 100101, China
| | - Fang Wang
- Department of Respiratory and Critical Care Medicine, Peking University People's Hospital, Beijing 100044, China; Beijing Key Laboratory of Genome and Precision Medicine Technologies, Beijing 100101, China
| | - Wentao Ni
- Department of Respiratory and Critical Care Medicine, Peking University People's Hospital, Beijing 100044, China; Beijing Key Laboratory of Genome and Precision Medicine Technologies, Beijing 100101, China
| | - Moqin Zhang
- Department of Respiratory and Critical Care Medicine, Peking University People's Hospital, Beijing 100044, China; Beijing Key Laboratory of Genome and Precision Medicine Technologies, Beijing 100101, China
| | - Yan Yu
- Department of Respiratory and Critical Care Medicine, Peking University People's Hospital, Beijing 100044, China; Beijing Key Laboratory of Genome and Precision Medicine Technologies, Beijing 100101, China
| | - Shengnan Li
- Department of Respiratory and Critical Care Medicine, Peking University People's Hospital, Beijing 100044, China; Beijing Key Laboratory of Genome and Precision Medicine Technologies, Beijing 100101, China
| | - Yu Kang
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100490, China.
| | - Zhancheng Gao
- Department of Respiratory and Critical Care Medicine, Peking University People's Hospital, Beijing 100044, China; Institute of Chest and Lung Diseases, Shanxi Medical University, Taiyuan 030001, China; Beijing Key Laboratory of Genome and Precision Medicine Technologies, Beijing 100101, China.
| |
Collapse
|
41
|
Peeters SB, Posynick BJ, Brown CJ. Out of the Silence: Insights into How Genes Escape X-Chromosome Inactivation. EPIGENOMES 2023; 7:29. [PMID: 38131901 PMCID: PMC10742877 DOI: 10.3390/epigenomes7040029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Revised: 11/08/2023] [Accepted: 11/14/2023] [Indexed: 12/23/2023] Open
Abstract
The silencing of all but one X chromosome in mammalian cells is a remarkable epigenetic process leading to near dosage equivalence in X-linked gene products between the sexes. However, equally remarkable is the ability of a subset of genes to continue to be expressed from the otherwise inactive X chromosome-in some cases constitutively, while other genes are variable between individuals, tissues or cells. In this review we discuss the advantages and disadvantages of the approaches that have been used to identify escapees. The identity of escapees provides important clues to mechanisms underlying escape from XCI, an arena of study now moving from correlation to functional studies. As most escapees show greater expression in females, the not-so-inactive X chromosome is a substantial contributor to sex differences in humans, and we highlight some examples of such impact.
Collapse
Affiliation(s)
| | | | - Carolyn J. Brown
- Molecular Epigenetics Group, Department of Medical Genetics, Life Sciences Institute, University of British Columbia, 2350 Health Sciences Mall, Vancouver, BC V6T 1Z3, Canada
| |
Collapse
|
42
|
Ahmad S, Sands M, Greenberg E, Tangen L, Huang J, Irudayaraj JMK. Mucosal DNA methylome alteration in Crohn's disease: surgical and non-surgical groups. Front Genet 2023; 14:1244513. [PMID: 38046046 PMCID: PMC10691104 DOI: 10.3389/fgene.2023.1244513] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Accepted: 11/09/2023] [Indexed: 12/05/2023] Open
Abstract
Crohn's disease (CD) is characterized as a chronic, relapsing, and progressive disorder with a complex etiology involving interactions between host, microbiome, and the external environment. Genome wide association studies (GWAS) suggest several genetic variations in the diseased individuals but that explains only a small proportion of susceptibility to disease conditions. This indicates the possible role of epigenome which links environmental factors to the genetic variation in the disease etiology. The current study is focused on the DNA methylome evolution with disease progression. We performed Reduced Representation Bisulfite Sequencing (RRBS) to analyze differential DNA methylation in the diseased and healthy mucosal tissues of 2 different groups of CD patients: non-surgical and surgical, categorized based on the severity of disease and standard of care needed. Patients in both groups have unique DNA methylation signature compared to the healthy tissue. After removing single nucleotide polymorphisms (SNPs), 1,671 differentially methylated loci were found in the non-surgical and 3,334 in the surgical group of which only 206 were found overlapping in both groups. Furthermore, differential DNA methylation was noted in some of the GWAS associated genes implicated in CD. Also, functional enrichment analysis showed high representation of several key pathways where differential methylations were observed, and these can be implicated in CD pathogenesis. We identified specific DNA methylation patterns in the mucosal DNA of surgical and non-surgical CD patients which indicates evolution of the methylome as the disease progresses from initial to the advance stage. These unique patterns can be used as DNA methylation signatures to identify different stages of the disease.
Collapse
Affiliation(s)
- Saeed Ahmad
- Biomedical Research Center, Mills Breast Cancer Institute, Carle Foundation Hospital, Urbana, IL, United States
- Department of Bioengineering, University of Illinois Urbana-Champaign, Champaign, IL, United States
| | - Mia Sands
- Biomedical Research Center, Mills Breast Cancer Institute, Carle Foundation Hospital, Urbana, IL, United States
- Department of Bioengineering, University of Illinois Urbana-Champaign, Champaign, IL, United States
| | - Eugene Greenberg
- Digestive Health Institute, Carle Foundation Hospital, Urbana, IL, United States
| | - Lyn Tangen
- Digestive Health Institute, Carle Foundation Hospital, Urbana, IL, United States
| | - Jiacheng Huang
- Biomedical Research Center, Mills Breast Cancer Institute, Carle Foundation Hospital, Urbana, IL, United States
- Department of Bioengineering, University of Illinois Urbana-Champaign, Champaign, IL, United States
| | - Joseph Maria Kumar Irudayaraj
- Biomedical Research Center, Mills Breast Cancer Institute, Carle Foundation Hospital, Urbana, IL, United States
- Department of Bioengineering, University of Illinois Urbana-Champaign, Champaign, IL, United States
- Carl Woese Institute for Genomic Biology, University of Illinois Urbana-Champaign, Champaign, IL, United States
- Cancer Center at Illinois, University of Illinois Urbana-Champaign, Champaign, IL, United States
- Carle Illinois College of Medicine, University of Illinois Urbana-Champaign, Champaign, IL, United States
| |
Collapse
|
43
|
Delaney K, Weiss N, Almouzni G. The cell-cycle choreography of H3 variants shapes the genome. Mol Cell 2023; 83:3773-3786. [PMID: 37734377 PMCID: PMC10621666 DOI: 10.1016/j.molcel.2023.08.030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Revised: 08/07/2023] [Accepted: 08/29/2023] [Indexed: 09/23/2023]
Abstract
Histone variants provide versatility in the basic unit of chromatin, helping to define dynamic landscapes and cell fates. Maintaining genome integrity is paramount for the cell, and it is intimately linked with chromatin dynamics, assembly, and disassembly during DNA transactions such as replication, repair, recombination, and transcription. In this review, we focus on the family of H3 variants and their dynamics in space and time during the cell cycle. We review the distinct H3 variants' specific features along with their escort partners, the histone chaperones, compiled across different species to discuss their distinct importance considering evolution. We place H3 dynamics at different times during the cell cycle with the possible consequences for genome stability. Finally, we examine how their mutation and alteration impact disease. The emerging picture stresses key parameters in H3 dynamics to reflect on how when they are perturbed, they become a source of stress for genome integrity.
Collapse
Affiliation(s)
- Kamila Delaney
- Institut Curie, PSL Research University, CNRS, Sorbonne Université, Nuclear Dynamics Unit, Equipe Labellisée Ligue contre le Cancer, 26 rue d'Ulm, 75005 Paris, France
| | - Nicole Weiss
- Institut Curie, PSL Research University, CNRS, Sorbonne Université, Nuclear Dynamics Unit, Equipe Labellisée Ligue contre le Cancer, 26 rue d'Ulm, 75005 Paris, France
| | - Geneviève Almouzni
- Institut Curie, PSL Research University, CNRS, Sorbonne Université, Nuclear Dynamics Unit, Equipe Labellisée Ligue contre le Cancer, 26 rue d'Ulm, 75005 Paris, France.
| |
Collapse
|
44
|
Miga KH, Eichler EE. Envisioning a new era: Complete genetic information from routine, telomere-to-telomere genomes. Am J Hum Genet 2023; 110:1832-1840. [PMID: 37922882 PMCID: PMC10645551 DOI: 10.1016/j.ajhg.2023.09.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Revised: 09/19/2023] [Accepted: 09/20/2023] [Indexed: 11/07/2023] Open
Abstract
Advances in long-read sequencing and assembly now mean that individual labs can generate phased genomes that are more accurate and more contiguous than the original human reference genome. With declining costs and increasing democratization of technology, we suggest that complete genome assemblies, where both parental haplotypes are phased telomere to telomere, will become standard in human genetics. Soon, even in clinical settings where rigorous sample-handling standards must be met, affected individuals could have reference-grade genomes fully sequenced and assembled in just a few hours given advances in technology, computational processing, and annotation. Complete genetic variant discovery will transform how we map, catalog, and associate variation with human disease and fundamentally change our understanding of the genetic diversity of all humans.
Collapse
Affiliation(s)
- Karen H Miga
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA 95064, USA.
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA; Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA.
| |
Collapse
|
45
|
Sidhwani P, Straight AF. Epigenetic inheritance and boundary maintenance at human centromeres. Curr Opin Struct Biol 2023; 82:102694. [PMID: 37657353 PMCID: PMC10530090 DOI: 10.1016/j.sbi.2023.102694] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Revised: 07/31/2023] [Accepted: 08/07/2023] [Indexed: 09/03/2023]
Abstract
Centromeres are chromosomal regions that provide the foundation for microtubule attachment during chromosome segregation. Centromeres are epigenetically defined by nucleosomes containing the histone H3 variant centromere protein A (CENP-A) and, in many organisms, are surrounded by transcriptionally repressed pericentromeric chromatin marked by trimethylation of histone H3 lysine 9 (H3K9me3). Pericentromeric regions facilitate sister chromatid cohesion during mitosis, thereby supporting centromere function. Heterochromatin has a known propensity to spread into adjacent euchromatic domains unless it is properly bounded. Heterochromatin spreading into the centromere can disrupt kinetochore function, perturbing chromosome segregation and genome stability. In the fission yeast Schizosaccharomyces pombe, tRNA genes provide barriers to heterochromatin spread at the centromere, the absence of which results in abnormal meiotic chromosome segregation. How heterochromatin-centromere boundaries are established in humans is not understood. We propose models for stable epigenetic inheritance of centromeric domains in humans and discuss advances that will enable the discovery of novel regulators of this process.
Collapse
Affiliation(s)
- Pragya Sidhwani
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, United States. https://twitter.com/@pra_sidh
| | - Aaron F Straight
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, United States.
| |
Collapse
|
46
|
Yang C, Zhou Y, Song Y, Wu D, Zeng Y, Nie L, Liu P, Zhang S, Chen G, Xu J, Zhou H, Zhou L, Qian X, Liu C, Tan S, Zhou C, Dai W, Xu M, Qi Y, Wang X, Guo L, Fan G, Wang A, Deng Y, Zhang Y, Jin J, He Y, Guo C, Guo G, Zhou Q, Xu X, Yang H, Wang J, Xu S, Mao Y, Jin X, Ruan J, Zhang G. The complete and fully-phased diploid genome of a male Han Chinese. Cell Res 2023; 33:745-761. [PMID: 37452091 PMCID: PMC10542383 DOI: 10.1038/s41422-023-00849-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2023] [Accepted: 06/29/2023] [Indexed: 07/18/2023] Open
Abstract
Since the release of the complete human genome, the priority of human genomic study has now been shifting towards closing gaps in ethnic diversity. Here, we present a fully phased and well-annotated diploid human genome from a Han Chinese male individual (CN1), in which the assemblies of both haploids achieve the telomere-to-telomere (T2T) level. Comparison of this diploid genome with the CHM13 haploid T2T genome revealed significant variations in the centromere. Outside the centromere, we discovered 11,413 structural variations, including numerous novel ones. We also detected thousands of CN1 alleles that have accumulated high substitution rates and a few that have been under positive selection in the East Asian population. Further, we found that CN1 outperforms CHM13 as a reference genome in mapping and variant calling for the East Asian population owing to the distinct structural variants of the two references. Comparison of SNP calling for a large cohort of 8869 Chinese genomes using CN1 and CHM13 as reference respectively showed that the reference bias profoundly impacts rare SNP calling, with nearly 2 million rare SNPs miss-called with different reference genomes. Finally, applying the CN1 as a reference, we discovered 5.80 Mb and 4.21 Mb putative introgression sequences from Neanderthal and Denisovan, respectively, including many East Asian specific ones undetected using CHM13 as the reference. Our analyses reveal the advances of using CN1 as a reference for population genomic studies and paleo-genomic studies. This complete genome will serve as an alternative reference for future genomic studies on the East Asian population.
Collapse
Affiliation(s)
- Chentao Yang
- Center for Genomic Research, International Institutes of Medicine, The Fourth Affiliated Hospital, Zhejiang University School of Medicine, Yiwu, Zhejiang, China
- Center for Evolutionary & Organismal Biology, & Women's Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
- BGI-Shenzhen, Shenzhen, Guangdong, China
| | - Yang Zhou
- BGI-Shenzhen, Shenzhen, Guangdong, China
- BGI Research-Wuhan, BGI, Wuhan, Hubei, China
| | - Yanni Song
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China
| | - Dongya Wu
- Center for Genomic Research, International Institutes of Medicine, The Fourth Affiliated Hospital, Zhejiang University School of Medicine, Yiwu, Zhejiang, China
- Center for Evolutionary & Organismal Biology, & Women's Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
- Liangzhu Laboratory, Zhejiang University Medical Center, Hangzhou, Zhejiang, China
- Institute of Crop Science & Institute of Bioinformatics, Zhejiang University, Hangzhou, Zhejiang, China
| | - Yan Zeng
- BGI-Shenzhen, Shenzhen, Guangdong, China
| | - Lei Nie
- BGI-Shenzhen, Shenzhen, Guangdong, China
| | | | - Shilong Zhang
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong University, Shanghai, China
| | - Guangji Chen
- BGI-Shenzhen, Shenzhen, Guangdong, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Jinjin Xu
- BGI-Shenzhen, Shenzhen, Guangdong, China
| | - Hongling Zhou
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China
| | - Long Zhou
- Center for Evolutionary & Organismal Biology, & Women's Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
- Liangzhu Laboratory, Zhejiang University Medical Center, Hangzhou, Zhejiang, China
- Innovation Center of Yangtze River Delta, Zhejiang University, Hangzhou, Zhejiang, China
| | - Xiaobo Qian
- BGI-Shenzhen, Shenzhen, Guangdong, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Chenlu Liu
- Life Sciences Institute, Zhejiang University, Hangzhou, Zhejiang, China
| | | | | | - Wei Dai
- BGI-Shenzhen, Shenzhen, Guangdong, China
| | - Mengyang Xu
- BGI-Shenzhen, Shenzhen, Guangdong, China
- BGI-Qingdao, BGI-Shenzhen, Qingdao, Shandong, China
| | - Yanwei Qi
- BGI-Qingdao, BGI-Shenzhen, Qingdao, Shandong, China
| | - Xiaobo Wang
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China
| | - Lidong Guo
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
- BGI-Qingdao, BGI-Shenzhen, Qingdao, Shandong, China
| | - Guangyi Fan
- BGI-Qingdao, BGI-Shenzhen, Qingdao, Shandong, China
| | - Aijun Wang
- BGI-Qingdao, BGI-Shenzhen, Qingdao, Shandong, China
| | - Yuan Deng
- BGI-Shenzhen, Shenzhen, Guangdong, China
| | - Yong Zhang
- BGI-Shenzhen, Shenzhen, Guangdong, China
| | | | - Yunqiu He
- Center for Genomic Research, International Institutes of Medicine, The Fourth Affiliated Hospital, Zhejiang University School of Medicine, Yiwu, Zhejiang, China
- Center for Evolutionary & Organismal Biology, & Women's Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
| | - Chunxue Guo
- BGI-Shenzhen, Shenzhen, Guangdong, China
- BGI-Hangzhou, Hangzhou, Zhejiang, China
| | - Guoji Guo
- School of Medicine, Zhejiang University, Hangzhou, Zhejiang, China
| | - Qing Zhou
- Liangzhu Laboratory, Zhejiang University Medical Center, Hangzhou, Zhejiang, China
- Life Sciences Institute, Zhejiang University, Hangzhou, Zhejiang, China
| | - Xun Xu
- BGI-Shenzhen, Shenzhen, Guangdong, China
| | | | - Jian Wang
- BGI-Shenzhen, Shenzhen, Guangdong, China
| | - Shuhua Xu
- State Key Laboratory of Genetic Engineering, Center for Evolutionary Biology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China
- Human Phenome Institute, Zhangjiang Fudan International Innovation Center, and Ministry of Education Key Laboratory of Contemporary Anthropology, Fudan University, Shanghai, China
- Jiangsu Key Laboratory of Phylogenomics & Comparative Genomics, International Joint Center of Genomics of Jiangsu Province School of Life Sciences, Jiangsu Normal University, Xuzhou, Jiangsu, China
- Department of Liver Surgery and Transplantation Liver Cancer Institute, Zhongshan Hospital, Fudan University, Shanghai, China
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, Yunnan, China
| | - Yafei Mao
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong University, Shanghai, China
| | - Xin Jin
- BGI-Shenzhen, Shenzhen, Guangdong, China
| | - Jue Ruan
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China.
| | - Guojie Zhang
- Center for Genomic Research, International Institutes of Medicine, The Fourth Affiliated Hospital, Zhejiang University School of Medicine, Yiwu, Zhejiang, China.
- Center for Evolutionary & Organismal Biology, & Women's Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China.
- Liangzhu Laboratory, Zhejiang University Medical Center, Hangzhou, Zhejiang, China.
- Innovation Center of Yangtze River Delta, Zhejiang University, Hangzhou, Zhejiang, China.
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, China.
| |
Collapse
|
47
|
Farooqi AA, Rakhmetova V, Kapanova G, Tanbayeva G, Mussakhanova A, Abdykulova A, Ryskulova AG. Role of Ubiquitination and Epigenetics in the Regulation of AhR Signaling in Carcinogenesis and Metastasis: "Albatross around the Neck" or "Blessing in Disguise". Cells 2023; 12:2382. [PMID: 37830596 PMCID: PMC10571945 DOI: 10.3390/cells12192382] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2023] [Revised: 09/25/2023] [Accepted: 09/27/2023] [Indexed: 10/14/2023] Open
Abstract
The molecular mechanisms and signal transduction cascades evoked by the activation of aryl hydrocarbon receptor (AhR) are becoming increasingly understandable. AhR is a ligand-activated transcriptional factor that integrates environmental, dietary and metabolic cues for the pleiotropic regulation of a wide variety of mechanisms. AhR mediates transcriptional programming in a ligand-specific, context-specific and cell-type-specific manner. Pioneering cutting-edge research works have provided fascinating new insights into the mechanistic role of AhR-driven downstream signaling in a wide variety of cancers. AhR ligands derived from food, environmental contaminants and intestinal microbiota strategically activated AhR signaling and regulated multiple stages of cancer. Although AhR has classically been viewed and characterized as a ligand-regulated transcriptional factor, its role as a ubiquitin ligase is fascinating. Accordingly, recent evidence has paradigmatically shifted our understanding and urged researchers to drill down deep into these novel and clinically valuable facets of AhR biology. Our rapidly increasing realization related to AhR-mediated regulation of the ubiquitination and proteasomal degradation of different proteins has started to scratch the surface of intriguing mechanisms. Furthermore, AhR and epigenome dynamics have shown previously unprecedented complexity during multiple stages of cancer progression. AhR not only transcriptionally regulated epigenetic-associated molecules, but also worked with epigenetic-modifying enzymes during cancer progression. In this review, we have summarized the findings obtained not only from cell-culture studies, but also from animal models. Different clinical trials are currently being conducted using AhR inhibitors and PD-1 inhibitors (Pembrolizumab and nivolumab), which confirm the linchpin role of AhR-related mechanistic details in cancer progression. Therefore, further studies are required to develop a better comprehension of the many-sided and "diametrically opposed" roles of AhR in the regulation of carcinogenesis and metastatic spread of cancer cells to the secondary organs.
Collapse
Affiliation(s)
- Ammad Ahmad Farooqi
- Institute of Biomedical and Genetic Engineering (IBGE), Islamabad 54000, Pakistan
| | - Venera Rakhmetova
- Department of Internal Diseases, Medical University of Astana, Astana 010000, Kazakhstan
| | - Gulnara Kapanova
- Faculty of Medicine and healthcare, Al-Farabi Kazakh National University, 71 Al-Farabi Ave, Almaty 050040, Kazakhstan (G.T.)
- Scientific Center of Anti-Infectious Drugs, 75 Al-Farabi Ave, Almaty 050040, Kazakhstan
| | - Gulnur Tanbayeva
- Faculty of Medicine and healthcare, Al-Farabi Kazakh National University, 71 Al-Farabi Ave, Almaty 050040, Kazakhstan (G.T.)
| | - Akmaral Mussakhanova
- Department of Public Health and Management, Astana Medical University, Astana 010000, Kazakhstan;
| | - Akmaral Abdykulova
- Department of General Medical Practice, General Medicine Faculty, Asfendiyarov Kazakh National Medical University, Almaty 050000, Kazakhstan;
| | - Alma-Gul Ryskulova
- Department of Public Health and Social Sciences, Kazakhstan Medical University “KSPH”, Utenos Str. 19A, Almaty 050060, Kazakhstan;
| |
Collapse
|
48
|
van Dijk EL, Naquin D, Gorrichon K, Jaszczyszyn Y, Ouazahrou R, Thermes C, Hernandez C. Genomics in the long-read sequencing era. Trends Genet 2023; 39:649-671. [PMID: 37230864 DOI: 10.1016/j.tig.2023.04.006] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Revised: 04/21/2023] [Accepted: 04/25/2023] [Indexed: 05/27/2023]
Abstract
Long-read sequencing (LRS) technologies have provided extremely powerful tools to explore genomes. While in the early years these methods suffered technical limitations, they have recently made significant progress in terms of read length, throughput, and accuracy and bioinformatics tools have strongly improved. Here, we aim to review the current status of LRS technologies, the development of novel methods, and the impact on genomics research. We will explore the most impactful recent findings made possible by these technologies focusing on high-resolution sequencing of genomes and transcriptomes and the direct detection of DNA and RNA modifications. We will also discuss how LRS methods promise a more comprehensive understanding of human genetic variation, transcriptomics, and epigenetics for the coming years.
Collapse
Affiliation(s)
- Erwin L van Dijk
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France.
| | - Delphine Naquin
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France
| | - Kévin Gorrichon
- National Center of Human Genomics Research (CNRGH), 91000 Évry-Courcouronnes, France
| | - Yan Jaszczyszyn
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France
| | - Rania Ouazahrou
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France
| | - Claude Thermes
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France
| | - Céline Hernandez
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France
| |
Collapse
|
49
|
Mo W, Shu Y, Liu B, Long Y, Li T, Cao X, Deng X, Zhai J. Single-molecule targeted accessibility and methylation sequencing of centromeres, telomeres and rDNAs in Arabidopsis. NATURE PLANTS 2023; 9:1439-1450. [PMID: 37599304 DOI: 10.1038/s41477-023-01498-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Accepted: 07/24/2023] [Indexed: 08/22/2023]
Abstract
The short read-length of next-generation sequencing makes it challenging to characterize highly repetitive regions (HRRs) such as centromeres, telomeres and ribosomal DNAs. Based on recent strategies that combined long-read sequencing and exogenous enzymatic labelling of open chromatin, we developed single-molecule targeted accessibility and methylation sequencing (STAM-seq) in plants by further integrating nanopore adaptive sampling to investigate the HRRs in wild-type Arabidopsis and DNA methylation mutants that are defective in CG- or non-CG methylation. We found that CEN180 repeats show higher chromatin accessibility and lower DNA methylation on their forward strand, individual rDNA units show a negative correlation between their DNA methylation and accessibility, and both accessibility and CHH methylation levels are lower at telomere compared to adjacent subtelomeric region. Moreover, DNA methylation-deficient mutants showed increased chromatin accessibility at HRRs, consistent with the role of DNA methylation in maintaining heterochromatic status in plants. STAM-seq can be applied to study accessibility and methylation of repetitive sequences across diverse plant species.
Collapse
Affiliation(s)
- Weipeng Mo
- Institute of Plant and Food Science, Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China
| | - Yi Shu
- Institute of Plant and Food Science, Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China
| | - Bo Liu
- Institute of Plant and Food Science, Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China
| | - Yanping Long
- Institute of Plant and Food Science, Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China
| | - Tong Li
- State Key Laboratory of Plant Genomics and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China
| | - Xiaofeng Cao
- State Key Laboratory of Plant Genomics and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Xian Deng
- State Key Laboratory of Plant Genomics and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China.
| | - Jixian Zhai
- Institute of Plant and Food Science, Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China.
| |
Collapse
|
50
|
Hallast P, Ebert P, Loftus M, Yilmaz F, Audano PA, Logsdon GA, Bonder MJ, Zhou W, Höps W, Kim K, Li C, Hoyt SJ, Dishuck PC, Porubsky D, Tsetsos F, Kwon JY, Zhu Q, Munson KM, Hasenfeld P, Harvey WT, Lewis AP, Kordosky J, Hoekzema K, O'Neill RJ, Korbel JO, Tyler-Smith C, Eichler EE, Shi X, Beck CR, Marschall T, Konkel MK, Lee C. Assembly of 43 human Y chromosomes reveals extensive complexity and variation. Nature 2023; 621:355-364. [PMID: 37612510 PMCID: PMC10726138 DOI: 10.1038/s41586-023-06425-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Accepted: 07/11/2023] [Indexed: 08/25/2023]
Abstract
The prevalence of highly repetitive sequences within the human Y chromosome has prevented its complete assembly to date1 and led to its systematic omission from genomic analyses. Here we present de novo assemblies of 43 Y chromosomes spanning 182,900 years of human evolution and report considerable diversity in size and structure. Half of the male-specific euchromatic region is subject to large inversions with a greater than twofold higher recurrence rate compared with all other chromosomes2. Ampliconic sequences associated with these inversions show differing mutation rates that are sequence context dependent, and some ampliconic genes exhibit evidence for concerted evolution with the acquisition and purging of lineage-specific pseudogenes. The largest heterochromatic region in the human genome, Yq12, is composed of alternating repeat arrays that show extensive variation in the number, size and distribution, but retain a 1:1 copy-number ratio. Finally, our data suggest that the boundary between the recombining pseudoautosomal region 1 and the non-recombining portions of the X and Y chromosomes lies 500 kb away from the currently established1 boundary. The availability of fully sequence-resolved Y chromosomes from multiple individuals provides a unique opportunity for identifying new associations of traits with specific Y-chromosomal variants and garnering insights into the evolution and function of complex regions of the human genome.
Collapse
Affiliation(s)
- Pille Hallast
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Peter Ebert
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University, Düsseldorf, Germany
- Core Unit Bioinformatics, Medical Faculty, Heinrich Heine University, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University, Düsseldorf, Germany
| | - Mark Loftus
- Department of Genetics & Biochemistry, Clemson University, Clemson, SC, USA
- Center for Human Genetics, Clemson University, Greenwood, SC, USA
| | - Feyza Yilmaz
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Peter A Audano
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Glennis A Logsdon
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Marc Jan Bonder
- Division of Computational Genomics and Systems Genetics, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Department of Genetics, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
| | - Weichen Zhou
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Wolfram Höps
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - Kwondo Kim
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Chong Li
- Department of Computer and Information Sciences, Temple University, Philadelphia, PA, USA
| | - Savannah J Hoyt
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
| | - Philip C Dishuck
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - David Porubsky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Fotios Tsetsos
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Jee Young Kwon
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Qihui Zhu
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Katherine M Munson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Patrick Hasenfeld
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - William T Harvey
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Alexandra P Lewis
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Jennifer Kordosky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Kendra Hoekzema
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Rachel J O'Neill
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
- Institute for Systems Genomics, University of Connecticut, Storrs, CT, USA
- The University of Connecticut Health Center, Farmington, CT, USA
| | - Jan O Korbel
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | | | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Xinghua Shi
- Department of Computer and Information Sciences, Temple University, Philadelphia, PA, USA
| | - Christine R Beck
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
- Institute for Systems Genomics, University of Connecticut, Storrs, CT, USA
- The University of Connecticut Health Center, Farmington, CT, USA
| | - Tobias Marschall
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University, Düsseldorf, Germany
| | - Miriam K Konkel
- Department of Genetics & Biochemistry, Clemson University, Clemson, SC, USA
- Center for Human Genetics, Clemson University, Greenwood, SC, USA
| | - Charles Lee
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA.
| |
Collapse
|