1
|
Deng W, Citu C, Liu A, Zhao Z. Dynamic dysregulation of retrotransposons in neurodegenerative diseases at the single-cell level. Genome Res 2024; 34:1687-1699. [PMID: 39424325 PMCID: PMC11529867 DOI: 10.1101/gr.279363.124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2024] [Accepted: 09/18/2024] [Indexed: 10/21/2024]
Abstract
Retrotransposable elements (RTEs) are common mobile genetic elements comprising ∼42% of the human genome. RTEs play critical roles in gene regulation and function, but how they are specifically involved in complex diseases is largely unknown. Here, we investigate the cellular heterogeneity of RTEs using 12 single-cell transcriptome profiles covering three neurodegenerative diseases, Alzheimer's disease (AD), Parkinson's disease, and multiple sclerosis. We identify cell type marker RTEs in neurons, astrocytes, oligodendrocytes, and oligodendrocyte precursor cells that are related to these diseases. The differential expression analysis reveals the landscape of dysregulated RTE expression, especially L1s, in excitatory neurons of multiple neurodegenerative diseases. Machine learning algorithms for predicting cell disease stage using a combination of RTE and gene expression features suggests dynamic regulation of RTEs in AD. Furthermore, we construct a single-cell atlas of retrotransposable elements in neurodegenerative disease (scARE) using these data sets and features. scARE has six feature analysis modules to explore RTE dynamics in a user-defined condition. To our knowledge, scARE represents the first systematic investigation of RTE dynamics at the single-cell level within the context of neurodegenerative diseases.
Collapse
Affiliation(s)
- Wankun Deng
- Center for Precision Health, McWilliams School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, Texas 77030, USA
| | - Citu Citu
- Center for Precision Health, McWilliams School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, Texas 77030, USA
| | - Andi Liu
- Center for Precision Health, McWilliams School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, Texas 77030, USA
| | - Zhongming Zhao
- Center for Precision Health, McWilliams School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, Texas 77030, USA;
- MD Anderson Cancer Center UTHealth Graduate School of Biomedical Sciences, Houston, Texas 77030, USA
- Faillace Department of Psychiatry and Behavioral Sciences, McGovern Medical School, The University of Texas Health Science Center at Houston, Houston, Texas 77030, USA
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee 37203, USA
| |
Collapse
|
2
|
Wang R, Zheng Y, Zhang Z, Song K, Wu E, Zhu X, Wu TP, Ding J. MATES: a deep learning-based model for locus-specific quantification of transposable elements in single cell. Nat Commun 2024; 15:8798. [PMID: 39394211 PMCID: PMC11470080 DOI: 10.1038/s41467-024-53114-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2023] [Accepted: 09/24/2024] [Indexed: 10/13/2024] Open
Abstract
Transposable elements (TEs) are crucial for genetic diversity and gene regulation. Current single-cell quantification methods often align multi-mapping reads to either 'best-mapped' or 'random-mapped' locations and categorize them at the subfamily levels, overlooking the biological necessity for accurate, locus-specific TE quantification. Moreover, these existing methods are primarily designed for and focused on transcriptomics data, which restricts their adaptability to single-cell data of other modalities. To address these challenges, here we introduce MATES, a deep-learning approach that accurately allocates multi-mapping reads to specific loci of TEs, utilizing context from adjacent read alignments flanking the TE locus. When applied to diverse single-cell omics datasets, MATES shows improved performance over existing methods, enhancing the accuracy of TE quantification and aiding in the identification of marker TEs for identified cell populations. This development facilitates the exploration of single-cell heterogeneity and gene regulation through the lens of TEs, offering an effective transposon quantification tool for the single-cell genomics community.
Collapse
Affiliation(s)
- Ruohan Wang
- School of Computer Science, McGill University, Montreal, Quebec, Canada
- Meakins-Christie Laboratories, Translational Research in Respiratory Diseases Program, Research Institute of the McGill University Health Centre, Montreal, Quebec, Canada
- Department of Medicine, McGill University, Montreal, Quebec, Canada
| | - Yumin Zheng
- Meakins-Christie Laboratories, Translational Research in Respiratory Diseases Program, Research Institute of the McGill University Health Centre, Montreal, Quebec, Canada
- Department of Medicine, McGill University, Montreal, Quebec, Canada
- Quantitative Life Sciences, Faculty of Medicine & Health Sciences, McGill University, Montreal, Quebec, Canada
| | - Zijian Zhang
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Kailu Song
- Meakins-Christie Laboratories, Translational Research in Respiratory Diseases Program, Research Institute of the McGill University Health Centre, Montreal, Quebec, Canada
- Department of Medicine, McGill University, Montreal, Quebec, Canada
- Quantitative Life Sciences, Faculty of Medicine & Health Sciences, McGill University, Montreal, Quebec, Canada
| | - Erxi Wu
- Department of Neurosurgery, Baylor College of Medicine, Temple, TX, USA
- College of Medicine and Irma Lerma Rangel College of Pharmacy, Texas A&M University, College Station, TX, USA
- LIVESTRONG Cancer Institutes and Department of Oncology, Dell Medical School, The University of Texas at Austin, Austin, TX, USA
- Neuroscience Institute and Department of Neurosurgery, Baylor Scott & White Health, Temple, TX, USA
| | | | - Tao P Wu
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA.
| | - Jun Ding
- School of Computer Science, McGill University, Montreal, Quebec, Canada.
- Meakins-Christie Laboratories, Translational Research in Respiratory Diseases Program, Research Institute of the McGill University Health Centre, Montreal, Quebec, Canada.
- Department of Medicine, McGill University, Montreal, Quebec, Canada.
- Quantitative Life Sciences, Faculty of Medicine & Health Sciences, McGill University, Montreal, Quebec, Canada.
- Mila-Quebec AI Institue, Montreal, Quebec, Canada.
| |
Collapse
|
3
|
Vendrell X, de Castro P, Escrich L, Grau N, Gonzalez-Martin R, Quiñonero A, Escribá MJ, Domínguez F. Longitudinal profiling of human androgenotes through single-cell analysis unveils paternal gene expression dynamics in early embryo development. Hum Reprod 2024; 39:1186-1196. [PMID: 38622061 PMCID: PMC11145015 DOI: 10.1093/humrep/deae072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Revised: 03/12/2024] [Indexed: 04/17/2024] Open
Abstract
STUDY QUESTION How do transcriptomics vary in haploid human androgenote embryos at single cell level in the first four cell cycles of embryo development? SUMMARY ANSWER Gene expression peaks at the fourth cell cycle, however some androcytes exhibit unique transcriptional behaviors. WHAT IS KNOWN ALREADY The developmental potential of an embryo is determined by the competence of the oocyte and the sperm. However, studies of the contribution of the paternal genome using pure haploid androgenotes are very scarce. STUDY DESIGN, SIZE, DURATION This study was performed analyzing the single-cell transcriptomic sequencing of 38 androcytes obtained from 10 androgenote bioconstructs previously produced in vitro (de Castro et al., 2023). These results were analyzed through different bioinformatics software such as g: Profiler, GSEA, Cytoscape, and Reactome. PARTICIPANTS/MATERIALS, SETTING, METHODS Single cell sequencing was used to obtain the transcriptomic profiles of the different androcytes. The results obtained were compared between the different cycles studied using the DESeq2 program and functional enrichment pathways using g: Profiler, Cytoscape, and Reactome. MAIN RESULTS AND THE ROLE OF CHANCE A wave of paternally driven transcriptomic activation was found during the third-cell cycle, with 1128 upregulated and 225 downregulated genes and the fourth-cell cycle, with 1373 upregulated and 286 downregulated genes, compared to first-cell cycle androcytes. Differentially expressed routes related to cell differentiation, DNA-binding transcription, RNA biosynthesis and RNA polymerase II transcription regulatory complex, and cell death were found in the third and fourth with respect to the first-cell cycle. Conversely, in the fourth cell cycle, 153 downregulated and 332 upregulated genes were found compared with third cell cycle, associated with differentially expressed processes related to E-box binding and zinc finger protein 652 (ZNF652) transcription factor. Further, significant overexpression of LEUTX, PRAMEF1, DUXA, RFPL4A, TRIM43, and ZNF675 found in androgenotes, compared to biparental embryos, highlights the paternal contributions to zygote genome activation. LARGE SCALE DATA All raw sequencing data are available through the Gene Expression Omnibus (GEO) under accessions number: GSE216501. LIMITATIONS, REASONS FOR CAUTION Extrapolation of biological events from uniparental constructs to biparental embryos should be done with caution. Maternal and paternal genomes do not act independently of each other in a natural condition. The absence of one genome may affect gene transcription of the other. In this sense, the haploid condition of the bioconstructs could mask the transcriptomic patterns of the single cells. WIDER IMPLICATIONS OF THE FINDINGS The results obtained demonstrated the level of involvement of the human paternal haploid genome in the early stages of embryo development as well as its evolution at the transcriptomic level, laying the groundwork for the use of these bioconstructs as reliable models to dispel doubts about the genetic role played by the paternal genome in the early cycles of embryo development. STUDY FUNDING/COMPETING INTEREST(S) This study was funded by Instituto de Salud Carlos III (ISCIII) through the project 'PI22/00924', co-funded by European Regional Development Fund (ERDF); 'A way to make Europe'. F.D. was supported by the Spanish Ministry of Economy and Competitiveness through the Miguel Servet program (CPII018/00002). M.J.E. was supported by Instituto de Salud Carlos III (PI19/00577 [M.J.E.]) and FI20/00086. P.dC. was supported by a predoctoral grant for training in research into health (PFIS PI19/00577) from the Instituto de Salud Carlos III. All authors declare having no conflict of interest with regard to this trial.
Collapse
Affiliation(s)
- X Vendrell
- Reproductive Genetics Department, Sistemas Genómicos-Synlab, Valencia, Spain
| | - P de Castro
- Research Department, IVIRMA Global Research Alliance, IVI Foundation—Reproductive Biology and Bioengineering in Human Reproduction, IIS La Fe Health Research, Valencia, Spain
| | - L Escrich
- Embryology Department, IVIRMA Valencia, Valencia, Spain
| | - N Grau
- Embryology Department, IVIRMA Valencia, Valencia, Spain
| | - R Gonzalez-Martin
- Research Department, IVIRMA Global Research Alliance, IVI Foundation—Reproductive Biology and Bioengineering in Human Reproduction, IIS La Fe Health Research, Valencia, Spain
| | - A Quiñonero
- Research Department, IVIRMA Global Research Alliance, IVI Foundation—Reproductive Biology and Bioengineering in Human Reproduction, IIS La Fe Health Research, Valencia, Spain
| | - M J Escribá
- Research Department, IVIRMA Global Research Alliance, IVI Foundation—Reproductive Biology and Bioengineering in Human Reproduction, IIS La Fe Health Research, Valencia, Spain
- Embryology Department, IVIRMA Valencia, Valencia, Spain
| | - F Domínguez
- Research Department, IVIRMA Global Research Alliance, IVI Foundation—Reproductive Biology and Bioengineering in Human Reproduction, IIS La Fe Health Research, Valencia, Spain
| |
Collapse
|
4
|
Li C, Qian Q, Yan C, Lu M, Li L, Li P, Fan Z, Lei W, Shang K, Wang P, Wang J, Lu T, Huang Y, Yang H, Wei H, Han J, Xiao J, Chen F. HervD Atlas: a curated knowledgebase of associations between human endogenous retroviruses and diseases. Nucleic Acids Res 2024; 52:D1315-D1326. [PMID: 37870452 PMCID: PMC10767980 DOI: 10.1093/nar/gkad904] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Revised: 10/03/2023] [Accepted: 10/06/2023] [Indexed: 10/24/2023] Open
Abstract
Human endogenous retroviruses (HERVs), as remnants of ancient exogenous retrovirus infected and integrated into germ cells, comprise ∼8% of the human genome. These HERVs have been implicated in numerous diseases, and extensive research has been conducted to uncover their specific roles. Despite these efforts, a comprehensive source of HERV-disease association still needs to be added. To address this gap, we introduce the HervD Atlas (https://ngdc.cncb.ac.cn/hervd/), an integrated knowledgebase of HERV-disease associations manually curated from all related published literature. In the current version, HervD Atlas collects 60 726 HERV-disease associations from 254 publications (out of 4692 screened literature), covering 21 790 HERVs (21 049 HERV-Terms and 741 HERV-Elements) belonging to six types, 149 diseases and 610 related/affected genes. Notably, an interactive knowledge graph that systematically integrates all the HERV-disease associations and corresponding affected genes into a comprehensive network provides a powerful tool to uncover and deduce the complex interplay between HERVs and diseases. The HervD Atlas also features a user-friendly web interface that allows efficient browsing, searching, and downloading of all association information, research metadata, and annotation information. Overall, the HervD Atlas is an essential resource for comprehensive, up-to-date knowledge on HERV-disease research, potentially facilitating the development of novel HERV-associated diagnostic and therapeutic strategies.
Collapse
Affiliation(s)
- Cuidan Li
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China
| | - Qiheng Qian
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Chenghao Yan
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Mingming Lu
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China
| | - Lin Li
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Microbiology and Epidemiology, Beijing 100071, China
| | - Pan Li
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Zhuojing Fan
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China
| | - Wenyan Lei
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Kang Shang
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Peihan Wang
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jie Wang
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Tianyi Lu
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yuting Huang
- State Key Laboratory of Elemento-Organic Chemistry, College of Chemistry, Nankai University, Tianjin 300071, China
| | - Hongwei Yang
- State Key Laboratory of Elemento-Organic Chemistry, College of Chemistry, Nankai University, Tianjin 300071, China
| | - Haobin Wei
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jingwan Han
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Microbiology and Epidemiology, Beijing 100071, China
| | - Jingfa Xiao
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Fei Chen
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
- Beijing Key Laboratory of Genome and Precision Medicine Technologies, Beijing100101, China
| |
Collapse
|