1
|
Eskier D, Arıbaş A, Karakülah G. PlanTEnrichment: A How-to Guide on Rapid Identification of Transposable Elements Associated with Regions of Interest in Select Plant Genomes. Methods Mol Biol 2023; 2703:59-70. [PMID: 37646937 DOI: 10.1007/978-1-0716-3389-2_5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]
Abstract
Transposable elements (TEs) are repeat elements that can relocate or create novel copies of themselves in the genome and contribute to genomic complexity and expansion, via events such as chromosome recombination or regulation of gene expression. However, given the large number of such repeats across the genome, identifying repeats of interest can be a challenge in even well-annotated genomes, especially in more complex, TE-rich plant genomes. Here, we describe a protocol for PlanTEnrichment, a database we created comprising information on 11 plant genomes to analyze stress-associated TEs using publicly available data. By selecting a genome and providing a list of genes or genomic regions whose TE associations the user wants to identify, the user can rapidly obtain TE subfamilies found near the provided regions, as well as their superfamily and class, and the enrichment values of the repeats. The results also provide the locations of individual repeat instances found, alongside the input regions or genes they are associated with, and a bar graph of the top ten most significant repeat subfamilies identified. PlanTEnrichment is freely available at http://tools.ibg.deu.edu.tr/plantenrichment/ and can be used by researchers with rudimentary or no proficiency in computational analysis of TE elements, allowing for expedience in the identification of TEs of interest and helping further our understanding of the potential contributions of TEs in plant genomes.
Collapse
Affiliation(s)
- Doğa Eskier
- İzmir International Biomedicine and Genome Institute, Dokuz Eylül University, İnciraltı, İzmir, Turkey
- Bioinformatics Platform, İzmir Biomedicine and Genome Center (IBG), İnciraltı, İzmir, Turkey
| | - Alirıza Arıbaş
- Bioinformatics Platform, İzmir Biomedicine and Genome Center (IBG), İnciraltı, İzmir, Turkey
| | - Gökhan Karakülah
- İzmir International Biomedicine and Genome Institute, Dokuz Eylül University, İnciraltı, İzmir, Turkey.
- Bioinformatics Platform, İzmir Biomedicine and Genome Center (IBG), İnciraltı, İzmir, Turkey.
| |
Collapse
|
2
|
Li C, Zhang Y, Leng L, Pan X, Zhao D, Li X, Huang J, Bolund L, Lin G, Luo Y, Xu F. The single-cell expression profile of transposable elements and transcription factors in human early biparental and uniparental embryonic development. Front Cell Dev Biol 2022; 10:1020490. [PMID: 36438554 PMCID: PMC9691860 DOI: 10.3389/fcell.2022.1020490] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2022] [Accepted: 10/17/2022] [Indexed: 10/24/2023] Open
Abstract
Transposable elements (TEs) and transcription factors (TFs) are involved in the precise regulation of gene expression during the preimplantation stage. Activation of TEs is a key event for mammalian embryonic genome activation and preimplantation early embryonic development. TFs are involved in the regulation of drastic changes in gene expression patterns, but an inventory of the interplay between TEs and TFs during normal/abnormal human embryonic development is still lacking. Here we used single-cell RNA sequencing data generated from biparental and uniparental embryos to perform an integrative analysis of TE and TF expression. Our results showed that endogenous retroviruses (ERVs) are mainly expressed during the minor embryonic genome activation (EGA) process of early embryos, while Alu is gradually expressed in the middle and later stages. Some important ERVs (e.g., LTR5_Hs, MLT2A1) and Alu TEs are expressed at significantly lower levels in androgenic embryos. Integrative analysis revealed that the expression of the transcription factors CTCF and POU5F1 is correlated with the differential expression of ERV TEs. Comparative coexpression network analysis further showed distinct expression levels of important TFs (e.g., LEUTX and ZSCAN5A) in dizygotic embryos vs. parthenogenetic and androgenic embryos. This systematic investigation of TE and TF expression in human early embryonic development by single-cell RNA sequencing provides valuable insights into mammalian embryonic development.
Collapse
Affiliation(s)
- Conghui Li
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
- BGI-Shenzhen, Beishan Industrial Zone, Shenzhen, China
- Qingdao-Europe Advanced Institute for Life Sciences, BGI-Shenzhen, Qingdao, China
- Lars Bolund Institute of Regenerative Medicine, BGI-Qingdao, BGI-Shenzhen, Qingdao, China
- China National GeneBank, BGI-Shenzhen, Shenzhen, China
- Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Yue Zhang
- BGI-Qingdao, BGI-Shenzhen, Qingdao, Shandong, China
| | - Lizhi Leng
- Institute of Reproductive and Stem Cell Engineering, School of Basic Medical Science, Central South University, Changsha, China
- Key Laboratory of Reproductive and Stem Cells Engineering, Ministry of Health, Changsha, China
- Reproductive & Genetic Hospital of CITIC-Xiangya, Changsha, China
| | - Xiaoguang Pan
- Qingdao-Europe Advanced Institute for Life Sciences, BGI-Shenzhen, Qingdao, China
- Lars Bolund Institute of Regenerative Medicine, BGI-Qingdao, BGI-Shenzhen, Qingdao, China
| | - Depeng Zhao
- Department of Reproductive Medicine, Affiliated Shenzhen Maternity and Child Healthcare Hospital, Southern Medical University, Shenzhen, China
| | - Xuemei Li
- Department of Reproductive Medicine, Affiliated Shenzhen Maternity and Child Healthcare Hospital, Southern Medical University, Shenzhen, China
| | - Jinrong Huang
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
- BGI-Shenzhen, Beishan Industrial Zone, Shenzhen, China
- Qingdao-Europe Advanced Institute for Life Sciences, BGI-Shenzhen, Qingdao, China
- Lars Bolund Institute of Regenerative Medicine, BGI-Qingdao, BGI-Shenzhen, Qingdao, China
- China National GeneBank, BGI-Shenzhen, Shenzhen, China
- Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Lars Bolund
- BGI-Shenzhen, Beishan Industrial Zone, Shenzhen, China
- Qingdao-Europe Advanced Institute for Life Sciences, BGI-Shenzhen, Qingdao, China
- Lars Bolund Institute of Regenerative Medicine, BGI-Qingdao, BGI-Shenzhen, Qingdao, China
- China National GeneBank, BGI-Shenzhen, Shenzhen, China
- Department of Biomedicine, Aarhus University, Aarhus, Denmark
- Steno Diabetes Center Aarhus, Aarhus University Hospital, Aarhus, Denmark
| | - Ge Lin
- Institute of Reproductive and Stem Cell Engineering, School of Basic Medical Science, Central South University, Changsha, China
- Key Laboratory of Reproductive and Stem Cells Engineering, Ministry of Health, Changsha, China
- Reproductive & Genetic Hospital of CITIC-Xiangya, Changsha, China
| | - Yonglun Luo
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
- BGI-Shenzhen, Beishan Industrial Zone, Shenzhen, China
- Qingdao-Europe Advanced Institute for Life Sciences, BGI-Shenzhen, Qingdao, China
- Lars Bolund Institute of Regenerative Medicine, BGI-Qingdao, BGI-Shenzhen, Qingdao, China
- China National GeneBank, BGI-Shenzhen, Shenzhen, China
- Department of Biomedicine, Aarhus University, Aarhus, Denmark
- Steno Diabetes Center Aarhus, Aarhus University Hospital, Aarhus, Denmark
| | - Fengping Xu
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
- BGI-Shenzhen, Beishan Industrial Zone, Shenzhen, China
- Qingdao-Europe Advanced Institute for Life Sciences, BGI-Shenzhen, Qingdao, China
- Lars Bolund Institute of Regenerative Medicine, BGI-Qingdao, BGI-Shenzhen, Qingdao, China
- China National GeneBank, BGI-Shenzhen, Shenzhen, China
- BGI Cell, BGI-Shenzhen, Shenzhen, China
| |
Collapse
|
3
|
Eskier D, Karakülah G. In Silico Identification of Stress-Associated Transposable Elements in Arabidopsis thaliana Using Public Transcriptome Data. Methods Mol Biol 2021; 2250:15-30. [PMID: 33900589 DOI: 10.1007/978-1-0716-1134-0_2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Transposable elements (TEs) have been associated with stress response in many plants, making them a key target of study. However, the high variability, genomic repeat-heavy nature, and widely noncoding character of TEs have made them difficult to study using non-specialized methods, whether experimental or computational. In this chapter, we introduce two computational workflows to analyze transposable elements using publicly available transcriptome data. In the first of these methods, we identify TEs, which show differential expression under salt stress using sample transcriptome libraries that includes noncoding transcripts. In the second, we identify protein-coding genes with differential expression under the same conditions, and determine which TEs are enriched in the promoter regions of these stress-related genes.
Collapse
Affiliation(s)
- Doğa Eskier
- İzmir International Biomedicine and Genome Institute (iBG-İzmir), Dokuz Eylül University, İnciralti, İzmir, Turkey
- İzmir Biomedicine and Genome Center (IBG), İnciralti, İzmir, Turkey
| | - Gökhan Karakülah
- İzmir International Biomedicine and Genome Institute (iBG-İzmir), Dokuz Eylül University, İnciralti, İzmir, Turkey.
- İzmir Biomedicine and Genome Center (IBG), İnciralti, İzmir, Turkey.
| |
Collapse
|
4
|
Karakülah G, Arslan N, Yandım C, Suner A. TEffectR: an R package for studying the potential effects of transposable elements on gene expression with linear regression model. PeerJ 2019; 7:e8192. [PMID: 31824778 PMCID: PMC6899341 DOI: 10.7717/peerj.8192] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2019] [Accepted: 11/11/2019] [Indexed: 01/24/2023] Open
Abstract
Introduction Recent studies highlight the crucial regulatory roles of transposable elements (TEs) on proximal gene expression in distinct biological contexts such as disease and development. However, computational tools extracting potential TE -proximal gene expression associations from RNA-sequencing data are still missing. Implementation Herein, we developed a novel R package, using a linear regression model, for studying the potential influence of TE species on proximal gene expression from a given RNA-sequencing data set. Our R package, namely TEffectR, makes use of publicly available RepeatMasker TE and Ensembl gene annotations as well as several functions of other R-packages. It calculates total read counts of TEs from sorted and indexed genome aligned BAM files provided by the user, and determines statistically significant relations between TE expression and the transcription of nearby genes under diverse biological conditions. Availability TEffectR is freely available at https://github.com/karakulahg/TEffectR along with a handy tutorial as exemplified by the analysis of RNA-sequencing data including normal and tumour tissue specimens obtained from breast cancer patients.
Collapse
Affiliation(s)
- Gökhan Karakülah
- Izmir Biomedicine and Genome Center, Izmir, Turkey.,Izmir International Biomedicine and Genome Institute, Dokuz Eylül University, Izmir, Turkey
| | | | - Cihangir Yandım
- Izmir Biomedicine and Genome Center, Izmir, Turkey.,Department of Genetics and Bioengineering, Faculty of Engineering, Izmir University of Economics, Izmir, Turkey
| | - Aslı Suner
- Department of Biostatistics and Medical Informatics, Faculty of Medicine, Ege University, Izmir, Turkey
| |
Collapse
|