1
|
Yan Y, Tian Y, Wu Z, Zhang K, Yang R. Interchromosomal Colocalization with Parental Genes Is Linked to the Function and Evolution of Mammalian Retrocopies. Mol Biol Evol 2023; 40:msad265. [PMID: 38060983 PMCID: PMC10733166 DOI: 10.1093/molbev/msad265] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 10/25/2023] [Accepted: 11/29/2023] [Indexed: 12/22/2023] Open
Abstract
Retrocopies are gene duplicates arising from reverse transcription of mature mRNA transcripts and their insertion back into the genome. While long being regarded as processed pseudogenes, more and more functional retrocopies have been discovered. How the stripped-down retrocopies recover expression capability and become functional paralogs continually intrigues evolutionary biologists. Here, we investigated the function and evolution of retrocopies in the context of 3D genome organization. By mapping retrocopy-parent pairs onto sequencing-based and imaging-based chromatin contact maps in human and mouse cell lines and onto Hi-C interaction maps in 5 other mammals, we found that retrocopies and their parental genes show a higher-than-expected interchromosomal colocalization frequency. The spatial interactions between retrocopies and parental genes occur frequently at loci in active subcompartments and near nuclear speckles. Accordingly, colocalized retrocopies are more actively transcribed and translated and are more evolutionarily conserved than noncolocalized ones. The active transcription of colocalized retrocopies may result from their permissive epigenetic environment and shared regulatory elements with parental genes. Population genetic analysis of retroposed gene copy number variants in human populations revealed that retrocopy insertions are not entirely random in regard to interchromosomal interactions and that colocalized retroposed gene copy number variants are more likely to reach high frequencies, suggesting that both insertion bias and natural selection contribute to the colocalization of retrocopy-parent pairs. Further dissection implies that reduced selection efficacy, rather than positive selection, contributes to the elevated allele frequency of colocalized retroposed gene copy number variants. Overall, our results hint a role of interchromosomal colocalization in the "resurrection" of initially neutral retrocopies.
Collapse
Affiliation(s)
- Yubin Yan
- College of Life Sciences, Northwest A&F University, Yangling, Shaanxi, China
| | - Yuhan Tian
- College of Life Sciences, Northwest A&F University, Yangling, Shaanxi, China
| | - Zefeng Wu
- College of Life Sciences, Northwest A&F University, Yangling, Shaanxi, China
| | - Kunling Zhang
- College of Life Sciences, Northwest A&F University, Yangling, Shaanxi, China
| | - Ruolin Yang
- College of Life Sciences, Northwest A&F University, Yangling, Shaanxi, China
| |
Collapse
|
2
|
Ma H, Wang M, Zhang YE, Tan S. The power of "controllers": Transposon-mediated duplicated genes evolve towards neofunctionalization. J Genet Genomics 2023; 50:462-472. [PMID: 37068629 DOI: 10.1016/j.jgg.2023.04.003] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Revised: 04/04/2023] [Accepted: 04/05/2023] [Indexed: 04/19/2023]
Abstract
Since the discovery of the first transposon by Dr. Barbara McClintock, the prevalence and diversity of transposable elements (TEs) have been gradually recognized. As fundamental genetic components, TEs drive organismal evolution not only by contributing functional sequences (e.g., regulatory elements or "controllers" as phrased by Dr. McClintock) but also by shuffling genomic sequences. In the latter respect, TE-mediated gene duplications have contributed to the origination of new genes and attracted extensive interest. In response to the development of this field, we herein attempt to provide an overview of TE-mediated duplication by focusing on common rules emerging across duplications generated by different TE types. Specifically, despite the huge divergence of transposition machinery across TEs, we identify three common features of various TE-mediated duplication mechanisms, including end bypass, template switching, and recurrent transposition. These three features lead to one common functional outcome, namely, TE-mediated duplicates tend to be subjected to exon shuffling and neofunctionalization. Therefore, the intrinsic properties of the mutational mechanism constrain the evolutionary trajectories of these duplicates. We finally discuss the future of this field including an in-depth characterization of both the duplication mechanisms and functions of TE-mediated duplicates.
Collapse
Affiliation(s)
- Huijing Ma
- Key Laboratory of Zoological Systematics and Evolution & State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Mengxia Wang
- Key Laboratory of Zoological Systematics and Evolution & State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yong E Zhang
- Key Laboratory of Zoological Systematics and Evolution & State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China; CAS Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, Yunnan 650223, China; Chinese Institute for Brain Research, Beijing 102206, China.
| | - Shengjun Tan
- Key Laboratory of Zoological Systematics and Evolution & State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China.
| |
Collapse
|
3
|
Mustafin RN, Khusnutdinova E. Perspective for Studying the Relationship of miRNAs with Transposable Elements. Curr Issues Mol Biol 2023; 45:3122-3145. [PMID: 37185728 PMCID: PMC10136691 DOI: 10.3390/cimb45040204] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2023] [Revised: 03/07/2023] [Accepted: 04/03/2023] [Indexed: 05/17/2023] Open
Abstract
Transposable elements are important sources of miRNA, long non-coding RNAs genes, and their targets in the composition of protein-coding genes in plants and animals. Therefore, the detection of expression levels of specific non-coding RNAs in various tissues and cells in normal and pathological conditions may indicate a programmed pattern of transposable elements' activation. This reflects the species-specific composition and distribution of transposable elements in genomes, which underlie gene regulation in every cell division, including during aging. TEs' expression is also regulated by epigenetic factors (DNA methylation, histone modifications), SIRT6, cytidine deaminases APOBEC3, APOBEC1, and other catalytic proteins, such as ERCC, TREX1, RB1, HELLS, and MEGP2. In evolution, protein-coding genes and their regulatory elements are derived from transposons. As part of non-coding regions and introns of genes, they are sensors for transcriptional and post-transcriptional control of expression, using miRNAs and long non-coding RNAs, that arose from transposable elements in evolution. Methods (Orbld, ncRNAclassifier) and databases have been created for determining the occurrence of miRNAs from transposable elements in plants (PlanTE-MIR DB, PlaNC-TE), which can be used to design epigenetic gene networks in ontogenesis. Based on the data accumulated in the scientific literature, the presence of 467 transposon-derived miRNA genes in the human genome has been reliably established. It was proposed to create an updated and controlled online bioinformatics database of miRNAs derived from transposable elements in healthy individuals, as well as expression changes of these miRNAs during aging and various diseases, such as cancer and difficult-to-treat diseases. The use of the information obtained can open new horizons in the management of tissue and organ differentiation to aging slow down. In addition, the created database could become the basis for clarifying the mechanisms of pathogenesis of various diseases (imbalance in the activity of transposable elements, reflected in changes in the expression of miRNAs) and designing their targeted therapy using specific miRNAs as targets. This article provides examples of the detection of transposable elements-derived miRNAs involved in the development of specific malignant neoplasms, aging, and idiopathic pulmonary fibrosis.
Collapse
Affiliation(s)
- Rustam Nailevich Mustafin
- Department of Medical Genetics and Fundamental Medicine, Bashkir State Medical University, 450008 Ufa, Russia
| | - Elza Khusnutdinova
- Ufa Federal Research Centre, Institute of Biochemistry and Genetics, Russian Academy of Sciences, 450054 Ufa, Russia
| |
Collapse
|
4
|
Amici DR, Cingoz H, Alasady MJ, Alhayek S, Phoumyvong CM, Sahni N, Yi SS, Mendillo ML. The HAPSTR2 retrogene buffers stress signaling and resilience in mammals. Nat Commun 2023; 14:152. [PMID: 36631436 PMCID: PMC9834230 DOI: 10.1038/s41467-022-35697-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Accepted: 12/20/2022] [Indexed: 01/12/2023] Open
Abstract
We recently identified HAPSTR1 (C16orf72) as a key component in a novel pathway which regulates the cellular response to molecular stressors, such as DNA damage, nutrient scarcity, and protein misfolding. Here, we identify a functional paralog to HAPSTR1: HAPSTR2. HAPSTR2 formed early in mammalian evolution, via genomic integration of a reverse transcribed HAPSTR1 transcript, and has since been preserved under purifying selection. HAPSTR2, expressed primarily in neural and germline tissues and a subset of cancers, retains established biochemical features of HAPSTR1 to achieve two functions. In normal physiology, HAPSTR2 directly interacts with HAPSTR1, markedly augmenting HAPSTR1 protein stability in a manner independent from HAPSTR1's canonical E3 ligase, HUWE1. Alternatively, in the context of HAPSTR1 loss, HAPSTR2 expression is sufficient to buffer stress signaling and resilience. Thus, we discover a mammalian retrogene which safeguards fitness.
Collapse
Affiliation(s)
- David R Amici
- Dept. of Biochemistry and Molecular Genetics, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA
- Simpson Querrey Center for Epigenetics, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA
- Robert H. Lurie Comprehensive Cancer Center, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA
| | - Harun Cingoz
- Dept. of Biochemistry and Molecular Genetics, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA
- Simpson Querrey Center for Epigenetics, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA
- Robert H. Lurie Comprehensive Cancer Center, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA
| | - Milad J Alasady
- Dept. of Biochemistry and Molecular Genetics, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA
- Simpson Querrey Center for Epigenetics, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA
- Robert H. Lurie Comprehensive Cancer Center, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA
| | - Sammy Alhayek
- Dept. of Biochemistry and Molecular Genetics, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA
- Simpson Querrey Center for Epigenetics, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA
- Robert H. Lurie Comprehensive Cancer Center, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA
| | - Claire M Phoumyvong
- Dept. of Biochemistry and Molecular Genetics, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA
- Simpson Querrey Center for Epigenetics, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA
- Robert H. Lurie Comprehensive Cancer Center, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA
| | - Nidhi Sahni
- Department of Epigenetics and Molecular Carcinogenesis, and Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
- Quantitative and Computational Biosciences Program, Baylor College of Medicine, Houston, TX, 77030, USA
| | - S Stephen Yi
- Livestrong Cancer Institutes, Department of Oncology, and Department of Biomedical Engineering, The University of Texas at Austin, Austin, TX, 78712, USA
- Interdisciplinary Life Sciences Graduate Programs (ILSGP), and Oden Institute for Computational Engineering and Sciences (ICES), The University of Texas at Austin, Austin, TX, 78712, USA
| | - Marc L Mendillo
- Dept. of Biochemistry and Molecular Genetics, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA.
- Simpson Querrey Center for Epigenetics, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA.
- Robert H. Lurie Comprehensive Cancer Center, Northwestern University Feinberg School of Medicine, Chicago, IL, 60610, USA.
| |
Collapse
|
5
|
Vaidya M, Smith J, Field M, Sugaya K. Analysis of regulatory sequences in exosomal DNA of NANOGP8. PLoS One 2023; 18:e0280959. [PMID: 36696426 PMCID: PMC9876286 DOI: 10.1371/journal.pone.0280959] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Accepted: 12/20/2022] [Indexed: 01/26/2023] Open
Abstract
Exosomes participate in intercellular communication by transporting functionally active molecules. Such cargo from the original cells comprising proteins, micro-RNA, mRNA, single-stranded (ssDNA) and double-stranded DNA (dsDNA) molecules pleiotropically transforms the target cells. Although cancer cells secrete exosomes carrying a significant level of DNA capable of modulating oncogene expression in a recipient cell, the regulatory mechanism is unknown. We have previously reported that cancer cells produce exosomes containing NANOGP8 DNA. NANOGP8 is an oncogenic paralog of embryonic stem cell transcription factor NANOG and does not express in cells since it is a pseudogene. However, in this study, we evaluated NANOGP8 expression in glioblastoma multiforme (GBM) tissue from a surgically removed brain tumor of a patient. Significantly higher NANOGP8 transcription was observed in GBM cancer stem cells (CSCs) than in GBM cancer cells or neural stem cells (NSCs), despite identical sequences of NANOGP8-upstream genomic region in all the cell lines. This finding suggests that upstream genomic sequences of NANOGP8 may have environment-dependent promoter activity. We also found that the regulatory sequences upstream of exosomal NANOGP8 GBM DNA contain multiple core promoter elements, transcription factor binding sites, and segments of human viruses known for their oncogenic role. The exosomal sequence of NANOGP8-upstream GBM DNA is different from corresponding genomic sequences in CSCs, cancer cells, and NSCs as well as from the sequences reported by NCBI. These sequence dissimilarities suggest that exosomal NANOGP8 GBM DNA may not be a part of the genomic DNA. Exosomes possibly acquire this DNA from other sources where it is synthesized by an unknown mechanism. The significance of exosome-bestowed regulatory elements in the transcription of promoter-less retrogene such as NANOGP8 remains to be determined.
Collapse
Affiliation(s)
- Manjusha Vaidya
- Burnett School of Biomedical Sciences, College of Medicine, University of Central Florida, Orlando, FL, United States of America
| | - Jonhoi Smith
- Burnett School of Biomedical Sciences, College of Medicine, University of Central Florida, Orlando, FL, United States of America
| | - Melvin Field
- Burnett School of Biomedical Sciences, College of Medicine, University of Central Florida, Orlando, FL, United States of America
- AdventHealth Cancer Institute, Orlando, FL, United States of America
| | - Kiminobu Sugaya
- Burnett School of Biomedical Sciences, College of Medicine, University of Central Florida, Orlando, FL, United States of America
- * E-mail:
| |
Collapse
|
6
|
Su Z, Ao J, Zhao F, Xu G, Chen H, Gao C. The roles of long non‑coding RNAs in renal cell carcinoma (Review). Mol Clin Oncol 2022; 18:4. [PMID: 36591597 PMCID: PMC9780631 DOI: 10.3892/mco.2022.2600] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2022] [Accepted: 10/07/2022] [Indexed: 11/29/2022] Open
Abstract
Long non-coding RNAs (lncRNAs) are involved in the gene expression regulation and usually play important roles in various human cancers, including the renal cell carcinoma (RCC). Dysregulation of certain lncRNAs are associated with the prognosis of patients with RCC. In the present review, several recently studied lncRNAs were discussed and their critical roles in proliferation, migration, invasion, apoptosis and drug resistance of renal cancer cells were revealed. The research on lncRNAs further increases our understanding on the development and progression of RCC. It is suggested that lncRNAs can be used as biomarkers or therapeutic targets for diagnosis or treatment of renal cancer.
Collapse
Affiliation(s)
- Zhengming Su
- Department of Urology, Key Laboratory of Biological Targeting Diagnosis, Therapy and Rehabilitation of Guangdong Higher Education Institutes, The Fifth Affiliated Hospital of Guangzhou Medical University, Guangzhou, Guangdong 510700, P.R. China
| | - Jian Ao
- Department of General Surgery, The Fifth Affiliated Hospital of Guangzhou Medical University, Guangzhou, Guangdong 510700, P.R. China
| | - Fengjin Zhao
- Department of Urology, Key Laboratory of Biological Targeting Diagnosis, Therapy and Rehabilitation of Guangdong Higher Education Institutes, The Fifth Affiliated Hospital of Guangzhou Medical University, Guangzhou, Guangdong 510700, P.R. China
| | - Guibin Xu
- Department of Urology, Key Laboratory of Biological Targeting Diagnosis, Therapy and Rehabilitation of Guangdong Higher Education Institutes, The Fifth Affiliated Hospital of Guangzhou Medical University, Guangzhou, Guangdong 510700, P.R. China
| | - Huihua Chen
- Department of Education Management Section, The Fifth Affiliated Hospital of Guangzhou Medical University, Guangzhou, Guangdong 510700, P.R. China,Correspondence to: Professor Huihua Chen, Department of Education Management Section, The Fifth Affiliated Hospital of Guangzhou Medical University, 621 Gangwan Road, Guangzhou, Guangdong 510700, P.R. China
| | - Chen Gao
- Department of Operating Room, Peking University Shenzhen Hospital, Shenzhen, Guangdong 518036, P.R. China,Correspondence to: Professor Huihua Chen, Department of Education Management Section, The Fifth Affiliated Hospital of Guangzhou Medical University, 621 Gangwan Road, Guangzhou, Guangdong 510700, P.R. China
| |
Collapse
|
7
|
Erdman VV, Karimov DD, Tuktarova IA, Timasheva YR, Nasibullin TR, Korytina GF. Alu Deletions in LAMA2 and CDH4 Genes Are Key Components of Polygenic Predictors of Longevity. Int J Mol Sci 2022; 23:13492. [PMID: 36362280 PMCID: PMC9657309 DOI: 10.3390/ijms232113492] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Revised: 10/27/2022] [Accepted: 10/31/2022] [Indexed: 10/18/2023] Open
Abstract
Longevity is a unique human phenomenon and a highly stable trait, characterized by polygenicity. The longevity phenotype occurs due to the ability to successfully withstand the age-related genomic instability triggered by Alu elements. The purpose of our cross-sectional study was to evaluate the combined contribution of ACE*Ya5ACE, CDH4*Yb8NBC516, COL13A1*Ya5ac1986, HECW1*Ya5NBC182, LAMA2*Ya5-MLS19, PLAT*TPA25, PKHD1L1*Yb8AC702, SEMA6A*Yb8NBC597, STK38L*Ya5ac2145 and TEAD1*Ya5ac2013 Alu elements to longevity. The study group included 2054 unrelated individuals aged from 18 to 113 years who are ethnic Tatars from Russia. We analyzed the dynamics of the allele and genotype frequencies of the studied Alu polymorphic loci in the age groups of young (18-44 years old), middle-aged (45-59 years old), elderly (60-74 years old), old seniors (75-89 years old) and long-livers (90-113 years old). Most significant changes in allele and genotype frequencies were observed between the long-livers and other groups. The search for polygenic predictors of longevity was performed using the APSampler program. Attaining longevity was associated with the combinations LAMA2*ID + CDH4*D (OR = 2.23, PBonf = 1.90 × 10-2) and CDH4*DD + LAMA2*ID + HECW1*D (OR = 4.58, PBonf = 9.00 × 10-3) among persons aged between 18 and 89 years, LAMA2*ID + CDH4*D + SEMA6A*I for individuals below 75 years of age (OR = 3.13, PBonf = 2.00 × 10-2), LAMA2*ID + HECW1*I for elderly people aged 60 and older (OR = 3.13, PBonf = 2.00 × 10-2) and CDH4*DD + LAMA2*D + HECW1*D (OR = 4.21, PBonf = 2.60 × 10-2) and CDH4*DD + LAMA2*D + ACE*I (OR = 3.68, PBonf = 1.90 × 10-2) among old seniors (75-89 years old). The key elements of combinations associated with longevity were the deletion alleles of CDH4 and LAMA2 genes. Our results point to the significance for human longevity of the Alu polymorphic loci in CDH4, LAMA2, HECW1, SEMA6A and ACE genes, involved in the integration systems.
Collapse
Affiliation(s)
- Vera V. Erdman
- Institute of Biochemistry and Genetics, Subdivision of the Ufa Federal Research Centre of the Russian Academy of Sciences, 450054 Ufa, Russia
| | - Denis D. Karimov
- Ufa Research Institute of Labor Medicine and Human Ecology, 450106 Ufa, Russia
| | - Ilsia A. Tuktarova
- Institute of Biochemistry and Genetics, Subdivision of the Ufa Federal Research Centre of the Russian Academy of Sciences, 450054 Ufa, Russia
| | - Yanina R. Timasheva
- Institute of Biochemistry and Genetics, Subdivision of the Ufa Federal Research Centre of the Russian Academy of Sciences, 450054 Ufa, Russia
| | - Timur R. Nasibullin
- Institute of Biochemistry and Genetics, Subdivision of the Ufa Federal Research Centre of the Russian Academy of Sciences, 450054 Ufa, Russia
| | - Gulnaz F. Korytina
- Institute of Biochemistry and Genetics, Subdivision of the Ufa Federal Research Centre of the Russian Academy of Sciences, 450054 Ufa, Russia
| |
Collapse
|
8
|
Zhang X, Hu Y, Smith DR. HSDatabase-a database of highly similar duplicate genes from plants, animals, and algae. Database (Oxford) 2022; 2022:6754190. [PMID: 36208223 PMCID: PMC9547538 DOI: 10.1093/database/baac086] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2022] [Revised: 08/16/2022] [Accepted: 09/20/2022] [Indexed: 11/30/2022]
Abstract
Gene duplication is an important evolutionary mechanism capable of providing new genetic material, which in some instances can help organisms adapt to various environmental conditions. Recent studies, for example, have indicated that highly similar duplicate genes (HSDs) are aiding adaptation to extreme conditions via gene dosage. However, for most eukaryotic genomes HSDs remain uncharacterized, partly because they can be hard to identify and categorize efficiently and effectively. Here, we collected and curated HSDs in nuclear genomes from various model animals, land plants and algae and indexed them in an online, open-access sequence repository called HSDatabase. Currently, this database contains 117 864 curated HSDs from 40 distinct genomes; it includes statistics on the total number of HSDs per genome as well as individual HSD copy numbers/lengths and provides sequence alignments of the duplicate gene copies. HSDatabase also allows users to download sequences of gene copies, access genome browsers, and link out to other databases, such as Pfam and Kyoto Encyclopedia of Genes and Genomes. What is more, a built-in Basic Local Alignment Search Tool option is available to conveniently explore potential homologous sequences of interest within and across species. HSDatabase has a user-friendly interface and provides easy access to the source data. It can be used on its own for comparative analyses of gene duplicates or in conjunction with HSDFinder, a newly developed bioinformatics tool for identifying, annotating, categorizing and visualizing HSDs. Database URL: http://hsdfinder.com/database/
Collapse
Affiliation(s)
- Xi Zhang
- Institute for Comparative Genomics, Dalhousie University, Halifax, Nova Scotia B3H 4R2, Canada.,Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia B3H 4R2, Canada
| | - Yining Hu
- Department of Computer Science, University of Western Ontario, London, Ontario N6A 3K7, Canada
| | - David Roy Smith
- Department of Biology, University of Western Ontario, London, Ontario N6A 3K7, Canada
| |
Collapse
|
9
|
Batcher K, Varney S, Affolter VK, Friedenberg SG, Bannasch D. An SNN retrocopy insertion upstream of GPR22 is associated with dark red coat color in Poodles. G3 GENES|GENOMES|GENETICS 2022; 12:6680184. [PMID: 36047852 PMCID: PMC9635648 DOI: 10.1093/g3journal/jkac227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/12/2022] [Accepted: 08/27/2022] [Indexed: 11/21/2022]
Abstract
Pigment production and distribution is controlled through multiple genes, resulting in a wide range of coat color phenotypes in dogs. Dogs that produce only the pheomelanin pigment vary in intensity from white to deep red. The Poodle breed has a wide range of officially recognized coat colors, including the pheomelanin-based white, cream, apricot, and red coat colors, which are not fully explained by the previously identified genetic variants involved in pigment intensity. Here, a genome-wide association study for pheomelanin intensity was performed in Poodles which identified an association on canine chromosome 18. Whole-genome sequencing data revealed an SNN retrocopy insertion (SNNL1) in apricot and red Poodles within the associated region on chromosome 18. While equal numbers of melanocytes were observed in all Poodle skin hair bulbs, higher melanin content was observed in the darker Poodles. Several genes involved in melanogenesis were also identified as highly overexpressed in red Poodle skin. The most differentially expressed gene however was GPR22, which was highly expressed in red Poodle skin while unexpressed in white Poodle skin (log2 fold change in expression 6.1, P < 0.001). GPR22 is an orphan G-protein-coupled receptor normally expressed exclusively in the brain and heart. The SNNL1 retrocopy inserted 2.8 kb upstream of GPR22 and is likely disrupting regulation of the gene, resulting in atypical expression in the skin. Thus, we identify the SNNL1 insertion as a candidate variant for the CFA18 pheomelanin intensity locus in red Poodles.
Collapse
Affiliation(s)
- Kevin Batcher
- Department of Population Health and Reproduction, University of California , Davis, Davis, CA 95616, USA
| | - Scarlett Varney
- Department of Population Health and Reproduction, University of California , Davis, Davis, CA 95616, USA
| | - Verena K Affolter
- Department of Pathology, Microbiology, & Immunology, University of California, Davis , Davis, CA 95616, USA
| | - Steven G Friedenberg
- Department of Veterinary Clinical Sciences, University of Minnesota , St Paul, MN 55455, USA
| | - Danika Bannasch
- Department of Population Health and Reproduction, University of California , Davis, Davis, CA 95616, USA
| |
Collapse
|
10
|
Rodriguez M, Makałowski W. Software evaluation for de novo detection of transposons. Mob DNA 2022; 13:14. [PMID: 35477485 PMCID: PMC9047281 DOI: 10.1186/s13100-022-00266-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2021] [Accepted: 03/16/2022] [Indexed: 11/16/2022] Open
Abstract
Transposable elements (TEs) are major genomic components in most eukaryotic genomes and play an important role in genome evolution. However, despite their relevance the identification of TEs is not an easy task and a number of tools were developed to tackle this problem. To better understand how they perform, we tested several widely used tools for de novo TE detection and compared their performance on both simulated data and well curated genomic sequences. As expected, tools that build TE-models performed better than k-mer counting ones, with RepeatModeler beating competitors in most datasets. However, there is a tendency for most tools to identify TE-regions in a fragmented manner and it is also frequent that small TEs or fragmented TEs are not detected. Consequently, the identification of TEs is still a challenging endeavor and it requires a significant manual curation by an experienced expert. The results will be helpful for identifying common issues associated with TE-annotation and for evaluating how comparable are the results obtained with different tools.
Collapse
Affiliation(s)
- Matias Rodriguez
- Institute of Bioinformatics, Faculty of Medicine, University of Münster, 48149, Münster, Germany
| | - Wojciech Makałowski
- Institute of Bioinformatics, Faculty of Medicine, University of Münster, 48149, Münster, Germany.
| |
Collapse
|
11
|
Rheinemann L, Downhour DM, Davenport KA, McKeown AN, Sundquist WI, Elde NC. Recurrent evolution of an inhibitor of ESCRT-dependent virus budding and LINE-1 retrotransposition in primates. Curr Biol 2022; 32:1511-1522.e6. [PMID: 35245459 PMCID: PMC9007875 DOI: 10.1016/j.cub.2022.02.018] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2021] [Revised: 12/20/2021] [Accepted: 02/03/2022] [Indexed: 12/15/2022]
Abstract
Most antiviral proteins recognize specific features of viruses. In contrast, the recently described antiviral factor retroCHMP3 interferes with the "host endosomal complexes required for transport" (ESCRT) pathway to inhibit the budding of enveloped viruses. RetroCHMP3 arose independently on multiple occasions via duplication and truncation of the gene encoding the ESCRT-III factor CHMP3. However, since the ESCRT pathway is essential for cellular membrane fission reactions, ESCRT inhibition is potentially cytotoxic. This raises fundamental questions about how hosts can repurpose core cellular functions into antiviral functions without incurring a fitness cost due to excess cellular toxicity. We reveal the evolutionary process of detoxification for retroCHMP3 in New World monkeys using a combination of ancestral reconstructions, cytotoxicity, and virus release assays. A duplicated, full-length copy of retroCHMP3 in the ancestors of New World monkeys provides modest inhibition of virus budding while exhibiting subtle cytotoxicity. Ancient retroCHMP3 then accumulated mutations that reduced cytotoxicity but preserved virus inhibition before a truncating stop codon arose in the more recent ancestors of squirrel monkeys, resulting in potent inhibition. In species where full-length copies of retroCHMP3 still exist, their artificial truncation generated potent virus-budding inhibitors with little cytotoxicity, revealing the potential for future antiviral defenses in modern species. In addition, we discovered that retroCHMP3 restricts LINE-1 retrotransposition, revealing how different challenges to genome integrity might explain multiple independent origins of retroCHMP3 in different species to converge on new immune functions.
Collapse
Affiliation(s)
- Lara Rheinemann
- Department of Biochemistry, University of Utah School of Medicine, 15 N Medical Drive East, Salt Lake City, UT 84112, USA
| | - Diane Miller Downhour
- Department of Human Genetics, University of Utah School of Medicine, 15 N 2030 E, Salt Lake City, UT 84112, USA
| | - Kristen A Davenport
- Department of Biochemistry, University of Utah School of Medicine, 15 N Medical Drive East, Salt Lake City, UT 84112, USA; Department of Human Genetics, University of Utah School of Medicine, 15 N 2030 E, Salt Lake City, UT 84112, USA
| | - Alesia N McKeown
- Department of Human Genetics, University of Utah School of Medicine, 15 N 2030 E, Salt Lake City, UT 84112, USA
| | - Wesley I Sundquist
- Department of Biochemistry, University of Utah School of Medicine, 15 N Medical Drive East, Salt Lake City, UT 84112, USA
| | - Nels C Elde
- Department of Human Genetics, University of Utah School of Medicine, 15 N 2030 E, Salt Lake City, UT 84112, USA; Howard Hughes Medical Institute, 4000 Jones Bridge Rd, Chevy Chase, MD 20815, USA.
| |
Collapse
|
12
|
Zhang X, Hu Y, Smith DR. HSDFinder: A BLAST-Based Strategy for Identifying Highly Similar Duplicated Genes in Eukaryotic Genomes. FRONTIERS IN BIOINFORMATICS 2021; 1:803176. [PMID: 36303740 PMCID: PMC9580922 DOI: 10.3389/fbinf.2021.803176] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Accepted: 11/25/2021] [Indexed: 01/01/2023] Open
Abstract
Gene duplication is an important evolutionary mechanism capable of providing new genetic material for adaptive and nonadaptive evolution. However, bioinformatics tools for identifying duplicate genes are often limited to the detection of paralogs in multiple species or to specific types of gene duplicates, such as retrocopies. Here, we present a user-friendly, BLAST-based web tool, called HSDFinder, which can identify, annotate, categorize, and visualize highly similar duplicate genes (HSDs) in eukaryotic nuclear genomes. HSDFinder includes an online heatmap plotting option, allowing users to compare HSDs among different species and visualize the results in different Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway functional categories. The external software requirements are BLAST, InterProScan, and KEGG. The utility of HSDFinder was tested on various model eukaryotic species, including Chlamydomonas reinhardtii, Arabidopsis thaliana, Oryza sativa, and Zea mays as well as the psychrophilic green alga Chlamydomonas sp. UWO241, and was proven to be a practical and accurate tool for gene duplication analyses. The web tool is free to use at http://hsdfinder.com. Documentation and tutorials can be found via the GitHub: https://github.com/zx0223winner/HSDFinder.
Collapse
Affiliation(s)
- Xi Zhang
- Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, NS, Canada
- Institute for Comparative Genomics, Dalhousie University, Halifax, NS, Canada
- *Correspondence: Xi Zhang, ; David Roy Smith,
| | - Yining Hu
- Department of Computer Science, Western University, London, ON, Canada
| | - David Roy Smith
- Department of Biology, Western University, London, ON, Canada
- *Correspondence: Xi Zhang, ; David Roy Smith,
| |
Collapse
|
13
|
Rheinemann L, Downhour DM, Bredbenner K, Mercenne G, Davenport KA, Schmitt PT, Necessary CR, McCullough J, Schmitt AP, Simon SM, Sundquist WI, Elde NC. RetroCHMP3 blocks budding of enveloped viruses without blocking cytokinesis. Cell 2021; 184:5419-5431.e16. [PMID: 34597582 PMCID: PMC8929533 DOI: 10.1016/j.cell.2021.09.008] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2020] [Revised: 05/03/2021] [Accepted: 09/03/2021] [Indexed: 11/26/2022]
Abstract
Many enveloped viruses require the endosomal sorting complexes required for transport (ESCRT) pathway to exit infected cells. This highly conserved pathway mediates essential cellular membrane fission events, which restricts the acquisition of adaptive mutations to counteract viral co-option. Here, we describe duplicated and truncated copies of the ESCRT-III factor CHMP3 that block ESCRT-dependent virus budding and arose independently in New World monkeys and mice. When expressed in human cells, these retroCHMP3 proteins potently inhibit release of retroviruses, paramyxoviruses, and filoviruses. Remarkably, retroCHMP3 proteins have evolved to reduce interactions with other ESCRT-III factors and have little effect on cellular ESCRT processes, revealing routes for decoupling cellular ESCRT functions from viral exploitation. The repurposing of duplicated ESCRT-III proteins thus provides a mechanism to generate broad-spectrum viral budding inhibitors without blocking highly conserved essential cellular ESCRT functions.
Collapse
Affiliation(s)
- Lara Rheinemann
- Department of Biochemistry, University of Utah School of Medicine, Salt Lake City, UT 84112, USA
| | - Diane Miller Downhour
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT 84112, USA
| | - Kate Bredbenner
- Laboratory of Cellular Biophysics, Rockefeller University, New York, NY 10065, USA
| | - Gaelle Mercenne
- Department of Biochemistry, University of Utah School of Medicine, Salt Lake City, UT 84112, USA
| | - Kristen A Davenport
- Department of Biochemistry, University of Utah School of Medicine, Salt Lake City, UT 84112, USA; Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT 84112, USA
| | - Phuong Tieu Schmitt
- Department of Veterinary and Biomedical Sciences, The Pennsylvania State University, University Park, PA 16802, USA
| | - Christina R Necessary
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT 84112, USA
| | - John McCullough
- Department of Biochemistry, University of Utah School of Medicine, Salt Lake City, UT 84112, USA
| | - Anthony P Schmitt
- Department of Veterinary and Biomedical Sciences, The Pennsylvania State University, University Park, PA 16802, USA
| | - Sanford M Simon
- Laboratory of Cellular Biophysics, Rockefeller University, New York, NY 10065, USA.
| | - Wesley I Sundquist
- Department of Biochemistry, University of Utah School of Medicine, Salt Lake City, UT 84112, USA.
| | - Nels C Elde
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT 84112, USA.
| |
Collapse
|
14
|
Choi IS, Wojciechowski MF, Ruhlman TA, Jansen RK. In and out: Evolution of viral sequences in the mitochondrial genomes of legumes (Fabaceae). Mol Phylogenet Evol 2021; 163:107236. [PMID: 34147655 DOI: 10.1016/j.ympev.2021.107236] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2021] [Revised: 06/11/2021] [Accepted: 06/14/2021] [Indexed: 10/21/2022]
Abstract
Plant specific mitoviruses in the 'genus' Mitovirus (Narnaviridae) and their integrated sequences (non-retroviral endogenous RNA viral elements or NERVEs) have been recently identified in various plant lineages. However, the sparse phylogenetic coverage of complete plant mitochondrial genome (mitogenome) sequences and the non-conserved nature of mitochondrial intergenic regions have hindered comparative studies on mitovirus NERVEs in plants. In this study, 10 new mitogenomes were sequenced from legumes (Fabaceae). Based on comparative genomic analysis of 27 total mitogenomes, we identified mitovirus NERVEs and transposable elements across the family. All legume mitogenomes included NERVEs and total NERVE length varied from ca. 2 kb in the papilionoid Trifolium to 35 kb in the mimosoid Acacia. Most of the NERVE integration sites were in highly variable intergenic regions, however, some were positioned in six cis-spliced mitochondrial introns. In the Acacia mitogenome, there were L1-like transposon sequences including an almost full-length copy with target site duplications (TSDs). The integration sites of NERVEs in four introns showed evidence of L1-like retrotransposition events. Phylogenetic analysis revealed that there were multiple instances of precise deletion of NERVEs between TSDs. This study provides clear evidence that a L1-like retrotransposition mechanism has a long history of contributing to the integration of viral RNA into plant mitogenomes while microhomology-mediated deletion can restore the integration site.
Collapse
Affiliation(s)
- In-Su Choi
- Department of Integrative Biology, University of Texas at Austin, Austin, TX 78712, USA; School of Life Sciences, Arizona State University, Tempe, AZ 85287, USA.
| | | | - Tracey A Ruhlman
- Department of Integrative Biology, University of Texas at Austin, Austin, TX 78712, USA.
| | - Robert K Jansen
- Department of Integrative Biology, University of Texas at Austin, Austin, TX 78712, USA; Centre of Excellence in Bionanoscience Research, Department of Biological Sciences, Faculty of Science, King Abdulaziz University, Jeddah 21589, Saudi Arabia.
| |
Collapse
|
15
|
Almojil D, Bourgeois Y, Falis M, Hariyani I, Wilcox J, Boissinot S. The Structural, Functional and Evolutionary Impact of Transposable Elements in Eukaryotes. Genes (Basel) 2021; 12:genes12060918. [PMID: 34203645 PMCID: PMC8232201 DOI: 10.3390/genes12060918] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2021] [Revised: 06/04/2021] [Accepted: 06/07/2021] [Indexed: 12/22/2022] Open
Abstract
Transposable elements (TEs) are nearly ubiquitous in eukaryotes. The increase in genomic data, as well as progress in genome annotation and molecular biology techniques, have revealed the vast number of ways mobile elements have impacted the evolution of eukaryotes. In addition to being the main cause of difference in haploid genome size, TEs have affected the overall organization of genomes by accumulating preferentially in some genomic regions, by causing structural rearrangements or by modifying the recombination rate. Although the vast majority of insertions is neutral or deleterious, TEs have been an important source of evolutionary novelties and have played a determinant role in the evolution of fundamental biological processes. TEs have been recruited in the regulation of host genes and are implicated in the evolution of regulatory networks. They have also served as a source of protein-coding sequences or even entire genes. The impact of TEs on eukaryotic evolution is only now being fully appreciated and the role they may play in a number of biological processes, such as speciation and adaptation, remains to be deciphered.
Collapse
Affiliation(s)
- Dareen Almojil
- New York University Abu Dhabi, Saadiyat Island, Abu Dhabi P.O. Box 129188, United Arab Emirates; (D.A.); (M.F.); (I.H.); (J.W.)
| | - Yann Bourgeois
- School of Biological Sciences, University of Portsmouth, Portsmouth, UK;
| | - Marcin Falis
- New York University Abu Dhabi, Saadiyat Island, Abu Dhabi P.O. Box 129188, United Arab Emirates; (D.A.); (M.F.); (I.H.); (J.W.)
| | - Imtiyaz Hariyani
- New York University Abu Dhabi, Saadiyat Island, Abu Dhabi P.O. Box 129188, United Arab Emirates; (D.A.); (M.F.); (I.H.); (J.W.)
| | - Justin Wilcox
- New York University Abu Dhabi, Saadiyat Island, Abu Dhabi P.O. Box 129188, United Arab Emirates; (D.A.); (M.F.); (I.H.); (J.W.)
- Center for Genomics and Systems Biology, New York University Abu Dhabi, Saadiyat Island, Abu Dhabi P.O. Box 129188, United Arab Emirates
| | - Stéphane Boissinot
- New York University Abu Dhabi, Saadiyat Island, Abu Dhabi P.O. Box 129188, United Arab Emirates; (D.A.); (M.F.); (I.H.); (J.W.)
- Center for Genomics and Systems Biology, New York University Abu Dhabi, Saadiyat Island, Abu Dhabi P.O. Box 129188, United Arab Emirates
- Correspondence:
| |
Collapse
|
16
|
Tine M, Kuhl H, Teske PR, Reinhardt R. Genome-wide analysis of European sea bass provides insights into the evolution and functions of single-exon genes. Ecol Evol 2021; 11:6546-6557. [PMID: 34141239 PMCID: PMC8207432 DOI: 10.1002/ece3.7507] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2020] [Revised: 01/24/2021] [Accepted: 03/12/2021] [Indexed: 11/17/2022] Open
Abstract
Several studies have attempted to understand the origin and evolution of single-exon genes (SEGs) in eukaryotic organisms, including fishes, but few have examined the functional and evolutionary relationships between SEGs and multiple-exon gene (MEG) paralogs, in particular the conservation of promoter regions. Given that SEGs originate via the reverse transcription of mRNA from a "parental" MEGs, such comparisons may enable identifying evolutionarily-related SEG/MEG paralogs, which might fulfill equivalent physiological functions. Here, the relationship of SEG proportion with MEG count, gene density, intron count, and chromosome size was assessed for the genome of the European sea bass, Dicentrarchus labrax. Then, SEGs with an MEG parent were identified, and promoter sequences of SEG/MEG paralogs were compared, to identify highly conserved functional motifs. The results revealed a total count of 1,585 (8.3% of total genes) SEGs in the European sea bass genome, which was correlated with MEG count but not with gene density. The significant correlation of SEG content with the number of MEGs suggests that SEGs were continuously and independently generated over evolutionary time following species divergence through retrotranscription events, followed by tandem duplications. Functional annotation showed that the majority of SEGs are functional, as is evident from their expression in RNA-seq data used to support homology-based genome annotation. Differences in 5'UTR and 3'UTR lengths between SEG/MEG paralogs observed in this study may contribute to gene expression divergence between them and therefore lead to the emergence of new SEG functions. The comparison of nonsynonymous to synonymous changes (Ka/Ks) between SEG/MEG parents showed that 74 of them are under positive selection (Ka/Ks > 1; p = .0447). An additional fifteen SEGs with an MEG parent have a common promoter, which implies that they are under the influence of common regulatory networks.
Collapse
Affiliation(s)
- Mbaye Tine
- UFR des Sciences Agronomiques, de l'Aquaculture et des Technologies Alimentaires (S2ATA)Université Gaston Berger (UGB)Saint‐LouisSenegal
- Genome Centre at the Max‐Planck Institute for Plant Breeding ResearchKölnGermany
| | - Heiner Kuhl
- Department of Ecophysiology and AquacultureLeibniz‐Institute of Freshwater Ecology and Inland Fisheries (IGB)BerlinGermany
| | - Peter R. Teske
- Department of ZoologyCentre for Ecological Genomics and Wildlife ConservationUniversity of JohannesburgJohannesburgSouth Africa
| | - Richard Reinhardt
- Genome Centre at the Max‐Planck Institute for Plant Breeding ResearchKölnGermany
| |
Collapse
|
17
|
Muneeb M, Henschel A. Eye-color and Type-2 diabetes phenotype prediction from genotype data using deep learning methods. BMC Bioinformatics 2021; 22:198. [PMID: 33874881 PMCID: PMC8056510 DOI: 10.1186/s12859-021-04077-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2020] [Accepted: 03/03/2021] [Indexed: 01/08/2023] Open
Abstract
Background Genotype–phenotype predictions are of great importance in genetics. These predictions can help to find genetic mutations causing variations in human beings. There are many approaches for finding the association which can be broadly categorized into two classes, statistical techniques, and machine learning. Statistical techniques are good for finding the actual SNPs causing variation where Machine Learning techniques are good where we just want to classify the people into different categories. In this article, we examined the Eye-color and Type-2 diabetes phenotype. The proposed technique is a hybrid approach consisting of some parts from statistical techniques and remaining from Machine learning. Results The main dataset for Eye-color phenotype consists of 806 people. 404 people have Blue-Green eyes where 402 people have Brown eyes. After preprocessing we generated 8 different datasets, containing different numbers of SNPs, using the mutation difference and thresholding at individual SNP. We calculated three types of mutation at each SNP no mutation, partial mutation, and full mutation. After that data is transformed for machine learning algorithms. We used about 9 classifiers, RandomForest, Extreme Gradient boosting, ANN, LSTM, GRU, BILSTM, 1DCNN, ensembles of ANN, and ensembles of LSTM which gave the best accuracy of 0.91, 0.9286, 0.945, 0.94, 0.94, 0.92, 0.95, and 0.96% respectively. Stacked ensembles of LSTM outperformed other algorithms for 1560 SNPs with an overall accuracy of 0.96, AUC = 0.98 for brown eyes, and AUC = 0.97 for Blue-Green eyes. The main dataset for Type-2 diabetes consists of 107 people where 30 people are classified as cases and 74 people as controls. We used different linear threshold to find the optimal number of SNPs for classification. The final model gave an accuracy of 0.97%. Conclusion Genotype–phenotype predictions are very useful especially in forensic. These predictions can help to identify SNP variant association with traits and diseases. Given more datasets, machine learning model predictions can be increased. Moreover, the non-linearity in the Machine learning model and the combination of SNPs Mutations while training the model increases the prediction. We considered binary classification problems but the proposed approach can be extended to multi-class classification.
Collapse
Affiliation(s)
- Muhammad Muneeb
- Department of Electrical Engineering and Computer Science, Center for Biotechnology Khalifa University, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
| | - Andreas Henschel
- Department of Electrical Engineering and Computer Science, Center for Biotechnology Khalifa University, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates.
| |
Collapse
|
18
|
Ciomborowska-Basheer J, Staszak K, Kubiak MR, Makałowska I. Not So Dead Genes-Retrocopies as Regulators of Their Disease-Related Progenitors and Hosts. Cells 2021; 10:cells10040912. [PMID: 33921034 PMCID: PMC8071448 DOI: 10.3390/cells10040912] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2021] [Revised: 03/30/2021] [Accepted: 04/13/2021] [Indexed: 12/12/2022] Open
Abstract
Retroposition is RNA-based gene duplication leading to the creation of single exon nonfunctional copies. Nevertheless, over time, many of these duplicates acquire transcriptional capabilities. In human in most cases, these so-called retrogenes do not code for proteins but function as regulatory long noncoding RNAs (lncRNAs). The mechanisms by which they can regulate other genes include microRNA sponging, modulation of alternative splicing, epigenetic regulation and competition for stabilizing factors, among others. Here, we summarize recent findings related to lncRNAs originating from retrocopies that are involved in human diseases such as cancer and neurodegenerative, mental or cardiovascular disorders. Special attention is given to retrocopies that regulate their progenitors or host genes. Presented evidence from the literature and our bioinformatics analyses demonstrates that these retrocopies, often described as unimportant pseudogenes, are significant players in the cell’s molecular machinery.
Collapse
|
19
|
Mustafin RN, Khusnutdinova EK. Involvement of transposable elements in neurogenesis. Vavilovskii Zhurnal Genet Selektsii 2021; 24:209-218. [PMID: 33659801 PMCID: PMC7893149 DOI: 10.18699/vj20.613] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
The article is about the role of transposons in the regulation of functioning of neuronal stem cells and mature neurons of the human brain. Starting from the first division of the zygote, embryonic development is governed by regular activations of transposable elements, which are necessary for the sequential regulation of the expression of genes specific for each cell type. These processes include differentiation of neuronal stem cells, which requires the finest tuning of expression of neuron genes in various regions of the brain. Therefore, in the hippocampus, the center of human neurogenesis, the highest transposon activity has been identified, which causes somatic mosaicism of cells during the formation of specific brain structures. Similar data were obtained in studies on experimental animals. Mobile genetic elements are the most important sources of long non-coding RNAs that are coexpressed with important brain protein-coding genes. Significant activity of long non-coding RNA was detected in the hippocampus, which confirms the role of transposons in the regulation of brain function. MicroRNAs, many of which arise from transposon transcripts, also play an important role in regulating the differentiation of neuronal stem cells. Therefore, transposons, through their own processed transcripts, take an active part in the epigenetic regulation of differentiation of neurons. The global regulatory role of transposons in the human brain is due to the emergence of protein-coding genes in evolution by their exonization, duplication and domestication. These genes are involved in an epigenetic regulatory network with the participation of transposons, since they contain nucleotide sequences complementary to miRNA and long non-coding RNA formed from transposons. In the memory formation, the role of the exchange of virus-like mRNA with the help of the Arc protein of endogenous retroviruses HERV between neurons has been revealed. A possible mechanism for the implementation of this mechanism may be reverse transcription of mRNA and site-specific insertion into the genome with a regulatory effect on the genes involved in the memory.
Collapse
Affiliation(s)
| | - E K Khusnutdinova
- Institute of Biochemistry and Genetics - Subdivision of the Ufa Federal Research Centre of the Russian Academy of Sciences, Ufa, Russia
| |
Collapse
|
20
|
Zhang X, Cvetkovska M, Morgan-Kiss R, Hüner NPA, Smith DR. Draft genome sequence of the Antarctic green alga Chlamydomonas sp. UWO241. iScience 2021; 24:102084. [PMID: 33644715 PMCID: PMC7887394 DOI: 10.1016/j.isci.2021.102084] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2020] [Revised: 12/08/2020] [Accepted: 01/14/2021] [Indexed: 11/22/2022] Open
Abstract
Antarctica is home to an assortment of psychrophilic algae, which have evolved various survival strategies for coping with their frigid environments. Here, we explore Antarctic psychrophily by examining the ∼212 Mb draft nuclear genome of the green alga Chlamydomonas sp. UWO241, which resides within the water column of a perennially ice-covered, hypersaline lake. Like certain other Antarctic algae, UWO241 encodes a large number (≥37) of ice-binding proteins, putatively originating from horizontal gene transfer. Even more striking, UWO241 harbors hundreds of highly similar duplicated genes involved in diverse cellular processes, some of which we argue are aiding its survival in the Antarctic via gene dosage. Gene and partial gene duplication appear to be an ongoing phenomenon within UWO241, one which might be mediated by retrotransposons. Ultimately, we consider how such a process could be associated with adaptation to extreme environments but explore potential non-adaptive hypotheses as well. Chlamydomonas sp. UWO241 is a green alga originating from Lake Bonney, Antarctica We present a draft nuclear genome sequence of UWO241 (∼212 Mb). The UWO genome contains hundreds of highly similar duplicated genes These duplicates, we argue, might be involved in cold adaptation
Collapse
Affiliation(s)
- Xi Zhang
- Department of Biology, University of Western Ontario, London, ON N6A 5B7, Canada
| | - Marina Cvetkovska
- Department of Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada
| | | | - Norman P A Hüner
- Department of Biology, University of Western Ontario, London, ON N6A 5B7, Canada
| | - David Roy Smith
- Department of Biology, University of Western Ontario, London, ON N6A 5B7, Canada
| |
Collapse
|
21
|
Cancer, Retrogenes, and Evolution. Life (Basel) 2021; 11:life11010072. [PMID: 33478113 PMCID: PMC7835786 DOI: 10.3390/life11010072] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2020] [Revised: 01/14/2021] [Accepted: 01/15/2021] [Indexed: 12/18/2022] Open
Abstract
This review summarizes the knowledge about retrogenes in the context of cancer and evolution. The retroposition, in which the processed mRNA from parental genes undergoes reverse transcription and the resulting cDNA is integrated back into the genome, results in additional copies of existing genes. Despite the initial misconception, retroposition-derived copies can become functional, and due to their role in the molecular evolution of genomes, they have been named the “seeds of evolution”. It is convincing that retrogenes, as important elements involved in the evolution of species, also take part in the evolution of neoplastic tumors at the cell and species levels. The occurrence of specific “resistance mechanisms” to neoplastic transformation in some species has been noted. This phenomenon has been related to additional gene copies, including retrogenes. In addition, the role of retrogenes in the evolution of tumors has been described. Retrogene expression correlates with the occurrence of specific cancer subtypes, their stages, and their response to therapy. Phylogenetic insights into retrogenes show that most cancer-related retrocopies arose in the lineage of primates, and the number of identified cancer-related retrogenes demonstrates that these duplicates are quite important players in human carcinogenesis.
Collapse
|
22
|
Zeng H, Chen X, Li H, Zhang J, Wei Z, Wang Y. Interpopulation differences of retroduplication variations (RDVs) in rice retrogenes and their phenotypic correlations. Comput Struct Biotechnol J 2021; 19:600-611. [PMID: 33510865 PMCID: PMC7811064 DOI: 10.1016/j.csbj.2020.12.046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2020] [Revised: 12/29/2020] [Accepted: 12/31/2020] [Indexed: 11/21/2022] Open
Abstract
Retroduplication variation (RDV), a type of retrocopy polymorphism, is considered to have essential biological significance, but its effect on gene function and species phenotype is still poorly understood. To this end, we analyzed the retrocopies and RDVs in 3,010 rice genomes. We calculated the RDV frequencies in the genome of each rice population; detected the mutated, ancestral and expressed retrogenes in rice genomes; and analyzed their RDV influence on rice phenotypic traits. Collectively, 73 RDVs were identified, and 14 RDVs in ancestral retrogenes can significantly affect rice phenotypes. Our research reveals that RDV plays an important role in rice migration, domestication and evolution. We think that RDV is a good molecular breeding marker candidate. To our knowledge, this is the first study on the relationship between retrogene function, expression, RDV and species phenotype.
Collapse
Affiliation(s)
- Haiyue Zeng
- State Key Laboratory of Silkworm Genome Biology, Southwest University, Chongqing 400715, China
- Biological Science Research Center, Southwest University, Chongqing 400715, China
- Shennong Class, Southwest University, Chongqing 400715, China
| | - Xingyu Chen
- Shennong Class, Southwest University, Chongqing 400715, China
| | - Hongbo Li
- College of Electronic and Information Engineering, Southwest University, Chongqing 400715
| | - Jun Zhang
- College of Computer & Information Science, Southwest University, Chongqing 400715, China
| | - Zhaoyuan Wei
- State Key Laboratory of Silkworm Genome Biology, Southwest University, Chongqing 400715, China
- Biological Science Research Center, Southwest University, Chongqing 400715, China
| | - Yi Wang
- State Key Laboratory of Silkworm Genome Biology, Southwest University, Chongqing 400715, China
- Biological Science Research Center, Southwest University, Chongqing 400715, China
| |
Collapse
|
23
|
Pan CT, Lin YS. MicroRNA retrocopies generated via L1-mediated retrotransposition in placental mammals help to reveal how their parental genes were transcribed. Sci Rep 2020; 10:20612. [PMID: 33244051 PMCID: PMC7692494 DOI: 10.1038/s41598-020-77381-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2020] [Accepted: 11/10/2020] [Indexed: 02/07/2023] Open
Abstract
In mammalian genomes, most retrocopies emerged via the L1 retrotransposition machinery. The hallmarks of an L1-mediated retrocopy, i.e., the intronlessness, the presence of a 3′ poly-A tail, and the TSDs at both ends, were frequently used to identify retrotransposition events. However, most previous studies only focused on protein-coding genes as their possible parental sources and thus only a few retrocopies derived from non-coding genes were reported. Remarkably, none of them was from microRNAs. Here in this study, we found several retrocopies generated from the mir-302–367 cluster gene (MIR302CHG), and identified a novel alternatively spliced exon encoding mir-302a. The other recognized microRNA retrotransposition events are primate-specific with mir-373 and mir-498 as their parental genes. The 3′ poly-A tracts of these two retrocopy groups were directly attached to the end of the microRNA precursor homologous regions, which suggests that their parental transcripts might alternatively terminate at the end of mir-373 and mir-498. All the three parental microRNAs are highly expressed in specific tissues with elevated retrotransposon activity, such as the embryonic stem cells and the placenta. This might be the reason that our first microRNA retrocopy findings were derived from these three microRNA genes.
Collapse
Affiliation(s)
- Cheng-Tsung Pan
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, 300, Taiwan
| | - Yeong-Shin Lin
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, 300, Taiwan. .,Department of Biological Science and Technology, College of Biological Science and Technology, National Chiao Tung University, Hsinchu, 300, Taiwan. .,Center For Intelligent Drug Systems and Smart Bio-devices (IDS2B), National Chiao Tung University, Hsinchu, 300, Taiwan.
| |
Collapse
|
24
|
Dickinson PJ, Bannasch DL. Current Understanding of the Genetics of Intervertebral Disc Degeneration. Front Vet Sci 2020; 7:431. [PMID: 32793650 PMCID: PMC7393939 DOI: 10.3389/fvets.2020.00431] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2020] [Accepted: 06/15/2020] [Indexed: 11/13/2022] Open
Abstract
Premature degeneration of the intervertebral disc and its association with specific chondrodystrophic dog breeds has been recognized for over a century. Several lines of evidence including disease breed predisposition, studies suggesting heritability of premature intervertebral disc degeneration (IVDD) and association of a dog chromosome 12 (CFA 12) locus with intervertebral disc calcification have strongly supported a genetic component in IVDD in dogs. Recent studies documenting association of IVDD with an overexpressing FGF4 retrogene on CFA 12 have opened up new areas of investigation to further define the pathophysiology of premature IVDD. While preliminary data from studies investigating FGF4 retrogenes in IVDD implicate FGF4 overexpression as a major disease factor, they have also highlighted knowledge gaps in our understanding of intervertebral disc herniation which is a complex and multifactorial disease process.
Collapse
Affiliation(s)
- Peter J Dickinson
- Department of Surgical and Radiological Sciences, School of Veterinary Medicine, University of California, Davis, Davis, CA, United States
| | - Danika L Bannasch
- Department of Population Health and Reproduction, School of Veterinary Medicine, University of California, Davis, Davis, CA, United States
| |
Collapse
|
25
|
Batcher K, Dickinson P, Maciejczyk K, Brzeski K, Rasouliha SH, Letko A, Drögemüller C, Leeb T, Bannasch D. Multiple FGF4 Retrocopies Recently Derived within Canids. Genes (Basel) 2020; 11:genes11080839. [PMID: 32717834 PMCID: PMC7465015 DOI: 10.3390/genes11080839] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2020] [Revised: 07/21/2020] [Accepted: 07/21/2020] [Indexed: 12/17/2022] Open
Abstract
Two transcribed retrocopies of the fibroblast growth factor 4 (FGF4) gene have previously been described in the domestic dog. An FGF4 retrocopy on chr18 is associated with disproportionate dwarfism, while an FGF4 retrocopy on chr12 is associated with both disproportionate dwarfism and intervertebral disc disease (IVDD). In this study, whole-genome sequencing data were queried to identify other FGF4 retrocopies that could be contributing to phenotypic diversity in canids. Additionally, dogs with surgically confirmed IVDD were assayed for novel FGF4 retrocopies. Five additional and distinct FGF4 retrocopies were identified in canids including a copy unique to red wolves (Canis rufus). The FGF4 retrocopies identified in domestic dogs were identical to domestic dog FGF4 haplotypes, which are distinct from modern wolf FGF4 haplotypes, indicating that these retrotransposition events likely occurred after domestication. The identification of multiple, full length FGF4 retrocopies with open reading frames in canids indicates that gene retrotransposition events occur much more frequently than previously thought and provide a mechanism for continued genetic and phenotypic diversity in canids.
Collapse
Affiliation(s)
- Kevin Batcher
- Department of Population Health and Reproduction, University of California-Davis, Davis, CA 95616, USA; (K.B.); (K.M.)
| | - Peter Dickinson
- Department of Surgical and Radiological Sciences, University of California-Davis, Davis, CA 95616, USA;
| | - Kimberly Maciejczyk
- Department of Population Health and Reproduction, University of California-Davis, Davis, CA 95616, USA; (K.B.); (K.M.)
| | - Kristin Brzeski
- College of Forest Resources and Environmental Science, Michigan Technological University, Houghton, MI 49931, USA;
| | - Sheida Hadji Rasouliha
- Institute of Genetics, Vetsuisse Faculty, University of Bern, 3012 Bern, Switzerland; (S.H.R.); (A.L.); (C.D.); (T.L.)
| | - Anna Letko
- Institute of Genetics, Vetsuisse Faculty, University of Bern, 3012 Bern, Switzerland; (S.H.R.); (A.L.); (C.D.); (T.L.)
| | - Cord Drögemüller
- Institute of Genetics, Vetsuisse Faculty, University of Bern, 3012 Bern, Switzerland; (S.H.R.); (A.L.); (C.D.); (T.L.)
| | - Tosso Leeb
- Institute of Genetics, Vetsuisse Faculty, University of Bern, 3012 Bern, Switzerland; (S.H.R.); (A.L.); (C.D.); (T.L.)
| | - Danika Bannasch
- Department of Population Health and Reproduction, University of California-Davis, Davis, CA 95616, USA; (K.B.); (K.M.)
- Correspondence:
| |
Collapse
|
26
|
|
27
|
Complex Analysis of Retroposed Genes' Contribution to Human Genome, Proteome and Transcriptome. Genes (Basel) 2020; 11:genes11050542. [PMID: 32408516 PMCID: PMC7290577 DOI: 10.3390/genes11050542] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2020] [Revised: 05/06/2020] [Accepted: 05/08/2020] [Indexed: 02/07/2023] Open
Abstract
Gene duplication is a major driver of organismal evolution. One of the main mechanisms of gene duplications is retroposition, a process in which mRNA is first transcribed into DNA and then reintegrated into the genome. Most gene retrocopies are depleted of the regulatory regions. Nevertheless, examples of functional retrogenes are rapidly increasing. These functions come from the gain of new spatio-temporal expression patterns, imposed by the content of the genomic sequence surrounding inserted cDNA and/or by selectively advantageous mutations, which may lead to the switch from protein coding to regulatory RNA. As recent studies have shown, these genes may lead to new protein domain formation through fusion with other genes, new regulatory RNAs or other regulatory elements. We utilized existing data from high-throughput technologies to create a complex description of retrogenes functionality. Our analysis led to the identification of human retroposed genes that substantially contributed to transcriptome and proteome. These retrocopies demonstrated the potential to encode proteins or short peptides, act as cis- and trans- Natural Antisense Transcripts (NATs), regulate their progenitors’ expression by competing for the same microRNAs, and provide a sequence to lncRNA and novel exons to existing protein-coding genes. Our study also revealed that retrocopies, similarly to retrotransposons, may act as recombination hot spots. To our best knowledge this is the first complex analysis of these functions of retrocopies.
Collapse
|
28
|
LncRNA-ATB in cancers: what do we know so far? Mol Biol Rep 2020; 47:4077-4086. [PMID: 32248383 DOI: 10.1007/s11033-020-05415-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2020] [Accepted: 03/30/2020] [Indexed: 01/06/2023]
Abstract
Cancer-related deaths did not apparently decrease in the past decades despite aggressive treatments. It's reported that cancer will become the leading cause of death worldwide in the twenty-first century. Increasing evidence has revealed that lncRNAs will emerge as promising cancer biomarkers or therapeutic targets in cancer treatment. LncRNA-ATB, a long noncoding RNA activated by TGF-β, was found to be abnormally expressed in certain cancers and participate in the development and progression of tumors. In addition, aberrant lncRNA-ATB expression was also associated with clinical characteristics of tumors. The purpose of this review is to summarize functions and underlying mechanisms of lncRNA-ATB in tumors, and discuss whether lncRNA-ATB can be a biomarker and therapeutic target in cancers.
Collapse
|
29
|
Kubota S, Ishikawa T, Kawata K, Hattori T, Nishida T. Retrotransposons Manipulating Mammalian Skeletal Development in Chondrocytes. Int J Mol Sci 2020; 21:ijms21051564. [PMID: 32106563 PMCID: PMC7084347 DOI: 10.3390/ijms21051564] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2020] [Revised: 02/19/2020] [Accepted: 02/21/2020] [Indexed: 12/14/2022] Open
Abstract
Retrotransposons are genetic elements that copy and paste themselves in the host genome through transcription, reverse-transcription, and integration processes. Along with their proliferation in the genome, retrotransposons inevitably modify host genes around the integration sites, and occasionally create novel genes. Even now, a number of retrotransposons are still actively editing our genomes. As such, their profound role in the evolution of mammalian genomes is obvious; thus, their contribution to mammalian skeletal evolution and development is also unquestionable. In mammals, most of the skeletal parts are formed and grown through a process entitled endochondral ossification, in which chondrocytes play central roles. In this review, current knowledge on the evolutional, physiological, and pathological roles of retrotransposons in mammalian chondrocyte differentiation and cartilage development is summarized. The possible biological impact of these mobile genetic elements in the future is also discussed.
Collapse
|
30
|
Springer MS, Molloy EK, Sloan DB, Simmons MP, Gatesy J. ILS-Aware Analysis of Low-Homoplasy Retroelement Insertions: Inference of Species Trees and Introgression Using Quartets. J Hered 2019; 111:147-168. [DOI: 10.1093/jhered/esz076] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2019] [Accepted: 12/12/2019] [Indexed: 12/20/2022] Open
Abstract
Abstract
DNA sequence alignments have provided the majority of data for inferring phylogenetic relationships with both concatenation and coalescent methods. However, DNA sequences are susceptible to extensive homoplasy, especially for deep divergences in the Tree of Life. Retroelement insertions have emerged as a powerful alternative to sequences for deciphering evolutionary relationships because these data are nearly homoplasy-free. In addition, retroelement insertions satisfy the “no intralocus-recombination” assumption of summary coalescent methods because they are singular events and better approximate neutrality relative to DNA loci commonly sampled in phylogenomic studies. Retroelements have traditionally been analyzed with parsimony, distance, and network methods. Here, we analyze retroelement data sets for vertebrate clades (Placentalia, Laurasiatheria, Balaenopteroidea, Palaeognathae) with 2 ILS-aware methods that operate by extracting, weighting, and then assembling unrooted quartets into a species tree. The first approach constructs a species tree from retroelement bipartitions with ASTRAL, and the second method is based on split-decomposition with parsimony. We also develop a Quartet-Asymmetry test to detect hybridization using retroelements. Both ILS-aware methods recovered the same species-tree topology for each data set. The ASTRAL species trees for Laurasiatheria have consecutive short branch lengths in the anomaly zone whereas Palaeognathae is outside of this zone. For the Balaenopteroidea data set, which includes rorquals (Balaenopteridae) and gray whale (Eschrichtiidae), both ILS-aware methods resolved balaeonopterids as paraphyletic. Application of the Quartet-Asymmetry test to this data set detected 19 different quartets of species for which historical introgression may be inferred. Evidence for introgression was not detected in the other data sets.
Collapse
Affiliation(s)
- Mark S Springer
- Department of Evolution, Ecology, and Organismal Biology, University of California, Riverside, CA
| | - Erin K Molloy
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL
| | - Daniel B Sloan
- Department of Biology, Colorado State University, Fort Collins, CO
| | - Mark P Simmons
- Department of Biology, Colorado State University, Fort Collins, CO
| | - John Gatesy
- Division of Vertebrate Zoology and Sackler Institute for Comparative Genomics, American Museum of Natural History, New York, NY
| |
Collapse
|
31
|
Jiao Y, Cao Y, Zheng Z, Liu M, Guo X. Massive expansion and diversity of nicotinic acetylcholine receptors in lophotrochozoans. BMC Genomics 2019; 20:937. [PMID: 31805848 PMCID: PMC6896357 DOI: 10.1186/s12864-019-6278-9] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2019] [Accepted: 11/13/2019] [Indexed: 02/07/2023] Open
Abstract
Background Nicotinic acetylcholine receptors (nAChRs) are among the oldest and most conserved transmembrane receptors involved in signal transduction. Despite the prevalence and significance of cholinergic signaling, the diversity and evolution of nAChRs are not fully understood. Result By comparative genomic analysis, we found massive expansions of nAChR genes in molluscs and some other lophotrochozoans. The expansion is particularly pronounced in stationary bivalve molluscs with simple nervous systems, with the number of nAChR genes ranging from 99 to 217 in five bivalves, compared with 10 to 29 in five ecdysozoans and vertebrates. The expanded molluscan nAChR genes tend to be intronless and in tandem arrays due to retroposition followed by tandem duplication. Phylogenetic analysis revealed diverse nAChR families in the common ancestor of bilaterians, which subsequently experienced lineage-specific expansions or contractions. The expanded molluscan nAChR genes are highly diverse in sequence, domain structure, temporal and spatial expression profiles, implying diversified functions. Some molluscan nAChR genes are expressed in early development before the development of the nervous system, while others are involved in immune and stress responses. Conclusion The massive expansion and diversification of nAChR genes in bivalve molluscs may be a compensation for reduced nervous systems as part of adaptation to stationary life under dynamic environments, while in vertebrates a subset of specialized nAChRs are retained to work with advanced nervous systems. The unprecedented diversity identified in molluscs broadens our view on the evolution and function of nAChRs that are critical to animal physiology and human health.
Collapse
Affiliation(s)
- Yu Jiao
- Fishery College, Guangdong Ocean University, Zhanjiang, 524025, Guangdong, China.,Haskin Shellfish Research Laboratory, Department of Marine and Coastal Sciences, Rutgers University, 6959 Miller Avenue, Port Norris, NJ, 08349, USA
| | - Yanfei Cao
- Fishery College, Guangdong Ocean University, Zhanjiang, 524025, Guangdong, China
| | - Zhe Zheng
- Fishery College, Guangdong Ocean University, Zhanjiang, 524025, Guangdong, China
| | - Ming Liu
- Haskin Shellfish Research Laboratory, Department of Marine and Coastal Sciences, Rutgers University, 6959 Miller Avenue, Port Norris, NJ, 08349, USA
| | - Ximing Guo
- Haskin Shellfish Research Laboratory, Department of Marine and Coastal Sciences, Rutgers University, 6959 Miller Avenue, Port Norris, NJ, 08349, USA.
| |
Collapse
|
32
|
Gardner EJ, Prigmore E, Gallone G, Danecek P, Samocha KE, Handsaker J, Gerety SS, Ironfield H, Short PJ, Sifrim A, Singh T, Chandler KE, Clement E, Lachlan KL, Prescott K, Rosser E, FitzPatrick DR, Firth HV, Hurles ME. Contribution of retrotransposition to developmental disorders. Nat Commun 2019; 10:4630. [PMID: 31604926 PMCID: PMC6789007 DOI: 10.1038/s41467-019-12520-y] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2018] [Accepted: 09/11/2019] [Indexed: 02/08/2023] Open
Abstract
Mobile genetic Elements (MEs) are segments of DNA which can copy themselves and other transcribed sequences through the process of retrotransposition (RT). In humans several disorders have been attributed to RT, but the role of RT in severe developmental disorders (DD) has not yet been explored. Here we identify RT-derived events in 9738 exome sequenced trios with DD-affected probands. We ascertain 9 de novo MEs, 4 of which are likely causative of the patient's symptoms (0.04%), as well as 2 de novo gene retroduplications. Beyond identifying likely diagnostic RT events, we estimate genome-wide germline ME mutation rate and selective constraint and demonstrate that coding RT events have signatures of purifying selection equivalent to those of truncating mutations. Overall, our analysis represents a comprehensive interrogation of the impact of retrotransposition on protein coding genes and a framework for future evolutionary and disease studies.
Collapse
Affiliation(s)
- Eugene J Gardner
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, Hinxton, CB10 1SA, UK
| | - Elena Prigmore
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, Hinxton, CB10 1SA, UK
| | - Giuseppe Gallone
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, Hinxton, CB10 1SA, UK
| | - Petr Danecek
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, Hinxton, CB10 1SA, UK
| | - Kaitlin E Samocha
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, Hinxton, CB10 1SA, UK
| | - Juliet Handsaker
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, Hinxton, CB10 1SA, UK
| | - Sebastian S Gerety
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, Hinxton, CB10 1SA, UK
| | - Holly Ironfield
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, Hinxton, CB10 1SA, UK
| | - Patrick J Short
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, Hinxton, CB10 1SA, UK
| | - Alejandro Sifrim
- Department of Human Genetics, KU Leuven, Herestraat 49, Box 602, Leuven, B-3000, Belgium
| | - Tarjinder Singh
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, Hinxton, CB10 1SA, UK
| | - Kate E Chandler
- Manchester Centre for Genomic Medicine, Manchester University Hospitals NHS Foundation Trust, Manchester Academic Health Science Centre, Manchester, Greater, Manchester, M13 9WL, UK
| | - Emma Clement
- Department of Clinical Genetics, North East Thames Regional Genetics Service, Great Ormond Street Hospital for Children NHS Trust, Holborn, London, WC1N 3JH, UK
| | - Katherine L Lachlan
- Wessex Clinical Genetics Service, Southampton University Hospitals NHS Foundation Trust, Princess Anne Hospital, Southampton, SO16 5YA, UK.,Faculty of Medicine, Human Development and Health, University of Southampton, Southampton, SO17 1BJ, UK
| | - Katrina Prescott
- Clinical Genetics Department, Yorkshire Regional Genetics Service, Leeds Teaching Hospitals NHS Trust, Chapel Allerton Hospital, Leeds, LS7 4SA, UK
| | - Elisabeth Rosser
- Department of Clinical Genetics, North East Thames Regional Genetics Service, Great Ormond Street Hospital for Children NHS Trust, Holborn, London, WC1N 3JH, UK
| | - David R FitzPatrick
- MRC Human Genetics Unit, MRC IGMM, University of Edinburgh, WGH, Edinburgh, EH4 2SP, UK
| | - Helen V Firth
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, Hinxton, CB10 1SA, UK.,East Anglian Medical Genetics Service, Box 134, Cambridge University Hospitals NHS Foundation Trust, Cambridge Biomedical Campus, Cambridge, CB2 0QQ, UK
| | - Matthew E Hurles
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, Hinxton, CB10 1SA, UK.
| |
Collapse
|
33
|
Mustafin RN, Khusnutdinova EK. The Role of Reverse Transcriptase in the Origin of Life. BIOCHEMISTRY (MOSCOW) 2019; 84:870-883. [DOI: 10.1134/s0006297919080030] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
34
|
Bim LV, Navarro FCP, Valente FOF, Lima-Junior JV, Delcelo R, Dias-da-Silva MR, Maciel RMB, Galante PAF, Cerutti JM. Retroposed copies of RET gene: a somatically acquired event in medullary thyroid carcinoma. BMC Med Genomics 2019; 12:104. [PMID: 31288802 PMCID: PMC6617568 DOI: 10.1186/s12920-019-0552-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2019] [Accepted: 06/17/2019] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND Different pathogenic germline mutations in the RET oncogene are identified in MEN 2, a hereditary syndrome characterized by medullary thyroid carcinoma (MTC) and other endocrine tumors. Although genetic predisposition is recognized, not all RET mutation carriers will develop the disease during their lifetime or, likewise, RET mutation carriers belonging to the same family may present clinical heterogeneity. It has been suggested that a single germline mutation might not be sufficient for development of MEN 2-associated tumors and a somatic bi-allelic alteration might be required. Here we investigated the presence of somatic second hit mutation in the RET gene in MTC. METHODS We integrated Multiplex Ligation-dependent Probe Amplification (MLPA) and whole exome sequencing (WES) to search for copy number alteration (CNA) in the RET gene in MTC samples and medullary thyroid cell lines (TT and MZ-CR-1). We next found reads spanning exon-exon boundaries on RET, an indicative of retrocopy. We subsequently searched for RET retrocopies in the human reference genome (GRCh37) and in the 1000 Genomes Project data, by looking for reads reporting joined exons in the RET locus or distinct genomic regions. To determine RET retrocopy specificity and recurrence, DNA isolated from sporadic and MEN 2-associated MTC (n = 37), peripheral blood (n = 3) and papillary thyroid carcinomas with RET fusion (n = 10) samples were tested using PCR-sequencing methodology. RESULTS Through MLPA we have found evidence of CNA in the RET gene in MTC samples and MTC cell lines. WES analysis reinforced the presence of the CNA and hinted for a retroposed copy of RET not found in the human reference genome and 1.000 Genomes Project. Extended analysis confirmed the presence of a somatic MTC-related retrocopy of RET in both sporadic and hereditary tumors. We further unveiled a recurrent (28%) novel point mutation (p.G548 V) found exclusively in the retrocopy of RET. The mutation was also found in cDNA of mutated samples, suggesting it might be functional. CONCLUSION We here report a somatic specific RET retroposed copy in MTC samples and cell lines. Our results support the idea that generation of retrocopies in somatic cells is likely to contribute to MTC genesis and progression.
Collapse
Affiliation(s)
- Larissa V Bim
- Laboratório As Bases Genéticas dos Tumores da Tiroide, Universidade Federal de São Paulo, São Paulo, SP, Brazil
| | - Fábio C P Navarro
- Centro de Oncologia Molecular, Hospital Sírio-libanês, São Paulo, SP, Brazil.,Departamento de Bioquímica, Universidade de São Paulo, São Paulo, SP, Brazil
| | - Flávia O F Valente
- Laboratório de Endocrinologia Molecular e Translacional, Universidade Federal de São Paulo, São Paulo, SP, Brazil
| | - José V Lima-Junior
- Laboratório As Bases Genéticas dos Tumores da Tiroide, Universidade Federal de São Paulo, São Paulo, SP, Brazil
| | - Rosana Delcelo
- Departamento de Patologia, Universidade Federal de São Paulo, São Paulo, SP, Brazil
| | - Magnus R Dias-da-Silva
- Laboratório de Endocrinologia Molecular e Translacional, Universidade Federal de São Paulo, São Paulo, SP, Brazil
| | - Rui M B Maciel
- Laboratório de Endocrinologia Molecular e Translacional, Universidade Federal de São Paulo, São Paulo, SP, Brazil
| | - Pedro A F Galante
- Centro de Oncologia Molecular, Hospital Sírio-libanês, São Paulo, SP, Brazil
| | - Janete M Cerutti
- Laboratório As Bases Genéticas dos Tumores da Tiroide, Universidade Federal de São Paulo, São Paulo, SP, Brazil.
| |
Collapse
|
35
|
Mustafin RN, Khusnutdinova EK. The role of transposable elements in the ecological morphogenesis under the influence of stress. Vavilovskii Zhurnal Genet Selektsii 2019. [DOI: 10.18699/vj19.506] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
In natural selection, insertional mutagenesis is an important source of genome variability. Transposons are sensors of environmental stress effects, which contribute to adaptation and speciation. These effects are due to changes in the mechanisms of morphogenesis, since transposons contain regulatory sequences that have cis and trans effects on specific protein-coding genes. In variability of genomes, the horizontal transfer of transposons plays an important role, because it contributes to changing the composition of transposons and the acquisition of new properties. Transposons are capable of site-specific transpositions, which lead to the activation of stress response genes. Transposons are sources of non-coding RNA, transcription factors binding sites and protein-coding genes due to domestication, exonization, and duplication. These genes contain nucleotide sequences that interact with non-coding RNAs processed from transposons transcripts, and therefore they are under the control of epigenetic regulatory networks involving transposons. Therefore, inherited features of the location and composition of transposons, along with a change in the phenotype, play an important role in the characteristics of responding to a variety of environmental stressors. This is the basis for the selection and survival of organisms with a specific composition and arrangement of transposons that contribute to adaptation under certain environmental conditions. In evolution, the capability to transpose into specific genome sites, regulate gene expression, and interact with transcription factors, along with the ability to respond to stressors, is the basis for rapid variability and speciation by altering the regulation of ontogenesis. The review presents evidence of tissue-specific and stage-specific features of transposon activation and their role in the regulation of cell differentiation to confirm their role in ecological morphogenesis.
Collapse
Affiliation(s)
| | - E. K. Khusnutdinova
- Bashkir State Medical University;
Institute of Biochemistry and Genetics – Subdivision of the Ufa Federal Research Centre of RAS
| |
Collapse
|
36
|
Rapid evolution of a retro-transposable hotspot of ovine genome underlies the alteration of BMP2 expression and development of fat tails. BMC Genomics 2019; 20:261. [PMID: 30940097 PMCID: PMC6445056 DOI: 10.1186/s12864-019-5620-6] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2018] [Accepted: 03/19/2019] [Indexed: 12/21/2022] Open
Abstract
Background Sheep have developed the ability to store fat in their tails, which is a unique way of reserving energy to survive a harsh environment. However, the mechanism underlying this adaptive trait remains largely unsolved. Results In the present study, we provide evidence for the genetic determinants of fat tails, based on whole genome sequences of 89 individual sheep. A genome-wide scan of selective sweep identified several candidate loci including a region at chromosome 13, a haplotype of which underwent rapid evolution and spread through fat-tailed populations in China and the Middle East. Sequence analysis revealed an inter-genic origin of this locus, which later became a hotspot of ruminant-specific retro-transposon named BovB. Additionally, the candidate locus was validated based on a fat- and thin-tailed cross population. The expression of an upstream gene BMP2 was differentially regulated between fat-tailed and thin-tailed individuals in tail adipose and several other tissue types. Conclusions Our findings suggest the fixation of fat tails in domestic sheep is caused by a selective sweep near a retro-transposable hotspot at chromosome 13, the diversity of which specifically affects the expression of BMP2. The present study has shed light onto the understanding of fat metabolism. Electronic supplementary material The online version of this article (10.1186/s12864-019-5620-6) contains supplementary material, which is available to authorized users.
Collapse
|
37
|
Mustafin RN. Functional Dualism of Transposon Transcripts in Evolution of Eukaryotic Genomes. Russ J Dev Biol 2019. [DOI: 10.1134/s1062360418070019] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
|
38
|
Mustafin RN. The Relationship between Transposons and Transcription Factors in the Evolution of Eukaryotes. J EVOL BIOCHEM PHYS+ 2019. [DOI: 10.1134/s0022093019010022] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
39
|
Kovalenko TF, Patrushev LI. Pseudogenes as Functionally Significant Elements of the Genome. BIOCHEMISTRY (MOSCOW) 2018; 83:1332-1349. [PMID: 30482145 DOI: 10.1134/s0006297918110044] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Pseudogene is a gene copy that has lost its original function. For a long time, pseudogenes have been considered as "junk DNA" that inevitably arises as a result of ongoing evolutionary process. However, experimental data obtained during recent years indicate this understanding of the nature of pseudogenes is not entirely correct, and many pseudogenes perform important genetic functions. In the review, we have addressed classification of pseudogenes, methods of their detection in the genome, and the problem of their evolutionary conservatism and prevalence among species belonging to different taxonomic groups in the light of modern data. The mechanisms of gene expression regulation by pseudogenes and the role of pseudogenes in pathogenesis of various human diseases are discussed.
Collapse
Affiliation(s)
- T F Kovalenko
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, 117997, Russia.
| | - L I Patrushev
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, 117997, Russia
| |
Collapse
|
40
|
Bourque G, Burns KH, Gehring M, Gorbunova V, Seluanov A, Hammell M, Imbeault M, Izsvák Z, Levin HL, Macfarlan TS, Mager DL, Feschotte C. Ten things you should know about transposable elements. Genome Biol 2018; 19:199. [PMID: 30454069 PMCID: PMC6240941 DOI: 10.1186/s13059-018-1577-z] [Citation(s) in RCA: 591] [Impact Index Per Article: 98.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Transposable elements (TEs) are major components of eukaryotic genomes. However, the extent of their impact on genome evolution, function, and disease remain a matter of intense interrogation. The rise of genomics and large-scale functional assays has shed new light on the multi-faceted activities of TEs and implies that they should no longer be marginalized. Here, we introduce the fundamental properties of TEs and their complex interactions with their cellular environment, which are crucial to understanding their impact and manifold consequences for organismal biology. While we draw examples primarily from mammalian systems, the core concepts outlined here are relevant to a broad range of organisms.
Collapse
Affiliation(s)
- Guillaume Bourque
- Department of Human Genetics, McGill University, Montréal, Québec, H3A 0G1, Canada.
- Canadian Center for Computational Genomics, McGill University, Montréal, Québec, H3A 0G1, Canada.
| | - Kathleen H Burns
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, MD, 21205, USA
| | - Mary Gehring
- Whitehead Institute for Biomedical Research and Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, 02142, USA
| | - Vera Gorbunova
- Department of Biology, University of Rochester, Rochester, NY, 14627, USA
| | - Andrei Seluanov
- Department of Biology, University of Rochester, Rochester, NY, 14627, USA
| | - Molly Hammell
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
| | - Michaël Imbeault
- Department of Genetics, University of Cambridge, Cambridge, CB2 3EH, UK
| | - Zsuzsanna Izsvák
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association, 13125, Berlin, Germany
| | - Henry L Levin
- The Eunice Kennedy Shriver National Institute of Child Health and Human Development, The National Institutes of Health, Bethesda, Maryland, USA
| | - Todd S Macfarlan
- The Eunice Kennedy Shriver National Institute of Child Health and Human Development, The National Institutes of Health, Bethesda, Maryland, USA
| | - Dixie L Mager
- Terry Fox Laboratory, British Columbia Cancer Agency and Department of Medical Genetics, University of BC, Vancouver, BC, V5Z1L3, Canada
| | - Cédric Feschotte
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, 14850, USA.
| |
Collapse
|
41
|
Schrader L, Schmitz J. The impact of transposable elements in adaptive evolution. Mol Ecol 2018; 28:1537-1549. [PMID: 30003608 DOI: 10.1111/mec.14794] [Citation(s) in RCA: 134] [Impact Index Per Article: 22.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2018] [Accepted: 07/06/2018] [Indexed: 12/16/2022]
Abstract
The growing knowledge about the influence of transposable elements (TEs) on (a) long-term genome and transcriptome evolution; (b) genomic, transcriptomic and epigenetic variation within populations; and (c) patterns of somatic genetic differences in individuals continues to spur the interest of evolutionary biologists in the role of TEs in adaptive evolution. As TEs can trigger a broad range of molecular variation in a population with potentially severe fitness and phenotypic consequences for individuals, different mechanisms evolved to keep TE activity in check, allowing for a dynamic interplay between the host, its TEs and the environment in evolution. Here, we review evidence for adaptive phenotypic changes associated with TEs and the basic molecular mechanisms by which the underlying genetic changes arise: (a) domestication, (b) exaptation, (c) host gene regulation, (d) TE-mediated formation of intronless gene copies-so-called retrogenes and (e) overall increased genome plasticity. Furthermore, we review and discuss how the stress-dependent incapacitation of defence mechanisms against the activity of TEs might facilitate adaptive responses to environmental challenges and how such mechanisms might be particularly relevant in species frequently facing novel environments, such as invasive, pathogenic or parasitic species.
Collapse
Affiliation(s)
- Lukas Schrader
- Institute for Evolution and Biodiversity (IEB), University of Münster, Münster, Germany
| | - Jürgen Schmitz
- Institute of Experimental Pathology, University of Münster, Münster, Germany
| |
Collapse
|
42
|
Mustafin RN, Khusnutdinova EK. The Role of Transposable Elements in Emergence of Metazoa. BIOCHEMISTRY (MOSCOW) 2018; 83:185-199. [PMID: 29625540 DOI: 10.1134/s000629791803001x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Systems initially emerged for protecting genomes against insertions of transposable elements and represented by mechanisms of splicing regulation, RNA-interference, and epigenetic factors have played a key role in the evolution of animals. Many studies have shown inherited transpositions of mobile elements in embryogenesis and preservation of their activities in certain tissues of adult organisms. It was supposed that on the emergence of Metazoa the self-regulation mechanisms of transposons related with the gene networks controlling their activity could be involved in intercellular cell coordination in the cascade of successive divisions with differentiated gene expression for generation of tissues and organs. It was supposed that during evolution species-specific features of transposons in the genomes of eukaryotes could form the basis for creation of dynamically related complexes of systems for epigenetic regulation of gene expression. These complexes could be produced due to the influence of noncoding transposon-derived RNAs on DNA methylation, histone modifications, and processing of alternative splicing variants, whereas the mobile elements themselves could be directly involved in the regulation of gene expression in cis and in trans. Transposons are widely distributed in the genomes of eukaryotes; therefore, their activation can change the expression of specific genes. In turn, this can play an important role in cell differentiation during ontogenesis. It is supposed that transposons can form a species-specific pattern for control of gene expression, and that some variants of this pattern can be favorable for adaptation. The presented data indicate the possible influence of transposons in karyotype formation. It is supposed that transposon localization relative to one another and to protein-coding genes can influence the species-specific epigenetic regulation of ontogenesis.
Collapse
|
43
|
Jorquera R, González C, Clausen P, Petersen B, Holmes DS. Improved ontology for eukaryotic single-exon coding sequences in biological databases. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2018; 2018:1-6. [PMID: 30239665 PMCID: PMC6146118 DOI: 10.1093/database/bay089] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/03/2018] [Accepted: 07/30/2018] [Indexed: 12/21/2022]
Abstract
Efficient extraction of knowledge from biological data requires the development of structured vocabularies to unambiguously define biological terms. This paper proposes descriptions and definitions to disambiguate the term 'single-exon gene'. Eukaryotic Single-Exon Genes (SEGs) have been defined as genes that do not have introns in their protein coding sequences. They have been studied not only to determine their origin and evolution but also because their expression has been linked to several types of human cancer and neurological/developmental disorders and many exhibit tissue-specific transcription. Unfortunately, the term 'SEGs' is rife with ambiguity, leading to biological misinterpretations. In the classic definition, no distinction is made between SEGs that harbor introns in their untranslated regions (UTRs) versus those without. This distinction is important to make because the presence of introns in UTRs affects transcriptional regulation and post-transcriptional processing of the mRNA. In addition, recent whole-transcriptome shotgun sequencing has led to the discovery of many examples of single-exon mRNAs that arise from alternative splicing of multi-exon genes, these single-exon isoforms are being confused with SEGs despite their clearly different origin. The increasing expansion of RNA-seq datasets makes it imperative to distinguish the different SEG types before annotation errors become indelibly propagated in biological databases. This paper develops a structured vocabulary for their disambiguation, allowing a major reassessment of their evolutionary trajectories, regulation, RNA processing and transport, and provides the opportunity to improve the detection of gene associations with disorders including cancers, neurological and developmental diseases.
Collapse
Affiliation(s)
- Roddy Jorquera
- Center for Bioinformatics and Genome Biology, Fundacion Ciencia & Vida, Avenida Zañartu 1482, Ñuñoa, Santiago, Chile.,Facultad de Ciencias Biologicas, Universidad Andres Bello, Santiago, Chile
| | - Carolina González
- Center for Bioinformatics and Genome Biology, Fundacion Ciencia & Vida, Avenida Zañartu 1482, Ñuñoa, Santiago, Chile
| | - Philip Clausen
- Department of Bio and Health Informatics, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Bent Petersen
- Department of Bio and Health Informatics, Technical University of Denmark, Kgs. Lyngby, Denmark.,Centre of Excellence for Omics-Driven Computational Biodiscovery (COMBio), Faculty of Applied Sciences, AIMST University, Kedah, Malaysia
| | - David S Holmes
- Center for Bioinformatics and Genome Biology, Fundacion Ciencia & Vida, Avenida Zañartu 1482, Ñuñoa, Santiago, Chile.,Centro de Genómica y Bioinformática Facultad de Ciencias, Universidad Mayor, Santiago, Chile
| |
Collapse
|
44
|
Alonso J, Martinez M. Insights into the molecular evolution of peptidase inhibitors in arthropods. PLoS One 2017; 12:e0187643. [PMID: 29108008 PMCID: PMC5673224 DOI: 10.1371/journal.pone.0187643] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2017] [Accepted: 10/23/2017] [Indexed: 11/19/2022] Open
Abstract
Peptidase inhibitors are key proteins involved in the control of peptidases. In arthropods, peptidase inhibitors modulate the activity of peptidases involved in endogenous physiological processes and peptidases of the organisms with which they interact. Exploring available arthropod genomic sequences is a powerful way to obtain the repertoire of peptidase inhibitors in every arthropod species and to understand the evolutionary mechanisms involved in the diversification of this kind of proteins. A genomic comparative analysis of peptidase inhibitors in species belonging to different arthropod taxonomic groups was performed. The results point out: i) species or clade-specific presence is shown for several families of peptidase inhibitors; ii) multidomain peptidase inhibitors are commonly found in many peptidase inhibitor families; iii) several families have a wide range of members in different arthropod species; iv) several peptidase inhibitor families show species-specific (or clade-specific) gene family expansions; v) functional divergence may be assumed for particular clades; vi) passive expansions may be used by natural selection to fix adaptations. In conclusion, conservation and divergence of duplicated genes and the potential recruitment as peptidase inhibitors of proteins from other families are the main mechanisms used by arthropods to fix diversity. This diversity would be associated to the control of target peptidases and, as consequence, to adapt to specific environments.
Collapse
Affiliation(s)
- Joaquin Alonso
- Centro de Biotecnología y Genómica de Plantas, Universidad Politécnica de Madrid (UPM)—Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA), Campus Montegancedo UPM, Pozuelo de Alarcón (Madrid), Spain
| | - Manuel Martinez
- Centro de Biotecnología y Genómica de Plantas, Universidad Politécnica de Madrid (UPM)—Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA), Campus Montegancedo UPM, Pozuelo de Alarcón (Madrid), Spain
- Departamento de Biotecnología-Biología Vegetal, Escuela Técnica Superior de Ingeniería Agronómica, Alimentaria y de Biosistemas, UPM, Madrid, Spain
| |
Collapse
|
45
|
FGF4 retrogene on CFA12 is responsible for chondrodystrophy and intervertebral disc disease in dogs. Proc Natl Acad Sci U S A 2017; 114:11476-11481. [PMID: 29073074 PMCID: PMC5664524 DOI: 10.1073/pnas.1709082114] [Citation(s) in RCA: 81] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
Abstract
Chondrodystrophy, characterized by short limbs and intervertebral disc disease (IVDD), is a common phenotype in many of the most popular dog breeds, including the dachshund, beagle, and French bulldog. Here, we report the identification of a FGF4 retrogene insertion on chromosome 12, the second FGF4 retrogene reported in the dog, as responsible for chondrodystrophy and IVDD. Identification of the causative mutation for IVDD will impact an incredibly large proportion of the dog population and provides a model for IVDD in humans, as FGF-associated mutations are responsible for IVDD and short stature in human achondroplasia. This is a report of a second retrogene copy of the same parental gene, each causing complementary disease phenotypes in a mammalian species. Chondrodystrophy in dogs is defined by dysplastic, shortened long bones and premature degeneration and calcification of intervertebral discs. Independent genome-wide association analyses for skeletal dysplasia (short limbs) within a single breed (PBonferroni = 0.01) and intervertebral disc disease (IVDD) across breeds (PBonferroni = 4.0 × 10−10) both identified a significant association to the same region on CFA12. Whole genome sequencing identified a highly expressed FGF4 retrogene within this shared region. The FGF4 retrogene segregated with limb length and had an odds ratio of 51.23 (95% CI = 46.69, 56.20) for IVDD. Long bone length in dogs is a unique example of multiple disease-causing retrocopies of the same parental gene in a mammalian species. FGF signaling abnormalities have been associated with skeletal dysplasia in humans, and our findings present opportunities for both selective elimination of a medically and financially devastating disease in dogs and further understanding of the ever-growing complexity of retrogene biology.
Collapse
|