1
|
Pereira AB, Marano M, Bathala R, Zaragoza RA, Neira A, Samano A, Owoyemi A, Casola C. Orphan genes are not a distinct biological entity. Bioessays 2025; 47:e2400146. [PMID: 39491810 DOI: 10.1002/bies.202400146] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2024] [Revised: 10/06/2024] [Accepted: 10/11/2024] [Indexed: 11/05/2024]
Abstract
The genome sequencing revolution has revealed that all species possess a large number of unique genes critical for trait variation, adaptation, and evolutionary innovation. One widely used approach to identify such genes consists of detecting protein-coding sequences with no homology in other genomes, termed orphan genes. These genes have been extensively studied, under the assumption that they represent valid proxies for species-specific genes. Here, we critically evaluate taxonomic, phylogenetic, and sequence evolution evidence showing that orphan genes belong to a range of evolutionary ages and thus cannot be assigned to a single lineage. Furthermore, we show that the processes generating orphan genes are substantially more diverse than generally thought and include horizontal gene transfer, transposable element domestication, and overprinting. Thus, orphan genes represent a heterogeneous collection of genes rather than a single biological entity, making them unsuitable as a subject for meaningful investigation of gene evolution and phenotypic innovation.
Collapse
Affiliation(s)
- Andres Barboza Pereira
- Interdisciplinary Graduate Program in Genetics & Genomics, Texas A&M University, College Station, Texas, USA
- Interdisciplinary Doctoral Program in Ecology and Evolutionary Biology, Texas A&M University, College Station, Texas, USA
| | - Matthew Marano
- Interdisciplinary Doctoral Program in Ecology and Evolutionary Biology, Texas A&M University, College Station, Texas, USA
| | - Ramya Bathala
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, Texas, USA
| | | | - Andres Neira
- School of Pharmacy, Texas A&M University, College Station, Texas, USA
| | - Alex Samano
- Department of Biology, Texas A&M University, College Station, Texas, USA
| | - Adekola Owoyemi
- Department of Ecology and Conservation Biology, Texas A&M University, College Station, Texas, USA
| | - Claudio Casola
- Interdisciplinary Graduate Program in Genetics & Genomics, Texas A&M University, College Station, Texas, USA
- Interdisciplinary Doctoral Program in Ecology and Evolutionary Biology, Texas A&M University, College Station, Texas, USA
- Department of Ecology and Conservation Biology, Texas A&M University, College Station, Texas, USA
| |
Collapse
|
2
|
Zhao Q, Zheng Y, Li Y, Shi L, Zhang J, Ma D, You M. An Orphan Gene Enhances Male Reproductive Success in Plutella xylostella. Mol Biol Evol 2024; 41:msae142. [PMID: 38990889 PMCID: PMC11290247 DOI: 10.1093/molbev/msae142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2023] [Revised: 06/28/2024] [Accepted: 07/05/2024] [Indexed: 07/13/2024] Open
Abstract
Plutella xylostella exhibits exceptional reproduction ability, yet the genetic basis underlying the high reproductive capacity remains unknown. Here, we demonstrate that an orphan gene, lushu, which encodes a sperm protein, plays a crucial role in male reproductive success. Lushu is located on the Z chromosome and is prevalent across different P. xylostella populations worldwide. We subsequently generated lushu mutants using transgenic CRISPR/Cas9 system. Knockout of Lushu results in reduced male mating efficiency and accelerated death in adult males. Furthermore, our findings highlight that the deficiency of lushu reduced the transfer of sperms from males to females, potentially resulting in hindered sperm competition. Additionally, the knockout of Lushu results in disrupted gene expression in energy-related pathways and elevated insulin levels in adult males. Our findings reveal that male reproductive performance has evolved through the birth of a newly evolved, lineage-specific gene with enormous potentiality in fecundity success. These insights hold valuable implications for identifying the target for genetic control, particularly in relation to species-specific traits that are pivotal in determining high levels of fecundity.
Collapse
Affiliation(s)
- Qian Zhao
- State Key Laboratory for Ecological Pest Control of Fujian/Taiwan Crops and College of Life Science, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Ministerial and Provincial Joint Innovation Centre for Safety Production of Cross-Strait Crops, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Joint International Research Laboratory of Ecological Pest Control, Ministry of Education, Fuzhou 350002, China
| | - Yahong Zheng
- State Key Laboratory for Ecological Pest Control of Fujian/Taiwan Crops and College of Life Science, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Yiying Li
- State Key Laboratory for Ecological Pest Control of Fujian/Taiwan Crops and College of Life Science, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Lingping Shi
- State Key Laboratory for Ecological Pest Control of Fujian/Taiwan Crops and College of Life Science, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Jing Zhang
- State Key Laboratory for Ecological Pest Control of Fujian/Taiwan Crops and College of Life Science, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Ministerial and Provincial Joint Innovation Centre for Safety Production of Cross-Strait Crops, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Joint International Research Laboratory of Ecological Pest Control, Ministry of Education, Fuzhou 350002, China
| | - Dongna Ma
- State Key Laboratory for Ecological Pest Control of Fujian/Taiwan Crops and College of Life Science, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Minsheng You
- State Key Laboratory for Ecological Pest Control of Fujian/Taiwan Crops and College of Life Science, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Ministerial and Provincial Joint Innovation Centre for Safety Production of Cross-Strait Crops, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Joint International Research Laboratory of Ecological Pest Control, Ministry of Education, Fuzhou 350002, China
| |
Collapse
|
3
|
Wu W, Ye Z, Mao Q, Shan HW, Li JM, Chen JP. Combined Transcriptome and Proteome Analysis of the Protein Composition of the Brochosomes of the Leafhopper Nephotettix cincticeps. INSECTS 2023; 14:784. [PMID: 37887796 PMCID: PMC10607721 DOI: 10.3390/insects14100784] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Revised: 09/22/2023] [Accepted: 09/25/2023] [Indexed: 10/28/2023]
Abstract
Brochosomes, unique coatings on the integuments of Cicadellidae, are synthesized in specialized glandular sections of Malpighian tubules. However, limited knowledge exists regarding the protein composition of brochosomes. In this study, we conducted transcriptomic and proteomic profiling to characterize the brochosome protein composition in the rice green leafhopper Nephotettix cincticeps. Brochosomes were collected from the forewings of leafhoppers using ultrasonic treatment, allowing for more effective brochosome collection and shaking treatment, resulting in purer brochosomes. Transcriptome sequencing analysis identified 106 genes specifically expressed in the Malpighian tubules; combined with proteomic data, we identified 22 candidate brochosome proteins. These proteins were classified into 12 brochosomins (BSM) and 10 brochosome-associated proteins (BSAP) based on previous research. Conserved motif analysis and functional predictions unveiled unique motifs in each BSM, while BSAP appeared to play a crucial role in BSM folding and pathogen resistance. Comparative analysis of other Hemiptera species demonstrated that all BSM and some BSAP are specific to the Cicadellidae family. Our findings could contribute to understanding the mechanism of brochosome synthesis, its function, and evolutionary genesis.
Collapse
Affiliation(s)
- Wei Wu
- State Key Laboratory for Managing Biotic and Chemical Threats to the Quality and Safety of Agro-Products, Key Laboratory of Biotechnology in Plant Protection of Ministry of Agriculture and Zhejiang Province, Institute of Plant Virology, Ningbo University, Ningbo 315211, China
| | | | | | | | | | - Jian-Ping Chen
- State Key Laboratory for Managing Biotic and Chemical Threats to the Quality and Safety of Agro-Products, Key Laboratory of Biotechnology in Plant Protection of Ministry of Agriculture and Zhejiang Province, Institute of Plant Virology, Ningbo University, Ningbo 315211, China
| |
Collapse
|
4
|
Fakhar AZ, Liu J, Pajerowska-Mukhtar KM, Mukhtar MS. The Lost and Found: Unraveling the Functions of Orphan Genes. J Dev Biol 2023; 11:27. [PMID: 37367481 PMCID: PMC10299390 DOI: 10.3390/jdb11020027] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Revised: 05/19/2023] [Accepted: 05/26/2023] [Indexed: 06/28/2023] Open
Abstract
Orphan Genes (OGs) are a mysterious class of genes that have recently gained significant attention. Despite lacking a clear evolutionary history, they are found in nearly all living organisms, from bacteria to humans, and they play important roles in diverse biological processes. The discovery of OGs was first made through comparative genomics followed by the identification of unique genes across different species. OGs tend to be more prevalent in species with larger genomes, such as plants and animals, and their evolutionary origins remain unclear but potentially arise from gene duplication, horizontal gene transfer (HGT), or de novo origination. Although their precise function is not well understood, OGs have been implicated in crucial biological processes such as development, metabolism, and stress responses. To better understand their significance, researchers are using a variety of approaches, including transcriptomics, functional genomics, and molecular biology. This review offers a comprehensive overview of the current knowledge of OGs in all domains of life, highlighting the possible role of dark transcriptomics in their evolution. More research is needed to fully comprehend the role of OGs in biology and their impact on various biological processes.
Collapse
Affiliation(s)
| | | | | | - M. Shahid Mukhtar
- Department of Biology, University of Alabama at Birmingham, 1300 University Blvd., Birmingham, AL 35294, USA
| |
Collapse
|
5
|
Poretti M, Praz CR, Sotiropoulos AG, Wicker T. A survey of lineage-specific genes in Triticeae reveals de novo gene evolution from genomic raw material. PLANT DIRECT 2023; 7:e484. [PMID: 36937792 PMCID: PMC10020141 DOI: 10.1002/pld3.484] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Revised: 01/26/2023] [Accepted: 01/27/2023] [Indexed: 06/18/2023]
Abstract
Diploid plant genomes typically contain ~35,000 genes, almost all belonging to highly conserved gene families. Only a small fraction are lineage-specific, which are found in only one or few closely related species. Little is known about how genes arise de novo in plant genomes and how often this occurs; however, they are believed to be important for plants diversification and adaptation. We developed a pipeline to identify lineage-specific genes in Triticeae, using newly available genome assemblies of wheat, barley, and rye. Applying a set of stringent criteria, we identified 5942 candidate Triticeae-specific genes (TSGs), of which 2337 were validated as protein-coding genes in wheat. Differential gene expression analyses revealed that stress-induced wheat TSGs are strongly enriched in putative secreted proteins. Some were previously described to be involved in Triticeae non-host resistance and cold response. Additionally, we show that 1079 TSGs have sequence homology to transposable elements (TEs), ~68% of them deriving from regulatory non-coding regions of Gypsy retrotransposons. Most importantly, we demonstrate that these TSGs are enriched in transmembrane domains and are among the most highly expressed wheat genes overall. To summarize, we conclude that de novo gene formation is relatively rare and that Triticeae probably possess ~779 lineage-specific genes per haploid genome. TSGs, which respond to pathogen and environmental stresses, may be interesting candidates for future targeted resistance breeding in Triticeae. Finally, we propose that non-coding regions of TEs might provide important genetic raw material for the functional innovation of TM domains and the evolution of novel secreted proteins.
Collapse
Affiliation(s)
- Manuel Poretti
- Department of Plant and Microbial BiologyUniversity of ZurichZurichSwitzerland
- Department of BiologyUniversity of FribourgFribourgSwitzerland
| | - Coraline R. Praz
- Department of Plant and Microbial BiologyUniversity of ZurichZurichSwitzerland
- Centro de Biotecnología y Genómica de PlantasUniversidad Politécnica de Madrid (UPM)–Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA)MadridSpain
| | | | - Thomas Wicker
- Department of Plant and Microbial BiologyUniversity of ZurichZurichSwitzerland
| |
Collapse
|
6
|
Chenevert M, Miller B, Karkoutli A, Rusnak A, Lott SE, Atallah J. The early embryonic transcriptome of a Hawaiian Drosophila picture-wing fly shows evidence of altered gene expression and novel gene evolution. JOURNAL OF EXPERIMENTAL ZOOLOGY. PART B, MOLECULAR AND DEVELOPMENTAL EVOLUTION 2022; 338:277-291. [PMID: 35322942 DOI: 10.1002/jez.b.23129] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Revised: 01/14/2022] [Accepted: 02/13/2022] [Indexed: 06/14/2023]
Abstract
A massive adaptive radiation on the Hawaiian archipelago has produced approximately one-quarter of the fly species in the family Drosophilidae. The Hawaiian Drosophila clade has long been recognized as a model system for the study of both the ecology of island endemics and the evolution of developmental mechanisms, but relatively few genomic and transcriptomic datasets are available for this group. We present here a differential expression analysis of the transcriptional profiles of two highly conserved embryonic stages in the Hawaiian picture-wing fly Drosophila grimshawi. When we compared our results to previously published datasets across the family Drosophilidae, we identified cases of both gains and losses of gene representation in D. grimshawi, including an apparent delay in Hox gene activation. We also found a high expression of unannotated genes. Most transcripts of unannotated genes with open reading frames do not have identified homologs in non-Hawaiian Drosophila species, although the vast majority have sequence matches in genomes of other Hawaiian picture-wing flies. Some of these unannotated genes may have arisen from noncoding sequence in the ancestor of Hawaiian flies or during the evolution of the clade. Our results suggest that both the modified use of ancestral genes and the evolution of new ones may occur in rapid radiations.
Collapse
Affiliation(s)
- Madeline Chenevert
- Department of Biological Sciences, University of New Orleans, New Orleans, Louisiana, USA
- Hayward Genetics Center, Tulane University School of Medicine, New Orleans, Louisiana, USA
| | - Bronwyn Miller
- Department of Biological Sciences, University of New Orleans, New Orleans, Louisiana, USA
| | - Ahmad Karkoutli
- Department of Biological Sciences, University of New Orleans, New Orleans, Louisiana, USA
- LSUHSC School of Medicine, New Orleans, Louisiana, USA
| | - Anna Rusnak
- Department of Biological Sciences, University of New Orleans, New Orleans, Louisiana, USA
- Center for Biomedical Engineering, Brown University, Box A-2, Arnold Lab, Providence, Rhode Island, USA
| | - Susan E Lott
- Department of Evolution & Ecology, University of California-Davis, Davis, California, USA
| | - Joel Atallah
- Department of Biological Sciences, University of New Orleans, New Orleans, Louisiana, USA
| |
Collapse
|
7
|
Ma D, Lai Z, Ding Q, Zhang K, Chang K, Li S, Zhao Z, Zhong F. Identification, Characterization and Function of Orphan Genes Among the Current Cucurbitaceae Genomes. FRONTIERS IN PLANT SCIENCE 2022; 13:872137. [PMID: 35599909 PMCID: PMC9114813 DOI: 10.3389/fpls.2022.872137] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Accepted: 03/28/2022] [Indexed: 06/15/2023]
Abstract
Orphan genes (OGs) that are missing identifiable homologs in other lineages may potentially make contributions to a variety of biological functions. The Cucurbitaceae family consists of a wide range of fruit crops of worldwide or local economic significance. To date, very few functional mechanisms of OGs in Cucurbitaceae are known. In this study, we systematically identified the OGs of eight Cucurbitaceae species using a comparative genomics approach. The content of OGs varied widely among the eight Cucurbitaceae species, ranging from 1.63% in chayote to 16.55% in wax gourd. Genetic structure analysis showed that OGs have significantly shorter protein lengths and fewer exons in Cucurbitaceae. The subcellular localizations of OGs were basically the same, with only subtle differences. Except for aggregation in some chromosomal regions, the distribution density of OGs was higher near the telomeres and relatively evenly distributed on the chromosomes. Gene expression analysis revealed that OGs had less abundantly and highly tissue-specific expression. Interestingly, the largest proportion of these OGs was significantly more tissue-specific expressed in the flower than in other tissues, and more detectable expression was found in the male flower. Functional prediction of OGs showed that (1) 18 OGs associated with male sterility in watermelon; (2) 182 OGs associated with flower development in cucumber; (3) 51 OGs associated with environmental adaptation in watermelon; (4) 520 OGs may help with the large fruit size in wax gourd. Our results provide the molecular basis and research direction for some important mechanisms in Cucurbitaceae species and domesticated crops.
Collapse
Affiliation(s)
- Dongna Ma
- College of Horticulture, Fujian Agriculture and Forestry University, Fujian, China
- College of the Environment and Ecology, Xiamen University, Fujian, China
| | - Zhengfeng Lai
- Subtropical Agricultural Research Institute, Fujian Academy of Agriculture Sciences, Fujian, China
| | - Qiansu Ding
- College of the Environment and Ecology, Xiamen University, Fujian, China
| | - Kun Zhang
- College of Horticulture, Fujian Agriculture and Forestry University, Fujian, China
| | - Kaizhen Chang
- College of Horticulture, Fujian Agriculture and Forestry University, Fujian, China
| | - Shuhao Li
- College of Horticulture, Fujian Agriculture and Forestry University, Fujian, China
| | - Zhizhu Zhao
- College of the Environment and Ecology, Xiamen University, Fujian, China
| | - Fenglin Zhong
- College of Horticulture, Fujian Agriculture and Forestry University, Fujian, China
| |
Collapse
|
8
|
Zhao Z, Ma D. Genome-Wide Identification, Characterization and Function Analysis of Lineage-Specific Genes in the Tea Plant Camellia sinensis. Front Genet 2021; 12:770570. [PMID: 34858483 PMCID: PMC8631334 DOI: 10.3389/fgene.2021.770570] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2021] [Accepted: 10/14/2021] [Indexed: 11/22/2022] Open
Abstract
Genes that have no homologous sequences with other species are called lineage-specific genes (LSGs), are common in living organisms, and have an important role in the generation of new functions, adaptive evolution and phenotypic alteration of species. Camellia sinensis var. sinensis (CSS) is one of the most widely distributed cultivars for quality green tea production. The rich catechins in tea have antioxidant, free radical elimination, fat loss and cancer prevention potential. To further understand the evolution and utilize the function of LSGs in tea, we performed a comparative genomics approach to identify Camellia-specific genes (CSGs). Our result reveals that 1701 CSGs were identified specific to CSS, accounting for 3.37% of all protein-coding genes. The majority of CSGs (57.08%) were generated by gene duplication, and the time of duplication occurrence coincide with the time of two genome-wide replication (WGD) events that happened in CSS genome. Gene structure analysis revealed that CSGs have shorter gene lengths, fewer exons, higher GC content and higher isoelectric point. Gene expression analysis showed that CSG had more tissue-specific expression compared to evolutionary conserved genes (ECs). Weighted gene co-expression network analysis (WGCNA) showed that 18 CSGs are mainly associated with catechin synthesis-related pathways, including phenylalanine biosynthesis, biosynthesis of amino acids, pentose phosphate pathway, photosynthesis and carbon metabolism. Besides, we found that the expression of three CSGs (CSS0030246, CSS0002298, and CSS0030939) was significantly down-regulated in response to both types of stresses (salt and drought). Our study first systematically identified LSGs in CSS, and comprehensively analyzed the features and potential functions of CSGs. We also identified key candidate genes, which will provide valuable assistance for further studies on catechin synthesis and provide a molecular basis for the excavation of excellent germplasm resources.
Collapse
Affiliation(s)
- Zhizhu Zhao
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, China
| | - Dongna Ma
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, China
| |
Collapse
|
9
|
Ma D, Ding Q, Guo Z, Zhao Z, Wei L, Li Y, Song S, Zheng HL. Identification, characterization and expression analysis of lineage-specific genes within mangrove species Aegiceras corniculatum. Mol Genet Genomics 2021; 296:1235-1247. [PMID: 34363105 DOI: 10.1007/s00438-021-01810-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2021] [Accepted: 07/22/2021] [Indexed: 11/25/2022]
Abstract
Lineage-specific genes (LSGs) are the genes that have no recognizable homology to any sequences in other species, which are important drivers for the generation of new functions, phenotypic changes, and facilitating species adaptation to environment. Aegiceras corniculatum is one of major mangrove plant species adapted to waterlogging and saline conditions, and the exploration of aegiceras-specific genes (ASGs) is important to reveal its adaptation to the harsh environment. Here, we performed a systematic analysis on ASGs, focusing on their sequence characterization, origination and expression patterns. Our results reveal that there are 4823 ASGs in the genome, approximately 11.84% of all protein-coding genes. High proportion (45.78%) of ASGs originate from gene duplication, and the time of gene duplication of ASGs is consistent with the timing of two genome-wide replication (WGD) events that occurred in A. corniculatum, and also coincides with a short period of global warming during the Paleocene-Eocene Maximum (PETM, 55.5 million years ago). Gene structure analysis showed that ASGs have shorter protein lengths, fewer exons, and higher isoelectric point. Expression patterns analysis showed that ASGs had low levels of expression and more tissue-specific expression. Weighted gene co-expression network analysis (WGCNA) revealed that 86 ASGs co-expressed gene modules were primarily involved in pathways related to adversity stress, including plant hormone signal transduction, phenylpropanoid biosynthesis, photosynthesis, peroxisome and pentose phosphate pathway. This study provides a comprehensive analysis of the characteristics and potential functions of ASGs and identifies key candidate genes, which will contribute to the subsequent further investigation of the adaptation of A. corniculatum to intertidal coastal wetland habitats.
Collapse
Affiliation(s)
- Dongna Ma
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361005, China
| | - Qiansu Ding
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361005, China
| | - Zejun Guo
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361005, China
| | - Zhizhu Zhao
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361005, China
| | - Liufeng Wei
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361005, China
| | - Yiying Li
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, Institute of Applied Ecology, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Shiwei Song
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361005, China
| | - Hai-Lei Zheng
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361005, China.
| |
Collapse
|
10
|
Zile K, Dessimoz C, Wurm Y, Masel J. Only a Single Taxonomically Restricted Gene Family in the Drosophila melanogaster Subgroup Can Be Identified with High Confidence. Genome Biol Evol 2020; 12:1355-1366. [PMID: 32589737 PMCID: PMC8059200 DOI: 10.1093/gbe/evaa127] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/19/2020] [Indexed: 12/12/2022] Open
Abstract
Taxonomically restricted genes (TRGs) are genes that are present only in one clade. Protein-coding TRGs may evolve de novo from previously noncoding sequences: functional ncRNA, introns, or alternative reading frames of older protein-coding genes, or intergenic sequences. A major challenge in studying de novo genes is the need to avoid both false-positives (nonfunctional open reading frames and/or functional genes that did not arise de novo) and false-negatives. Here, we search conservatively for high-confidence TRGs as the most promising candidates for experimental studies, ensuring functionality through conservation across at least two species, and ensuring de novo status through examination of homologous noncoding sequences. Our pipeline also avoids ascertainment biases associated with preconceptions of how de novo genes are born. We identify one TRG family that evolved de novo in the Drosophila melanogaster subgroup. This TRG family contains single-copy genes in Drosophila simulans and Drosophila sechellia. It originated in an intron of a well-established gene, sharing that intron with another well-established gene upstream. These TRGs contain an intron that predates their open reading frame. These genes have not been previously reported as de novo originated, and to our knowledge, they are the best Drosophila candidates identified so far for experimental studies aimed at elucidating the properties of de novo genes.
Collapse
Affiliation(s)
- Karina Zile
- Division of Biosciences, University College London, United Kingdom
| | - Christophe Dessimoz
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- Department of Computational Biology, University of Lausanne, Switzerland
- Center for Integrative Genomics, University of Lausanne, Switzerland
- Department of Genetics, Evolution and Environment, University College London, United Kingdom
- Department of Computer Science, University College London, United Kingdom
| | - Yannick Wurm
- School of Biological and Chemical Sciences, Queen Mary University of London, United Kingdom
- Alan Turing Institute, London, United Kingdom
| | - Joanna Masel
- Department of Ecology and Evolutionary Biology, University of Arizona
| |
Collapse
|
11
|
Affiliation(s)
- Stephen Branden Van Oss
- Department of Computational and Systems Biology, Pittsburgh Center for Evolutionary Biology and Medicine, School of Medicine, University of Pittsburgh, Pittsburgh, PA, United States of America
| | - Anne-Ruxandra Carvunis
- Department of Computational and Systems Biology, Pittsburgh Center for Evolutionary Biology and Medicine, School of Medicine, University of Pittsburgh, Pittsburgh, PA, United States of America
| |
Collapse
|
12
|
Origination and evolution of orphan genes and de novo genes in the genome of Caenorhabditis elegans. SCIENCE CHINA-LIFE SCIENCES 2019; 62:579-593. [DOI: 10.1007/s11427-019-9482-0] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/11/2019] [Accepted: 01/22/2019] [Indexed: 12/31/2022]
|
13
|
|
14
|
Carlson DE, Hedin M. Comparative transcriptomics of Entelegyne spiders (Araneae, Entelegynae), with emphasis on molecular evolution of orphan genes. PLoS One 2017; 12:e0174102. [PMID: 28379977 PMCID: PMC5381867 DOI: 10.1371/journal.pone.0174102] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2016] [Accepted: 03/04/2017] [Indexed: 11/18/2022] Open
Abstract
Next-generation sequencing technology is rapidly transforming the landscape of evolutionary biology, and has become a cost-effective and efficient means of collecting exome information for non-model organisms. Due to their taxonomic diversity, production of interesting venom and silk proteins, and the relative scarcity of existing genomic resources, spiders in particular are excellent targets for next-generation sequencing (NGS) methods. In this study, the transcriptomes of six entelegyne spider species from three genera (Cicurina travisae, C. vibora, Habronattus signatus, H. ustulatus, Nesticus bishopi, and N. cooperi) were sequenced and de novo assembled. Each assembly was assessed for quality and completeness and functionally annotated using gene ontology information. Approximately 100 transcripts with evidence of homology to venom proteins were discovered. After identifying more than 3,000 putatively orthologous genes across all six taxa, we used comparative analyses to identify 24 instances of positively selected genes. In addition, between ~ 550 and 1,100 unique orphan genes were found in each genus. These unique, uncharacterized genes exhibited elevated rates of amino acid substitution, potentially consistent with lineage-specific adaptive evolution. The data generated for this study represent a valuable resource for future phylogenetic and molecular evolutionary research, and our results provide new insight into the forces driving genome evolution in taxa that span the root of entelegyne spider phylogeny.
Collapse
Affiliation(s)
- David E. Carlson
- Department of Biology, San Diego State University, San Diego, California, United States of America
- Department of Ecology & Evolution, Stony Brook University, Stony Brook, New York, United States of America
| | - Marshal Hedin
- Department of Biology, San Diego State University, San Diego, California, United States of America
| |
Collapse
|
15
|
Yi S, Wang S, Zhong J, Wang W. Comprehensive Transcriptome Analysis Provides Evidence of Local Thermal Adaptation in Three Loaches (Genus: Misgurnus). Int J Mol Sci 2016; 17:ijms17121943. [PMID: 27886141 PMCID: PMC5187763 DOI: 10.3390/ijms17121943] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2016] [Revised: 10/27/2016] [Accepted: 11/14/2016] [Indexed: 02/07/2023] Open
Abstract
The geographic distribution of three Misgurnus species, M. anguillicaudatus, M. bipartitus, and M. mohoity, displays a specific pattern in China, coincident with temperature zones. In this study, we sequenced the transcriptomes of these three species and used the sequences to investigate the lineage-specific adaptations within the genus Misgurnus. In total, 51 orphan genes (19 in M. anguillicaudatus, 18 in M. bipartitus, and 14 in M. mohoity) that may contribute to the species-specific adaptations were identified. An analysis of 1392 one-to-one orthologous genes revealed significantly higher ratios of nonsynonymous-to-synonymous substitutions in the M. mohoity lineage than in M. anguillicaudatus. The genes displaying signatures of positive selection and rapid evolution in Misgurnus were involved in four function categories, (1) energy metabolism; (2) signal transduction; (3) membrane; and (4) cell proliferation or apoptosis, implying that these candidate genes play critical roles in the thermal adaptation of the fish to their living environments. We also detected more than five positively selected sites in cldn15lb and isca1, which function as important factors in paracellular Na+ transport and Fe/S cluster assembly, respectively. Overall, our study provides valuable insights into the adaptive evolution of loaches from different temperature zones in China and is a foundation for future studies to clarify the genetic basis of temperature adaptation in fishes.
Collapse
Affiliation(s)
- Shaokui Yi
- Key Lab of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, College of Fisheries, Huazhong Agricultural University, Wuhan 430070, China.
- Key Lab of Freshwater Animal Breeding, Ministry of Agriculture, College of Fisheries, Huazhong Agricultural University, Wuhan 430070, China.
| | - Sai Wang
- Key Lab of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, College of Fisheries, Huazhong Agricultural University, Wuhan 430070, China.
- Key Lab of Freshwater Animal Breeding, Ministry of Agriculture, College of Fisheries, Huazhong Agricultural University, Wuhan 430070, China.
| | - Jia Zhong
- Key Lab of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, College of Fisheries, Huazhong Agricultural University, Wuhan 430070, China.
- Key Lab of Freshwater Animal Breeding, Ministry of Agriculture, College of Fisheries, Huazhong Agricultural University, Wuhan 430070, China.
| | - Weimin Wang
- Key Lab of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, College of Fisheries, Huazhong Agricultural University, Wuhan 430070, China.
- Key Lab of Freshwater Animal Breeding, Ministry of Agriculture, College of Fisheries, Huazhong Agricultural University, Wuhan 430070, China.
| |
Collapse
|
16
|
Prabh N, Rödelsperger C. Are orphan genes protein-coding, prediction artifacts, or non-coding RNAs? BMC Bioinformatics 2016; 17:226. [PMID: 27245157 PMCID: PMC4888513 DOI: 10.1186/s12859-016-1102-x] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2016] [Accepted: 05/24/2016] [Indexed: 12/26/2022] Open
Abstract
Background Current genome sequencing projects reveal substantial numbers of taxonomically restricted, so called orphan genes that lack homology with genes from other evolutionary lineages. However, it is not clear to what extent orphan genes are real, genomic artifacts, or represent non-coding RNAs. Results Here, we use a simple set of assumptions to test the nature of orphan genes. First, a sequence that is transcribed is considered a real biological entity. Second, every sequence that is supported by proteome data or shows a depletion of non-synonymous substitutions is a protein-coding gene. Using genomic, transcriptomic and proteomic data for the nematode Pristionchus pacificus, we show that between 4129–7997 (42–81 %) of predicted orphan genes are expressed and 3818–7545 (39–76 %) of orphan genes are under negative selection. In three cases that exhibited strong evolutionary constraint but lacked expression evidence in 14 RNA-seq samples, we could experimentally validate the predicted gene structures. Comparing different data sets to infer selection on orphan gene clusters, we find that the presence of a closely related genome provides the most powerful resource to robustly identify evidence of negative selection. However, even in the absence of other genomic data, the availability of paralogous sequences was enough to show negative selection in 8–10 % of orphan genes. Conclusions Our study shows that the great majority of previously identified orphan genes in P. pacificus are indeed protein-coding genes. Even though this work represents a case study on a single species, our approach can be transferred to genomic data of other non-model organisms in order to ascertain the protein-coding nature of orphan genes.
Collapse
Affiliation(s)
- Neel Prabh
- Department for Evolutionary Biology, Max-Planck-Institute for Developmental Biology, Spemannstrasse 35, 72076, Tübingen, Germany
| | - Christian Rödelsperger
- Department for Evolutionary Biology, Max-Planck-Institute for Developmental Biology, Spemannstrasse 35, 72076, Tübingen, Germany.
| |
Collapse
|
17
|
Proteome analysis of male accessory gland secretions in oriental fruit flies reveals juvenile hormone-binding protein, suggesting impact on female reproduction. Sci Rep 2015; 5:16845. [PMID: 26582577 PMCID: PMC4652233 DOI: 10.1038/srep16845] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2015] [Accepted: 10/21/2015] [Indexed: 01/14/2023] Open
Abstract
In insects, the accessory gland proteins (Acps) secreted by male accessory glands (MAGs) account for the majority of seminal fluids proteins. Mixed with sperm, they are transferred to the female at mating and so impact reproduction. In this project, we identified 2,927 proteins in the MAG secretions of the oriental fruit fly Bactrocera dorsalis, an important agricultural pest worldwide, using LC-MS analysis, and all sequences containing open reading frames were analyzed using signalP. In total, 90 Acps were identified. About one third (26) of these 90 Acps had a specific functional description, while the other two thirds (64) had no functional description including dozens of new classes of proteins. Hence, several of these novel Acps were abundant in the MAG secretions, and we confirmed their MAG-specific expression by qPCR. Finally and interestingly, one of these novel proteins was functionally predicted as juvenile hormone-binding protein, suggesting the impact of Acps with reproductive events in the female. Our results will aid in the development of an experimental method to identify Acps in insects, and in turn this information with new Acps in B. dorsalis will pave the way of further exploration their function in reproduction and potential development as new insecticide targets.
Collapse
|