1
|
Wang Z, Wang Y, Kasuga T, Hassler H, Lopez‐Giraldez F, Dong C, Yarden O, Townsend JP. Origins of lineage-specific elements via gene duplication, relocation, and regional rearrangement in Neurospora crassa. Mol Ecol 2024; 33:e17168. [PMID: 37843462 PMCID: PMC11628664 DOI: 10.1111/mec.17168] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 09/20/2023] [Accepted: 09/27/2023] [Indexed: 10/17/2023]
Abstract
The origin of new genes has long been a central interest of evolutionary biologists. However, their novelty means that they evade reconstruction by the classical tools of evolutionary modelling. This evasion of deep ancestral investigation necessitates intensive study of model species within well-sampled, recently diversified, clades. One such clade is the model genus Neurospora, members of which lack recent gene duplications. Several Neurospora species are comprehensively characterized organisms apt for studying the evolution of lineage-specific genes (LSGs). Using gene synteny, we documented that 78% of Neurospora LSG clusters are located adjacent to the telomeres featuring extensive tracts of non-coding DNA and duplicated genes. Here, we report several instances of LSGs that are likely from regional rearrangements and potentially from gene rebirth. To broadly investigate the functions of LSGs, we assembled transcriptomics data from 68 experimental data points and identified co-regulatory modules using Weighted Gene Correlation Network Analysis, revealing that LSGs are widely but peripherally involved in known regulatory machinery for diverse functions. The ancestral status of the LSG mas-1, a gene with roles in cell-wall integrity and cellular sensitivity to antifungal toxins, was investigated in detail alongside its genomic neighbours, indicating that it arose from an ancient lysophospholipase precursor that is ubiquitous in lineages of the Sordariomycetes. Our discoveries illuminate a "rummage region" in the N. crassa genome that enables the formation of new genes and functions to arise via gene duplication and relocation, followed by fast mutation and recombination facilitated by sequence repeats and unconstrained non-coding sequences.
Collapse
Affiliation(s)
- Zheng Wang
- Department of BiostatisticsYale School of Public HealthNew HavenConnecticutUSA
| | - Yen‐Wen Wang
- Department of BiostatisticsYale School of Public HealthNew HavenConnecticutUSA
| | - Takao Kasuga
- College of Biological SciencesUniversity of California, DavisDavisCaliforniaUSA
| | - Hayley Hassler
- Department of BiostatisticsYale School of Public HealthNew HavenConnecticutUSA
| | | | - Caihong Dong
- Institute of MicrobiologyChinese Academy of SciencesBeijingChina
| | - Oded Yarden
- Department of Plant Pathology and Microbiology, The Robert H. Smith Faculty of Agriculture, Food and EnvironmentThe Hebrew University of JerusalemRehovotIsrael
| | - Jeffrey P. Townsend
- Department of BiostatisticsYale School of Public HealthNew HavenConnecticutUSA
- Department of Ecology and Evolutionary Biology, Program in Microbiology, and Program in Computational Biology and BioinformaticsYale UniversityNew HavenConnecticutUSA
| |
Collapse
|
2
|
Harry ND, Zakas C. The role of heterochronic gene expression and regulatory architecture in early developmental divergence. eLife 2024; 13:RP93062. [PMID: 39177024 PMCID: PMC11343563 DOI: 10.7554/elife.93062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/24/2024] Open
Abstract
New developmental programs can evolve through adaptive changes to gene expression. The annelid Streblospio benedicti has a developmental dimorphism, which provides a unique intraspecific framework for understanding the earliest genetic changes that take place during developmental divergence. Using comparative RNAseq through ontogeny, we find that only a small proportion of genes are differentially expressed at any time, despite major differences in larval development and life history. These genes shift expression profiles across morphs by either turning off any expression in one morph or changing the timing or amount of gene expression. We directly connect the contributions of these mechanisms to differences in developmental processes. We examine F1 offspring - using reciprocal crosses - to determine maternal mRNA inheritance and the regulatory architecture of gene expression. These results highlight the importance of both novel gene expression and heterochronic shifts in developmental evolution, as well as the trans-acting regulatory factors in initiating divergence.
Collapse
Affiliation(s)
- Nathan D Harry
- Department of Biological Sciences, North Carolina State UniversityRaleighUnited States
| | - Christina Zakas
- Department of Biological Sciences, North Carolina State UniversityRaleighUnited States
| |
Collapse
|
3
|
Boland DJ, Cornejo-Corona I, Browne DR, Murphy RL, Mullet J, Okada S, Devarenne TP. Reclassification of Botryococcus braunii chemical races into separate species based on a comparative genomics analysis. PLoS One 2024; 19:e0304144. [PMID: 39074348 DOI: 10.1371/journal.pone.0304144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Accepted: 05/07/2024] [Indexed: 07/31/2024] Open
Abstract
The colonial green microalga Botryococcus braunii is well known for producing liquid hydrocarbons that can be utilized as biofuel feedstocks. B. braunii is taxonomically classified as a single species made up of three chemical races, A, B, and L, that are mainly distinguished by the hydrocarbons produced. We previously reported a B race draft nuclear genome, and here we report the draft nuclear genomes for the A and L races. A comparative genomic study of the three B. braunii races and 14 other algal species within Chlorophyta revealed significant differences in the genomes of each race of B. braunii. Phylogenomically, there was a clear divergence of the three races with the A race diverging earlier than both the B and L races, and the B and L races diverging from a later common ancestor not shared by the A race. DNA repeat content analysis suggested the B race had more repeat content than the A or L races. Orthogroup analysis revealed the B. braunii races displayed more gene orthogroup diversity than three closely related Chlamydomonas species, with nearly 24-36% of all genes in each B. braunii race being specific to each race. This analysis suggests the three races are distinct species based on sufficient differences in their respective genomes. We propose reclassification of the three chemical races to the following species names: Botryococcus alkenealis (A race), Botryococcus braunii (B race), and Botryococcus lycopadienor (L race).
Collapse
Affiliation(s)
- Devon J Boland
- Department of Biochemistry and Biophysics, Texas A & M University, College Station, Texas, United States of America
- Texas A&M Institute for Genome Sciences & Society (TIGSS), College Station, Texas, United States of America
| | - Ivette Cornejo-Corona
- Department of Biochemistry and Biophysics, Texas A & M University, College Station, Texas, United States of America
| | - Daniel R Browne
- Department of Biochemistry and Biophysics, Texas A & M University, College Station, Texas, United States of America
- AI & Computational Biology, LanzaTech Inc., Skokie, Illinois, United States of America
| | - Rebecca L Murphy
- Department of Biochemistry and Biophysics, Texas A & M University, College Station, Texas, United States of America
- Biology Department, Centenary College of Louisiana, Shreveport, Louisiana, United States of America
| | - John Mullet
- Department of Biochemistry and Biophysics, Texas A & M University, College Station, Texas, United States of America
| | - Shigeru Okada
- Laboratory of Aquatic Natural Products Chemistry, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Yayoi, Bunkyo, Tokyo, Japan
| | - Timothy P Devarenne
- Department of Biochemistry and Biophysics, Texas A & M University, College Station, Texas, United States of America
| |
Collapse
|
4
|
Maurer-Alcalá XX, Cote-L’Heureux A, Kosakovsky Pond SL, Katz LA. Somatic genome architecture and molecular evolution are decoupled in "young" linage-specific gene families in ciliates. PLoS One 2024; 19:e0291688. [PMID: 38271450 PMCID: PMC10810533 DOI: 10.1371/journal.pone.0291688] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2023] [Accepted: 09/02/2023] [Indexed: 01/27/2024] Open
Abstract
The evolution of lineage-specific gene families remains poorly studied across the eukaryotic tree of life, with most analyses focusing on the recent evolution of de novo genes in model species. Here we explore the origins of lineage-specific genes in ciliates, a ~1 billion year old clade of microeukaryotes that are defined by their division of somatic and germline functions into distinct nuclei. Previous analyses on conserved gene families have shown the effect of ciliates' unusual genome architecture on gene family evolution: extensive genome processing-the generation of thousands of gene-sized somatic chromosomes from canonical germline chromosomes-is associated with larger and more diverse gene families. To further study the relationship between ciliate genome architecture and gene family evolution, we analyzed lineage specific gene families from a set of 46 transcriptomes and 12 genomes representing x species from eight ciliate classes. We assess how the evolution lineage-specific gene families occurs among four groups of ciliates: extensive fragmenters with gene-size somatic chromosomes, non-extensive fragmenters with "large'' multi-gene somatic chromosomes, Heterotrichea with highly polyploid somatic genomes and Karyorelictea with 'paradiploid' somatic genomes. Our analyses demonstrate that: 1) most lineage-specific gene families are found at shallow taxonomic scales; 2) extensive genome processing (i.e., gene unscrambling) during development likely influences the size and number of young lineage-specific gene families; and 3) the influence of somatic genome architecture on molecular evolution is increasingly apparent in older gene families. Altogether, these data highlight the influences of genome architecture on the evolution of lineage-specific gene families in eukaryotes.
Collapse
Affiliation(s)
- Xyrus X. Maurer-Alcalá
- Institute of Cell Biology, University of Bern, Bern, Switzerland
- Department of Invertebrate Zoology, American Museum of Natural History, New York, New York, United States of America
| | - Auden Cote-L’Heureux
- Department of Biological Sciences, Smith College, Northampton, Massachusetts, United States of America
| | - Sergei L. Kosakovsky Pond
- Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, Pennsylvania, United States of America
| | - Laura A. Katz
- Department of Biological Sciences, Smith College, Northampton, Massachusetts, United States of America
- Program in Organismic and Evolutionary Biology, University of Massachusetts Amherst, Amherst, Massachusetts, United States of America
| |
Collapse
|
5
|
Coombes B, Fellers JP, Grewal S, Rusholme‐Pilcher R, Hubbart‐Edwards S, Yang C, Joynson R, King IP, King J, Hall A. Whole-genome sequencing uncovers the structural and transcriptomic landscape of hexaploid wheat/Ambylopyrum muticum introgression lines. PLANT BIOTECHNOLOGY JOURNAL 2023; 21:482-496. [PMID: 35598169 PMCID: PMC9946142 DOI: 10.1111/pbi.13859] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Revised: 04/28/2022] [Accepted: 05/15/2022] [Indexed: 05/29/2023]
Abstract
Wheat is a globally vital crop, but its limited genetic variation creates a challenge for breeders aiming to maintain or accelerate agricultural improvements over time. Introducing novel genes and alleles from wheat's wild relatives into the wheat breeding pool via introgression lines is an important component of overcoming this low variation but is constrained by poor genomic resolution and limited understanding of the genomic impact of introgression breeding programmes. By sequencing 17 hexaploid wheat/Ambylopyrum muticum introgression lines and the parent lines, we have precisely pinpointed the borders of introgressed segments, most of which occur within genes. We report a genome assembly and annotation of Am. muticum that has facilitated the identification of Am. muticum resistance genes commonly introgressed in lines resistant to stripe rust. Our analysis has identified an abundance of structural disruption and homoeologous pairing across the introgression lines, likely caused by the suppressed Ph1 locus. mRNAseq analysis of six of these introgression lines revealed that novel introgressed genes are rarely expressed and those that directly replace a wheat orthologue have a tendency towards downregulation, with no discernible compensation in the expression of homoeologous copies. This study explores the genomic impact of introgression breeding and provides a schematic that can be followed to characterize introgression lines and identify segments and candidate genes underlying the phenotype. This will facilitate more effective utilization of introgression pre-breeding material in wheat breeding programmes.
Collapse
Affiliation(s)
| | - John P. Fellers
- USDA–ARS Hard Winter Wheat Genetics Research UnitManhattanKansas66506USA
| | - Surbhi Grewal
- School of BiosciencesThe University of Nottingham, Sutton Bonington Campus, LoughboroughLeicestershireLE12 5RDUK
| | | | - Stella Hubbart‐Edwards
- School of BiosciencesThe University of Nottingham, Sutton Bonington Campus, LoughboroughLeicestershireLE12 5RDUK
| | - Cai‐yun Yang
- School of BiosciencesThe University of Nottingham, Sutton Bonington Campus, LoughboroughLeicestershireLE12 5RDUK
| | | | - Ian P. King
- School of BiosciencesThe University of Nottingham, Sutton Bonington Campus, LoughboroughLeicestershireLE12 5RDUK
| | - Julie King
- School of BiosciencesThe University of Nottingham, Sutton Bonington Campus, LoughboroughLeicestershireLE12 5RDUK
| | | |
Collapse
|
6
|
Jiang M, Zhang Y, Yang X, Li X, Lang H. Brassica rapa orphan gene BR1 delays flowering time in Arabidopsis. FRONTIERS IN PLANT SCIENCE 2023; 14:1135684. [PMID: 36909380 PMCID: PMC9998908 DOI: 10.3389/fpls.2023.1135684] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/01/2023] [Accepted: 02/15/2023] [Indexed: 06/18/2023]
Abstract
Orphan genes are essential to the emergence of species-specific traits and the process of evolution, lacking sequence similarity to any other identified genes. As they lack recognizable domains or functional motifs, however, efforts to characterize these orphan genes are often difficult. Flowering is a key trait in Brassica rapa, as premature bolting can have a pronounced adverse impact on plant quality and yield. Bolting resistance-related orphan genes, however, have yet to be characterized. In this study, an orphan gene designated BOLTING RESISTANCE 1 (BR1) was identified and found through gene structural variation analyses to be more highly conserved in Chinese cabbage than in other available accessions. The expression of BR1 was increased in bolting resistant Chinese cabbage and decreased in bolting non-resistant type, and the expression of some mark genes were consist with bolting resistance phenotype. BR1 is primarily expressed in leaves at the vegetative growth stage, and the highest BR1 expression levels during the flowering stage were observed in the flower buds and silique as compared to other tissue types. The overexpression of BR1 in Arabidopsis was associated with enhanced bolting resistance under long day (LD) conditions, with these transgenic plants exhibiting significant decreases in stem height, rosette radius, and chlorophyll content. Transcriptomic sequencing of WT and BR1OE plants showed the association of BR1 with other bolting resistance genes. Transcriptomic sequencing and qPCR revealed that six flowering integrator genes and one chlorophyll biosynthesis-related gene were downregulated following BR1 overexpression. Six key genes in photoperiodic flowering pathway exhibited downward expression trends in BR1OE plants, while the expression of floral repressor AtFLC gene was upregulated. The transcripts of these key genes were consistent with observed phenotypes in BR1OE plants, and the results indicated that BR1 may function through vernalization and photoperiodic pathway. Instead, the protein encoded by BR1 gene was subsequently found to localize to the nucleus. Taken together, we first propose that orphan gene BR1 functions as a novel regulator of flowering time, and these results suggested that BR1 may represent a promising candidate gene to support the selective breeding of Chinese cabbage cultivars with enhanced bolting resistance.
Collapse
Affiliation(s)
- Mingliang Jiang
- School of Agriculture, Jilin Agricultural Science and Technology College, Jilin, China
| | - Yuting Zhang
- College of Horticulture, Shenyang Agricultural University, Shenyang, China
| | - Xiaolong Yang
- College of Horticulture, South China Agricultural University, Guangzhou, China
| | - Xiaonan Li
- College of Horticulture, Shenyang Agricultural University, Shenyang, China
| | - Hong Lang
- School of Agriculture, Jilin Agricultural Science and Technology College, Jilin, China
| |
Collapse
|
7
|
Liu K, Chen Q, Huang GH. An Efficient Feature Selection Algorithm for Gene Families Using NMF and ReliefF. Genes (Basel) 2023; 14:421. [PMID: 36833348 PMCID: PMC9957060 DOI: 10.3390/genes14020421] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Revised: 01/24/2023] [Accepted: 01/25/2023] [Indexed: 02/10/2023] Open
Abstract
Gene families, which are parts of a genome's information storage hierarchy, play a significant role in the development and diversity of multicellular organisms. Several studies have focused on the characteristics of gene families, such as function, homology, or phenotype. However, statistical and correlation analyses on the distribution of gene family members in the genome have yet to be conducted. Here, a novel framework incorporating gene family analysis and genome selection based on NMF-ReliefF is reported. Specifically, the proposed method starts by obtaining gene families from the TreeFam database and determining the number of gene families within the feature matrix. Then, NMF-ReliefF is used to select features from the gene feature matrix, which is a new feature selection algorithm that overcomes the inefficiencies of traditional methods. Finally, a support vector machine is utilized to classify the acquired features. The results show that the framework achieved an accuracy of 89.1% and an AUC of 0.919 on the insect genome test set. We also employed four microarray gene data sets to evaluate the performance of the NMF-ReliefF algorithm. The outcomes show that the proposed method may strike a delicate balance between robustness and discrimination. Additionally, the proposed method's categorization is superior to state-of-the-art feature selection approaches.
Collapse
Affiliation(s)
- Kai Liu
- College of Plant Protection, Hunan Agricultural University, Changsha 410128, China
- Hunan Provincial Key Laboratory for Biology and Control of Plant Diseases and Insect Pests, Hunan Agricultural University, Nongda Road, Furong District, Changsha 410128, China
- College of Information and Intelligence, Hunan Agricultural University, Changsha 410128, China
| | - Qi Chen
- College of Plant Protection, Hunan Agricultural University, Changsha 410128, China
- Hunan Provincial Key Laboratory for Biology and Control of Plant Diseases and Insect Pests, Hunan Agricultural University, Nongda Road, Furong District, Changsha 410128, China
| | - Guo-Hua Huang
- College of Plant Protection, Hunan Agricultural University, Changsha 410128, China
- Hunan Provincial Key Laboratory for Biology and Control of Plant Diseases and Insect Pests, Hunan Agricultural University, Nongda Road, Furong District, Changsha 410128, China
| |
Collapse
|
8
|
Metivier JC, Chain FJJ. Diversity in Expression Biases of Lineage-Specific Genes During Development and Anhydrobiosis Among Tardigrade Species. Evol Bioinform Online 2022; 18:11769343221140277. [PMID: 36578471 PMCID: PMC9791283 DOI: 10.1177/11769343221140277] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Accepted: 10/27/2022] [Indexed: 12/24/2022] Open
Abstract
Lineage-specific genes can contribute to the emergence and evolution of novel traits and adaptations. Tardigrades are animals that have adapted to tolerate extreme conditions by undergoing a form of cryptobiosis called anhydrobiosis, a physical transformation to an inactive desiccated state. While studies to understand the genetics underlying the interspecies diversity in anhydrobiotic transitions have identified tardigrade-specific genes and family expansions involved in this process, the contributions of species-specific genes to the variation in tardigrade development and cryptobiosis are less clear. We used previously published transcriptomes throughout development and anhydrobiosis (5 embryonic stages, 7 juvenile stages, active adults, and tun adults) to assess the transcriptional biases of different classes of genes between 2 tardigrade species, Hypsibius exemplaris and Ramazzottius varieornatus. We also used the transcriptomes of 2 other tardigrades, Echiniscoides sigismundi and Richtersius coronifer, and data from 3 non-tardigrade species (Adenita vaga, Drosophila melanogaster, and Caenorhabditis elegans) to help identify lineage-specific genes. We found that lineage-specific genes have generally low and narrow expression but are enriched among biased genes in different stages of development depending on the species. Biased genes tend to be specific to early and late development, but there is little overlap in functional enrichment of biased genes between species. Gene expansions in the 2 tardigrades also involve families with different functions despite homologous genes being expressed during anhydrobiosis in both species. Our results demonstrate the interspecific variation in transcriptional contributions and biases of lineage-specific genes during development and anhydrobiosis in 2 tardigrades.
Collapse
Affiliation(s)
| | - Frédéric J J Chain
- Frédéric J J Chain, Department of Biological Sciences, University of Massachusetts Lowell, One University Ave, Lowell, MA 01854, USA.
| |
Collapse
|
9
|
Evolutionary New Genes in a Growing Paradigm. Genes (Basel) 2022; 13:genes13091605. [PMID: 36140774 PMCID: PMC9498540 DOI: 10.3390/genes13091605] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2022] [Accepted: 08/28/2022] [Indexed: 11/26/2022] Open
|