1
|
Perry JA, Werner ME, Heck BW, Maddox PS, Maddox AS. Septins throughout phylogeny are predicted to have a transmembrane domain, which in Caenorhabditis elegans is functionally important. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.20.567915. [PMID: 38045322 PMCID: PMC10690161 DOI: 10.1101/2023.11.20.567915] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/05/2023]
Abstract
Septins, a conserved family of filament-forming proteins, contribute to eukaryotic cell division, polarity, and membrane trafficking. Septins are thought to act in these processes by scaffolding other proteins to the plasma membrane. The mechanisms by which septins associate with the plasma membrane are not well understood but can involve two polybasic domains and/or an amphipathic helix. We discovered that the genomes of organisms throughout phylogeny, but not most commonly used model organisms, encode one or more septins predicted to have transmembrane domains. The nematode Caenorhabditis elegans, which was thought to express only two septin proteins, UNC-59 and UNC-61, translates multiple isoforms of UNC-61, and one isoform, UNC-61a, is predicted to contain a transmembrane domain. UNC-61a localizes specifically to the apical membrane of the C. elegans vulva and is important for maintaining vulval morphology. UNC-61a partially compensates for the loss of the other two UNC-61 isoforms, UNC-61b and UNC-61c. The UNC-61a transmembrane domain is sufficient to localize a fluorophore to membranes in mammalian cells, and its deletion from UNC-61a recapitulates the phenotypes of unc-61a null animals. The localization and loss-of-function phenotypes of UNC-61a and its transmembrane domain suggest roles in cell polarity and secretion and help explain the cellular and tissue biological underpinnings of C. elegans septin null alleles' enigmatically hypomorphic phenotypes. Together, our findings reveal a novel mechanism of septin-membrane association with profound implications for the dynamics and regulation of this association.
Collapse
Affiliation(s)
- Jenna A Perry
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Michael E Werner
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Bryan W Heck
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Paul S Maddox
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Amy Shaub Maddox
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC
| |
Collapse
|
2
|
Liu X, Zhang Y, Pu Y, Ma Y, Jiang L. Whole-genome identification of transposable elements reveals the equine repetitive element insertion polymorphism in Chinese horses. Anim Genet 2023; 54:144-154. [PMID: 36464985 DOI: 10.1111/age.13277] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2022] [Revised: 10/29/2022] [Accepted: 11/14/2022] [Indexed: 12/12/2022]
Abstract
Transposable elements (TEs) are diverse, abundant, and complicated in genomes. They not only can drive the genome evolution process but can also act as special resources for adaptation. However, little is known about the evolutionary processes that shaped horses. In this work, 126 horse assemblages involved in most horse breeds in China were used to investigate the patterns of TE variation for the first time. By using RepeatMasker and melt software, we found that the horse-specific short interspersed repetitive elements family, equine repetitive elements (ERE1), exhibited polymorphisms in horse genomes. Phylogenetic analysis based on these ERE1 loci (minor allele frequency ≥0.05) revealed three major horse groups, namely, those in northern China, southern China, and Qinghai-Tibetan, which mirrors the result determined by SNPs to some extent. The present ERE1 family emerged ~0.26 to 1.77 Mya ago, with an activity peak at ~0.49 Mya, which matches the early stage of the horse lineage and decreases after the divergence of Equus caballus and Equus ferus przewalskii. To detect the functional ERE1(s) associated with adaptation, locus-specific branch length, genome-wide association study, and absolute allele frequency difference analyses were conducted and resulted in two common protein-coding genes annotated by candidate ERE1s. They were clustered into the vascular smooth muscle contraction (p = 0.01, EDNRA) and apelin signalling pathways (p = 0.02, NRF1). Notably, ERE1 insertion into the EDNRA gene showed a higher association with adaptation among southern China horses and other horses in 15 populations and 451 individuals (p = 4.55 e-8). Our results provide a comprehensive understanding of TE variations to analyse the phylogenetic relationships and traits relevant to adaptive evolution in horses.
Collapse
Affiliation(s)
- Xuexue Liu
- National Germplasm Centre of Domestic Animal Resources, Institute of Animal Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing, China.,Centre d'Anthropobiologie et de Génomique de Toulouse, Université Paul Sabatier, Toulouse, France
| | - Yanli Zhang
- National Germplasm Centre of Domestic Animal Resources, Institute of Animal Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing, China.,CAAS-ILRI Joint Laboratory on Livestock and Forage Genetic Resources, Institute of Animal Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing, China
| | - Yabin Pu
- National Germplasm Centre of Domestic Animal Resources, Institute of Animal Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing, China.,CAAS-ILRI Joint Laboratory on Livestock and Forage Genetic Resources, Institute of Animal Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing, China
| | - Yuehui Ma
- National Germplasm Centre of Domestic Animal Resources, Institute of Animal Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing, China.,CAAS-ILRI Joint Laboratory on Livestock and Forage Genetic Resources, Institute of Animal Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing, China
| | - Lin Jiang
- National Germplasm Centre of Domestic Animal Resources, Institute of Animal Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing, China.,CAAS-ILRI Joint Laboratory on Livestock and Forage Genetic Resources, Institute of Animal Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing, China
| |
Collapse
|
3
|
Deng S. The origin of genetic and metabolic systems: Evolutionary structuralinsights. Heliyon 2023; 9:e14466. [PMID: 36967965 PMCID: PMC10036676 DOI: 10.1016/j.heliyon.2023.e14466] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2022] [Revised: 02/27/2023] [Accepted: 03/06/2023] [Indexed: 03/16/2023] Open
Abstract
DNA is derived from reverse transcription and its origin is related to reverse transcriptase, DNA polymerase and integrase. The gene structure originated from the evolution of the first RNA polymerase. Thus, an explanation of the origin of the genetic system must also explain the evolution of these enzymes. This paper proposes a polymer structure model, termed the stable complex evolution model, which explains the evolution of enzymes and functional molecules. Enzymes evolved their functions by forming locally tightly packed complexes with specific substrates. A metabolic reaction can therefore be considered to be the result of adaptive evolution in this way when a certain essential molecule is lacking in a cell. The evolution of the primitive genetic and metabolic systems was thus coordinated and synchronized. According to the stable complex model, almost all functional molecules establish binding affinity and specific recognition through complementary interactions, and functional molecules therefore have the nature of being auto-reactive. This is thermodynamically favorable and leads to functional duplication and self-organization. Therefore, it can be speculated that biological systems have a certain tendency to maintain functional stability or are influenced by an inherent selective power. The evolution of dormant bacteria may support this hypothesis, and inherent selectivity can be unified with natural selection at the molecular level.
Collapse
Affiliation(s)
- Shaojie Deng
- Chongqing (Fengjie) Municipal Bureau of Planning and Natural Resources, China
| |
Collapse
|
4
|
Li Y, Xue Y, Peng Z, Zhang L. Immune diversity in lophotrochozoans, with a focus on recognition and effector systems. Comput Struct Biotechnol J 2023; 21:2262-2275. [PMID: 37035545 PMCID: PMC10073891 DOI: 10.1016/j.csbj.2023.03.031] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2022] [Revised: 03/11/2023] [Accepted: 03/19/2023] [Indexed: 03/30/2023] Open
Abstract
Lophotrochozoa is one of the most species-rich but immunologically poorly explored phyla. Although lack of acquired response in a narrow sense, lophotrochozoans possess various genetic mechanisms that enhance the diversity and specificity of innate immune system. Here, we review the recent advances of comparative immunology studies in lophotrochozoans with focus on immune recognition and effector systems. Haemocytes and coelomocytes are general important yet understudied player. Comparative genomics studies suggest expansion and functional divergence of lophotrochozoan immune reorganization systems is not as "homogeneous and simple" as we thought including the large-scale expansion and molecular divergence of pattern recognition receptors (PRRs) (TLRs, RLRs, lectins, etc.) and signaling adapters (MyD88s etc.), significant domain recombination of immune receptors (RLR, NLRs, lectins, etc.), extensive somatic recombination of fibrinogenrelated proteins (FREPs) in snails. Furthermore, there are repeatedly identified molecular mechanisms that generate immune effector diversity, including high polymorphism of antimicrobial peptides and proteins (AMPs), reactive oxygen and nitrogen species (RONS) and cytokines. Finally, we argue that the next generation omics tools and the recently emerged genome editing technicism will revolutionize our understanding of innate immune system in a comparative immunology perspective.
Collapse
Affiliation(s)
- Yongnan Li
- CAS and Shandong Province Key Laboratory of Experimental Marine Biology & Center of Deep Sea Research, Center for Ocean Mega-Science, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, China
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
| | - Yu Xue
- CAS and Shandong Province Key Laboratory of Experimental Marine Biology & Center of Deep Sea Research, Center for Ocean Mega-Science, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, China
- Qingdao Agricultural University, Qingdao, China
| | - Zhangjie Peng
- CAS and Shandong Province Key Laboratory of Experimental Marine Biology & Center of Deep Sea Research, Center for Ocean Mega-Science, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, China
- College of Marine Science, University of Chinese Academy of Sciences, Beijing, China
| | - Linlin Zhang
- CAS and Shandong Province Key Laboratory of Experimental Marine Biology & Center of Deep Sea Research, Center for Ocean Mega-Science, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, China
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
- College of Marine Science, University of Chinese Academy of Sciences, Beijing, China
- Corresponding author at: CAS and Shandong Province Key Laboratory of Experimental Marine Biology & Center of Deep Sea Research, Center for Ocean Mega-Science, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, China.
| |
Collapse
|
5
|
Genome-Wide Identification of Cotton (Gossypium spp.) Trehalose-6-Phosphate Phosphatase (TPP) Gene Family Members and the Role of GhTPP22 in the Response to Drought Stress. PLANTS 2022; 11:plants11081079. [PMID: 35448808 PMCID: PMC9024796 DOI: 10.3390/plants11081079] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Revised: 04/11/2022] [Accepted: 04/12/2022] [Indexed: 01/10/2023]
Abstract
Trehalose-6-phosphate phosphatase (TPP) is a key enzyme involved in trehalose synthesis in higher plants. Previous studies have shown that TPP family genes increase yields without affecting plant growth under drought conditions, but their functions in cotton have not been reported. In this study, 17, 12, 26 and 24 TPP family genes were identified in Gossypium arboreum, Gossypium raimondii, Gossypium barbadense and Gossypium hirsutum, respectively. The 79 TPP family genes were divided into three subgroups by phylogenetic analysis. Virus-induced gene silencing (VIGS) of GhTPP22 produced TRV::GhTPP22 plants that were more sensitive to drought stress than the control plants, and the relative expression of GhTPP22 was decreased, as shown by qRT–PCR. Moreover, we analysed the gene structure, targeted small RNAs, and gene expression patterns of TPP family members and the physicochemical properties of their encoded proteins. Overall, members of the TPP gene family in cotton were systematically identified, and the function of GhTPP22 under drought stress conditions was preliminarily verified. These findings provide new information for improving drought resistance for cotton breeding in the future.
Collapse
|
6
|
Zhou SS, Yan XM, Zhang KF, Liu H, Xu J, Nie S, Jia KH, Jiao SQ, Zhao W, Zhao YJ, Porth I, El Kassaby YA, Wang T, Mao JF. A comprehensive annotation dataset of intact LTR retrotransposons of 300 plant genomes. Sci Data 2021; 8:174. [PMID: 34267227 PMCID: PMC8282616 DOI: 10.1038/s41597-021-00968-x] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Accepted: 06/07/2021] [Indexed: 12/11/2022] Open
Abstract
LTR retrotransposons (LTR-RTs) are ubiquitous and represent the dominant repeat element in plant genomes, playing important roles in functional variation, genome plasticity and evolution. With the advent of new sequencing technologies, a growing number of whole-genome sequences have been made publicly available, making it possible to carry out systematic analyses of LTR-RTs. However, a comprehensive and unified annotation of LTR-RTs in plant groups is still lacking. Here, we constructed a plant intact LTR-RTs dataset, which is designed to classify and annotate intact LTR-RTs with a standardized procedure. The dataset currently comprises a total of 2,593,685 intact LTR-RTs from genomes of 300 plant species representing 93 families of 46 orders. The dataset is accompanied by sequence, diverse structural and functional annotation, age determination and classification information associated with the LTR-RTs. This dataset will contribute valuable resources for investigating the evolutionary dynamics and functional implications of LTR-RTs in plant genomes.
Collapse
Affiliation(s)
- Shan-Shan Zhou
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 100083, China
| | - Xue-Mei Yan
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 100083, China
| | - Kai-Fu Zhang
- College of Big data and Intelligent Engineering, Southwest Forestry University, Yunnan, 650224, China
| | - Hui Liu
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 100083, China
| | - Jie Xu
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 100083, China
| | - Shuai Nie
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 100083, China
| | - Kai-Hua Jia
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 100083, China
| | - Si-Qian Jiao
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 100083, China
| | - Wei Zhao
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 100083, China
| | - You-Jie Zhao
- College of Big data and Intelligent Engineering, Southwest Forestry University, Yunnan, 650224, China
| | - Ilga Porth
- Départment des Sciences du Bois et de la Forêt, Faculté de Foresterie, de Géographie et Géomatique, Université Laval Québec, Québec, QC, G1V 0A6, Canada
| | - Yousry A El Kassaby
- Department of Forest and Conservation Sciences, Faculty of Forestry, The University of British Columbia, 2424 Main Mall, Vancouver, BC, V6T 1Z4, Canada
| | - Tongli Wang
- Department of Forest and Conservation Sciences, Faculty of Forestry, The University of British Columbia, 2424 Main Mall, Vancouver, BC, V6T 1Z4, Canada
| | - Jian-Feng Mao
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 100083, China.
| |
Collapse
|
7
|
Sexually dimorphic expression and regulatory sequence of dnali1 in the olive flounder Paralichthys olivaceus. Mol Biol Rep 2021; 48:3529-3540. [PMID: 33877529 DOI: 10.1007/s11033-021-06342-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2021] [Accepted: 04/07/2021] [Indexed: 10/21/2022]
Abstract
Dynein axonemal light intermediate chain 1 (dnali1) is an important part of axonemal dyneins and plays an important role in the growth and development of animals. However, there is little information about dnali1 in fish. Herein, we cloned dnali1 gene from the genome of olive flounder (Paralichthys olivaceus), a commercially important maricultured fish in China, Japan, and Korea, and analyzed its expression patterns in different gender fish. The flounder dnali1 DNA sequence contained a 771 bp open reading frame (ORF), two different sizes of 5' untranslated region (5'UTR), and a 1499 bp 3' untranslated region (3'UTR). Two duplicated 922 nt fragments were found in dnali1 mRNA. The first fragment contained the downstream coding region and the front portion of 3'UTR, and the second fragment was entirely located in 3'UTR. Multiple alignments indicated that the flounder Dnali1 protein contained the putative conserved coiled-coil domain. Its expression showed sexually dimorphic with predominant expression in the flounder testis, and lower expression in other tissues. The gene with the longer 5'UTR was specifically expressed in the testis. The highest expression level in the testis was detected at stages IV and V. Transient expression analysis showed that the 922 bp repeated sequence 3'UTR of dnali1 down-regulated the expression of GFP at the early stage in zebrafish. The flounder dnali1 might play an important role in the testis, especially in the period of spermatogenesis, and the 5'UTR and the repetitive sequences in 3'UTR might contain some regulatory elements for the cilia.
Collapse
|
8
|
Liu Z, Wang X, Sun Z, Zhang Y, Meng C, Chen B, Wang G, Ke H, Wu J, Yan Y, Wu L, Li Z, Yang J, Zhang G, Ma Z. Evolution, expression and functional analysis of cultivated allotetraploid cotton DIR genes. BMC PLANT BIOLOGY 2021; 21:89. [PMID: 33568051 PMCID: PMC7876823 DOI: 10.1186/s12870-021-02859-0] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/25/2020] [Accepted: 01/27/2021] [Indexed: 05/13/2023]
Abstract
BACKGROUND Dirigent (DIR) proteins mediate regioselectivity and stereoselectivity during lignan biosynthesis and are also involved in lignin, gossypol and pterocarpan biosynthesis. This gene family plays a vital role in enhancing stress resistance and in secondary cell-wall development, but systematical understanding is lacking in cotton. RESULTS In this study, 107 GbDIRs and 107 GhDIRs were identified in Gossypium barbadense and Gossypium hirsutum, respectively. Most of these genes have a classical gene structure without intron and encode proteins containing a signal peptide. Phylogenetic analysis showed that cotton DIR genes were classified into four distinct subfamilies (a, b/d, e, and f). Of these groups, DIR-a and DIR-e were evolutionarily conserved, and segmental and tandem duplications contributed equally to their formation. In contrast, DIR-b/d mainly expanded by recent tandem duplications, accompanying with a number of gene clusters. With the rapid evolution, DIR-b/d-III was a Gossypium-specific clade involved in atropselective synthesis of gossypol. RNA-seq data highlighted GhDIRs in response to Verticillium dahliae infection and suggested that DIR gene family could confer Verticillium wilt resistance. We also identified candidate DIR genes related to fiber development in G. barbadense and G. hirsutum and revealed their differential expression. To further determine the involvement of DIR genes in fiber development, we overexpressed a fiber length-related gene GbDIR78 in Arabidopsis and validated its function in trichomes and hypocotyls. CONCLUSIONS These findings contribute novel insights towards the evolution of DIR gene family and provide valuable information for further understanding the roles of DIR genes in cotton fiber development as well as in stress responses.
Collapse
Affiliation(s)
- Zhengwen Liu
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, 071001, China
| | - Xingfen Wang
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, 071001, China
| | - Zhengwen Sun
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, 071001, China
| | - Yan Zhang
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, 071001, China
| | - Chengsheng Meng
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, 071001, China
| | - Bin Chen
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, 071001, China
| | - Guoning Wang
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, 071001, China
| | - Huifeng Ke
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, 071001, China
| | - Jinhua Wu
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, 071001, China
| | - Yuanyuan Yan
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, 071001, China
| | - Liqiang Wu
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, 071001, China
| | - Zhikun Li
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, 071001, China
| | - Jun Yang
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, 071001, China
| | - Guiyin Zhang
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, 071001, China.
| | - Zhiying Ma
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, 071001, China.
| |
Collapse
|
9
|
Amalgamated cross-species transcriptomes reveal organ-specific propensity in gene expression evolution. Nat Commun 2020; 11:4459. [PMID: 32900997 PMCID: PMC7479108 DOI: 10.1038/s41467-020-18090-8] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2019] [Accepted: 07/29/2020] [Indexed: 12/24/2022] Open
Abstract
The origins of multicellular physiology are tied to evolution of gene expression. Genes can shift expression as organisms evolve, but how ancestral expression influences altered descendant expression is not well understood. To examine this, we amalgamate 1,903 RNA-seq datasets from 182 research projects, including 6 organs in 21 vertebrate species. Quality control eliminates project-specific biases, and expression shifts are reconstructed using gene-family-wise phylogenetic Ornstein-Uhlenbeck models. Expression shifts following gene duplication result in more drastic changes in expression properties than shifts without gene duplication. The expression properties are tightly coupled with protein evolutionary rate, depending on whether and how gene duplication occurred. Fluxes in expression patterns among organs are nonrandom, forming modular connections that are reshaped by gene duplication. Thus, if expression shifts, ancestral expression in some organs induces a strong propensity for expression in particular organs in descendants. Regardless of whether the shifts are adaptive or not, this supports a major role for what might be termed preadaptive pathways of gene expression evolution.
Collapse
|
10
|
Ghosh A, Platt RN, Vandewege MW, Tabassum R, Hsu CY, Isberg SR, Peterson DG, Finger JW, Kieran TJ, Glenn TC, Gongora J, Ray DA. Identification and characterization of microRNAs (miRNAs) and their transposable element origins in the saltwater crocodile, Crocodylus porosus. Anal Biochem 2020; 602:113781. [PMID: 32485163 DOI: 10.1016/j.ab.2020.113781] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2020] [Revised: 05/11/2020] [Accepted: 05/19/2020] [Indexed: 12/23/2022]
Abstract
MicroRNAs (miRNAs) are 18-24 nucleotide regulatory RNAs. They are involved in the regulation of genetic and biological pathways through post transcriptional gene silencing and/or translational repression. Data suggests a slow evolutionary rate for the saltwater crocodile (Crocodylus porosus) over the past several million years when compared to birds, the closest extant relatives of crocodilians. Understanding gene regulation in the saltwater crocodile in the context of relatively slow genomic change thus holds potential for the investigation of genomics, evolution, and adaptation. Utilizing eleven tissue types and sixteen small RNA libraries, we report 644 miRNAs in the saltwater crocodile with >78% of miRNAs being novel to crocodilians. We also identified potential targets for the miRNAs and analyzed the relationship of the miRNA repertoire to transposable elements (TEs). Results suggest an increased association of DNA transposons with miRNAs when compared to retrotransposons. This work reports the first comprehensive analysis of miRNAs in Crocodylus porosus and addresses the potential impacts of miRNAs in regulating the genome in the saltwater crocodile. In addition, the data suggests a supporting role of TEs as a source for miRNAs, adding to the increasing evidence that TEs play a significant role in the evolution of gene regulation.
Collapse
Affiliation(s)
- Arnab Ghosh
- Department of Biological Sciences, Texas Tech University, Lubbock, TX, USA
| | - Roy N Platt
- Department of Biological Sciences, Texas Tech University, Lubbock, TX, USA
| | - Michael W Vandewege
- Department of Biological Sciences, Texas Tech University, Lubbock, TX, USA; Department of Biology, Eastern New Mexico University, Portales, NM, USA
| | | | - Chuan-Yu Hsu
- Institute for Genomics, Biocomputing and Biotechnology, Mississippi State University, MS, USA
| | - Sally R Isberg
- Sydney School of Veterinary Science, Faculty of Science, University of Sydney, Sydney, NSW, Australia; The Centre for Crocodile Research, Darwin, NT, Australia
| | - Daniel G Peterson
- Institute for Genomics, Biocomputing and Biotechnology, Mississippi State University, MS, USA
| | - John W Finger
- Department of Environmental Health Science, University of Georgia, Athens, GA, USA; Department of Biological Sciences, Auburn University, Auburn, AL, USA
| | - Troy J Kieran
- Department of Environmental Health Science, University of Georgia, Athens, GA, USA
| | - Travis C Glenn
- Department of Environmental Health Science, University of Georgia, Athens, GA, USA
| | - Jaime Gongora
- Sydney School of Veterinary Science, Faculty of Science, University of Sydney, Sydney, NSW, Australia
| | - David A Ray
- Department of Biological Sciences, Texas Tech University, Lubbock, TX, USA.
| |
Collapse
|
11
|
Wang L, Yang J, Xu Y, Piao X, Lv J. Domain-based Comparative Analysis of Bacterial Proteomes: Uniqueness, Interactions, and the Dark Matter. Curr Genomics 2019; 20:115-123. [PMID: 31555062 PMCID: PMC6728903 DOI: 10.2174/1389202920666190320134438] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2018] [Revised: 11/15/2018] [Accepted: 01/10/2019] [Indexed: 01/05/2023] Open
Abstract
Background Proteins may have none, single, double, or multiple domains, while a single domain may appear in multiple proteins. Their distribution patterns may have impacts on bacterial physi-ology and lifestyle.Objective: This study aims to understand how domains are distributed and duplicated in bacterial prote-omes, in order to better understand bacterial physiology and lifestyles. Methods In this study, we used 16712 Hidden Markov Models to screen 944 bacterial reference prote-omes versus a threshold E-value<0.001. The number of non-redundant domains and duplication rates of redundant domains for each species were calculated. The unique domains, if any, were also identified for each species. In addition, the properties of no-domain proteins were investigated in terms of physico-chemical properties. Results The increasing number of non-redundant domains for a bacterial proteome follows the trend of an asymptotic function. The domain duplication rate is positively correlated with proteome size and in-creases more rapidly. The high percentage of single-domain proteins is more associated with small pro-teome size. For each proteome, unique domains were also obtained. Moreover, no-domain proteins show differences with the other three groups for several physicochemical properties analysed in this study. Conclusion The study confirmed that a low domain duplication rate and a high percentage of single-domain proteins are more likely to be associated with bacterial host-dependent or restricted niche-adapted lifestyle. In addition, the unique lifestyle and physiology were revealed based on the analysis of species-specific domains and core domain interactions or co-occurrences.
Collapse
Affiliation(s)
- Liang Wang
- 1Department of Bioinformatics, School of Medical Informatics, Xuzhou Medical University, Xuzhou, Jiangsu, 221000, P.R. China; 2Key Laboratory of New Drug Research and Clinical Pharmacy of Jiangsu Province, Xuzhou Medical University, Xuzhou, Jiangsu, 221000, P.R. China; 3School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, Jiangsu, 221116, P.R. China
| | - Jianye Yang
- 1Department of Bioinformatics, School of Medical Informatics, Xuzhou Medical University, Xuzhou, Jiangsu, 221000, P.R. China; 2Key Laboratory of New Drug Research and Clinical Pharmacy of Jiangsu Province, Xuzhou Medical University, Xuzhou, Jiangsu, 221000, P.R. China; 3School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, Jiangsu, 221116, P.R. China
| | - Yaping Xu
- 1Department of Bioinformatics, School of Medical Informatics, Xuzhou Medical University, Xuzhou, Jiangsu, 221000, P.R. China; 2Key Laboratory of New Drug Research and Clinical Pharmacy of Jiangsu Province, Xuzhou Medical University, Xuzhou, Jiangsu, 221000, P.R. China; 3School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, Jiangsu, 221116, P.R. China
| | - Xue Piao
- 1Department of Bioinformatics, School of Medical Informatics, Xuzhou Medical University, Xuzhou, Jiangsu, 221000, P.R. China; 2Key Laboratory of New Drug Research and Clinical Pharmacy of Jiangsu Province, Xuzhou Medical University, Xuzhou, Jiangsu, 221000, P.R. China; 3School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, Jiangsu, 221116, P.R. China
| | - Jichang Lv
- 1Department of Bioinformatics, School of Medical Informatics, Xuzhou Medical University, Xuzhou, Jiangsu, 221000, P.R. China; 2Key Laboratory of New Drug Research and Clinical Pharmacy of Jiangsu Province, Xuzhou Medical University, Xuzhou, Jiangsu, 221000, P.R. China; 3School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, Jiangsu, 221116, P.R. China
| |
Collapse
|
12
|
OrthoList 2: A New Comparative Genomic Analysis of Human and Caenorhabditis elegans Genes. Genetics 2018; 210:445-461. [PMID: 30120140 DOI: 10.1534/genetics.118.301307] [Citation(s) in RCA: 184] [Impact Index Per Article: 30.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2018] [Accepted: 08/15/2018] [Indexed: 11/18/2022] Open
Abstract
OrthoList, a compendium of Caenorhabditis elegans genes with human orthologs compiled in 2011 by a meta-analysis of four orthology-prediction methods, has been a popular tool for identifying conserved genes for research into biological and disease mechanisms. However, the efficacy of orthology prediction depends on the accuracy of gene-model predictions, an ongoing process, and orthology-prediction algorithms have also been updated over time. Here we present OrthoList 2 (OL2), a new comparative genomic analysis between C. elegans and humans, and the first assessment of how changes over time affect the landscape of predicted orthologs between two species. Although we find that updates to the orthology-prediction methods significantly changed the landscape of C. elegans-human orthologs predicted by individual programs and-unexpectedly-reduced agreement among them, we also show that our meta-analysis approach "buffered" against changes in gene content. We show that adding results from more programs did not lead to many additions to the list and discuss reasons to avoid assigning "scores" based on support by individual orthology-prediction programs; the treatment of "legacy" genes no longer predicted by these programs; and the practical difficulties of updating due to encountering deprecated, changed, or retired gene identifiers. In addition, we consider what other criteria may support claims of orthology and alternative approaches to find potential orthologs that elude identification by these programs. Finally, we created a new web-based tool that allows for rapid searches of OL2 by gene identifiers, protein domains [InterPro and SMART (Simple Modular Architecture Research Tool], or human disease associations ([OMIM (Online Mendelian Inheritence in Man], and also includes available RNA-interference resources to facilitate potential translational cross-species studies.
Collapse
|
13
|
Villalba M, Fredericksen F, Otth C, Olavarría VH. Molecular characterization of the bovine IER3 gene: Down-regulation of IL-8 by blocking NF-κB activity mediated by IER3 overexpression in MDBK cells infected with bovine viral diarrhea virus-1. Mol Immunol 2017; 92:169-179. [DOI: 10.1016/j.molimm.2017.10.012] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2016] [Revised: 07/19/2017] [Accepted: 10/15/2017] [Indexed: 10/18/2022]
|
14
|
Ettensohn CA, Dey D. KirrelL, a member of the Ig-domain superfamily of adhesion proteins, is essential for fusion of primary mesenchyme cells in the sea urchin embryo. Dev Biol 2016; 421:258-270. [PMID: 27866905 DOI: 10.1016/j.ydbio.2016.11.006] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2016] [Revised: 11/01/2016] [Accepted: 11/01/2016] [Indexed: 11/25/2022]
Abstract
In the sea urchin embryo, primary mesenchyme cells (PMCs) adhere to one another and fuse via filopodia, forming cable-like structures within which skeletal rods are deposited. Although this process was first described more than a century ago, molecules that participate in PMC adhesion and fusion have not been identified. Here we show that KirrelL, a PMC-specific, Ig domain-containing transmembrane protein, is essential for PMC fusion, probably by mediating filopodial adhesions that are a pre-requisite for subsequent membrane fusion. We show that KirrelL is not required for PMC specification, migration, or for direct filopodial contacts between PMCs. In the absence of KirrelL, however, filopodial contacts do not result in fusion. kirrelL is a member of a family of closely related, intronless genes that likely arose through an echinoid-specific gene expansion, possibly via retrotransposition. Our findings are significant in that they establish a direct linkage between the transcriptional network deployed in the PMC lineage and an effector molecule required for a critically important PMC morphogenetic process. In addition, our results point to a conserved role for Ig domain-containing adhesion proteins in facilitating cell fusion in both muscle and non-muscle cell lineages during animal development.
Collapse
Affiliation(s)
- Charles A Ettensohn
- Department of Biological Sciences, Carnegie Mellon University, 4400 Fifth Avenue, Pittsburgh, PA 15213, United States.
| | - Debleena Dey
- Department of Biological Sciences, Carnegie Mellon University, 4400 Fifth Avenue, Pittsburgh, PA 15213, United States
| |
Collapse
|
15
|
Wang W, Wu Y, Messing J. Genome-wide analysis of pentatricopeptide-repeat proteins of an aquatic plant. PLANTA 2016; 244:893-899. [PMID: 27306450 DOI: 10.1007/s00425-016-2555-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/16/2016] [Accepted: 06/07/2016] [Indexed: 06/06/2023]
Abstract
A large proportion of genes in plant genomes are organized as gene families. Whereas most gene families in the aquative plant Spirodela are reduced in their copy number, the PPR gene family is expanded, which match the RNA editing sites in organelles, providing us with new insights in the evolution of flowering plants. Pentatricopeptide-repeat proteins (PPRs) are nuclear-encoded proteins that are targeted to mitochondria and plastids to stabilize and edit mRNA transcribed from organellar genomes. They have been described for many terrestrial plant species from a diverse spectrum of sequenced genomes. To further increase our understanding of the evolution of this gene family across angiosperms, we analyzed the PPR genes in the aquatic species Spirodela polyrhiza in the order of the Alismatales (monocotyledonous plants). Because we had generated next generation sequencing data from transcripts and had sequenced the genome of Spirodela polyrhiza, we were able to identify its PPR genes and determine the level of their expression. In total, we could identify 556 PPR proteins, of which 238 members belong to the P (P motif) subfamily that is mainly involved in RNA stabilization and 318 ones to the PLS (P, Longer P, shorter P motif) subfamily responsible for RNA editing. Compared to other angiosperms, this is a large increase in the copy number of the PLS-PPRs subfamily and the expansion correlates with the increase of the number of RNA editing sites of organellar transcripts. Expression of PPR was generally stable even during growing and dormant stages, indicating that their function was critical throughout development. However, PPRs, especially those of the PLS subfamily, were expressed at relatively low levels, suggesting a delicate fine-tuning of its trans-acting function in the post-transcriptional regulation of gene expression. Thus, understanding PPR evolution and expression will help decipher the PPR code for their binding sites, which could genetically engineer RNA-binding proteins toward desired sequence.
Collapse
Affiliation(s)
- Wenqin Wang
- School of Agriculture and Biology, Shanghai Jiaotong University, 800 Dong Chuan Road, Shanghai, 200240, China
| | - Yongrui Wu
- Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 300 Fenglin Road, Shanghai, 200032, China
| | - Joachim Messing
- Waksman Institute of Microbiology, Rutgers, The State University of New Jersey, Piscataway, NJ, USA.
| |
Collapse
|
16
|
Zhong Y, Cheng ZMM. A unique RPW8-encoding class of genes that originated in early land plants and evolved through domain fission, fusion, and duplication. Sci Rep 2016; 6:32923. [PMID: 27678195 PMCID: PMC5039405 DOI: 10.1038/srep32923] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2015] [Accepted: 08/16/2016] [Indexed: 01/17/2023] Open
Abstract
Duplication, lateral gene transfer, domain fusion/fission and de novo domain creation play a key role in formation of initial common ancestral protein. Abundant protein diversities are produced by domain rearrangements, including fusions, fissions, duplications, and terminal domain losses. In this report, we explored the origin of the RPW8 domain and examined the domain rearrangements that have driven the evolution of RPW8-encoding genes in land plants. The RPW8 domain first emerged in the early land plant, Physcomitrella patens, and it likely originated de novo from a non-coding sequence or domain divergence after duplication. It was then incorporated into the NBS-LRR protein to create a main sub-class of RPW8-encoding genes, the RPW8-NBS-encoding genes. They evolved by a series of genetic events of domain fissions, fusions, and duplications. Many species-specific duplication events and tandemly duplicated clusters clearly demonstrated that species-specific and tandem duplications played important roles in expansion of RPW8-encoding genes, especially in gymnosperms and species of the Rosaceae. RPW8 domains with greater Ka/Ks values than those of the NBS domains indicated that they evolved faster than the NBS domains in RPW8-NBSs.
Collapse
Affiliation(s)
- Yan Zhong
- College of Horticulture, Nanjing Agricultural University, Nanjing, 210095, China
| | - Zong-Ming Max Cheng
- College of Horticulture, Nanjing Agricultural University, Nanjing, 210095, China.,Department of Plant Science, University of Tennessee, Knoxville, 37996, USA
| |
Collapse
|
17
|
Karaz S, Courgeon M, Lepetit H, Bruno E, Pannone R, Tarallo A, Thouzé F, Kerner P, Vervoort M, Causeret F, Pierani A, D'Onofrio G. Neuronal fate specification by the Dbx1 transcription factor is linked to the evolutionary acquisition of a novel functional domain. EvoDevo 2016; 7:18. [PMID: 27525057 PMCID: PMC4983035 DOI: 10.1186/s13227-016-0055-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2016] [Accepted: 07/27/2016] [Indexed: 12/18/2022] Open
Abstract
Background Dbx1 is a homeodomain transcription factor involved in neuronal fate specification belonging to a widely conserved family among bilaterians. In mammals, Dbx1 was proposed to act as a transcriptional repressor by interacting with the Groucho corepressors to allow the specification of neurons involved in essential biological functions such as locomotion or breathing. Results Sequence alignments of Dbx1 proteins from different species allowed us to identify two conserved domains related to the Groucho-dependent Engrailed repressor domain (RD), as well as a newly described domain composed of clusterized acidic residues at the C-terminus (Cter) which is present in tetrapods but also several invertebrates. Using a heterologous luciferase assay, we showed that the two putative repressor domains behave as such in a Groucho-dependent manner, whereas the Cter does not bear any intrinsic transcriptional activity. Consistently with in vitro data, we found that both RDs are involved in cell fate specification using in vivo electroporation experiments in the chick spinal cord. Surprisingly, we show that the Cter domain is required for Dbx1 function in vivo, acting as a modulator of its repressive activity and/or imparting specificity. Conclusion Our results strongly suggest that the presence of a Cter domain among tetrapods is essential for Dbx1 to regulate neuronal diversity and, in turn, nervous system complexity. Electronic supplementary material The online version of this article (doi:10.1186/s13227-016-0055-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Sonia Karaz
- Institut Jacques Monod, CNRS UMR 7592, Université Paris Diderot, Sorbonne Paris Cité, 75205 Paris Cedex, France
| | - Maximilien Courgeon
- Institut Jacques Monod, CNRS UMR 7592, Université Paris Diderot, Sorbonne Paris Cité, 75205 Paris Cedex, France
| | - Hélène Lepetit
- Institut Jacques Monod, CNRS UMR 7592, Université Paris Diderot, Sorbonne Paris Cité, 75205 Paris Cedex, France
| | - Eugenia Bruno
- Dept. BEOM, Stazione Zoologica A. Dohrn, Villa Comunale, 80121 Naples, Italy
| | - Raimondo Pannone
- Dept. BEOM, Stazione Zoologica A. Dohrn, Villa Comunale, 80121 Naples, Italy
| | - Andrea Tarallo
- Dept. BEOM, Stazione Zoologica A. Dohrn, Villa Comunale, 80121 Naples, Italy
| | - France Thouzé
- Institut Jacques Monod, CNRS UMR 7592, Université Paris Diderot, Sorbonne Paris Cité, 75205 Paris Cedex, France
| | - Pierre Kerner
- Institut Jacques Monod, CNRS UMR 7592, Université Paris Diderot, Sorbonne Paris Cité, 75205 Paris Cedex, France
| | - Michel Vervoort
- Institut Jacques Monod, CNRS UMR 7592, Université Paris Diderot, Sorbonne Paris Cité, 75205 Paris Cedex, France
| | - Frédéric Causeret
- Institut Jacques Monod, CNRS UMR 7592, Université Paris Diderot, Sorbonne Paris Cité, 75205 Paris Cedex, France
| | - Alessandra Pierani
- Institut Jacques Monod, CNRS UMR 7592, Université Paris Diderot, Sorbonne Paris Cité, 75205 Paris Cedex, France
| | - Giuseppe D'Onofrio
- Dept. BEOM, Stazione Zoologica A. Dohrn, Villa Comunale, 80121 Naples, Italy
| |
Collapse
|
18
|
Kozlov AP. Expression of evolutionarily novel genes in tumors. Infect Agent Cancer 2016; 11:34. [PMID: 27437030 PMCID: PMC4949931 DOI: 10.1186/s13027-016-0077-6] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2016] [Accepted: 05/18/2016] [Indexed: 01/29/2023] Open
Abstract
The evolutionarily novel genes originated through different molecular mechanisms are expressed in tumors. Sometimes the expression of evolutionarily novel genes in tumors is highly specific. Moreover positive selection of many human tumor-related genes in primate lineage suggests their involvement in the origin of new functions beneficial to organisms. It is suggested to consider the expression of evolutionarily young or novel genes in tumors as a new biological phenomenon, a phenomenon of TSEEN (tumor specifically expressed, evolutionarily novel) genes.
Collapse
Affiliation(s)
- A. P. Kozlov
- The Biomedical Center and Peter the Great St. Petersburg Polytechnic University, St. Petersburg, Russia
| |
Collapse
|
19
|
Li SF, Zhang GJ, Zhang XJ, Yuan JH, Deng CL, Gu LF, Gao WJ. DPTEdb, an integrative database of transposable elements in dioecious plants. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2016; 2016:baw078. [PMID: 27173524 PMCID: PMC4865326 DOI: 10.1093/database/baw078] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/29/2015] [Accepted: 04/22/2016] [Indexed: 02/02/2023]
Abstract
Dioecious plants usually harbor ‘young’ sex chromosomes, providing an opportunity to study the early stages of sex chromosome evolution. Transposable elements (TEs) are mobile DNA elements frequently found in plants and are suggested to play important roles in plant sex chromosome evolution. The genomes of several dioecious plants have been sequenced, offering an opportunity to annotate and mine the TE data. However, comprehensive and unified annotation of TEs in these dioecious plants is still lacking. In this study, we constructed a dioecious plant transposable element database (DPTEdb). DPTEdb is a specific, comprehensive and unified relational database and web interface. We used a combination of de novo, structure-based and homology-based approaches to identify TEs from the genome assemblies of previously published data, as well as our own. The database currently integrates eight dioecious plant species and a total of 31 340 TEs along with classification information. DPTEdb provides user-friendly web interfaces to browse, search and download the TE sequences in the database. Users can also use tools, including BLAST, GetORF, HMMER, Cut sequence and JBrowse, to analyze TE data. Given the role of TEs in plant sex chromosome evolution, the database will contribute to the investigation of TEs in structural, functional and evolutionary dynamics of the genome of dioecious plants. In addition, the database will supplement the research of sex diversification and sex chromosome evolution of dioecious plants. Database URL: http://genedenovoweb.ticp.net:81/DPTEdb/index.php
Collapse
Affiliation(s)
- Shu-Fen Li
- College of Life Sciences, Henan Normal University, Xinxiang 453007, China
| | - Guo-Jun Zhang
- School of Basic Medical Sciences, Xinxiang Medical University, Xinxiang 453003, China
| | - Xue-Jin Zhang
- College of Life Sciences, Henan Normal University, Xinxiang 453007, China
| | - Jin-Hong Yuan
- College of Life Sciences, Henan Normal University, Xinxiang 453007, China
| | - Chuan-Liang Deng
- College of Life Sciences, Henan Normal University, Xinxiang 453007, China
| | - Lian-Feng Gu
- Basic Forestry and Proteomics Center, Haixia Institute of Science and Technology (HIST), Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Wu-Jun Gao
- College of Life Sciences, Henan Normal University, Xinxiang 453007, China
| |
Collapse
|
20
|
Retrotransposon-associated long non-coding RNAs in mice and men. Pflugers Arch 2016; 468:1049-60. [DOI: 10.1007/s00424-016-1818-5] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2016] [Accepted: 03/28/2016] [Indexed: 01/01/2023]
|
21
|
Fredericksen F, Villalba M, Olavarría VH. Characterization of bovine A20 gene: Expression mediated by NF-κB pathway in MDBK cells infected with bovine viral diarrhea virus-1. Gene 2016; 581:117-29. [PMID: 26809100 DOI: 10.1016/j.gene.2016.01.030] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2015] [Revised: 12/03/2015] [Accepted: 01/17/2016] [Indexed: 02/06/2023]
Abstract
Cytokine production for immunological process is tightly regulated at the transcriptional and posttranscriptional levels. The NF-κB signaling pathway maintains immune homeostasis in the cell through the participation of molecules such as A20 (TNFAIP3), which is a key regulatory factor in the immune response, hematopoietic differentiation, and immunomodulation. Although A20 has been identified in mammals, and despite recent efforts to identify A20 members in other higher vertebrates, relatively little is known about the composition of this regulator in other classes of vertebrates, particularly for bovines. In this study, the genetic context of bovine A20 was explored and compared against homologous genes in the human, mouse, chicken, dog, and zebrafish chromosomes. Through in silico analysis, several regions of interest were found conserved between even phylogenetically distant species. Additionally, a protein-deduced sequence of bovine A20 evidenced many conserved domains in humans and mice. Furthermore, all potential amino acid residues implicated in the active site of A20 were conserved. Finally, bovine A20 mRNA expression as mediated by the bovine viral diarrhea virus and poly (I:C) was evaluated. These analyses evidenced a strong fold increase in A20 expression following virus exposure, a phenomenon blocked by a pharmacological NF-κB inhibitor (BAY 117085). Interestingly, A20 mRNA had a half-life of only 32min, likely due to adenylate- and uridylate-rich elements in the 3'-untranslated region. Collectively, these data identify bovine A20 as a regulator of immune marker expression. Finally, this is the first report to find the bovine viral diarrhea virus modulating bovine A20 activation through the NF-κB pathway.
Collapse
Affiliation(s)
- Fernanda Fredericksen
- Facultad de Ciencias, Instituto de Bioquímica y Microbiología, Universidad Austral de Chile, Campus Isla Teja S/N, Valdivia, Chile
| | - Melina Villalba
- Facultad de Ciencias, Instituto de Bioquímica y Microbiología, Universidad Austral de Chile, Campus Isla Teja S/N, Valdivia, Chile
| | - Víctor H Olavarría
- Facultad de Ciencias, Instituto de Bioquímica y Microbiología, Universidad Austral de Chile, Campus Isla Teja S/N, Valdivia, Chile.
| |
Collapse
|
22
|
Brinkmann K, Winterhoff M, Önel SF, Schultz J, Faix J, Bogdan S. WHAMY is a novel actin polymerase promoting myoblast fusion, macrophage cell motility and sensory organ development in Drosophila. J Cell Sci 2015; 129:604-20. [PMID: 26675239 DOI: 10.1242/jcs.179325] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2015] [Accepted: 12/09/2015] [Indexed: 01/06/2023] Open
Abstract
Wiskott-Aldrich syndrome proteins (WASPs) are nucleation-promoting factors (NPF) that differentially control the Arp2/3 complex. In Drosophila, three different family members, SCAR (also known as WAVE), WASP and WASH (also known as CG13176), have been analyzed so far. Here, we characterized WHAMY, the fourth Drosophila WASP family member. whamy originated from a wasp gene duplication and underwent a sub-neofunctionalization. Unlike WASP, we found that WHAMY specifically interacted with activated Rac1 through its two CRIB domains, which were sufficient for targeting WHAMY to lamellipodial and filopodial tips. Biochemical analyses showed that WHAMY promoted exceptionally fast actin filament elongation, although it did not activate the Arp2/3 complex. Loss- and gain-of-function studies revealed an important function of WHAMY in membrane protrusions and cell migration in macrophages. Genetic data further implied synergistic functions between WHAMY and WASP during morphogenesis. Double mutants were late-embryonic lethal and showed severe defects in myoblast fusion. Trans-heterozygous mutant animals showed strongly increased defects in sensory cell fate specification. Thus, WHAMY is a novel actin polymerase with an initial partitioning of ancestral WASP functions in development and subsequent acquisition of a new function in cell motility during evolution.
Collapse
Affiliation(s)
- Klaus Brinkmann
- Institut für Neurobiologie, Universität Münster, Badestr. 9, Münster 48149, Germany
| | - Moritz Winterhoff
- Institut für Biophysikalische Chemie, Medizinische Hochschule Hannover, Carl-Neuberg Strasse 1, Hannover 30625, Germany
| | - Susanne-Filiz Önel
- Fachbereich Biologie, Entwicklungsbiologie, Philipps-Universität Marburg, Karl-von-Frisch Str. 8, Marburg 35043, Germany
| | - Jörg Schultz
- Center for Computational and Theoretical Biology, Campus Nord and Bioinformatik, Biozentrum, Am Hubland, Universität Würzburg, Würzburg 97074, Germany
| | - Jan Faix
- Institut für Biophysikalische Chemie, Medizinische Hochschule Hannover, Carl-Neuberg Strasse 1, Hannover 30625, Germany
| | - Sven Bogdan
- Institut für Neurobiologie, Universität Münster, Badestr. 9, Münster 48149, Germany
| |
Collapse
|
23
|
Sofuku K, Parrish NF, Honda T, Tomonaga K. Transcription Profiling Demonstrates Epigenetic Control of Non-retroviral RNA Virus-Derived Elements in the Human Genome. Cell Rep 2015; 12:1548-54. [PMID: 26321645 DOI: 10.1016/j.celrep.2015.08.007] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2015] [Revised: 07/03/2015] [Accepted: 08/03/2015] [Indexed: 01/22/2023] Open
Abstract
Endogenous bornavirus-like nucleoprotein elements (EBLNs) are DNA sequences in vertebrate genomes formed by the retrotransposon-mediated integration of ancient bornavirus sequence. Thus, EBLNs evidence a mechanism of retrotransposon-mediated RNA-to-DNA information flow from environment to animals. Although EBLNs are non-transposable, they share some features with retrotransposons. Here, to test whether hosts control the expression of EBLNs similarly to retrotransposons, we profiled the transcription of all Homo sapiens EBLNs (hsEBLN-1 to hsEBLN-7). We could detect transcription of all hsEBLNs in at least one tissue. Among them, hsEBLN-1 is transcribed almost exclusively in the testis. In most tissues, expression from the hsEBLN-1 locus is silenced epigenetically. Finally, we showed the possibility that hsEBLN-1 integration at this locus affects the expression of a neighboring gene. Our results suggest that hosts regulate the expression of endogenous non-retroviral virus elements similarly to how they regulate the expression of retrotransposons, possibly contributing to new transcripts and regulatory complexity to the human genome.
Collapse
Affiliation(s)
- Kozue Sofuku
- Department of Viral Oncology, Institute for Virus Research, Kyoto University, 53 Kawahara-cho, Shogoin, Sakyo-ku, Kyoto 606-8507, Japan
| | - Nicholas F Parrish
- Department of Viral Oncology, Institute for Virus Research, Kyoto University, 53 Kawahara-cho, Shogoin, Sakyo-ku, Kyoto 606-8507, Japan
| | - Tomoyuki Honda
- Department of Viral Oncology, Institute for Virus Research, Kyoto University, 53 Kawahara-cho, Shogoin, Sakyo-ku, Kyoto 606-8507, Japan.
| | - Keizo Tomonaga
- Department of Viral Oncology, Institute for Virus Research, Kyoto University, 53 Kawahara-cho, Shogoin, Sakyo-ku, Kyoto 606-8507, Japan.
| |
Collapse
|
24
|
Hoffmann FG, McGuire LP, Counterman BA, Ray DA. Transposable elements and small RNAs: Genomic fuel for species diversity. Mob Genet Elements 2015; 5:63-66. [PMID: 26904375 DOI: 10.1080/2159256x.2015.1066919] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2015] [Revised: 05/20/2015] [Accepted: 06/23/2015] [Indexed: 12/15/2022] Open
Abstract
While transposable elements (TE) have long been suspected of involvement in species diversification, identifying specific roles has been difficult. We recently found evidence of TE-derived regulatory RNAs in a species-rich family of bats. The TE-derived small RNAs are temporally associated with the burst of species diversification, suggesting that they may have been involved in the processes that led to the diversification. In this commentary, we expand on the ideas that were briefly touched upon in that manuscript. Specifically, we suggest avenues of research that may help to identify the roles that TEs may play in perturbing regulatory pathways. Such research endeavors may serve to inform evolutionary biologists of the ways that TEs have influenced the genomic and taxonomic diversity around us.
Collapse
Affiliation(s)
- Federico G Hoffmann
- Department of Biochemistry, Molecular Biology, Entomology and Plant Pathology; Mississippi State University; Mississippi State, MS USA; Institute for Genomics, Biocomputing, and Biotechnology; Mississippi State University; Mississippi State, MS USA
| | - Liam P McGuire
- Department of Biological Sciences; Texas Tech University ; Lubbock, TX USA
| | - Brian A Counterman
- Department of Biological Sciences; Mississippi State University ; Mississippi State, MS USA
| | - David A Ray
- Department of Biological Sciences; Texas Tech University ; Lubbock, TX USA
| |
Collapse
|
25
|
Abstract
Historically pseudogenes were believed to represent nonfunctional genomic fossils; however, there is emerging evidence that many of them could be biologically active. This possibility has ignited interest in pseudogene loci and made the need for their high-quality annotation more pressing as an accurate knowledge of all pseudogenes in the human reference genome sequence facilitates confident functional analysis. GENCODE have undertaken the first genome-wide pseudogene assignment for protein-coding genes combining both large-scale manual annotation and computational pseudogene prediction pipelines. Multiple computational predictions provide an unbiased set of hints for manual annotators to investigate, both during first-pass annotation and as part of QC to identify any potential missing pseudogene loci. Where a pseudogene is identified, the extent of its homology to the parent locus is fully investigated by a manual annotator; a pseudogene model is built and assigned to one of eight pseudogene biotypes depending on the mechanism of creation and on the presence of locus-specific transcriptional or proteomic data. The high-quality, information-rich set of pseudogenes created has been integrated with ENCODE functional genomics data, specifically expression level, transcription factor and RNA polymerase II binding, and chromatin marks. In this way we have been able to identify some pseudogenes that possess conventional characteristics of functionality as well as others with interesting patterns of partial activity, which might suggest that putatively inactive loci could be gaining a novel function, for example as long noncoding RNAs. The activity data associated with every pseudogene is stored in the psiDR resource.
Collapse
|
26
|
Brosius J. The persistent contributions of RNA to eukaryotic gen(om)e architecture and cellular function. Cold Spring Harb Perspect Biol 2014; 6:a016089. [PMID: 25081515 DOI: 10.1101/cshperspect.a016089] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Currently, the best scenario for earliest forms of life is based on RNA molecules as they have the proven ability to catalyze enzymatic reactions and harbor genetic information. Evolutionary principles valid today become apparent in such models already. Furthermore, many features of eukaryotic genome architecture might have their origins in an RNA or RNA/protein (RNP) world, including the onset of a further transition, when DNA replaced RNA as the genetic bookkeeper of the cell. Chromosome maintenance, splicing, and regulatory function via RNA may be deeply rooted in the RNA/RNP worlds. Mostly in eukaryotes, conversion from RNA to DNA is still ongoing, which greatly impacts the plasticity of extant genomes. Raw material for novel genes encoding protein or RNA, or parts of genes including regulatory elements that selection can act on, continues to enter the evolutionary lottery.
Collapse
Affiliation(s)
- Jürgen Brosius
- Institute of Experimental Pathology (ZMBE), University of Münster, D-48149 Münster, Germany
| |
Collapse
|
27
|
Accelerated Evolution of Fetuin Family Proteins inProtobothrops flavoviridis(Habu Snake) Serum and the Discovery of an L1-Like Genomic Element in the Intronic Sequence of a Fetuin-Encoding Gene. Biosci Biotechnol Biochem 2014; 77:582-90. [DOI: 10.1271/bbb.120829] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
28
|
Liu YH, Zhang M, Wu C, Huang JJ, Zhang HB. DNA is structured as a linear "jigsaw puzzle" in the genomes of Arabidopsis, rice, and budding yeast. Genome 2014; 57:9-19. [PMID: 24564211 DOI: 10.1139/gen-2013-0099] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Knowledge of how a genome is structured and organized from its constituent elements is crucial to understanding its biology and evolution. Here, we report the genome structuring and organization pattern as revealed by systems analysis of the sequences of three model species, Arabidopsis, rice and yeast, at the whole-genome and chromosome levels. We found that all fundamental function elements (FFE) constituting the genomes, including genes (GEN), DNA transposable elements (DTE), retrotransposable elements (RTE), simple sequence repeats (SSR), and (or) low complexity repeats (LCR), are structured in a nonrandom and correlative manner, thus leading to a hypothesis that the DNA of the species is structured as a linear "jigsaw puzzle". Furthermore, we showed that different FFE differ in their importance in the formation and evolution of the DNA jigsaw puzzle structure between species. DTE and RTE play more important roles than GEN, LCR, and SSR in Arabidopsis, whereas GEN and RTE play more important roles than LCR, SSR, and DTE in rice. The genes having multiple recognized functions play more important roles than those having single functions. These results provide useful knowledge necessary for better understanding genome biology and evolution of the species and for effective molecular breeding of rice.
Collapse
Affiliation(s)
- Yun-Hua Liu
- a Department of Soil and Crop Sciences, Texas A&M University, College Station, Texas 77843-2474, USA
| | | | | | | | | |
Collapse
|
29
|
Herniou EA, Huguet E, Thézé J, Bézier A, Periquet G, Drezen JM. When parasitic wasps hijacked viruses: genomic and functional evolution of polydnaviruses. Philos Trans R Soc Lond B Biol Sci 2013; 368:20130051. [PMID: 23938758 PMCID: PMC3758193 DOI: 10.1098/rstb.2013.0051] [Citation(s) in RCA: 107] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The Polydnaviridae (PDV), including the Bracovirus (BV) and Ichnovirus genera, originated from the integration of unrelated viruses in the genomes of two parasitoid wasp lineages, in a remarkable example of convergent evolution. Functionally active PDVs represent the most compelling evolutionary success among endogenous viral elements (EVEs). BV evolved from the domestication by braconid wasps of a nudivirus 100 Ma. The nudivirus genome has become an EVE involved in BV particle production but is not encapsidated. Instead, BV genomes have co-opted virulence genes, used by the wasps to control the immunity and development of their hosts. Gene transfers and duplications have shaped BV genomes, now encoding hundreds of genes. Phylogenomic studies suggest that BVs contribute largely to wasp diversification and adaptation to their hosts. A genome evolution model explains how multidirectional wasp adaptation to different host species could have fostered PDV genome extension. Integrative studies linking ecological data on the wasp to genomic analyses should provide new insights into the adaptive role of particular BV genes. Forthcoming genomic advances should also indicate if the associations between endoparasitoid wasps and symbiotic viruses evolved because of their particularly intimate interactions with their hosts, or if similar domesticated EVEs could be uncovered in other parasites.
Collapse
Affiliation(s)
| | | | | | | | | | - Jean-Michel Drezen
- Institut de Recherche sur la Biologie de l'Insecte, CNRS UMR 7261, Université François-Rabelais, Parc de Grandmont, 37200 Tours, France
| |
Collapse
|
30
|
RNA-Mediated Gene Duplication and Retroposons: Retrogenes, LINEs, SINEs, and Sequence Specificity. INTERNATIONAL JOURNAL OF EVOLUTIONARY BIOLOGY 2013; 2013:424726. [PMID: 23984183 PMCID: PMC3747384 DOI: 10.1155/2013/424726] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/14/2013] [Accepted: 07/01/2013] [Indexed: 11/18/2022]
Abstract
A substantial number of “retrogenes” that are derived from the mRNA of various intron-containing genes have been reported. A class of mammalian retroposons, long interspersed element-1 (LINE1, L1), has been shown to be involved in the reverse transcription of retrogenes (or processed pseudogenes) and non-autonomous short interspersed elements (SINEs). The 3′-end sequences of various SINEs originated from a corresponding LINE. As the 3′-untranslated regions of several LINEs are essential for retroposition, these LINEs presumably require “stringent” recognition of the 3′-end sequence of the RNA template. However, the 3′-ends of mammalian L1s do not exhibit any similarity to SINEs, except for the presence of 3′-poly(A) repeats. Since the 3′-poly(A) repeats of L1 and Alu SINE are critical for their retroposition, L1 probably recognizes the poly(A) repeats, thereby mobilizing not only Alu SINE but also cytosolic mRNA. Many flowering plants only harbor L1-clade LINEs and a significant number of SINEs with poly(A) repeats, but no homology to the LINEs. Moreover, processed pseudogenes have also been found in flowering plants. I propose that the ancestral L1-clade LINE in the common ancestor of green plants may have recognized a specific RNA template, with stringent recognition then becoming relaxed during the course of plant evolution.
Collapse
|
31
|
Moore AD, Grath S, Schüler A, Huylmans AK, Bornberg-Bauer E. Quantification and functional analysis of modular protein evolution in a dense phylogenetic tree. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2013; 1834:898-907. [PMID: 23376183 DOI: 10.1016/j.bbapap.2013.01.007] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/19/2012] [Revised: 01/06/2013] [Accepted: 01/09/2013] [Indexed: 12/24/2022]
Abstract
Modularity is a hallmark of molecular evolution. Whether considering gene regulation, the components of metabolic pathways or signaling cascades, the ability to reuse autonomous modules in different molecular contexts can expedite evolutionary innovation. Similarly, protein domains are the modules of proteins, and modular domain rearrangements can create diversity with seemingly few operations in turn allowing for swift changes to an organism's functional repertoire. Here, we assess the patterns and functional effects of modular rearrangements at high resolution. Using a well resolved and diverse group of pancrustaceans, we illustrate arrangement diversity within closely related organisms, estimate arrangement turnover frequency and establish, for the first time, branch-specific rate estimates for fusion, fission, domain addition and terminal loss. Our results show that roughly 16 new arrangements arise per million years and that between 64% and 81% of these can be explained by simple, single-step modular rearrangement events. We find evidence that the frequencies of fission and terminal deletion events increase over time, and that modular rearrangements impact all levels of the cellular signaling apparatus and thus may have strong adaptive potential. Novel arrangements that cannot be explained by simple modular rearrangements contain a significant amount of repeat domains that occur in complex patterns which we term "supra-repeats". Furthermore, these arrangements are significantly longer than those with a single-step rearrangement solution, suggesting that such arrangements may result from multi-step events. In summary, our analysis provides an integrated view and initial quantification of the patterns and functional impact of modular protein evolution in a well resolved phylogenetic tree. This article is part of a Special Issue entitled: The emerging dynamic view of proteins: Protein plasticity in allostery, evolution and self-assembly.
Collapse
Affiliation(s)
- Andrew D Moore
- Institute for Evolution and Biodiversity, Münster, Germany
| | | | | | | | | |
Collapse
|
32
|
|
33
|
The macrodomain family: Rethinking an ancient domain from evolutionary perspectives. CHINESE SCIENCE BULLETIN = KEXUE TONGBAO 2013; 58:953-960. [PMID: 32214744 PMCID: PMC7088686 DOI: 10.1007/s11434-013-5674-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 08/30/2012] [Accepted: 09/28/2012] [Indexed: 11/18/2022]
Abstract
The reasons why certain domains evolve much slower than others is unclear. The notion that functionally more important genes evolve more slowly than less important genes is one of the few commonly believed principles of molecular evolution. The macro-domain (also known as the X domain) is an ancient, slowly evolving and highly conserved structural domain found in proteins throughout all of the kingdoms and was first discovered nearly two decades ago with the isolation and cloning of macroH2A1. Macrodomains, which are functionally promiscuous, have been studied intensively for the past decade due to their importance in the regulation of cellular responses to DNA damage, chromatin remodeling, transcription and tumorigenesis. Recent structural, phylogenetic and biological analyses, however, suggest the need for some reconsideration of the evolutionary advantage of concentrating such a plethora of diverse functions into the macrodomain and of how macrodomains could perform so many functions. In this article, we focus on macrodomains that are evolving slowly and broadly discuss the potential relationship between the biological evolution and functional diversity of macrodomains.
Collapse
|
34
|
Serbielle C, Dupas S, Perdereau E, Héricourt F, Dupuy C, Huguet E, Drezen JM. Evolutionary mechanisms driving the evolution of a large polydnavirus gene family coding for protein tyrosine phosphatases. BMC Evol Biol 2012; 12:253. [PMID: 23270369 PMCID: PMC3573978 DOI: 10.1186/1471-2148-12-253] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2012] [Accepted: 12/11/2012] [Indexed: 11/20/2022] Open
Abstract
Background Gene duplications have been proposed to be the main mechanism involved in genome evolution and in acquisition of new functions. Polydnaviruses (PDVs), symbiotic viruses associated with parasitoid wasps, are ideal model systems to study mechanisms of gene duplications given that PDV genomes consist of virulence genes organized into multigene families. In these systems the viral genome is integrated in a wasp chromosome as a provirus and virus particles containing circular double-stranded DNA are injected into the parasitoids’ hosts and are essential for parasitism success. The viral virulence factors, organized in gene families, are required collectively to induce host immune suppression and developmental arrest. The gene family which encodes protein tyrosine phosphatases (PTPs) has undergone spectacular expansion in several PDV genomes with up to 42 genes. Results Here, we present strong indications that PTP gene family expansion occurred via classical mechanisms: by duplication of large segments of the chromosomally integrated form of the virus sequences (segmental duplication), by tandem duplications within this form and by dispersed duplications. We also propose a novel duplication mechanism specific to PDVs that involves viral circle reintegration into the wasp genome. The PTP copies produced were shown to undergo conservative evolution along with episodes of adaptive evolution. In particular recently produced copies have undergone positive selection in sites most likely involved in defining substrate selectivity. Conclusion The results provide evidence about the dynamic nature of polydnavirus proviral genomes. Classical and PDV-specific duplication mechanisms have been involved in the production of new gene copies. Selection pressures associated with antagonistic interactions with parasitized hosts have shaped these genes used to manipulate lepidopteran physiology with evidence for positive selection involved in adaptation to host targets.
Collapse
Affiliation(s)
- Céline Serbielle
- Institut de Recherche sur la Biologie de l'Insecte, UMR CNRS 7261, Faculté des Sciences et Techniques, Université F. Rabelais, Parc de Grandmont, 37200, Tours, France
| | | | | | | | | | | | | |
Collapse
|
35
|
Li Y, Xiao J, Wu J, Duan J, Liu Y, Ye X, Zhang X, Guo X, Gu Y, Zhang L, Jia J, Kong X. A tandem segmental duplication (TSD) in green revolution gene Rht-D1b region underlies plant height variation. THE NEW PHYTOLOGIST 2012; 196:282-291. [PMID: 22849513 DOI: 10.1111/j.1469-8137.2012.04243.x] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
• Rht-D1c (Rht10) carried by Chinese wheat (Triticum aestivum) line Aibian 1 is an allele at the Rht-D1 locus. Among the Rht-1 alleles, little is known about Rht-D1c although it determines an extreme dwarf phenotype in wheat. • Here, we cloned and functionally characterized Rht-D1c using a combination of Southern blotting, target region sequencing, gene expression analysis and transgenic experiments. • We found that the Rht-D1c allele was generated through a tandem segmental duplication (TSD) of a > 1 Mb region, resulting in two copies of the Rht-D1b. Two copies of Rht-D1b in the TSD were three-fold more effective in reducing plant height than a single copy, and transformation with a segment containing the tandemly duplicated copy of Rht-D1b resulted in the same level of reduction of plant height as the original copy in Aibian 1. • Our results suggest that changes in gene copy number are one of the important sources of genetic diversity and some of these changes could be directly associated with important traits in crops.
Collapse
Affiliation(s)
- Yiyuan Li
- College of Biology Sciences, China Agricultural University, No. 2 Yuanmingyuan West Road, Haidian District, Beijing 100094, China
- Key Laboratory of Crop Gene Resources and Germplasm Enhancement, Ministry of Agriculture, National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Science, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Jianhui Xiao
- Key Laboratory of Crop Gene Resources and Germplasm Enhancement, Ministry of Agriculture, National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Science, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Jiajie Wu
- Key Laboratory of Crop Gene Resources and Germplasm Enhancement, Ministry of Agriculture, National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Science, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Jialei Duan
- College of Biology Sciences, China Agricultural University, No. 2 Yuanmingyuan West Road, Haidian District, Beijing 100094, China
- Key Laboratory of Crop Gene Resources and Germplasm Enhancement, Ministry of Agriculture, National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Science, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Yue Liu
- Key Laboratory of Crop Gene Resources and Germplasm Enhancement, Ministry of Agriculture, National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Science, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Xingguo Ye
- Key Laboratory of Crop Gene Resources and Germplasm Enhancement, Ministry of Agriculture, National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Science, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Xin Zhang
- Key Laboratory of Crop Gene Resources and Germplasm Enhancement, Ministry of Agriculture, National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Science, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Xiuping Guo
- Key Laboratory of Crop Gene Resources and Germplasm Enhancement, Ministry of Agriculture, National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Science, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Yongqiang Gu
- United States Department of Agriculture, Agricultural Research Service, Western Regional Research Center, 800 Buchanan Street, Albany, CA 94710, USA
| | - Lichao Zhang
- Key Laboratory of Crop Gene Resources and Germplasm Enhancement, Ministry of Agriculture, National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Science, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Jizeng Jia
- Key Laboratory of Crop Gene Resources and Germplasm Enhancement, Ministry of Agriculture, National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Science, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Xiuying Kong
- Key Laboratory of Crop Gene Resources and Germplasm Enhancement, Ministry of Agriculture, National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Science, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| |
Collapse
|
36
|
Wang Z, Zarlenga D, Martin J, Abubucker S, Mitreva M. Exploring metazoan evolution through dynamic and holistic changes in protein families and domains. BMC Evol Biol 2012; 12:138. [PMID: 22862991 PMCID: PMC3483195 DOI: 10.1186/1471-2148-12-138] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2012] [Accepted: 07/19/2012] [Indexed: 11/18/2022] Open
Abstract
Background Proteins convey the majority of biochemical and cellular activities in organisms. Over the course of evolution, proteins undergo normal sequence mutations as well as large scale mutations involving domain duplication and/or domain shuffling. These events result in the generation of new proteins and protein families. Processes that affect proteome evolution drive species diversity and adaptation. Herein, change over the course of metazoan evolution, as defined by birth/death and duplication/deletion events within protein families and domains, was examined using the proteomes of 9 metazoan and two outgroup species. Results In studying members of the three major metazoan groups, the vertebrates, arthropods, and nematodes, we found that the number of protein families increased at the majority of lineages over the course of metazoan evolution where the magnitude of these increases was greatest at the lineages leading to mammals. In contrast, the number of protein domains decreased at most lineages and at all terminal lineages. This resulted in a weak correlation between protein family birth and domain birth; however, the correlation between domain birth and domain member duplication was quite strong. These data suggest that domain birth and protein family birth occur via different mechanisms, and that domain shuffling plays a role in the formation of protein families. The ratio of protein family birth to protein domain birth (domain shuffling index) suggests that shuffling had a more demonstrable effect on protein families in nematodes and arthropods than in vertebrates. Through the contrast of high and low domain shuffling indices at the lineages of Trichinella spiralis and Gallus gallus, we propose a link between protein redundancy and evolutionary changes controlled by domain shuffling; however, the speed of adaptation among the different lineages was relatively invariant. Evaluating the functions of protein families that appeared or disappeared at the last common ancestors (LCAs) of the three metazoan clades supports a correlation with organism adaptation. Furthermore, bursts of new protein families and domains in the LCAs of metazoans and vertebrates are consistent with whole genome duplications. Conclusion Metazoan speciation and adaptation were explored by birth/death and duplication/deletion events among protein families and domains. Our results provide insights into protein evolution and its bearing on metazoan evolution.
Collapse
Affiliation(s)
- Zhengyuan Wang
- The Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA
| | | | | | | | | |
Collapse
|
37
|
Sabath N, Wagner A, Karlin D. Evolution of viral proteins originated de novo by overprinting. Mol Biol Evol 2012; 29:3767-80. [PMID: 22821011 PMCID: PMC3494269 DOI: 10.1093/molbev/mss179] [Citation(s) in RCA: 104] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
New protein-coding genes can originate either through modification of existing genes or de novo. Recently, the importance of de novo origination has been recognized in eukaryotes, although eukaryotic genes originated de novo are relatively rare and difficult to identify. In contrast, viruses contain many de novo genes, namely those in which an existing gene has been “overprinted” by a new open reading frame, a process that generates a new protein-coding gene overlapping the ancestral gene. We analyzed the evolution of 12 experimentally validated viral genes that originated de novo and estimated their relative ages. We found that young de novo genes have a different codon usage from the rest of the genome. They evolve rapidly and are under positive or weak purifying selection. Thus, young de novo genes might have strain-specific functions, or no function, and would be difficult to detect using current genome annotation methods that rely on the sequence signature of purifying selection. In contrast to young de novo genes, older de novo genes have a codon usage that is similar to the rest of the genome. They evolve slowly and are under stronger purifying selection. Some of the oldest de novo genes evolve under stronger selection pressure than the ancestral gene they overlap, suggesting an evolutionary tug of war between the ancestral and the de novo gene.
Collapse
Affiliation(s)
- Niv Sabath
- Institute of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland.
| | | | | |
Collapse
|
38
|
Becker K, Braune M, Benderska N, Buratti E, Baralle F, Villmann C, Stamm S, Eulenburg V, Becker CM. A retroelement modifies pre-mRNA splicing: the murine Glrb(spa) allele is a splicing signal polymorphism amplified by long interspersed nuclear element insertion. J Biol Chem 2012; 287:31185-94. [PMID: 22782896 DOI: 10.1074/jbc.m112.375691] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The glycine receptor-deficient mutant mouse spastic carries a full-length long interspersed nuclear element (LINE1) retrotransposon in intron 6 of the glycine receptor β subunit gene, Glrb(spa). The mutation arose in the C57BL/6J strain and is associated with skipping of exon 6 or a combination of the exons 5 and 6, thus resulting in a translational frameshift within the coding regions of the GlyR β subunit. The effect of the Glrb(spa) LINE1 insertion on pre-mRNA splicing was studied using a minigene approach. Sequence comparison as well as motif prediction and mutational analysis revealed that in addition to the LINE1 insertion the inactivation of an exonic splicing enhancer (ESE) within exon 6 is required for skipping of exon 6. Reconstitution of the ESE by substitution of a single residue was sufficient to prevent exon skipping. In addition to the ESE, two regions within the 5' and 3' UTR of the LINE1 were shown to be critical determinants for exon skipping, indicating that LINE1 acts as efficient modifier of subtle endogenous splicing phenotypes. Thus, the spastic allele of the murine glycine receptor β subunit gene is a two-hit mutation, where the hypomorphic alteration in an ESE is amplified by the insertion of a LINE1 element in the adjacent intron. Conversely, the LINE1 effect on splicing may be modulated by individual polymorphisms, depending on the insertional environment within the host genome.
Collapse
Affiliation(s)
- Kristina Becker
- Institut für Biochemie, Emil Fischer Zentrum, Universität Erlangen-Nürnberg, 91054 Erlangen, Germany
| | | | | | | | | | | | | | | | | |
Collapse
|
39
|
Platt II RN, Ray DA. A non-LTR retroelement extinction in Spermophilus tridecemlineatus. Gene 2012; 500:47-53. [DOI: 10.1016/j.gene.2012.03.051] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2012] [Revised: 03/08/2012] [Accepted: 03/09/2012] [Indexed: 10/28/2022]
|
40
|
Lei L, Zhou SL, Ma H, Zhang LS. Expansion and diversification of the SET domain gene family following whole-genome duplications in Populus trichocarpa. BMC Evol Biol 2012; 12:51. [PMID: 22497662 PMCID: PMC3402991 DOI: 10.1186/1471-2148-12-51] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2011] [Accepted: 04/12/2012] [Indexed: 01/03/2023] Open
Abstract
Background Histone lysine methylation modifies chromatin structure and regulates eukaryotic gene transcription and a variety of developmental and physiological processes. SET domain proteins are lysine methyltransferases containing the evolutionarily-conserved SET domain, which is known to be the catalytic domain. Results We identified 59 SET genes in the Populus genome. Phylogenetic analyses of 106 SET genes from Populus and Arabidopsis supported the clustering of SET genes into six distinct subfamilies and identified 19 duplicated gene pairs in Populus. The chromosome locations of these gene pairs and the distribution of synonymous substitution rates showed that the expansion of the SET gene family might be caused by large-scale duplications in Populus. Comparison of gene structures and domain architectures of each duplicate pair indicated that divergence took place at the 3'- and 5'-terminal transcribed regions and at the N- and C-termini of the predicted proteins, respectively. Expression profile analysis of Populus SET genes suggested that most Populus SET genes were expressed widely, many with the highest expression in young leaves. In particular, the expression profiles of 12 of the 19 duplicated gene pairs fell into two types of expression patterns. Conclusions The 19 duplicated SET genes could have originated from whole genome duplication events. The differences in SET gene structure, domain architecture, and expression profiles in various tissues of Populus suggest that members of the SET gene family have a variety of developmental and physiological functions. Our study provides clues about the evolution of epigenetic regulation of chromatin structure and gene expression.
Collapse
Affiliation(s)
- Li Lei
- 1State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, the Chinese Academy of Sciences, Beijing 100093, China
| | | | | | | |
Collapse
|
41
|
Janicki M, Rooke R, Yang G. Bioinformatics and genomic analysis of transposable elements in eukaryotic genomes. Chromosome Res 2012; 19:787-808. [PMID: 21850457 DOI: 10.1007/s10577-011-9230-7] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
A major portion of most eukaryotic genomes are transposable elements (TEs). During evolution, TEs have introduced profound changes to genome size, structure, and function. As integral parts of genomes, the dynamic presence of TEs will continue to be a major force in reshaping genomes. Early computational analyses of TEs in genome sequences focused on filtering out "junk" sequences to facilitate gene annotation. When the high abundance and diversity of TEs in eukaryotic genomes were recognized, these early efforts transformed into the systematic genome-wide categorization and classification of TEs. The availability of genomic sequence data reversed the classical genetic approaches to discovering new TE families and superfamilies. Curated TE databases and their accurate annotation of genome sequences in turn facilitated the studies on TEs in a number of frontiers including: (1) TE-mediated changes of genome size and structure, (2) the influence of TEs on genome and gene functions, (3) TE regulation by host, (4) the evolution of TEs and their population dynamics, and (5) genomic scale studies of TE activity. Bioinformatics and genomic approaches have become an integral part of large-scale studies on TEs to extract information with pure in silico analyses or to assist wet lab experimental studies. The current revolution in genome sequencing technology facilitates further progress in the existing frontiers of research and emergence of new initiatives. The rapid generation of large-sequence datasets at record low costs on a routine basis is challenging the computing industry on storage capacity and manipulation speed and the bioinformatics community for improvement in algorithms and their implementations.
Collapse
Affiliation(s)
- Mateusz Janicki
- Department of Biology, University of Toronto at Mississauga, 3359 Mississauga Road, Mississauga, ON L5L1C6, Canada
| | | | | |
Collapse
|
42
|
Abstract
BACKGROUND Gene orthology has been well studied in the evolutionary area and is thought to be an important implication to functional genome annotations. As the accumulation of transcriptomic data, alternative splicing is taken into account in the assignments of gene orthologs and the orthology is suggested to be further considered at transcript level. Whether gene or transcript orthology, exons are the basic units that represent the whole gene structure; however, there is no any reported study on how to build exon level orthology in a whole genome scale. Therefore, it is essential to establish a gene-oriented exon orthology dataset. RESULTS Using a customized pipeline, we first build exon orthologous relationships from assigned gene orthologs pairs in two well-annotated genomes: human and mouse. More than 92% of non-overlapping exons have at least one ortholog between human and mouse and only a small portion of them own more than one ortholog. The exons located in the coding region are more conserved in terms of finding their ortholog counterparts. Within the untranslated region, the 5' UTR seems to have more diversity than the 3' UTR according to exon orthology designations. Interestingly, most exons located in the coding region are also conserved in length but this conservation phenomenon dramatically drops down in untranslated regions. In addition, we allowed multiple assignments in exon orthologs and a subset of exons with possible fusion/split events were defined here after a thorough analysis procedure. CONCLUSIONS Identification of orthologs at the exon level is essential to provide a detailed way to interrogate gene orthology and splicing analysis. It could be used to extend the genome annotation as well. Besides examining the one-to-one orthologous relationship, we manage the one-to-multi exon pairs to represent complicated exon generation behavior. Our results can be further applied in many research fields studying intron-exon structure and alternative/constitutive exons in functional genomic areas.
Collapse
Affiliation(s)
- Gloria C-L Fu
- Institute of of Biomedical Informatics, National Yang-Ming University, Taipei, Taiwan
| | | |
Collapse
|
43
|
Kersting AR, Bornberg-Bauer E, Moore AD, Grath S. Dynamics and adaptive benefits of protein domain emergence and arrangements during plant genome evolution. Genome Biol Evol 2012; 4:316-29. [PMID: 22250127 PMCID: PMC3318442 DOI: 10.1093/gbe/evs004] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Plant genomes are generally very large, mostly paleopolyploid, and have numerous gene duplicates and complex genomic features such as repeats and transposable elements. Many of these features have been hypothesized to enable plants, which cannot easily escape environmental challenges, to rapidly adapt. Another mechanism, which has recently been well described as a major facilitator of rapid adaptation in bacteria, animals, and fungi but not yet for plants, is modular rearrangement of protein-coding genes. Due to the high precision of profile-based methods, rearrangements can be well captured at the protein level by characterizing the emergence, loss, and rearrangements of protein domains, their structural, functional, and evolutionary building blocks. Here, we study the dynamics of domain rearrangements and explore their adaptive benefit in 27 plant and 3 algal genomes. We use a phylogenomic approach by which we can explain the formation of 88% of all arrangements by single-step events, such as fusion, fission, and terminal loss of domains. We find many domains are lost along every lineage, but at least 500 domains are novel, that is, they are unique to green plants and emerged more or less recently. These novel domains duplicate and rearrange more readily within their genomes than ancient domains and are overproportionally involved in stress response and developmental innovations. Novel domains more often affect regulatory proteins and show a higher degree of structural disorder than ancient domains. Whereas a relatively large and well-conserved core set of single-domain proteins exists, long multi-domain arrangements tend to be species-specific. We find that duplicated genes are more often involved in rearrangements. Although fission events typically impact metabolic proteins, fusion events often create new signaling proteins essential for environmental sensing. Taken together, the high volatility of single domains and complex arrangements in plant genomes demonstrate the importance of modularity for environmental adaptability of plants.
Collapse
Affiliation(s)
- Anna R Kersting
- Evolutionary Bioinformatics Group, Institute for Evolution and Biodiversity, University of Muenster (WWU), Germany
| | | | | | | |
Collapse
|
44
|
Rodgers-Melnick E, Mane SP, Dharmawardhana P, Slavov GT, Crasta OR, Strauss SH, Brunner AM, Difazio SP. Contrasting patterns of evolution following whole genome versus tandem duplication events in Populus. Genome Res 2011; 22:95-105. [PMID: 21974993 DOI: 10.1101/gr.125146.111] [Citation(s) in RCA: 106] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Comparative analysis of multiple angiosperm genomes has implicated gene duplication in the expansion and diversification of many gene families. However, empirical data and theory suggest that whole-genome and small-scale duplication events differ with respect to the types of genes preserved as duplicate pairs. We compared gene duplicates resulting from a recent whole genome duplication to a set of tandemly duplicated genes in the model forest tree Populus trichocarpa. We used a combination of microarray expression analyses of a diverse set of tissues and functional annotation to assess factors related to the preservation of duplicate genes of both types. Whole genome duplicates are 700 bp longer and are expressed in 20% more tissues than tandem duplicates. Furthermore, certain functional categories are over-represented in each class of duplicates. In particular, disease resistance genes and receptor-like kinases commonly occur in tandem but are significantly under-retained following whole genome duplication, while whole genome duplicate pairs are enriched for members of signal transduction cascades and transcription factors. The shape of the distribution of expression divergence for duplicated pairs suggests that nearly half of the whole genome duplicates have diverged in expression by a random degeneration process. The remaining pairs have more conserved gene expression than expected by chance, consistent with a role for selection under the constraints of gene balance. We hypothesize that duplicate gene preservation in Populus is driven by a combination of subfunctionalization of duplicate pairs and purifying selection favoring retention of genes encoding proteins with large numbers of interactions.
Collapse
Affiliation(s)
- Eli Rodgers-Melnick
- Department of Biology, West Virginia University, Morgantown, West Virginia 26506, USA
| | | | | | | | | | | | | | | |
Collapse
|
45
|
Suvorova YM, Rudenko VM, Korotkov EV. Detection change points of triplet periodicity of gene. Gene 2011; 491:58-64. [PMID: 21982972 DOI: 10.1016/j.gene.2011.08.032] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2011] [Revised: 08/10/2011] [Accepted: 08/25/2011] [Indexed: 10/17/2022]
Abstract
The triplet periodicity (TP) is a distinguished property of protein coding sequences. There are complex genes with more than one TP type along their sequence. We say that these genes contain a triplet periodicity change point. The aim of the work is to find all genes that contain TP change point and attempt to compare the positions of change point in genes with known biological data. We have developed a mathematical method to identify triplet periodicity changes along a sequence. We have found 311,221 genes with the TP change point in the KEGG/Genes database (version 48). It is about 8% from the total database volume (4013150). We showed that the repetitive sequences are not the only cause of such events. We suppose that the TP change point may indicate a fusion of genes or domains. We performed BLAST analysis to find potential ancestral genes for the parts of genes with TP change point. As a result we found that in 131323 cases sequences with TP change point have proper similarities for one or both parts. The relationship between TP change point and the fusion events in genes is discussed. The program realization of the method is available by request to authors.
Collapse
Affiliation(s)
- Yulia M Suvorova
- Bioinfomatics Laboratory, Centre of Bioengineering, Russian Academy of Sciences, 117312, Moscow, Prospect 60-tya Oktyabrya, 7/1, Russia.
| | | | | |
Collapse
|
46
|
Zhang X, Wang L, Yuan Y, Tian D, Yang S. Rapid copy number expansion and recent recruitment of domains in S-receptor kinase-like genes contribute to the origin of self-incompatibility. FEBS J 2011; 278:4323-37. [DOI: 10.1111/j.1742-4658.2011.08349.x] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
47
|
Ezawa K, Ikeo K, Gojobori T, Saitou N. Evolutionary patterns of recently emerged animal duplogs. Genome Biol Evol 2011; 3:1119-35. [PMID: 21859807 PMCID: PMC3194840 DOI: 10.1093/gbe/evr074] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Duplogs, or intraspecies paralogs, constitute the important portion of eukaryote genomes and serve as a major source of functional innovation. We conducted detailed analyses of recently emerged animal duplogs. Genome data of three vertebrate species (Homo sapiens, Mus musculus, and Danio rerio), Caenorhabditis elegans, and two Drosophila species (Drosophila melanogaster and D. pseudoobscura) were used. Duplication events were divided into six age-groups according to the synonymous distance (dS) up to 0.6. Duplogs were classified into four equal-sized classes on physical distances and into three classes on relative orientations. We observed the following shared characteristics among intrachromosomal multiexon duplogs: 1) inverted duplogs account for 20-50%, and about a half of the physically most distant 25%; 2) except for C. elegans, the composition of physical distances, that of relative orientations, and the proportion of inverted duplogs in each physical distance category are more or less uniform; 3) except for C. elegans, the characteristics of the youngest (dS < 0.01) duplogs are similar to the overall characteristics of the entire set. These results suggest that intrachromosomal duplogs with fairly long physical distances were generated at once, rather than resulting from tandem duplications and subsequent genomic rearrangements. This is different from the three well-known modes of gene duplication: tandem duplication, retrotransposition, and genome duplication. We termed this new mode as "drift" duplication. The drift duplication has been producing duplicate copies at paces comparable with tandem duplications since the common ancestor of vertebrates, and it may have already operated in the common ancestor of bilateral animals.
Collapse
Affiliation(s)
- Kiyoshi Ezawa
- Division of Population Genetics, National Institute of Genetics, Mishima, Japan
| | | | | | | |
Collapse
|
48
|
Mészáros B, Tóth J, Vértessy BG, Dosztányi Z, Simon I. Proteins with complex architecture as potential targets for drug design: a case study of Mycobacterium tuberculosis. PLoS Comput Biol 2011; 7:e1002118. [PMID: 21814507 PMCID: PMC3140968 DOI: 10.1371/journal.pcbi.1002118] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2011] [Accepted: 05/24/2011] [Indexed: 02/04/2023] Open
Abstract
Lengthy co-evolution of Homo sapiens and Mycobacterium tuberculosis, the main causative agent of tuberculosis, resulted in a dramatically successful pathogen species that presents considerable challenge for modern medicine. The continuous and ever increasing appearance of multi-drug resistant mycobacteria necessitates the identification of novel drug targets and drugs with new mechanisms of action. However, further insights are needed to establish automated protocols for target selection based on the available complete genome sequences. In the present study, we perform complete proteome level comparisons between M. tuberculosis, mycobacteria, other prokaryotes and available eukaryotes based on protein domains, local sequence similarities and protein disorder. We show that the enrichment of certain domains in the genome can indicate an important function specific to M. tuberculosis. We identified two families, termed pkn and PE/PPE that stand out in this respect. The common property of these two protein families is a complex domain organization that combines species-specific regions, commonly occurring domains and disordered segments. Besides highlighting promising novel drug target candidates in M. tuberculosis, the presented analysis can also be viewed as a general protocol to identify proteins involved in species-specific functions in a given organism. We conclude that target selection protocols should be extended to include proteins with complex domain architectures instead of focusing on sequentially unique and essential proteins only.
Collapse
Affiliation(s)
- Bálint Mészáros
- Institute of Enzymology, Hungarian Academy of Sciences, Budapest, Hungary
| | - Judit Tóth
- Institute of Enzymology, Hungarian Academy of Sciences, Budapest, Hungary
| | - Beáta G. Vértessy
- Institute of Enzymology, Hungarian Academy of Sciences, Budapest, Hungary
- Department of Applied Biotechnology, Budapest University of Technology and Economics, Budapest, Hungary
| | - Zsuzsanna Dosztányi
- Institute of Enzymology, Hungarian Academy of Sciences, Budapest, Hungary
- * E-mail: (ZD); (IS)
| | - István Simon
- Institute of Enzymology, Hungarian Academy of Sciences, Budapest, Hungary
- * E-mail: (ZD); (IS)
| |
Collapse
|
49
|
Ray DA, Batzer MA. Reading TE leaves: new approaches to the identification of transposable element insertions. Genome Res 2011; 21:813-20. [PMID: 21632748 PMCID: PMC3106314 DOI: 10.1101/gr.110528.110] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Transposable elements (TEs) are a tremendous source of genome instability and genetic variation. Of particular interest to investigators of human biology and human evolution are retrotransposon insertions that are recent and/or polymorphic in the human population. As a consequence, the ability to assay large numbers of polymorphic TEs in a given genome is valuable. Five recent manuscripts each propose methods to scan whole human genomes to identify, map, and, in some cases, genotype polymorphic retrotransposon insertions in multiple human genomes simultaneously. These technologies promise to revolutionize our ability to analyze human genomes for TE-based variation important to studies of human variability and human disease. Furthermore, the approaches hold promise for researchers interested in nonhuman genomic variability. Herein, we explore the methods reported in the manuscripts and discuss their applications to aspects of human biology and the biology of other organisms.
Collapse
Affiliation(s)
- David A. Ray
- Department of Biochemistry and Molecular Biology, Mississippi State University, Mississippi State, Mississippi 39762, USA
| | - Mark A. Batzer
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana 70803, USA
| |
Collapse
|
50
|
Siegal-Gaskins D, Mejia-Guerra MK, Smith GD, Grotewold E. Emergence of switch-like behavior in a large family of simple biochemical networks. PLoS Comput Biol 2011; 7:e1002039. [PMID: 21589886 PMCID: PMC3093349 DOI: 10.1371/journal.pcbi.1002039] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2010] [Accepted: 03/21/2011] [Indexed: 01/13/2023] Open
Abstract
Bistability plays a central role in the gene regulatory networks (GRNs) controlling many essential biological functions, including cellular differentiation and cell cycle control. However, establishing the network topologies that can exhibit bistability remains a challenge, in part due to the exceedingly large variety of GRNs that exist for even a small number of components. We begin to address this problem by employing chemical reaction network theory in a comprehensive in silico survey to determine the capacity for bistability of more than 40,000 simple networks that can be formed by two transcription factor-coding genes and their associated proteins (assuming only the most elementary biochemical processes). We find that there exist reaction rate constants leading to bistability in ∼90% of these GRN models, including several circuits that do not contain any of the TF cooperativity commonly associated with bistable systems, and the majority of which could only be identified as bistable through an original subnetwork-based analysis. A topological sorting of the two-gene family of networks based on the presence or absence of biochemical reactions reveals eleven minimal bistable networks (i.e., bistable networks that do not contain within them a smaller bistable subnetwork). The large number of previously unknown bistable network topologies suggests that the capacity for switch-like behavior in GRNs arises with relative ease and is not easily lost through network evolution. To highlight the relevance of the systematic application of CRNT to bistable network identification in real biological systems, we integrated publicly available protein-protein interaction, protein-DNA interaction, and gene expression data from Saccharomyces cerevisiae, and identified several GRNs predicted to behave in a bistable fashion. Switch-like behavior is found across a wide range of biological systems, and as a result there is significant interest in identifying the various ways in which biochemical reactions can be combined to yield a switch-like response. In this work we use a set of mathematical tools from chemical reaction network theory that provide information about the steady-states of a reaction network irrespective of the values of network rate constants, to conduct a large computational study of a family of model networks consisting of only two protein-coding genes. We find that a large majority of these networks (∼90%) have (for some set of parameters) the mathematical property known as bistability and can behave in a switch-like manner. Interestingly, the capacity for switch-like behavior is often maintained as networks increase in size through the introduction of new reactions. We then demonstrate using published yeast data how theoretical parameter-free surveys such as this one can be used to discover possible switch-like circuits in real biological systems. Our results highlight the potential usefulness of parameter-free modeling for the characterization of complex networks and to the study of network evolution, and are suggestive of a role for it in the development of novel synthetic biological switches.
Collapse
Affiliation(s)
- Dan Siegal-Gaskins
- Mathematical Biosciences Institute, The Ohio State University, Columbus, Ohio, United States of America.
| | | | | | | |
Collapse
|