1
|
Shang M, Wang J, Dai G, Zheng J, Liao B, Wang J, Duan B. Comparative analysis of chloroplast genome and new insights into phylogenetic relationships of Ajuga and common adulterants. FRONTIERS IN PLANT SCIENCE 2023; 14:1251829. [PMID: 37954994 PMCID: PMC10634298 DOI: 10.3389/fpls.2023.1251829] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/02/2023] [Accepted: 10/11/2023] [Indexed: 11/14/2023]
Abstract
Introduction The potential contamination of herbal medicinal products poses a significant concern for consumer health. Given the limited availability of genetic information concerning Ajuga species, it becomes imperative to incorporate supplementary molecular markers to enhance and ensure accurate species identification. Methods In this study, the chloroplast (cp) genomes of seven species of the genus Ajuag were sequenced, de novo assembled and characterized. Results exhibiting lengths ranging from 150,342 bp to 150,472 bp, encompassing 86 - 88 protein-coding genes (PCGs), 35 - 37 transfer RNA, and eight ribosomal RNA. The repetitive sequences, codon uses, and cp genomes of seven species were highly conserved, and PCGs were the reliable molecular markers for investigating the phylogenetic relationship within the Ajuga genus. Moreover, four mutation hotspot regions (accD-psaI, atpH-atpI, ndhC-trnV(UAC), and ndhF-rpl23) were identified within cp genomes of Ajuga, which could help distinguish A. bracteosa and its contaminants. Based on cp genomes and PCGs, the phylogenetic tree preliminary confirmed the position of Ajuga within the Lamiaceae family. It strongly supported a sister relationship between Subsect. Genevense and Subsect. Biflorae, suggesting the merger of Subsect. Biflorae and Subsect. Genevenses into one group rather than maintaining separate categorizations. Additionally, molecular clock analysis estimated the divergence time of Ajuga to be around 7.78 million years ago. Discussion The species authentication, phylogeny, and evolution analyses of the Ajuga species may benefit from the above findings.
Collapse
Affiliation(s)
- Mingyue Shang
- College of Pharmaceutical Science, Dali University, Dali, China
| | - Jiale Wang
- College of Pharmaceutical Science, Dali University, Dali, China
| | - Guona Dai
- College of Pharmaceutical Science, Dali University, Dali, China
| | - Jiamei Zheng
- College of Pharmaceutical Science, Dali University, Dali, China
| | - Binbin Liao
- College of Pharmaceutical Science, Dali University, Dali, China
| | - Jing Wang
- College of Pharmaceutical Science, Dali University, Dali, China
- College of Life Science, Northeast Forestry University, Harbin, China
| | - Baozhong Duan
- College of Pharmaceutical Science, Dali University, Dali, China
| |
Collapse
|
2
|
Yang Z, Jiang Y, Gong J, Li Q, Dun B, Liu D, Yin F, Yuan L, Zhou X, Wang H, Wang J, Zhan Z, Shah N, Nwafor CC, Zhou Y, Chen P, Zhu L, Li S, Wang B, Xiang J, Zhou Y, Li Z, Piao Z, Yang Q, Zhang C. R gene triplication confers European fodder turnip with improved clubroot resistance. PLANT BIOTECHNOLOGY JOURNAL 2022; 20:1502-1517. [PMID: 35445530 PMCID: PMC9342621 DOI: 10.1111/pbi.13827] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Revised: 03/30/2022] [Accepted: 04/10/2022] [Indexed: 05/08/2023]
Abstract
Clubroot is one of the most important diseases for many important cruciferous vegetables and oilseed crops worldwide. Different clubroot resistance (CR) loci have been identified from only limited species in Brassica, making it difficult to compare and utilize these loci. European fodder turnip ECD04 is considered one of the most valuable resources for CR breeding. To explore the genetic and evolutionary basis of CR in ECD04, we sequenced the genome of ECD04 using de novo assembly and identified 978 candidate R genes. Subsequently, the 28 published CR loci were physically mapped to 15 loci in the ECD04 genome, including 62 candidate CR genes. Among them, two CR genes, CRA3.7.1 and CRA8.2.4, were functionally validated. Phylogenetic analysis revealed that CRA3.7.1 and CRA8.2.4 originated from a common ancestor before the whole-genome triplication (WGT) event. In clubroot susceptible Brassica species, CR-gene homologues were affected by transposable element (TE) insertion, resulting in the loss of CR function. It can be concluded that the current functional CR genes in Brassica rapa and non-functional CR genes in other Brassica species were derived from a common ancestral gene before WGT. Finally, a hypothesis for CR gene evolution is proposed for further discussion.
Collapse
Affiliation(s)
- Zhiquan Yang
- Hubei Key Laboratory of Agricultural BioinformaticsCollege of InformaticsHuazhong Agricultural UniversityWuhanChina
| | - Yingfen Jiang
- Institute of Crop ScienceAnhui Academy of Agricultural ScienceHefeiChina
- National Key Laboratory of Crop Genetic Improvement and College of Plant Science and TechnologyHuazhong Agricultural UniversityWuhanChina
| | - Jianfang Gong
- National Key Laboratory of Crop Genetic Improvement and College of Plant Science and TechnologyHuazhong Agricultural UniversityWuhanChina
| | - Qian Li
- National Key Laboratory of Crop Genetic Improvement and College of Plant Science and TechnologyHuazhong Agricultural UniversityWuhanChina
| | - Bicheng Dun
- National Key Laboratory of Crop Genetic Improvement and College of Plant Science and TechnologyHuazhong Agricultural UniversityWuhanChina
- Yangtze River Rare Plant Research InstituteChina Three Gorges CorporationYichangChina
| | - Dongxu Liu
- Hubei Key Laboratory of Agricultural BioinformaticsCollege of InformaticsHuazhong Agricultural UniversityWuhanChina
| | - Feifan Yin
- Hubei Key Laboratory of Agricultural BioinformaticsCollege of InformaticsHuazhong Agricultural UniversityWuhanChina
| | - Lei Yuan
- National Key Laboratory of Crop Genetic Improvement and College of Plant Science and TechnologyHuazhong Agricultural UniversityWuhanChina
| | - Xueqing Zhou
- National Key Laboratory of Crop Genetic Improvement and College of Plant Science and TechnologyHuazhong Agricultural UniversityWuhanChina
| | - Huiying Wang
- National Key Laboratory of Crop Genetic Improvement and College of Plant Science and TechnologyHuazhong Agricultural UniversityWuhanChina
| | - Jing Wang
- Hubei Key Laboratory of Agricultural BioinformaticsCollege of InformaticsHuazhong Agricultural UniversityWuhanChina
| | - Zongxiang Zhan
- College of HorticultureShenyang Agricultural UniversityShenyangChina
| | - Nadil Shah
- National Key Laboratory of Crop Genetic Improvement and College of Plant Science and TechnologyHuazhong Agricultural UniversityWuhanChina
| | - Chinedu Charles Nwafor
- National Key Laboratory of Crop Genetic Improvement and College of Plant Science and TechnologyHuazhong Agricultural UniversityWuhanChina
| | - Yuanwei Zhou
- Yichang Academy of Agricultural ScienceYichangChina
| | - Peng Chen
- National Key Laboratory of Crop Genetic Improvement and College of Plant Science and TechnologyHuazhong Agricultural UniversityWuhanChina
| | - Li Zhu
- Collaborative Innovation Center for the Characteristic Resources Exploitation of Dabie Mountains and College of Biology and Agriculture ResourceHuanggang Normal UniversityHuanggangChina
| | - Shisheng Li
- Collaborative Innovation Center for the Characteristic Resources Exploitation of Dabie Mountains and College of Biology and Agriculture ResourceHuanggang Normal UniversityHuanggangChina
| | - Bingrui Wang
- National Key Laboratory of Crop Genetic Improvement and College of Plant Science and TechnologyHuazhong Agricultural UniversityWuhanChina
| | - Jun Xiang
- Collaborative Innovation Center for the Characteristic Resources Exploitation of Dabie Mountains and College of Biology and Agriculture ResourceHuanggang Normal UniversityHuanggangChina
| | - Yongming Zhou
- National Key Laboratory of Crop Genetic Improvement and College of Plant Science and TechnologyHuazhong Agricultural UniversityWuhanChina
| | - Zaiyun Li
- National Key Laboratory of Crop Genetic Improvement and College of Plant Science and TechnologyHuazhong Agricultural UniversityWuhanChina
| | - Zhongyun Piao
- College of HorticultureShenyang Agricultural UniversityShenyangChina
| | - Qingyong Yang
- Hubei Key Laboratory of Agricultural BioinformaticsCollege of InformaticsHuazhong Agricultural UniversityWuhanChina
| | - Chunyu Zhang
- National Key Laboratory of Crop Genetic Improvement and College of Plant Science and TechnologyHuazhong Agricultural UniversityWuhanChina
| |
Collapse
|
3
|
Guiguen Y, Pasquier J, Fostier A, Bobe J. Evolution of sex hormone binding globulins reveals early gene duplication at the root of vertebrates. Gen Comp Endocrinol 2021; 300:113646. [PMID: 33058907 DOI: 10.1016/j.ygcen.2020.113646] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/11/2020] [Revised: 07/02/2020] [Accepted: 10/06/2020] [Indexed: 10/23/2022]
Abstract
Sex hormone-binding globulin (Shbg) is an important vertebrate blood carrier protein synthetized in the liver and involved in the transport and local regulation of sex steroids in target tissues. A novel shbg gene (shbgb) with a predominant ovarian expression was recently characterized. Being initially found only in salmonids, this shbgb was originally thought to result from the Salmonid-specific whole genome duplication. Using updated transcriptomic and genomic resources we identified Shbgb orthologs in non-salmonid teleosts (European eel, arowana), holosteans (spotted gar, bowfin), polypteriformes (reedfish), agnatha (sea lamprey) and in amphibians, and found that the classical Shbg gene (Shbga) displays a predominant hepatic expression whereas Shbgb has a predominant gonadal expression. Together, these results indicate that these two Shgb genes most likely originate from a whole genome duplication event at the root of vertebrate evolution, followed by numerous and independent losses and by tissue expression specialization of Shbga and Shbgb paralogs.
Collapse
|
4
|
Premzl M. Comparative genomic analysis of eutherian fibroblast growth factor genes. BMC Genomics 2020; 21:542. [PMID: 32758140 PMCID: PMC7430813 DOI: 10.1186/s12864-020-06958-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2019] [Accepted: 07/29/2020] [Indexed: 12/14/2022] Open
Abstract
Background The eutherian fibroblast growth factors were implicated as key regulators in developmental processes. However, there were major disagreements in descriptions of comprehensive eutherian fibroblast growth factors gene data sets including either 18 or 22 homologues. The present analysis attempted to revise and update comprehensive eutherian fibroblast growth factor gene data sets, and address and resolve major discrepancies in their descriptions using eutherian comparative genomic analysis protocol and 35 public eutherian reference genomic sequence data sets. Results Among 577 potential coding sequences, the tests of reliability of eutherian public genomic sequences annotated most comprehensive curated eutherian third-party data gene data set of fibroblast growth factor genes including 267 complete coding sequences. The present study first described 8 superclusters including 22 eutherian fibroblast growth factor major gene clusters, proposing their updated classification and nomenclature. Conclusions The integrated gene annotations, phylogenetic analysis and protein molecular evolution analysis argued that comprehensive eutherian fibroblast growth factor gene data set classifications included 22 rather than 18 homologues.
Collapse
|
5
|
Suzuki N, Ochi H. Regeneration enhancers: A clue to reactivation of developmental genes. Dev Growth Differ 2020; 62:343-354. [PMID: 32096563 PMCID: PMC7383998 DOI: 10.1111/dgd.12654] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2019] [Revised: 01/17/2020] [Accepted: 01/20/2020] [Indexed: 12/18/2022]
Abstract
During tissue and organ regeneration, cells initially detect damage and then alter nuclear transcription in favor of tissue/organ reconstruction. Until recently, studies of tissue regeneration have focused on the identification of relevant genes. These studies show that many developmental genes are reused during regeneration. Concurrently, comparative genomics studies have shown that the total number of genes does not vastly differ among vertebrate taxa. Moreover, functional analyses of developmental genes using various knockout/knockdown techniques demonstrated that the functions of these genes are conserved among vertebrates. Despite these data, the ability to regenerate damaged body parts varies widely between animals. Thus, it is important to determine how regenerative transcriptional programs are triggered and why animals with low regenerative potential fail to express developmental genes after injury. Recently, we discovered relevant enhancers and named them regeneration signal-response enhancers (RSREs) after identifying their activation mechanisms in a Xenopus laevis transgenic system. In this review, we summarize recent studies of injury/regeneration-associated enhancers and then discuss their mechanisms of activation.
Collapse
Affiliation(s)
- Nanoka Suzuki
- Amphibian Research CenterHiroshima UniversityHigashi‐HiroshimaJapan
| | - Haruki Ochi
- Institute for Promotion of Medical Science ResearchFaculty of MedicineYamagata UniversityYamagataJapan
| |
Collapse
|
6
|
Abstract
The eutherian connexins were characterized as protein constituents of gap junctions implicated in cell-cell communications between adjoining cells in multiple cell types, regulation of major physiological processes and disease pathogeneses. However, conventional connexin gene and protein classifications could be regarded as unsuitable in descriptions of comprehensive eutherian connexin gene data sets, due to ambiguities and inconsistencies in connexin gene and protein nomenclatures. Using eutherian comparative genomic analysis protocol and 35 public eutherian reference genomic sequence data sets, the present analysis attempted to update and revise comprehensive eutherian connexin gene data sets, and address and resolve major discrepancies in their descriptions. Among 631 potential coding sequences, the tests of reliability of eutherian public genomic sequences annotated, in aggregate, 349 connexin complete coding sequences. The most comprehensive curated eutherian connexin gene data set described 21 major gene clusters, 4 of which included evidence of differential gene expansions. For example, the present gene annotations initially described human CXNK1 gene and annotated 22 human connexin genes. Phylogenetic tree calculations and calculations of pairwise nucleotide sequence identity patterns proposed revised and updated phylogenetic classification of eutherian connexin genes. Therefore, the present study integrating gene annotations, phylogenetic analysis and protein molecular evolution analysis proposed new nomenclature of eutherian connexin genes and proteins.
Collapse
|
7
|
Busby BP, Niktab E, Roberts CA, Sheridan JP, Coorey NV, Senanayake DS, Connor LM, Munkacsi AB, Atkinson PH. Genetic interaction networks mediate individual statin drug response in Saccharomyces cerevisiae. NPJ Syst Biol Appl 2019; 5:35. [PMID: 31602312 PMCID: PMC6776536 DOI: 10.1038/s41540-019-0112-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2018] [Accepted: 08/20/2019] [Indexed: 01/19/2023] Open
Abstract
Eukaryotic genetic interaction networks (GINs) are extensively described in the Saccharomyces cerevisiae S288C model using deletion libraries, yet being limited to this one genetic background, not informative to individual drug response. Here we created deletion libraries in three additional genetic backgrounds. Statin response was probed with five queries against four genetic backgrounds. The 20 resultant GINs representing drug-gene and gene-gene interactions were not conserved by functional enrichment, hierarchical clustering, and topology-based community partitioning. An unfolded protein response (UPR) community exhibited genetic background variation including different betweenness genes that were network bottlenecks, and we experimentally validated this UPR community via measurements of the UPR that were differentially activated and regulated in statin-resistant strains relative to the statin-sensitive S288C background. These network analyses by topology and function provide insight into the complexity of drug response influenced by genetic background.
Collapse
Affiliation(s)
- Bede P. Busby
- Centre for Biodiscovery, School of Biological Sciences, Victoria University of Wellington, Wellington, New Zealand
- European Molecular Biology Laboratory, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | - Eliatan Niktab
- Centre for Biodiscovery, School of Biological Sciences, Victoria University of Wellington, Wellington, New Zealand
| | - Christina A. Roberts
- Centre for Biodiscovery, School of Biological Sciences, Victoria University of Wellington, Wellington, New Zealand
| | - Jeffrey P. Sheridan
- Centre for Biodiscovery, School of Biological Sciences, Victoria University of Wellington, Wellington, New Zealand
| | - Namal V. Coorey
- Centre for Biodiscovery, School of Biological Sciences, Victoria University of Wellington, Wellington, New Zealand
| | - Dinindu S. Senanayake
- Centre for Biodiscovery, School of Biological Sciences, Victoria University of Wellington, Wellington, New Zealand
| | - Lisa M. Connor
- Centre for Biodiscovery, School of Biological Sciences, Victoria University of Wellington, Wellington, New Zealand
| | - Andrew B. Munkacsi
- Centre for Biodiscovery, School of Biological Sciences, Victoria University of Wellington, Wellington, New Zealand
| | - Paul H. Atkinson
- Centre for Biodiscovery, School of Biological Sciences, Victoria University of Wellington, Wellington, New Zealand
| |
Collapse
|
8
|
Casillas S, Mulet R, Villegas-Mirón P, Hervas S, Sanz E, Velasco D, Bertranpetit J, Laayouni H, Barbadilla A. PopHuman: the human population genomics browser. Nucleic Acids Res 2019; 46:D1003-D1010. [PMID: 29059408 PMCID: PMC5753332 DOI: 10.1093/nar/gkx943] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2017] [Accepted: 10/04/2017] [Indexed: 12/17/2022] Open
Abstract
The 1000 Genomes Project (1000GP) represents the most comprehensive world-wide nucleotide variation data set so far in humans, providing the sequencing and analysis of 2504 genomes from 26 populations and reporting >84 million variants. The availability of this sequence data provides the human lineage with an invaluable resource for population genomics studies, allowing the testing of molecular population genetics hypotheses and eventually the understanding of the evolutionary dynamics of genetic variation in human populations. Here we present PopHuman, a new population genomics-oriented genome browser based on JBrowse that allows the interactive visualization and retrieval of an extensive inventory of population genetics metrics. Efficient and reliable parameter estimates have been computed using a novel pipeline that faces the unique features and limitations of the 1000GP data, and include a battery of nucleotide variation measures, divergence and linkage disequilibrium parameters, as well as different tests of neutrality, estimated in non-overlapping windows along the chromosomes and in annotated genes for all 26 populations of the 1000GP. PopHuman is open and freely available at http://pophuman.uab.cat.
Collapse
Affiliation(s)
- Sònia Casillas
- Institut de Biotecnologia i de Biomedicina and Department de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona, Spain
- To whom correspondence should be addressed. Sònia Casillas. Tel: +34 93 5868958; Fax: +34 93 5812011; . Correspondence may also be addressed to Antonio Barbadilla.
| | - Roger Mulet
- Institut de Biotecnologia i de Biomedicina and Department de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona, Spain
| | - Pablo Villegas-Mirón
- Institute of Evolutionary Biology (UPF-CSIC), Universitat Pompeu Fabra, Doctor Aiguader 88 (PRBB), 08003 Barcelona, Catalonia, Spain
| | - Sergi Hervas
- Institut de Biotecnologia i de Biomedicina and Department de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona, Spain
| | - Esteve Sanz
- Servei de Genòmica i Bioinformàtica, Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona, Spain
| | - Daniel Velasco
- Institut de Biotecnologia i de Biomedicina and Department de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona, Spain
| | - Jaume Bertranpetit
- Institute of Evolutionary Biology (UPF-CSIC), Universitat Pompeu Fabra, Doctor Aiguader 88 (PRBB), 08003 Barcelona, Catalonia, Spain
| | - Hafid Laayouni
- Institute of Evolutionary Biology (UPF-CSIC), Universitat Pompeu Fabra, Doctor Aiguader 88 (PRBB), 08003 Barcelona, Catalonia, Spain
- Bioinformatics Studies, ESCI-UPF, Pg. Pujades 1, 08003 Barcelona, Spain
| | - Antonio Barbadilla
- Institut de Biotecnologia i de Biomedicina and Department de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona, Spain
- Servei de Genòmica i Bioinformàtica, Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona, Spain
- To whom correspondence should be addressed. Sònia Casillas. Tel: +34 93 5868958; Fax: +34 93 5812011; . Correspondence may also be addressed to Antonio Barbadilla.
| |
Collapse
|
9
|
D'Agostino N, Tamburino R, Cantarella C, De Carluccio V, Sannino L, Cozzolino S, Cardi T, Scotti N. The Complete Plastome Sequences of Eleven Capsicum Genotypes: Insights into DNA Variation and Molecular Evolution. Genes (Basel) 2018; 9:E503. [PMID: 30336638 PMCID: PMC6210379 DOI: 10.3390/genes9100503] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2018] [Revised: 10/11/2018] [Accepted: 10/11/2018] [Indexed: 11/16/2022] Open
Abstract
Members of the genus Capsicum are of great economic importance, including both wild forms and cultivars of peppers and chilies. The high number of potentially informative characteristics that can be identified through next-generation sequencing technologies gave a huge boost to evolutionary and comparative genomic research in higher plants. Here, we determined the complete nucleotide sequences of the plastomes of eight Capsicum species (eleven genotypes), representing the three main taxonomic groups in the genus and estimated molecular diversity. Comparative analyses highlighted a wide spectrum of variation, ranging from point mutations to small/medium size insertions/deletions (InDels), with accD, ndhB, rpl20, ycf1, and ycf2 being the most variable genes. The global pattern of sequence variation is consistent with the phylogenetic signal. Maximum-likelihood tree estimation revealed that Capsicum chacoense is sister to the baccatum complex. Divergence and positive selection analyses unveiled that protein-coding genes were generally well conserved, but we identified 25 positive signatures distributed in six genes involved in different essential plastid functions, suggesting positive selection during evolution of Capsicum plastomes. Finally, the identified sequence variation allowed us to develop simple PCR-based markers useful in future work to discriminate species belonging to different Capsicum complexes.
Collapse
Affiliation(s)
- Nunzio D'Agostino
- CREA Research Centre for Vegetable and Ornamental Crops, Via dei Cavalleggeri 25, 84098 Pontecagnano Faiano (SA), Italy.
| | - Rachele Tamburino
- CNR-IBBR, National Research Council of Italy, Institute of Biosciences and BioResources, Via Università 133, 80055 Portici (NA), Italy.
| | - Concita Cantarella
- CREA Research Centre for Vegetable and Ornamental Crops, Via dei Cavalleggeri 25, 84098 Pontecagnano Faiano (SA), Italy.
| | - Valentina De Carluccio
- CREA Research Centre for Vegetable and Ornamental Crops, Via dei Cavalleggeri 25, 84098 Pontecagnano Faiano (SA), Italy.
- Department of Biology, University of Naples Federico II, Via Cinthia, 80126 Naples, Italy.
| | - Lorenza Sannino
- CNR-IBBR, National Research Council of Italy, Institute of Biosciences and BioResources, Via Università 133, 80055 Portici (NA), Italy.
| | - Salvatore Cozzolino
- Department of Biology, University of Naples Federico II, Via Cinthia, 80126 Naples, Italy.
| | - Teodoro Cardi
- CREA Research Centre for Vegetable and Ornamental Crops, Via dei Cavalleggeri 25, 84098 Pontecagnano Faiano (SA), Italy.
| | - Nunzia Scotti
- CNR-IBBR, National Research Council of Italy, Institute of Biosciences and BioResources, Via Università 133, 80055 Portici (NA), Italy.
| |
Collapse
|
10
|
Lee NK, Azizan FL, Wong YS, Omar N. DeepFinder: An integration of feature-based and deep learning approach for DNA motif discovery. BIOTECHNOL BIOTEC EQ 2018. [DOI: 10.1080/13102818.2018.1438209] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022] Open
Affiliation(s)
- Nung Kion Lee
- Department of Cognitive Sciences, Faculty of Cognitive Sciences and Human Development, Universiti Malaysia Sarawak, Kota Samarahan, Sarawak, Malaysia
| | - Farah Liyana Azizan
- Centre For Pre-University Studies, Universiti Malaysia Sarawak, Kota Samarahan, Sarawak, Malaysia
| | - Yu Shiong Wong
- Department of Cognitive Sciences, Faculty of Cognitive Sciences and Human Development, Universiti Malaysia Sarawak, Kota Samarahan, Sarawak, Malaysia
| | - Norshafarina Omar
- Department of Cognitive Sciences, Faculty of Cognitive Sciences and Human Development, Universiti Malaysia Sarawak, Kota Samarahan, Sarawak, Malaysia
| |
Collapse
|
11
|
Jandrasits C, Dabrowski PW, Fuchs S, Renard BY. seq-seq-pan: building a computational pan-genome data structure on whole genome alignment. BMC Genomics 2018; 19:47. [PMID: 29334898 PMCID: PMC5769345 DOI: 10.1186/s12864-017-4401-3] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2017] [Accepted: 12/19/2017] [Indexed: 12/15/2022] Open
Abstract
Background The increasing application of next generation sequencing technologies has led to the availability of thousands of reference genomes, often providing multiple genomes for the same or closely related species. The current approach to represent a species or a population with a single reference sequence and a set of variations cannot represent their full diversity and introduces bias towards the chosen reference. There is a need for the representation of multiple sequences in a composite way that is compatible with existing data sources for annotation and suitable for established sequence analysis methods. At the same time, this representation needs to be easily accessible and extendable to account for the constant change of available genomes. Results We introduce seq-seq-pan, a framework that provides methods for adding or removing new genomes from a set of aligned genomes and uses these to construct a whole genome alignment. Throughout the sequential workflow the alignment is optimized for generating a representative linear presentation of the aligned set of genomes, that enables its usage for annotation and in downstream analyses. Conclusions By providing dynamic updates and optimized processing, our approach enables the usage of whole genome alignment in the field of pan-genomics. In addition, the sequential workflow can be used as a fast alternative to existing whole genome aligners for aligning closely related genomes. seq-seq-pan is freely available at https://gitlab.com/rki_bioinformatics Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-4401-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | - Stephan Fuchs
- Robert Koch Institute, Wernigerode Branch, Burgstraße 37, Wernigerode, 38855, Germany
| | | |
Collapse
|
12
|
Chen F, Dong W, Zhang J, Guo X, Chen J, Wang Z, Lin Z, Tang H, Zhang L. The Sequenced Angiosperm Genomes and Genome Databases. FRONTIERS IN PLANT SCIENCE 2018; 9:418. [PMID: 29706973 PMCID: PMC5909171 DOI: 10.3389/fpls.2018.00418] [Citation(s) in RCA: 82] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/11/2017] [Accepted: 03/15/2018] [Indexed: 05/18/2023]
Abstract
Angiosperms, the flowering plants, provide the essential resources for human life, such as food, energy, oxygen, and materials. They also promoted the evolution of human, animals, and the planet earth. Despite the numerous advances in genome reports or sequencing technologies, no review covers all the released angiosperm genomes and the genome databases for data sharing. Based on the rapid advances and innovations in the database reconstruction in the last few years, here we provide a comprehensive review for three major types of angiosperm genome databases, including databases for a single species, for a specific angiosperm clade, and for multiple angiosperm species. The scope, tools, and data of each type of databases and their features are concisely discussed. The genome databases for a single species or a clade of species are especially popular for specific group of researchers, while a timely-updated comprehensive database is more powerful for address of major scientific mysteries at the genome scale. Considering the low coverage of flowering plants in any available database, we propose construction of a comprehensive database to facilitate large-scale comparative studies of angiosperm genomes and to promote the collaborative studies of important questions in plant biology.
Collapse
Affiliation(s)
- Fei Chen
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, College of Life Sciences, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Ministry of Education Key Laboratory of Genetics, Breeding and Multiple Utilization of Corps, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Wei Dong
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, College of Life Sciences, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Ministry of Education Key Laboratory of Genetics, Breeding and Multiple Utilization of Corps, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Jiawei Zhang
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, College of Life Sciences, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Ministry of Education Key Laboratory of Genetics, Breeding and Multiple Utilization of Corps, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Xinyue Guo
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, College of Life Sciences, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Ministry of Education Key Laboratory of Genetics, Breeding and Multiple Utilization of Corps, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Junhao Chen
- State Key Laboratory of Subtropical Silviculture, School of Forestry and Biotechnology, Zhejiang Agriculture and Forestry University, Hangzhou, China
| | - Zhengjia Wang
- State Key Laboratory of Subtropical Silviculture, School of Forestry and Biotechnology, Zhejiang Agriculture and Forestry University, Hangzhou, China
| | - Zhenguo Lin
- Department of Biology, Saint Louis University, St. Louis, MO, United States
| | - Haibao Tang
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, College of Life Sciences, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Ministry of Education Key Laboratory of Genetics, Breeding and Multiple Utilization of Corps, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Liangsheng Zhang
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, College of Life Sciences, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Ministry of Education Key Laboratory of Genetics, Breeding and Multiple Utilization of Corps, Fujian Agriculture and Forestry University, Fuzhou, China
- *Correspondence: Liangsheng Zhang
| |
Collapse
|
13
|
Martínez-Raudales I, De La Cruz-Rodríguez Y, Alvarado-Gutiérrez A, Vega-Arreguín J, Fraire-Mayorga A, Alvarado-Rodríguez M, Balderas-Hernández V, Fraire-Velázquez S. Draft genome sequence of Bacillus velezensis 2A-2B strain: a rhizospheric inhabitant of Sporobolus airoides (Torr.) Torr ., with antifungal activity against root rot causing phytopathogens. Stand Genomic Sci 2017; 12:73. [PMID: 29225729 PMCID: PMC5717847 DOI: 10.1186/s40793-017-0289-4] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2017] [Accepted: 11/24/2017] [Indexed: 12/13/2022] Open
Abstract
A Bacillus velezensis strain from the rhizosphere of Sporobolus airoides (Torr.) Torr., a grass in central-north México, was isolated during a biocontrol of phytopathogens scrutiny study. The 2A-2B strain exhibited at least 60% of growth inhibition of virulent isolates of phytopathogens causing root rot. These phytopathogens include Phytophthora capsici, Fusarium solani, Fusarium oxysporum and Rhizoctonia solani. Furthermore, the 2A-2B strain is an indolacetic acid producer, and a plant inducer of PR1, which is an induced systemic resistance related gene in chili pepper plantlets. Whole genome sequencing was performed to generate a draft genome assembly of 3.953 MB with 46.36% of GC content, and a N50 of 294,737. The genome contains 3713 protein coding genes and 89 RNA genes. Moreover, comparative genome analysis revealed that the 2A-2B strain had the greatest identity (98.4%) with Bacillus velezensis.
Collapse
Affiliation(s)
- Inés Martínez-Raudales
- Laboratorio Biología Integrativa de Plantas y Microorganismos, Unidad Académica de Ciencias Biológicas, Universidad Autónoma de Zacatecas, Av. Preparatoria s/n, Col. Agronómica, Zac. CP, -98067 Zacatecas, Mexico
| | - Yumiko De La Cruz-Rodríguez
- Laboratorio Biología Integrativa de Plantas y Microorganismos, Unidad Académica de Ciencias Biológicas, Universidad Autónoma de Zacatecas, Av. Preparatoria s/n, Col. Agronómica, Zac. CP, -98067 Zacatecas, Mexico
| | - Alejandro Alvarado-Gutiérrez
- Laboratorio Biología Integrativa de Plantas y Microorganismos, Unidad Académica de Ciencias Biológicas, Universidad Autónoma de Zacatecas, Av. Preparatoria s/n, Col. Agronómica, Zac. CP, -98067 Zacatecas, Mexico
| | | | - Ahuitz Fraire-Mayorga
- Laboratorio Biología Integrativa de Plantas y Microorganismos, Unidad Académica de Ciencias Biológicas, Universidad Autónoma de Zacatecas, Av. Preparatoria s/n, Col. Agronómica, Zac. CP, -98067 Zacatecas, Mexico
| | - Miguel Alvarado-Rodríguez
- Laboratorio Biología Integrativa de Plantas y Microorganismos, Unidad Académica de Ciencias Biológicas, Universidad Autónoma de Zacatecas, Av. Preparatoria s/n, Col. Agronómica, Zac. CP, -98067 Zacatecas, Mexico
| | - Victor Balderas-Hernández
- Laboratorio Biología Integrativa de Plantas y Microorganismos, Unidad Académica de Ciencias Biológicas, Universidad Autónoma de Zacatecas, Av. Preparatoria s/n, Col. Agronómica, Zac. CP, -98067 Zacatecas, Mexico
| | - Saúl Fraire-Velázquez
- Laboratorio Biología Integrativa de Plantas y Microorganismos, Unidad Académica de Ciencias Biológicas, Universidad Autónoma de Zacatecas, Av. Preparatoria s/n, Col. Agronómica, Zac. CP, -98067 Zacatecas, Mexico
| |
Collapse
|
14
|
Ivanova Z, Sablok G, Daskalova E, Zahmanova G, Apostolova E, Yahubyan G, Baev V. Chloroplast Genome Analysis of Resurrection Tertiary Relict Haberlea rhodopensis Highlights Genes Important for Desiccation Stress Response. FRONTIERS IN PLANT SCIENCE 2017; 8:204. [PMID: 28265281 PMCID: PMC5316520 DOI: 10.3389/fpls.2017.00204] [Citation(s) in RCA: 80] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/26/2016] [Accepted: 02/03/2017] [Indexed: 05/20/2023]
Abstract
Haberlea rhodopensis is a paleolithic tertiary relict species, best known as a resurrection plant with remarkable tolerance to desiccation. When exposed to severe drought stress, H. rhodopensis shows an ability to maintain the structural integrity of its photosynthetic apparatus, which re-activates easily upon rehydration. We present here the results from the assembly and annotation of the chloroplast (cp) genome of H. rhodopensis, which was further subjected to comparative analysis with the cp genomes of closely related species. H. rhodopensis showed a cp genome size of 153,099 bp, harboring a pair of inverted repeats (IR) of 25,415 bp separated by small and large copy regions (SSC and LSC) of 17,826 and 84,443 bp. The genome structure, gene order, GC content and codon usage are similar to those of the typical angiosperm cp genomes. The genome hosts 137 genes representing 70.66% of the plastome, which includes 86 protein-coding genes, 36 tRNAs, and 4 rRNAs. A comparative plastome analysis with other closely related Lamiales members revealed conserved gene order in the IR and LSC/SSC regions. A phylogenetic analysis based on protein-coding genes from 33 species defines this species as belonging to the Gesneriaceae family. From an evolutionary point of view, a site-specific selection analysis detected positively selected sites in 17 genes, most of which are involved in photosynthesis (e.g., rbcL, ndhF, accD, atpE, etc.). The observed codon substitutions may be interpreted as being a consequence of molecular adaptation to drought stress, which ensures an evolutionary advantage to H. rhodopensis.
Collapse
Affiliation(s)
- Zdravka Ivanova
- Department of Plant Physiology and Molecular Biology, University of PlovdivPlovdiv, Bulgaria
| | - Gaurav Sablok
- Plant Functional Biology and Climate Change Cluster, University of Technology at Sydney, SydneyNSW, Australia
| | - Evelina Daskalova
- Department of Plant Physiology and Molecular Biology, University of PlovdivPlovdiv, Bulgaria
| | - Gergana Zahmanova
- Department of Plant Physiology and Molecular Biology, University of PlovdivPlovdiv, Bulgaria
| | - Elena Apostolova
- Department of Plant Physiology and Molecular Biology, University of PlovdivPlovdiv, Bulgaria
| | - Galina Yahubyan
- Department of Plant Physiology and Molecular Biology, University of PlovdivPlovdiv, Bulgaria
| | - Vesselin Baev
- Department of Plant Physiology and Molecular Biology, University of PlovdivPlovdiv, Bulgaria
- *Correspondence: Vesselin Baev,
| |
Collapse
|
15
|
Herrero J, Muffato M, Beal K, Fitzgerald S, Gordon L, Pignatelli M, Vilella AJ, Searle SMJ, Amode R, Brent S, Spooner W, Kulesha E, Yates A, Flicek P. Ensembl comparative genomics resources. Database (Oxford) 2016; 2016:bav096. [PMID: 26896847 PMCID: PMC4761110 DOI: 10.1093/database/bav096] [Citation(s) in RCA: 211] [Impact Index Per Article: 23.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2015] [Revised: 08/10/2015] [Accepted: 09/04/2015] [Indexed: 01/08/2023]
Abstract
Evolution provides the unifying framework with which to understand biology. The coherent investigation of genic and genomic data often requires comparative genomics analyses based on whole-genome alignments, sets of homologous genes and other relevant datasets in order to evaluate and answer evolutionary-related questions. However, the complexity and computational requirements of producing such data are substantial: this has led to only a small number of reference resources that are used for most comparative analyses. The Ensembl comparative genomics resources are one such reference set that facilitates comprehensive and reproducible analysis of chordate genome data. Ensembl computes pairwise and multiple whole-genome alignments from which large-scale synteny, per-base conservation scores and constrained elements are obtained. Gene alignments are used to define Ensembl Protein Families, GeneTrees and homologies for both protein-coding and non-coding RNA genes. These resources are updated frequently and have a consistent informatics infrastructure and data presentation across all supported species. Specialized web-based visualizations are also available including synteny displays, collapsible gene tree plots, a gene family locator and different alignment views. The Ensembl comparative genomics infrastructure is extensively reused for the analysis of non-vertebrate species by other projects including Ensembl Genomes and Gramene and much of the information here is relevant to these projects. The consistency of the annotation across species and the focus on vertebrates makes Ensembl an ideal system to perform and support vertebrate comparative genomic analyses. We use robust software and pipelines to produce reference comparative data and make it freely available. Database URL: http://www.ensembl.org.
Collapse
Affiliation(s)
- Javier Herrero
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD
- Bill Lyons Informatics Centre, UCL Cancer Institute, University College London, London WC1E 6DD
| | - Matthieu Muffato
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD
| | - Kathryn Beal
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD
| | - Stephen Fitzgerald
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD
| | - Leo Gordon
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD
| | - Miguel Pignatelli
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD
| | - Albert J. Vilella
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD
| | | | - Ridwan Amode
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA
| | - Simon Brent
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA
| | - William Spooner
- Eagle Genomics Ltd., Babraham Research Campus, Cambridge, CB22 3AT, UK, and
- Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724, USA
| | - Eugene Kulesha
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA
| | - Andrew Yates
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA
| |
Collapse
|
16
|
Herrero J, Muffato M, Beal K, Fitzgerald S, Gordon L, Pignatelli M, Vilella AJ, Searle SMJ, Amode R, Brent S, Spooner W, Kulesha E, Yates A, Flicek P. Ensembl comparative genomics resources. Database (Oxford) 2016; 2016:bav096. [PMID: 26896847 PMCID: PMC4761110 DOI: 10.1093/database/bav096 10.1093/database/baw053] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2015] [Revised: 08/10/2015] [Accepted: 09/04/2015] [Indexed: 08/10/2024]
Abstract
Evolution provides the unifying framework with which to understand biology. The coherent investigation of genic and genomic data often requires comparative genomics analyses based on whole-genome alignments, sets of homologous genes and other relevant datasets in order to evaluate and answer evolutionary-related questions. However, the complexity and computational requirements of producing such data are substantial: this has led to only a small number of reference resources that are used for most comparative analyses. The Ensembl comparative genomics resources are one such reference set that facilitates comprehensive and reproducible analysis of chordate genome data. Ensembl computes pairwise and multiple whole-genome alignments from which large-scale synteny, per-base conservation scores and constrained elements are obtained. Gene alignments are used to define Ensembl Protein Families, GeneTrees and homologies for both protein-coding and non-coding RNA genes. These resources are updated frequently and have a consistent informatics infrastructure and data presentation across all supported species. Specialized web-based visualizations are also available including synteny displays, collapsible gene tree plots, a gene family locator and different alignment views. The Ensembl comparative genomics infrastructure is extensively reused for the analysis of non-vertebrate species by other projects including Ensembl Genomes and Gramene and much of the information here is relevant to these projects. The consistency of the annotation across species and the focus on vertebrates makes Ensembl an ideal system to perform and support vertebrate comparative genomic analyses. We use robust software and pipelines to produce reference comparative data and make it freely available. Database URL: http://www.ensembl.org.
Collapse
Affiliation(s)
- Javier Herrero
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD
- Bill Lyons Informatics Centre, UCL Cancer Institute, University College London, London WC1E 6DD
| | - Matthieu Muffato
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD
| | - Kathryn Beal
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD
| | - Stephen Fitzgerald
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD
| | - Leo Gordon
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD
| | - Miguel Pignatelli
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD
| | - Albert J. Vilella
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD
| | | | - Ridwan Amode
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA
| | - Simon Brent
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA
| | - William Spooner
- Eagle Genomics Ltd., Babraham Research Campus, Cambridge, CB22 3AT, UK, and
- Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724, USA
| | - Eugene Kulesha
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA
| | - Andrew Yates
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA
| |
Collapse
|
17
|
Kinetoplastid Phylogenomics Reveals the Evolutionary Innovations Associated with the Origins of Parasitism. Curr Biol 2015; 26:161-172. [PMID: 26725202 PMCID: PMC4728078 DOI: 10.1016/j.cub.2015.11.055] [Citation(s) in RCA: 117] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2015] [Revised: 11/17/2015] [Accepted: 11/19/2015] [Indexed: 01/01/2023]
Abstract
The evolution of parasitism is a recurrent event in the history of life and a core problem in evolutionary biology. Trypanosomatids are important parasites and include the human pathogens Trypanosoma brucei, Trypanosoma cruzi, and Leishmania spp., which in humans cause African trypanosomiasis, Chagas disease, and leishmaniasis, respectively. Genome comparison between trypanosomatids reveals that these parasites have evolved specialized cell-surface protein families, overlaid on a well-conserved cell template. Understanding how these features evolved and which ones are specifically associated with parasitism requires comparison with related non-parasites. We have produced genome sequences for Bodo saltans, the closest known non-parasitic relative of trypanosomatids, and a second bodonid, Trypanoplasma borreli. Here we show how genomic reduction and innovation contributed to the character of trypanosomatid genomes. We show that gene loss has “streamlined” trypanosomatid genomes, particularly with respect to macromolecular degradation and ion transport, but consistent with a widespread loss of functional redundancy, while adaptive radiations of gene families involved in membrane function provide the principal innovations in trypanosomatid evolution. Gene gain and loss continued during trypanosomatid diversification, resulting in the asymmetric assortment of ancestral characters such as peptidases between Trypanosoma and Leishmania, genomic differences that were subsequently amplified by lineage-specific innovations after divergence. Finally, we show how species-specific, cell-surface gene families (DGF-1 and PSA) with no apparent structural similarity are independent derivations of a common ancestral form, which we call “bodonin.” This new evidence defines the parasitic innovations of trypanosomatid genomes, revealing how a free-living phagotroph became adapted to exploiting hostile host environments. The Bodo saltans genome reveals evolutionary changes at the origin of parasitism Parasite genomes are streamlined, consistent with a loss of functional redundancy Expanded parasite transporter genes reflect a reorientation of membrane function Non-homologous, parasite cell-surface proteins evolved from a common ancestor
Collapse
|