1
|
Jayakodi M, Lu Q, Pidon H, Rabanus-Wallace MT, Bayer M, Lux T, Guo Y, Jaegle B, Badea A, Bekele W, Brar GS, Braune K, Bunk B, Chalmers KJ, Chapman B, Jørgensen ME, Feng JW, Feser M, Fiebig A, Gundlach H, Guo W, Haberer G, Hansson M, Himmelbach A, Hoffie I, Hoffie RE, Hu H, Isobe S, König P, Kale SM, Kamal N, Keeble-Gagnère G, Keller B, Knauft M, Koppolu R, Krattinger SG, Kumlehn J, Langridge P, Li C, Marone MP, Maurer A, Mayer KFX, Melzer M, Muehlbauer GJ, Murozuka E, Padmarasu S, Perovic D, Pillen K, Pin PA, Pozniak CJ, Ramsay L, Pedas PR, Rutten T, Sakuma S, Sato K, Schüler D, Schmutzer T, Scholz U, Schreiber M, Shirasawa K, Simpson C, Skadhauge B, Spannagl M, Steffenson BJ, Thomsen HC, Tibbits JF, Nielsen MTS, Trautewig C, Vequaud D, Voss C, Wang P, Waugh R, Westcott S, Rasmussen MW, Zhang R, Zhang XQ, Wicker T, Dockter C, Mascher M, Stein N. Structural variation in the pangenome of wild and domesticated barley. Nature 2024; 636:654-662. [PMID: 39537924 PMCID: PMC11655362 DOI: 10.1038/s41586-024-08187-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Accepted: 10/09/2024] [Indexed: 11/16/2024]
Abstract
Pangenomes are collections of annotated genome sequences of multiple individuals of a species1. The structural variants uncovered by these datasets are a major asset to genetic analysis in crop plants2. Here we report a pangenome of barley comprising long-read sequence assemblies of 76 wild and domesticated genomes and short-read sequence data of 1,315 genotypes. An expanded catalogue of sequence variation in the crop includes structurally complex loci that are rich in gene copy number variation. To demonstrate the utility of the pangenome, we focus on four loci involved in disease resistance, plant architecture, nutrient release and trichome development. Novel allelic variation at a powdery mildew resistance locus and population-specific copy number gains in a regulator of vegetative branching were found. Expansion of a family of starch-cleaving enzymes in elite malting barleys was linked to shifts in enzymatic activity in micro-malting trials. Deletion of an enhancer motif is likely to change the developmental trajectory of the hairy appendages on barley grains. Our findings indicate that allelic diversity at structurally complex loci may have helped crop plants to adapt to new selective regimes in agricultural ecosystems.
Collapse
Affiliation(s)
- Murukarthick Jayakodi
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
- Department of Soil and Crop Sciences, Texas A&M AgriLife Research-Dallas, Dallas, TX, USA
| | - Qiongxian Lu
- Carlsberg Research Laboratory, Copenhagen, Denmark
| | - Hélène Pidon
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
- IPSiM, University of Montpellier, CNRS, INRAE, Institut Agro, Montpellier, France
| | | | | | - Thomas Lux
- PGSB-Plant Genome and Systems Biology, Helmholtz Center Munich-German Research Center for Environmental Health, Neuherberg, Germany
| | - Yu Guo
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Benjamin Jaegle
- Department of Plant and Microbial Biology, University of Zurich, Zurich, Switzerland
| | - Ana Badea
- Brandon Research and Development Centre, Agriculture et Agri-Food Canada, Brandon, Manitoba, Canada
| | - Wubishet Bekele
- Ottawa Research and Development Centre, Agriculture et Agri-Food Canada, Ottawa, Ontario, Canada
| | - Gurcharn S Brar
- Faculty of Land and Food Systems, The University of British Columbia, Vancouver, British Columbia, Canada
- Faculty of Agricultural, Life and Environmental Sciences (ALES), University of Alberta, Edmonton, Alberta, Canada
| | | | - Boyke Bunk
- DSMZ-German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany
| | - Kenneth J Chalmers
- School of Agriculture, Food and Wine, University of Adelaide, Urrbrae, South Australia, Australia
| | - Brett Chapman
- Western Crop Genetics Alliance, Food Futures Institute/School of Agriculture, Murdoch University, Murdoch, Western Australia, Australia
| | | | - Jia-Wu Feng
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Manuel Feser
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Anne Fiebig
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Heidrun Gundlach
- PGSB-Plant Genome and Systems Biology, Helmholtz Center Munich-German Research Center for Environmental Health, Neuherberg, Germany
| | | | - Georg Haberer
- PGSB-Plant Genome and Systems Biology, Helmholtz Center Munich-German Research Center for Environmental Health, Neuherberg, Germany
| | - Mats Hansson
- Department of Biology, Lund University, Lund, Sweden
| | - Axel Himmelbach
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Iris Hoffie
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Robert E Hoffie
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Haifei Hu
- Western Crop Genetics Alliance, Food Futures Institute/School of Agriculture, Murdoch University, Murdoch, Western Australia, Australia
- Rice Research Institute, Guangdong Academy of Agricultural Sciences, Guangzhou, China
| | | | - Patrick König
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Sandip M Kale
- Carlsberg Research Laboratory, Copenhagen, Denmark
- Department of Agroecology, Aarhus University, Slagelse, Denmark
| | - Nadia Kamal
- PGSB-Plant Genome and Systems Biology, Helmholtz Center Munich-German Research Center for Environmental Health, Neuherberg, Germany
| | - Gabriel Keeble-Gagnère
- Agriculture Victoria, Department of Jobs, Precincts and Regions, Agribio, La Trobe University, Bundoora, Victoria, Australia
| | - Beat Keller
- Department of Plant and Microbial Biology, University of Zurich, Zurich, Switzerland
| | - Manuela Knauft
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Ravi Koppolu
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Simon G Krattinger
- Plant Science Program, Biological and Environmental Science and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
| | - Jochen Kumlehn
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Peter Langridge
- School of Agriculture, Food and Wine, University of Adelaide, Urrbrae, South Australia, Australia
| | - Chengdao Li
- Western Crop Genetics Alliance, Food Futures Institute/School of Agriculture, Murdoch University, Murdoch, Western Australia, Australia
- Department of Primary Industry and Regional Development, Government of Western Australia, Perth, Western Australia, Australia
- College of Agriculture, Yangtze University, Jingzhou, China
| | - Marina P Marone
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Andreas Maurer
- Institute of Agricultural and Nutritional Sciences, Martin Luther University Halle-Wittenberg, Halle, Germany
| | - Klaus F X Mayer
- PGSB-Plant Genome and Systems Biology, Helmholtz Center Munich-German Research Center for Environmental Health, Neuherberg, Germany
- School of Life Sciences Weihenstephan, Technical University Munich, Freising, Germany
| | - Michael Melzer
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Gary J Muehlbauer
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN, USA
| | | | - Sudharsan Padmarasu
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Dragan Perovic
- Institute for Resistance Research and Stress Tolerance, Julius Kuehn-Institute (JKI), Federal Research Centre for Cultivated Plants, Quedlinburg, Germany
| | - Klaus Pillen
- Institute of Agricultural and Nutritional Sciences, Martin Luther University Halle-Wittenberg, Halle, Germany
| | | | - Curtis J Pozniak
- Department of Plant Sciences and Crop Development Centre, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
| | | | | | - Twan Rutten
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Shun Sakuma
- Faculty of Agriculture, Tottori University, Tottori, Japan
| | - Kazuhiro Sato
- Kazusa DNA Research Institute, Kisarazu, Japan
- Institute of Plant Science and Resources, Okayama University, Kurashiki, Japan
| | - Danuta Schüler
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Thomas Schmutzer
- Institute of Agricultural and Nutritional Sciences, Martin Luther University Halle-Wittenberg, Halle, Germany
| | - Uwe Scholz
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | | | | | | | | | - Manuel Spannagl
- PGSB-Plant Genome and Systems Biology, Helmholtz Center Munich-German Research Center for Environmental Health, Neuherberg, Germany
| | - Brian J Steffenson
- Department of Plant Pathology, University of Minnesota, St. Paul, MN, USA
| | | | - Josquin F Tibbits
- Agriculture Victoria, Department of Jobs, Precincts and Regions, Agribio, La Trobe University, Bundoora, Victoria, Australia
| | | | - Corinna Trautewig
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | | | - Cynthia Voss
- Carlsberg Research Laboratory, Copenhagen, Denmark
| | - Penghao Wang
- Western Crop Genetics Alliance, Food Futures Institute/School of Agriculture, Murdoch University, Murdoch, Western Australia, Australia
| | - Robbie Waugh
- The James Hutton Institute, Dundee, UK
- School of Life Sciences, University of Dundee, Dundee, UK
| | - Sharon Westcott
- Western Crop Genetics Alliance, Food Futures Institute/School of Agriculture, Murdoch University, Murdoch, Western Australia, Australia
| | | | | | - Xiao-Qi Zhang
- Western Crop Genetics Alliance, Food Futures Institute/School of Agriculture, Murdoch University, Murdoch, Western Australia, Australia
| | - Thomas Wicker
- Department of Plant and Microbial Biology, University of Zurich, Zurich, Switzerland.
| | | | - Martin Mascher
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany.
- German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Leipzig, Germany.
| | - Nils Stein
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany.
- Institute of Agricultural and Nutritional Sciences, Martin Luther University Halle-Wittenberg, Halle, Germany.
| |
Collapse
|
2
|
Zhao Z, Yang Y, Iqbal A, Wu Q, Zhou L. Biological Insights and Recent Advances in Plant Long Non-Coding RNA. Int J Mol Sci 2024; 25:11964. [PMID: 39596034 PMCID: PMC11593582 DOI: 10.3390/ijms252211964] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2024] [Revised: 11/03/2024] [Accepted: 11/04/2024] [Indexed: 11/28/2024] Open
Abstract
Long non-coding RNA (lncRNA) refers to an RNA molecule longer than 200 nucleotides (nt) that plays a significant role in regulating essential molecular and biological processes. It is commonly found in animals, plants, and viruses, and is characterized by features such as epigenetic markers, developmental stage-specific expression, and tissue-specific expression. Research has shown that lncRNA participates in anatomical processes like plant progression, while also playing a crucial role in plant disease resistance and adaptation mechanisms. In this review, we provide a concise overview of the formation mechanism, structural characteristics, and databases related to lncRNA in recent years. We primarily discuss the biological roles of lncRNA in plant progression as well as its involvement in response to biotic and abiotic stresses. Additionally, we examine the current challenges associated with lncRNA and explore its potential application in crop production and breeding. Studying plant lncRNAs is highly significant for multiple reasons: It reveals the regulatory mechanisms of plant growth and development, promotes agricultural production and food security, and drives research in plant genomics and epigenetics. Additionally, it facilitates ecological protection and biodiversity conservation.
Collapse
Affiliation(s)
- Zhihao Zhao
- National Key Laboratory for Tropical Crop Breeding, Chinese Academy of Tropical Agricultural Sciences, Haikou 571101, China; (Z.Z.); (Y.Y.); (Q.W.)
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang 571339, China;
- Industrial Development Department, Chinese Academy of Tropical Agricultural Sciences, Haikou 571101, China
| | - Yaodong Yang
- National Key Laboratory for Tropical Crop Breeding, Chinese Academy of Tropical Agricultural Sciences, Haikou 571101, China; (Z.Z.); (Y.Y.); (Q.W.)
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang 571339, China;
| | - Amjad Iqbal
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang 571339, China;
- Department of Food Science & Technology, Abdul Wali Khan University Mardan, Khyber Pakhtunkhwa 23200, Pakistan
| | - Qiufei Wu
- National Key Laboratory for Tropical Crop Breeding, Chinese Academy of Tropical Agricultural Sciences, Haikou 571101, China; (Z.Z.); (Y.Y.); (Q.W.)
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang 571339, China;
| | - Lixia Zhou
- National Key Laboratory for Tropical Crop Breeding, Chinese Academy of Tropical Agricultural Sciences, Haikou 571101, China; (Z.Z.); (Y.Y.); (Q.W.)
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang 571339, China;
| |
Collapse
|
3
|
Liu F, Wodajo B, Zhao K, Tang S, Xie Q, Xie P. Unravelling sorghum functional genomics and molecular breeding: past achievements and future prospects. J Genet Genomics 2024:S1673-8527(24)00194-2. [PMID: 39053846 DOI: 10.1016/j.jgg.2024.07.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2024] [Revised: 07/15/2024] [Accepted: 07/16/2024] [Indexed: 07/27/2024]
Abstract
Sorghum, renowned for its substantial biomass production and remarkable tolerance to various stresses, possesses extensive gene resources and phenotypic variations. A comprehensive understanding of the genetic basis underlying complex agronomic traits is essential for unlocking the potential of sorghum in addressing food and feed security and utilizing marginal lands. In this context, we provide an overview of the major trends in genomic resource studies focusing on key agronomic traits over the past decade, accompanied by a summary of functional genomic platforms. We also delve into the molecular functions and regulatory networks of impactful genes for important agricultural traits. Lastly, we discuss and synthesize the current challenges and prospects for advancing molecular design breeding by gene-editing and polymerization of the excellent alleles, with the aim of accelerating the development of desired sorghum varieties.
Collapse
Affiliation(s)
- Fangyuan Liu
- School of Agriculture and Biotechnology, Sun Yat-sen University, Shenzhen, Guangdong 518107, China
| | - Baye Wodajo
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing 100101, China; College of Natural and Computational Science, Woldia University, Woldia, Po.box-400, Ethiopia.
| | - Kangxu Zhao
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing 100101, China
| | - Sanyuan Tang
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing 100101, China
| | - Qi Xie
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing 100101, China
| | - Peng Xie
- School of Agriculture and Biotechnology, Sun Yat-sen University, Shenzhen, Guangdong 518107, China.
| |
Collapse
|
4
|
Li X, Yu S, Cheng Z, Chang X, Yun Y, Jiang M, Chen X, Wen X, Li H, Zhu W, Xu S, Xu Y, Wang X, Zhang C, Wu Q, Hu J, Lin Z, Aury JM, Van de Peer Y, Wang Z, Zhou X, Wang J, Lü P, Zhang L. Origin and evolution of the triploid cultivated banana genome. Nat Genet 2024; 56:136-142. [PMID: 38082204 DOI: 10.1038/s41588-023-01589-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Accepted: 10/23/2023] [Indexed: 01/14/2024]
Abstract
Most fresh bananas belong to the Cavendish and Gros Michel subgroups. Here, we report chromosome-scale genome assemblies of Cavendish (1.48 Gb) and Gros Michel (1.33 Gb), defining three subgenomes, Ban, Dh and Ze, with Musa acuminata ssp. banksii, malaccensis and zebrina as their major ancestral contributors, respectively. The insertion of repeat sequences in the Fusarium oxysporum f. sp. cubense (Foc) tropical race 4 RGA2 (resistance gene analog 2) promoter was identified in most diploid and triploid bananas. We found that the receptor-like protein (RLP) locus, including Foc race 1-resistant genes, is absent in the Gros Michel Ze subgenome. We identified two NAP (NAC-like, activated by apetala3/pistillata) transcription factor homologs specifically and highly expressed in fruit that directly bind to the promoters of many fruit ripening genes and may be key regulators of fruit ripening. Our genome data should facilitate the breeding and super-domestication of bananas.
Collapse
Affiliation(s)
- Xiuxiu Li
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, Haixia Institute of Science and Technology, College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Sheng Yu
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Zhihao Cheng
- Haikou Experimental Station, National Key Laboratory for Tropical Crop Breeding, Chinese Academy of Tropical Agricultural Sciences, Haikou, China
| | - Xiaojun Chang
- Laboratory of Medicinal Plant Biotechnology, School of Pharmaceutical Sciences, Zhejiang Chinese Medical University, Hangzhou, China
| | - Yingzi Yun
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, Haixia Institute of Science and Technology, College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Mengwei Jiang
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, Haixia Institute of Science and Technology, College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Xuequn Chen
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, Haixia Institute of Science and Technology, College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Xiaohui Wen
- Zhejiang Provincial Key Laboratory of Horticultural Plant Integrative Biology, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China
- Hainan Institute of Zhejiang University, Sanya, China
| | - Hua Li
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, Haixia Institute of Science and Technology, College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Wenjun Zhu
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, Haixia Institute of Science and Technology, College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Shiyao Xu
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, Haixia Institute of Science and Technology, College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Yanbing Xu
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, Haixia Institute of Science and Technology, College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Xianjun Wang
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, Haixia Institute of Science and Technology, College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Chen Zhang
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, Haixia Institute of Science and Technology, College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou, China
- Fuzhou Institute of Oceanography, Minjiang University, Fuzhou, China
| | - Qiong Wu
- Haikou Experimental Station, National Key Laboratory for Tropical Crop Breeding, Chinese Academy of Tropical Agricultural Sciences, Haikou, China
| | - Jin Hu
- Zhejiang Provincial Key Laboratory of Horticultural Plant Integrative Biology, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China
- Hainan Institute of Zhejiang University, Sanya, China
| | - Zhenguo Lin
- Department of Biology, Saint Louis University, St. Louis, MO, USA
| | - Jean-Marc Aury
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, Evry, France
| | - Yves Van de Peer
- Department of Plant Biotechnology and Bioinformatics, Ghent University and VIB Center for Plant Systems Biology, Ghent, Belgium.
- Centre for Microbial Ecology and Genomics, Department of Biochemistry, Genetics and Microbiology, University of Pretoria, Pretoria, South Africa.
- College of Horticulture, Academy for Advanced Interdisciplinary Studies, Nanjing Agricultural University, Nanjing, China.
| | - Zonghua Wang
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, Haixia Institute of Science and Technology, College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou, China.
- Fuzhou Institute of Oceanography, Minjiang University, Fuzhou, China.
| | - Xiaofan Zhou
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangdong Laboratory for Lingnan Modern Agriculture, Guangdong Province Key Laboratory of Microbial Signals and Disease Control, Integrative Microbiology Research Center, South China Agricultural University, Guangzhou, China.
| | | | - Peitao Lü
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, Haixia Institute of Science and Technology, College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou, China.
| | - Liangsheng Zhang
- Zhejiang Provincial Key Laboratory of Horticultural Plant Integrative Biology, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China.
- Hainan Institute of Zhejiang University, Sanya, China.
| |
Collapse
|
5
|
Wafula EK, Zhang H, Von Kuster G, Leebens-Mack JH, Honaas LA, dePamphilis CW. PlantTribes2: Tools for comparative gene family analysis in plant genomics. FRONTIERS IN PLANT SCIENCE 2023; 13:1011199. [PMID: 36798801 PMCID: PMC9928214 DOI: 10.3389/fpls.2022.1011199] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Accepted: 12/02/2022] [Indexed: 05/12/2023]
Abstract
Plant genome-scale resources are being generated at an increasing rate as sequencing technologies continue to improve and raw data costs continue to fall; however, the cost of downstream analyses remains large. This has resulted in a considerable range of genome assembly and annotation qualities across plant genomes due to their varying sizes, complexity, and the technology used for the assembly and annotation. To effectively work across genomes, researchers increasingly rely on comparative genomic approaches that integrate across plant community resources and data types. Such efforts have aided the genome annotation process and yielded novel insights into the evolutionary history of genomes and gene families, including complex non-model organisms. The essential tools to achieve these insights rely on gene family analysis at a genome-scale, but they are not well integrated for rapid analysis of new data, and the learning curve can be steep. Here we present PlantTribes2, a scalable, easily accessible, highly customizable, and broadly applicable gene family analysis framework with multiple entry points including user provided data. It uses objective classifications of annotated protein sequences from existing, high-quality plant genomes for comparative and evolutionary studies. PlantTribes2 can improve transcript models and then sort them, either genome-scale annotations or individual gene coding sequences, into pre-computed orthologous gene family clusters with rich functional annotation information. Then, for gene families of interest, PlantTribes2 performs downstream analyses and customizable visualizations including, (1) multiple sequence alignment, (2) gene family phylogeny, (3) estimation of synonymous and non-synonymous substitution rates among homologous sequences, and (4) inference of large-scale duplication events. We give examples of PlantTribes2 applications in functional genomic studies of economically important plant families, namely transcriptomics in the weedy Orobanchaceae and a core orthogroup analysis (CROG) in Rosaceae. PlantTribes2 is freely available for use within the main public Galaxy instance and can be downloaded from GitHub or Bioconda. Importantly, PlantTribes2 can be readily adapted for use with genomic and transcriptomic data from any kind of organism.
Collapse
Affiliation(s)
- Eric K Wafula
- Department of Biology, The Pennsylvania State University, University Park, PA, United States
| | - Huiting Zhang
- Tree Fruit Research Laboratory, United States Department of Agriculture (USDA), Agricultural Research Service (ARS), Wenatchee, WA, United States
- Department of Horticulture, Washington State University, Pullman, WA, United States
| | - Gregory Von Kuster
- Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA, United States
| | | | - Loren A Honaas
- Tree Fruit Research Laboratory, United States Department of Agriculture (USDA), Agricultural Research Service (ARS), Wenatchee, WA, United States
| | - Claude W dePamphilis
- Department of Biology, The Pennsylvania State University, University Park, PA, United States
- Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA, United States
| |
Collapse
|
6
|
Orozco-Arias S, Humberto Lopez-Murillo L, Candamil-Cortés MS, Arias M, Jaimes PA, Rossi Paschoal A, Tabares-Soto R, Isaza G, Guyot R. Inpactor2: a software based on deep learning to identify and classify LTR-retrotransposons in plant genomes. Brief Bioinform 2022; 24:6887110. [PMID: 36502372 PMCID: PMC9851300 DOI: 10.1093/bib/bbac511] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Revised: 10/13/2022] [Accepted: 10/26/2022] [Indexed: 12/14/2022] Open
Abstract
LTR-retrotransposons are the most abundant repeat sequences in plant genomes and play an important role in evolution and biodiversity. Their characterization is of great importance to understand their dynamics. However, the identification and classification of these elements remains a challenge today. Moreover, current software can be relatively slow (from hours to days), sometimes involve a lot of manual work and do not reach satisfactory levels in terms of precision and sensitivity. Here we present Inpactor2, an accurate and fast application that creates LTR-retrotransposon reference libraries in a very short time. Inpactor2 takes an assembled genome as input and follows a hybrid approach (deep learning and structure-based) to detect elements, filter partial sequences and finally classify intact sequences into superfamilies and, as very few tools do, into lineages. This tool takes advantage of multi-core and GPU architectures to decrease execution times. Using the rice genome, Inpactor2 showed a run time of 5 minutes (faster than other tools) and has the best accuracy and F1-Score of the tools tested here, also having the second best accuracy and specificity only surpassed by EDTA, but achieving 28% higher sensitivity. For large genomes, Inpactor2 is up to seven times faster than other available bioinformatics tools.
Collapse
Affiliation(s)
- Simon Orozco-Arias
- Corresponding authors. Simon Orozco-Arias, Computer Science Department, Universidad Autónoma de Manizales, Antigua Estación del Ferrocarrill, Manizalez, Colombia. Tel.: +57(606)8727272 - 8727709 Ext 102; E-mail: ; Alexandre Rossi Paschoal, Department of Computer Science, Bioinformatics and Pattern Recognition Group, Graduation Program in Bioinformatics, Federal University of Technology - Paraná, UTFPR, Cornélio Procópio, Paraná, 86300-000, Brazil. Tel.: +433133-3790; E-mail: ; Gustavo Isaza, Systems and Informatics Department, Center for Technology Development - Bioprocess and Agro-industry Plant, Universidad de Caldas, St 65 #26-10, Manizales, Colombia. Tel.: +57(606)8781500 ext 13146; E-mail: , Romain Guyot, IRD, 911 Av. Agropolis, 34394 Montpellier, France. Tel.: +334674160000; E-mail:
| | | | | | - Maradey Arias
- Department of Computer Science, Universidad Autónoma de Manizales, 170001, Caldas, Colombia
| | - Paula A Jaimes
- Department of Computer Science, Universidad Autónoma de Manizales, 170001, Caldas, Colombia
| | - Alexandre Rossi Paschoal
- Corresponding authors. Simon Orozco-Arias, Computer Science Department, Universidad Autónoma de Manizales, Antigua Estación del Ferrocarrill, Manizalez, Colombia. Tel.: +57(606)8727272 - 8727709 Ext 102; E-mail: ; Alexandre Rossi Paschoal, Department of Computer Science, Bioinformatics and Pattern Recognition Group, Graduation Program in Bioinformatics, Federal University of Technology - Paraná, UTFPR, Cornélio Procópio, Paraná, 86300-000, Brazil. Tel.: +433133-3790; E-mail: ; Gustavo Isaza, Systems and Informatics Department, Center for Technology Development - Bioprocess and Agro-industry Plant, Universidad de Caldas, St 65 #26-10, Manizales, Colombia. Tel.: +57(606)8781500 ext 13146; E-mail: , Romain Guyot, IRD, 911 Av. Agropolis, 34394 Montpellier, France. Tel.: +334674160000; E-mail:
| | - Reinel Tabares-Soto
- Department of Electronics and Automation, Universidad Autónoma de Manizales, 170001, Caldas, Colombia
| | - Gustavo Isaza
- Corresponding authors. Simon Orozco-Arias, Computer Science Department, Universidad Autónoma de Manizales, Antigua Estación del Ferrocarrill, Manizalez, Colombia. Tel.: +57(606)8727272 - 8727709 Ext 102; E-mail: ; Alexandre Rossi Paschoal, Department of Computer Science, Bioinformatics and Pattern Recognition Group, Graduation Program in Bioinformatics, Federal University of Technology - Paraná, UTFPR, Cornélio Procópio, Paraná, 86300-000, Brazil. Tel.: +433133-3790; E-mail: ; Gustavo Isaza, Systems and Informatics Department, Center for Technology Development - Bioprocess and Agro-industry Plant, Universidad de Caldas, St 65 #26-10, Manizales, Colombia. Tel.: +57(606)8781500 ext 13146; E-mail: , Romain Guyot, IRD, 911 Av. Agropolis, 34394 Montpellier, France. Tel.: +334674160000; E-mail:
| | - Romain Guyot
- Corresponding authors. Simon Orozco-Arias, Computer Science Department, Universidad Autónoma de Manizales, Antigua Estación del Ferrocarrill, Manizalez, Colombia. Tel.: +57(606)8727272 - 8727709 Ext 102; E-mail: ; Alexandre Rossi Paschoal, Department of Computer Science, Bioinformatics and Pattern Recognition Group, Graduation Program in Bioinformatics, Federal University of Technology - Paraná, UTFPR, Cornélio Procópio, Paraná, 86300-000, Brazil. Tel.: +433133-3790; E-mail: ; Gustavo Isaza, Systems and Informatics Department, Center for Technology Development - Bioprocess and Agro-industry Plant, Universidad de Caldas, St 65 #26-10, Manizales, Colombia. Tel.: +57(606)8781500 ext 13146; E-mail: , Romain Guyot, IRD, 911 Av. Agropolis, 34394 Montpellier, France. Tel.: +334674160000; E-mail:
| |
Collapse
|
7
|
Han X, Zhang J, Han S, Chong SL, Meng G, Song M, Wang Y, Zhou S, Liu C, Lou L, Lou X, Cheng L, Lin E, Huang H, Yang Q, Tong Z. The chromosome-scale genome of Phoebe bournei reveals contrasting fates of terpene synthase (TPS)-a and TPS-b subfamilies. PLANT COMMUNICATIONS 2022; 3:100410. [PMID: 35841151 PMCID: PMC9700126 DOI: 10.1016/j.xplc.2022.100410] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/16/2022] [Revised: 07/05/2022] [Accepted: 07/08/2022] [Indexed: 05/15/2023]
Abstract
Terpenoids, including aromatic volatile monoterpenoids and sesquiterpenoids, function in defense against pathogens and herbivores. Phoebe trees are remarkable for their scented wood and decay resistance. Unlike other Lauraceae species investigated to date, Phoebe species predominantly accumulate sesquiterpenoids instead of monoterpenoids. Limited genomic data restrict the elucidation of terpenoid variation and functions. Here, we present a chromosome-scale genome assembly of a Lauraceae tree, Phoebe bournei, and identify 72 full-length terpene synthase (TPS) genes. Genome-level comparison shows pervasive lineage-specific duplication and contraction of TPS subfamilies, which have contributed to the extreme terpenoid variation within Lauraceae species. Although the TPS-a and TPS-b subfamilies were both expanded via tandem duplication in P. bournei, more TPS-a copies were retained and constitutively expressed, whereas more TPS-b copies were lost. The TPS-a genes on chromosome 8 functionally diverged to synthesize eight highly accumulated sesquiterpenes in P. bournei. The essential oil of P. bournei and its main component, β-caryophyllene, exhibited antifungal activities against the three most widespread canker pathogens of trees. The TPS-a and TPS-b subfamilies have experienced contrasting fates over the evolution of P. bournei. The abundant sesquiterpenoids produced by TPS-a proteins contribute to the excellent pathogen resistance of P. bournei trees. Overall, this study sheds light on the evolution and adaptation of terpenoids in Lauraceae and provides valuable resources for boosting plant immunity against pathogens in various trees and crops.
Collapse
Affiliation(s)
- Xiao Han
- State Key Laboratory of Subtropical Silviculture, College of Forestry and Biotechnology, Zhejiang A&F University, Hangzhou, Zhejiang 311300, China
| | - Junhong Zhang
- State Key Laboratory of Subtropical Silviculture, College of Forestry and Biotechnology, Zhejiang A&F University, Hangzhou, Zhejiang 311300, China
| | - Shuang Han
- State Key Laboratory of Subtropical Silviculture, College of Forestry and Biotechnology, Zhejiang A&F University, Hangzhou, Zhejiang 311300, China
| | - Sun Li Chong
- State Key Laboratory of Subtropical Silviculture, College of Forestry and Biotechnology, Zhejiang A&F University, Hangzhou, Zhejiang 311300, China
| | | | - Minyan Song
- State Key Laboratory of Subtropical Silviculture, College of Forestry and Biotechnology, Zhejiang A&F University, Hangzhou, Zhejiang 311300, China
| | - Yang Wang
- State Key Laboratory of Subtropical Silviculture, College of Forestry and Biotechnology, Zhejiang A&F University, Hangzhou, Zhejiang 311300, China
| | - Shengcai Zhou
- Experimental Forest Farm of Qingyuan County, Qingyuan, Zhejiang 323800, China
| | - Chengcheng Liu
- State Key Laboratory of Subtropical Silviculture, College of Forestry and Biotechnology, Zhejiang A&F University, Hangzhou, Zhejiang 311300, China
| | - Luhuan Lou
- State Key Laboratory of Subtropical Silviculture, College of Forestry and Biotechnology, Zhejiang A&F University, Hangzhou, Zhejiang 311300, China
| | - Xiongzhen Lou
- State Key Laboratory of Subtropical Silviculture, College of Forestry and Biotechnology, Zhejiang A&F University, Hangzhou, Zhejiang 311300, China
| | - Longjun Cheng
- State Key Laboratory of Subtropical Silviculture, College of Forestry and Biotechnology, Zhejiang A&F University, Hangzhou, Zhejiang 311300, China
| | - Erpei Lin
- State Key Laboratory of Subtropical Silviculture, College of Forestry and Biotechnology, Zhejiang A&F University, Hangzhou, Zhejiang 311300, China
| | - Huahong Huang
- State Key Laboratory of Subtropical Silviculture, College of Forestry and Biotechnology, Zhejiang A&F University, Hangzhou, Zhejiang 311300, China
| | - Qi Yang
- State Key Laboratory of Subtropical Silviculture, College of Forestry and Biotechnology, Zhejiang A&F University, Hangzhou, Zhejiang 311300, China.
| | - Zaikang Tong
- State Key Laboratory of Subtropical Silviculture, College of Forestry and Biotechnology, Zhejiang A&F University, Hangzhou, Zhejiang 311300, China.
| |
Collapse
|
8
|
Abdullah-Zawawi MR, Govender N, Harun S, Muhammad NAN, Zainal Z, Mohamed-Hussein ZA. Multi-Omics Approaches and Resources for Systems-Level Gene Function Prediction in the Plant Kingdom. PLANTS (BASEL, SWITZERLAND) 2022; 11:2614. [PMID: 36235479 PMCID: PMC9573505 DOI: 10.3390/plants11192614] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Revised: 09/05/2022] [Accepted: 09/13/2022] [Indexed: 06/16/2023]
Abstract
In higher plants, the complexity of a system and the components within and among species are rapidly dissected by omics technologies. Multi-omics datasets are integrated to infer and enable a comprehensive understanding of the life processes of organisms of interest. Further, growing open-source datasets coupled with the emergence of high-performance computing and development of computational tools for biological sciences have assisted in silico functional prediction of unknown genes, proteins and metabolites, otherwise known as uncharacterized. The systems biology approach includes data collection and filtration, system modelling, experimentation and the establishment of new hypotheses for experimental validation. Informatics technologies add meaningful sense to the output generated by complex bioinformatics algorithms, which are now freely available in a user-friendly graphical user interface. These resources accentuate gene function prediction at a relatively minimal cost and effort. Herein, we present a comprehensive view of relevant approaches available for system-level gene function prediction in the plant kingdom. Together, the most recent applications and sought-after principles for gene mining are discussed to benefit the plant research community. A realistic tabulation of plant genomic resources is included for a less laborious and accurate candidate gene discovery in basic plant research and improvement strategies.
Collapse
Affiliation(s)
- Muhammad-Redha Abdullah-Zawawi
- UKM Medical Molecular Biology Institute (UMBI), Universiti Kebangsaan Malaysia, Kuala Lumpur 56000, Malaysia
- Institute of System Biology (INBIOSIS), Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Malaysia
| | - Nisha Govender
- Institute of System Biology (INBIOSIS), Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Malaysia
| | - Sarahani Harun
- Institute of System Biology (INBIOSIS), Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Malaysia
| | - Nor Azlan Nor Muhammad
- Institute of System Biology (INBIOSIS), Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Malaysia
| | - Zamri Zainal
- Institute of System Biology (INBIOSIS), Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Malaysia
- Faculty of Science and Technology, Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Malaysia
| | - Zeti-Azura Mohamed-Hussein
- Institute of System Biology (INBIOSIS), Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Malaysia
- Faculty of Science and Technology, Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Malaysia
| |
Collapse
|
9
|
Zangishei Z, Annacondia ML, Gundlach H, Didriksen A, Bruckmüller J, Salari H, Krause K, Martinez G. Parasitic plant small RNA analyses unveil parasite-specific signatures of microRNA retention, loss, and gain. PLANT PHYSIOLOGY 2022; 190:1242-1259. [PMID: 35861439 PMCID: PMC9516757 DOI: 10.1093/plphys/kiac331] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Accepted: 06/12/2022] [Indexed: 05/29/2023]
Abstract
Parasitism is a successful life strategy that has evolved independently in several families of vascular plants. The genera Cuscuta and Orobanche represent examples of the two profoundly different groups of parasites: one parasitizing host shoots and the other infecting host roots. In this study, we sequenced and described the overall repertoire of small RNAs from Cuscuta campestris and Orobanche aegyptiaca. We showed that C. campestris contains a number of novel microRNAs (miRNAs) in addition to a conspicuous retention of miRNAs that are typically lacking in other Solanales, while several typically conserved miRNAs seem to have become obsolete in the parasite. One new miRNA appears to be derived from a horizontal gene transfer event. The exploratory analysis of the miRNA population (exploratory due to the absence of a full genomic sequence for reference) from the root parasitic O. aegyptiaca also revealed a loss of a number of miRNAs compared to photosynthetic species from the same order. In summary, our study shows partly similar evolutionary signatures in the RNA silencing machinery in both parasites. Our data bear proof for the dynamism of this regulatory mechanism in parasitic plants.
Collapse
Affiliation(s)
| | | | - Heidrun Gundlach
- Helmholtz Zentrum München (HMGU), Plant Genome and Systems Biology (PGSB), Neuherberg 85764, Germany
| | - Alena Didriksen
- Department of Arctic and Marine Biology, Faculty of Biosciences, Fisheries and Economics, UiT The Arctic University of Norway, Tromsø 9019, Norway
| | | | - Hooman Salari
- Department of Production Engineering and Plant Genetics, Faculty of Science and Agricultural Engineering, Razi University, Kermanshah 67155, Iran
| | | | | |
Collapse
|
10
|
Orozco-Arias S, Candamil-Cortes MS, Jaimes PA, Valencia-Castrillon E, Tabares-Soto R, Isaza G, Guyot R. Automatic curation of LTR retrotransposon libraries from plant genomes through machine learning. J Integr Bioinform 2022; 19:jib-2021-0036. [PMID: 35822734 PMCID: PMC9521825 DOI: 10.1515/jib-2021-0036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2021] [Accepted: 06/10/2022] [Indexed: 11/19/2022] Open
Abstract
Transposable elements are mobile sequences that can move and insert themselves into chromosomes, activating under internal or external stimuli, giving the organism the ability to adapt to the environment. Annotating transposable elements in genomic data is currently considered a crucial task to understand key aspects of organisms such as phenotype variability, species evolution, and genome size, among others. Because of the way they replicate, LTR retrotransposons are the most common transposable elements in plants, accounting in some cases for up to 80% of all DNA information. To annotate these elements, a reference library is usually created, a curation process is performed, eliminating TE fragments and false positives and then annotated in the genome using the homology method. However, the curation process can take weeks, requires extensive manual work and the execution of multiple time-consuming bioinformatics software. Here, we propose a machine learning-based approach to perform this process automatically on plant genomes, obtaining up to 91.18% F1-score. This approach was tested with four plant species, obtaining up to 93.6% F1-score (Oryza granulata) in only 22.61 s, where bioinformatics methods took approximately 6 h. This acceleration demonstrates that the ML-based approach is efficient and could be used in massive sequencing projects.
Collapse
Affiliation(s)
- Simon Orozco-Arias
- Department of Computer Science, Universidad Autónoma de Manizales, Manizales, Colombia.,Department of Systems and Informatics, Universidad de Caldas, Manizales, Colombia
| | | | - Paula A Jaimes
- Department of Computer Science, Universidad Autónoma de Manizales, Manizales, Colombia
| | | | - Reinel Tabares-Soto
- Department of Electronics and Automation, Universidad Autónoma de Manizales, Manizales, Colombia
| | - Gustavo Isaza
- Department of Systems and Informatics, Universidad de Caldas, Manizales, Colombia
| | - Romain Guyot
- Department of Electronics and Automation, Universidad Autónoma de Manizales, Manizales, Colombia.,Institut de Recherche pour le Développement, CIRAD, Univ. Montpellier, Montpellier, France
| |
Collapse
|
11
|
Nagy I, Veeckman E, Liu C, Bel MV, Vandepoele K, Jensen CS, Ruttink T, Asp T. Chromosome-scale assembly and annotation of the perennial ryegrass genome. BMC Genomics 2022; 23:505. [PMID: 35831814 PMCID: PMC9281035 DOI: 10.1186/s12864-022-08697-0] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2021] [Accepted: 06/14/2022] [Indexed: 12/29/2022] Open
Abstract
BACKGROUND The availability of chromosome-scale genome assemblies is fundamentally important to advance genetics and breeding in crops, as well as for evolutionary and comparative genomics. The improvement of long-read sequencing technologies and the advent of optical mapping and chromosome conformation capture technologies in the last few years, significantly promoted the development of chromosome-scale genome assemblies of model plants and crop species. In grasses, chromosome-scale genome assemblies recently became available for cultivated and wild species of the Triticeae subfamily. Development of state-of-the-art genomic resources in species of the Poeae subfamily, which includes important crops like fescues and ryegrasses, is lagging behind the progress in the cereal species. RESULTS Here, we report a new chromosome-scale genome sequence assembly for perennial ryegrass, obtained by combining PacBio long-read sequencing, Illumina short-read polishing, BioNano optical mapping and Hi-C scaffolding. More than 90% of the total genome size of perennial ryegrass (approximately 2.55 Gb) is covered by seven pseudo-chromosomes that show high levels of collinearity to the orthologous chromosomes of Triticeae species. The transposon fraction of perennial ryegrass was found to be relatively low, approximately 35% of the total genome content, which is less than half of the genome repeat content of cultivated cereal species. We predicted 54,629 high-confidence gene models, 10,287 long non-coding RNAs and a total of 8,393 short non-coding RNAs in the perennial ryegrass genome. CONCLUSIONS The new reference genome sequence and annotation presented here are valuable resources for comparative genomic studies in grasses, as well as for breeding applications and will expedite the development of productive varieties in perennial ryegrass and related species.
Collapse
Affiliation(s)
- Istvan Nagy
- Center for Quantitative Genetics and Genomics, Aarhus University, Forsøgsvej 1, Slagelse, DK-4200 Denmark
| | - Elisabeth Veeckman
- Flanders Research Institute for Agriculture, Fisheries and Food (ILVO), Plant Sciences Unit, Caritasstraat 39, Melle, B-9090 Belgium
- Bioinformatics Institute Ghent, Ghent University, Technologiepark 71, Ghent, B-9052 Belgium
- Present address: DLF Seeds A/S, Denmark, Højerupvej 31, Store Heddinge, DK-4660 Denmark
| | - Chang Liu
- Zentrum für Molekularbiologie der Pflanzen (ZMBP), Eberhard Karls Universität, Auf der Morgenstelle 32, Tübingen, 72076 Germany
- Present address: Institut für Biologie, Universität Hohenheim, Garbenstr. 30, Stuttgart, 70599 Germany
| | - Michiel Van Bel
- Bioinformatics Institute Ghent, Ghent University, Technologiepark 71, Ghent, B-9052 Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, Ghent, B-9052 Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, Ghent, B-9052 Belgium
| | - Klaas Vandepoele
- Bioinformatics Institute Ghent, Ghent University, Technologiepark 71, Ghent, B-9052 Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, Ghent, B-9052 Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, Ghent, B-9052 Belgium
| | | | - Tom Ruttink
- Flanders Research Institute for Agriculture, Fisheries and Food (ILVO), Plant Sciences Unit, Caritasstraat 39, Melle, B-9090 Belgium
| | - Torben Asp
- Center for Quantitative Genetics and Genomics, Aarhus University, Forsøgsvej 1, Slagelse, DK-4200 Denmark
| |
Collapse
|
12
|
Li LF, Zhang ZB, Wang ZH, Li N, Sha Y, Wang XF, Ding N, Li Y, Zhao J, Wu Y, Gong L, Mafessoni F, Levy AA, Liu B. Genome sequences of five Sitopsis species of Aegilops and the origin of polyploid wheat B subgenome. MOLECULAR PLANT 2022; 15:488-503. [PMID: 34979290 DOI: 10.1016/j.molp.2021.12.019] [Citation(s) in RCA: 82] [Impact Index Per Article: 27.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Revised: 12/11/2021] [Accepted: 12/28/2021] [Indexed: 05/23/2023]
Abstract
Common wheat (Triticum aestivum, BBAADD) is a major staple food crop worldwide. The diploid progenitors of the A and D subgenomes have been unequivocally identified; that of B, however, remains ambiguous and controversial but is suspected to be related to species of Aegilops, section Sitopsis. Here, we report the assembly of chromosome-level genome sequences of all five Sitopsis species, namely Aegilops bicornis, Ae. longissima, Ae. searsii, Ae. sharonensis, and Ae. speltoides, as well as the partial assembly of the Amblyopyrum muticum (synonym Aegilops mutica) genome for phylogenetic analysis. Our results reveal that the donor of the common wheat B subgenome is a distinct, and most probably extinct, diploid species that diverged from an ancestral progenitor of the B lineage to which the still extant Ae. speltoides and Am. muticum belong. In addition, we identified interspecific genetic introgressions throughout the evolution of the Triticum/Aegilops species complex. The five Sitopsis species have various assembled genome sizes (4.11-5.89 Gb) with high proportions of repetitive sequences (85.99%-89.81%); nonetheless, they retain high collinearity with other genomes or subgenomes of species in the Triticum/Aegilops complex. Differences in genome size were primarily due to independent post-speciation amplification of transposons. We also identified a set of Sitopsis genes pertinent to important agronomic traits that can be harnessed for wheat breeding. These newly assembled genome resources provide a new roadmap for evolutionary and genetic studies of the Triticum/Aegilops complex, as well as for wheat improvement.
Collapse
Affiliation(s)
- Lin-Feng Li
- Key Laboratory of Molecular Epigenetics of the Ministry of Education (MOE), Northeast Normal University, Changchun 130024, China; Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, School of Life Sciences, Fudan University, Shanghai 200438, China.
| | - Zhi-Bin Zhang
- Key Laboratory of Molecular Epigenetics of the Ministry of Education (MOE), Northeast Normal University, Changchun 130024, China; Department of Plant and Environmental Sciences, The Weizmann Institute of Science, 76100 Rehovot, Israel
| | - Zhen-Hui Wang
- Faculty of Agronomy, Jilin Agricultural University, Changchun 130118, China
| | - Ning Li
- Key Laboratory of Molecular Epigenetics of the Ministry of Education (MOE), Northeast Normal University, Changchun 130024, China
| | - Yan Sha
- Key Laboratory of Molecular Epigenetics of the Ministry of Education (MOE), Northeast Normal University, Changchun 130024, China
| | - Xin-Feng Wang
- Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, School of Life Sciences, Fudan University, Shanghai 200438, China
| | - Ning Ding
- Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, School of Life Sciences, Fudan University, Shanghai 200438, China
| | - Yang Li
- Key Laboratory of Molecular Epigenetics of the Ministry of Education (MOE), Northeast Normal University, Changchun 130024, China
| | - Jing Zhao
- Key Laboratory of Molecular Epigenetics of the Ministry of Education (MOE), Northeast Normal University, Changchun 130024, China
| | - Ying Wu
- Key Laboratory of Molecular Epigenetics of the Ministry of Education (MOE), Northeast Normal University, Changchun 130024, China
| | - Lei Gong
- Key Laboratory of Molecular Epigenetics of the Ministry of Education (MOE), Northeast Normal University, Changchun 130024, China
| | - Fabrizio Mafessoni
- Department of Plant and Environmental Sciences, The Weizmann Institute of Science, 76100 Rehovot, Israel
| | - Avraham A Levy
- Department of Plant and Environmental Sciences, The Weizmann Institute of Science, 76100 Rehovot, Israel.
| | - Bao Liu
- Key Laboratory of Molecular Epigenetics of the Ministry of Education (MOE), Northeast Normal University, Changchun 130024, China.
| |
Collapse
|
13
|
Finding and Characterizing Repeats in Plant Genomes. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2022; 2443:327-385. [PMID: 35037215 DOI: 10.1007/978-1-0716-2067-0_18] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Plant genomes contain a particularly high proportion of repeated structures of various types. This chapter proposes a guided tour of the available software that can help biologists to scan automatically for these repeats in sequence data or check hypothetical models intended to characterize their structures. Since transposable elements (TEs) are a major source of repeats in plants, many methods have been used or developed for this broad class of sequences. They are representative of the range of tools available for other classes of repeats and we have provided two sections on this topic (for the analysis of genomes or directly of sequenced reads), as well as a selection of the main existing software. It may be hard to keep up with the profusion of proposals in this dynamic field and the rest of the chapter is devoted to the foundations of an efficient search for repeats and more complex patterns. We first introduce the key concepts of the art of indexing and mapping or querying sequences. We end the chapter with the more prospective issue of building models of repeat families. We present the Machine Learning approach first, seeking to build predictors automatically for some families of ET, from a set of sequences known to belong to this family. A second approach, the linguistic (or syntactic) approach, allows biologists to describe themselves and check the validity of models of their favorite repeat family.
Collapse
|
14
|
Oliveira LS, Patera AC, Domingues DS, Sanches DS, Lopes FM, Bugatti PH, Saito PTM, Maracaja-Coutinho V, Durham AM, Paschoal AR. Computational Analysis of Transposable Elements and CircRNAs in Plants. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2022; 2362:147-172. [PMID: 34195962 DOI: 10.1007/978-1-0716-1645-1_9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
This chapter provides two main contributions: (1) a description of computational tools and databases used to identify and analyze transposable elements (TEs) and circRNAs in plants; and (2) data analysis on public TE and circRNA data. Our goal is to highlight the primary information available in the literature on circular noncoding RNAs and transposable elements in plants. The exploratory analysis performed on publicly available circRNA and TEs data help discuss four sequence features. Finally, we investigate the association on circRNAs:TE in plants in the model organism Arabidopsis thaliana.
Collapse
Affiliation(s)
- Liliane Santana Oliveira
- Department of Computer Science, Federal University of Technology-Paraná (UTFPR), Cornélio Procópio, PR, Brazil. .,Embrapa Soja, Londrina, Paraná, Brazil.
| | - Andressa Caroline Patera
- Department of Computer Science, Federal University of Technology-Paraná (UTFPR), Cornélio Procópio, PR, Brazil
| | - Douglas Silva Domingues
- Department of Computer Science, Federal University of Technology-Paraná (UTFPR), Cornélio Procópio, PR, Brazil.,Group of Genomics and Transcriptomes in Plants, Instituto de Biociências de Rio Claro, Universidade Estadual Paulista (UNESP), Rio Claro, SP, Brazil
| | - Danilo Sipoli Sanches
- Department of Computer Science, Federal University of Technology-Paraná (UTFPR), Cornélio Procópio, PR, Brazil
| | - Fabricio Martins Lopes
- Department of Computer Science, Federal University of Technology-Paraná (UTFPR), Cornélio Procópio, PR, Brazil
| | - Pedro Henrique Bugatti
- Department of Computer Science, Federal University of Technology-Paraná (UTFPR), Cornélio Procópio, PR, Brazil
| | - Priscila Tiemi Maeda Saito
- Department of Computer Science, Federal University of Technology-Paraná (UTFPR), Cornélio Procópio, PR, Brazil
| | - Vinicius Maracaja-Coutinho
- Centro de Modelamiento Molecular, Biofísica y Bioinformática-CM2B2, Facultad de Ciencias Quimicas y Farmaceuticas, Universidad de Chile, Santiago, Chile
| | - Alan Mitchell Durham
- Department of Computer Science, Instituto de Matemática e Estatística, Universidade de São Paulo (USP), Cidade Universitária, SP, Brazil
| | - Alexandre Rossi Paschoal
- Department of Computer Science, Federal University of Technology-Paraná (UTFPR), Cornélio Procópio, PR, Brazil.
| |
Collapse
|
15
|
Kamal N, Lux T, Jayakodi M, Haberer G, Gundlach H, Mayer KFX, Mascher M, Spannagl M. The Barley and Wheat Pan-Genomes. Methods Mol Biol 2022; 2443:147-159. [PMID: 35037204 DOI: 10.1007/978-1-0716-2067-0_7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
To unlock the genetic potential in crops, multi-genome comparisons are an essential tool. Decreasing costs and improved sequencing technologies have democratized plant genome sequencing and led to a vast increase in the amount of available reference sequences on the one hand and enabled the assembly of even the largest and most complex and repetitive crops genomes such as wheat and barley. These developments have led to the era of pan-genomics in recent years. Pan-genome projects enable the definition of the core and dispensable genome for various crop species as well as the analysis of structural and functional variation and hence offer unprecedented opportunities for exploring and utilizing the genetic basis of natural variation in crops. Comparing, analyzing, and visualizing these multiple reference genomes and their diversity requires powerful and specialized computational strategies and tools.
Collapse
Affiliation(s)
- Nadia Kamal
- Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany
| | - Thomas Lux
- Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany
| | - Murukarthick Jayakodi
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, Seeland, Germany
| | - Georg Haberer
- Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany
| | - Heidrun Gundlach
- Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany
| | - Klaus F X Mayer
- Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany
| | - Martin Mascher
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, Seeland, Germany
| | - Manuel Spannagl
- Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany.
| |
Collapse
|
16
|
Li F, Han Z, Qiao W, Wang J, Song Y, Cui Y, Li J, Ge J, Lou D, Fan W, Li D, Nong B, Zhang Z, Cheng Y, Zhang L, Zheng X, Yang Q. High-Quality Genomes and High-Density Genetic Map Facilitate the Identification of Genes From a Weedy Rice. FRONTIERS IN PLANT SCIENCE 2021; 12:775051. [PMID: 34868173 PMCID: PMC8639688 DOI: 10.3389/fpls.2021.775051] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/13/2021] [Accepted: 10/27/2021] [Indexed: 06/13/2023]
Abstract
Genes have been lost or weakened from cultivated rice during rice domestication and breeding. Weedy rice (Oryza sativa f. spontanea) is usually recognized as the progeny between cultivated rice and wild rice and is also known to harbor an gene pool for rice breeding. Therefore, identifying genes from weedy rice germplasms is an important way to break the bottleneck of rice breeding. To discover genes from weedy rice germplasms, we constructed a genetic map based on w-hole-genome sequencing of a F2 population derived from the cross between LM8 and a cultivated rice variety. We further identified 31 QTLs associated with 12 important agronomic traits and revealed that ORUFILM03g000095 gene may play an important role in grain length regulation and participate in grain formation. To clarify the genomic characteristics from weedy rice germplasms of LM8, we generated a high-quality genome assembly using single-molecule sequencing, Bionano optical mapping, and Hi-C technologies. The genome harbored a total size of 375.8 Mb, a scaffold N50 of 24.1 Mb, and originated approximately 0.32 million years ago (Mya) and was more closely related to Oryza sativa ssp. japonica. and contained 672 unique genes. It is related to the formation of grain shape, heading date and tillering. This study generated a high-quality reference genome of weedy rice and high-density genetic map that would benefit the analysis of genome evolution for related species and suggested an effective way to identify genes related to important agronomic traits for further rice breeding.
Collapse
Affiliation(s)
- Fei Li
- National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Zhenyun Han
- National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Weihua Qiao
- National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Junrui Wang
- National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- Guangxi Key Laboratory for Polysaccharide Materials and Modifications, School of Marine Sciences and Biotechnology, Guangxi University for Nationalities, Nanning, China
| | - Yue Song
- National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Yongxia Cui
- National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- School of Clinical Medicine, Southwest Medical University, Luzhou, China
| | - Jiaqi Li
- National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- Little Berry Research Room, Liaoning Institute of Fruit Science, Yingkou, China
| | - Jinyue Ge
- National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Danjing Lou
- National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Weiya Fan
- National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Danting Li
- Guangxi Key Laboratory of Rice Genetics and Breeding, Rice Research Institute, Guangxi Academy of Agricultural Sciences, Nanning, China
| | - Baoxuan Nong
- Guangxi Key Laboratory of Rice Genetics and Breeding, Rice Research Institute, Guangxi Academy of Agricultural Sciences, Nanning, China
| | - Zongqiong Zhang
- Guangxi Key Laboratory of Rice Genetics and Breeding, Rice Research Institute, Guangxi Academy of Agricultural Sciences, Nanning, China
| | - Yunlian Cheng
- National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Lifang Zhang
- National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Xiaoming Zheng
- National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Qingwen Yang
- National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| |
Collapse
|
17
|
Wang Y, Li X, Wang C, Gao L, Wu Y, Ni X, Sun J, Jiang J. Unveiling the transcriptomic complexity of Miscanthus sinensis using a combination of PacBio long read- and Illumina short read sequencing platforms. BMC Genomics 2021; 22:690. [PMID: 34551715 PMCID: PMC8459517 DOI: 10.1186/s12864-021-07971-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2020] [Accepted: 09/03/2021] [Indexed: 11/10/2022] Open
Abstract
Background Miscanthus sinensis Andersson is a perennial grass that exhibits remarkable lignocellulose characteristics suitable for sustainable bioenergy production. However, knowledge of the genetic resources of this species is relatively limited, which considerably hampers further work on its biology and genetic improvement. Results In this study, through analyzing the transcriptome of mixed samples of leaves and stems using the latest PacBio Iso-Seq sequencing technology combined with Illumina HiSeq, we report the first full-length transcriptome dataset of M. sinensis with a total of 58.21 Gb clean data. An average of 15.75 Gb clean reads of each sample were obtained from the PacBio Iso-Seq system, which doubled the data size (6.68 Gb) obtained from the Illumina HiSeq platform. The integrated analyses of PacBio- and Illumina-based transcriptomic data uncovered 408,801 non-redundant transcripts with an average length of 1,685 bp. Of those, 189,406 transcripts were commonly identified by both methods, 169,149 transcripts with an average length of 619 bp were uniquely identified by Illumina HiSeq, and 51,246 transcripts with an average length of 2,535 bp were uniquely identified by PacBio Iso-Seq. Approximately 96 % of the final combined transcripts were mapped back to the Miscanthus genome, reflecting the high quality and coverage of our sequencing results. When comparing our data with genomes of four species of Andropogoneae, M. sinensis showed the closest relationship with sugarcane with up to 93 % mapping ratios, followed by sorghum with up to 80 % mapping ratios, indicating a high conservation of orthologs in these three genomes. Furthermore, 306,228 transcripts were successfully annotated against public databases including cell wall related genes and transcript factor families, thus providing many new insights into gene functions. The PacBio Iso-Seq data also helped identify 3,898 alternative splicing events and 2,963 annotated AS isoforms within 10 function categories. Conclusions Taken together, the present study provides a rich data set of full-length transcripts that greatly enriches our understanding of M. sinensis transcriptomic resources, thus facilitating further genetic improvement and molecular studies of the Miscanthus species. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-021-07971-x.
Collapse
Affiliation(s)
- Yongli Wang
- Biofuels Institute, School of the Environment and Safety Engineering, Jiangsu University, 212013, Zhenjiang, Jiangsu, China
| | - Xia Li
- Biofuels Institute, School of the Environment and Safety Engineering, Jiangsu University, 212013, Zhenjiang, Jiangsu, China
| | - Congsheng Wang
- Biofuels Institute, School of the Environment and Safety Engineering, Jiangsu University, 212013, Zhenjiang, Jiangsu, China
| | - Lu Gao
- Biofuels Institute, School of the Environment and Safety Engineering, Jiangsu University, 212013, Zhenjiang, Jiangsu, China
| | - Yanfang Wu
- Biofuels Institute, School of the Environment and Safety Engineering, Jiangsu University, 212013, Zhenjiang, Jiangsu, China
| | - Xingnan Ni
- Biofuels Institute, School of the Environment and Safety Engineering, Jiangsu University, 212013, Zhenjiang, Jiangsu, China
| | - Jianzhong Sun
- Biofuels Institute, School of the Environment and Safety Engineering, Jiangsu University, 212013, Zhenjiang, Jiangsu, China.
| | - Jianxiong Jiang
- Biofuels Institute, School of the Environment and Safety Engineering, Jiangsu University, 212013, Zhenjiang, Jiangsu, China.
| |
Collapse
|
18
|
Mayer G, Müller W, Schork K, Uszkoreit J, Weidemann A, Wittig U, Rey M, Quast C, Felden J, Glöckner FO, Lange M, Arend D, Beier S, Junker A, Scholz U, Schüler D, Kestler HA, Wibberg D, Pühler A, Twardziok S, Eils J, Eils R, Hoffmann S, Eisenacher M, Turewicz M. Implementing FAIR data management within the German Network for Bioinformatics Infrastructure (de.NBI) exemplified by selected use cases. Brief Bioinform 2021; 22:bbab010. [PMID: 33589928 PMCID: PMC8425304 DOI: 10.1093/bib/bbab010] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2020] [Revised: 12/21/2020] [Accepted: 01/06/2021] [Indexed: 12/21/2022] Open
Abstract
This article describes some use case studies and self-assessments of FAIR status of de.NBI services to illustrate the challenges and requirements for the definition of the needs of adhering to the FAIR (findable, accessible, interoperable and reusable) data principles in a large distributed bioinformatics infrastructure. We address the challenge of heterogeneity of wet lab technologies, data, metadata, software, computational workflows and the levels of implementation and monitoring of FAIR principles within the different bioinformatics sub-disciplines joint in de.NBI. On the one hand, this broad service landscape and the excellent network of experts are a strong basis for the development of useful research data management plans. On the other hand, the large number of tools and techniques maintained by distributed teams renders FAIR compliance challenging.
Collapse
Affiliation(s)
- Gerhard Mayer
- Ruhr University Bochum, Faculty of Medicine, Medizinisches Proteom-Center, Bochum, Germany
- Ruhr University Bochum, Center for Protein Diagnostics (ProDi), Medical Proteome Analysis, Bochum, Germany
- Ulm University, Institute of Medical Systems Biology, Ulm, Germany
| | - Wolfgang Müller
- Heidelberg Institute for Theoretical Studies (HITS gGmbH), Scientific Databases and Visualization Group, Heidelberg, Germany
| | - Karin Schork
- Ruhr University Bochum, Faculty of Medicine, Medizinisches Proteom-Center, Bochum, Germany
- Ruhr University Bochum, Center for Protein Diagnostics (ProDi), Medical Proteome Analysis, Bochum, Germany
| | - Julian Uszkoreit
- Ruhr University Bochum, Faculty of Medicine, Medizinisches Proteom-Center, Bochum, Germany
- Ruhr University Bochum, Center for Protein Diagnostics (ProDi), Medical Proteome Analysis, Bochum, Germany
| | - Andreas Weidemann
- Heidelberg Institute for Theoretical Studies (HITS gGmbH), Scientific Databases and Visualization Group, Heidelberg, Germany
| | - Ulrike Wittig
- Heidelberg Institute for Theoretical Studies (HITS gGmbH), Scientific Databases and Visualization Group, Heidelberg, Germany
| | - Maja Rey
- Heidelberg Institute for Theoretical Studies (HITS gGmbH), Scientific Databases and Visualization Group, Heidelberg, Germany
| | | | - Janine Felden
- Jacobs University Bremen gGmbH, Bremen, Germany
- University of Bremen, MARUM - Center for Marine Environmental Sciences, Bremen, Germany
| | - Frank Oliver Glöckner
- Jacobs University Bremen gGmbH, Bremen, Germany
- University of Bremen, MARUM - Center for Marine Environmental Sciences, Bremen, Germany
- Alfred Wegener Institute - Helmholtz Center for Polar- and Marine Research, Bremerhaven, Germany
| | - Matthias Lange
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Daniel Arend
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Sebastian Beier
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Astrid Junker
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Uwe Scholz
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Danuta Schüler
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Hans A Kestler
- Ulm University, Institute of Medical Systems Biology, Ulm, Germany
- Leibniz Institute on Ageing - Fritz Lipmann Institute, Jena
| | - Daniel Wibberg
- Bielefeld University, Center for Biotechnology (CeBiTec), Bielefeld, Germany
| | - Alfred Pühler
- Bielefeld University, Center for Biotechnology (CeBiTec), Bielefeld, Germany
| | - Sven Twardziok
- Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health (BIH), Center for Digital Health, Berlin, Germany
| | - Jürgen Eils
- Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health (BIH), Center for Digital Health, Berlin, Germany
| | - Roland Eils
- Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health (BIH), Center for Digital Health, Berlin, Germany
- Heidelberg University Hospital and BioQuant, Health Data Science Unit, Heidelberg, Germany
| | - Steve Hoffmann
- Leibniz Institute on Ageing - Fritz Lipmann Institute, Jena
| | - Martin Eisenacher
- Ruhr University Bochum, Faculty of Medicine, Medizinisches Proteom-Center, Bochum, Germany
- Ruhr University Bochum, Center for Protein Diagnostics (ProDi), Medical Proteome Analysis, Bochum, Germany
| | - Michael Turewicz
- Ruhr University Bochum, Faculty of Medicine, Medizinisches Proteom-Center, Bochum, Germany
- Ruhr University Bochum, Center for Protein Diagnostics (ProDi), Medical Proteome Analysis, Bochum, Germany
| |
Collapse
|
19
|
Liu Y, Wang Z, Wu X, Zhu J, Luo H, Tian D, Li C, Luo J, Zhao W, Hao H, Jing HC. SorGSD: updating and expanding the sorghum genome science database with new contents and tools. BIOTECHNOLOGY FOR BIOFUELS 2021; 14:165. [PMID: 34344425 PMCID: PMC8336335 DOI: 10.1186/s13068-021-02016-7] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/05/2021] [Accepted: 07/24/2021] [Indexed: 05/17/2023]
Abstract
BACKGROUND As the fifth major cereal crop originated from Africa, sorghum (Sorghum bicolor) has become a key C4 model organism for energy plant research. With the development of high-throughput detection technologies for various omics data, much multi-dimensional and multi-omics information has been accumulated for sorghum. Integrating this information may accelerate genetic research and improve molecular breeding for sorghum agronomic traits. RESULTS We updated the Sorghum Genome SNP Database (SorGSD) by adding new data, new features and renamed it to Sorghum Genome Science Database (SorGSD). In comparison with the original version SorGSD, which contains SNPs from 48 sorghum accessions mapped to the reference genome BTx623 (v2.1), the new version was expanded to 289 sorghum lines with both single nucleotide polymorphisms (SNPs) and small insertions/deletions (INDELs), which were aligned to the newly assembled and annotated sorghum genome BTx623 (v3.1). Moreover, phenotypic data and panicle pictures of critical accessions were provided in the new version. We implemented new tools including ID Conversion, Homologue Search and Genome Browser for analysis and updated the general information related to sorghum research, such as online sorghum resources and literature references. In addition, we deployed a new database infrastructure and redesigned a new user interface as one of the Genome Variation Map databases. The new version SorGSD is freely accessible online at http://ngdc.cncb.ac.cn/sorgsd/ . CONCLUSIONS SorGSD is a comprehensive integration with large-scale genomic variation, phenotypic information and incorporates online data analysis tools for data mining, genome navigation and analysis. We hope that SorGSD could provide a valuable resource for sorghum researchers to find variations they are interested in and generate customized high-throughput datasets for further analysis.
Collapse
Affiliation(s)
- Yuanming Liu
- Key Laboratory of Plant Resources, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093 China
- University of Chinese Academy of Sciences, Beijing, 100049 China
| | - Zhonghuang Wang
- University of Chinese Academy of Sciences, Beijing, 100049 China
- China National Center for Bioinformation, Beijing, 100101 China
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, 100101 China
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, 100101 China
| | - Xiaoyuan Wu
- Key Laboratory of Plant Resources, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093 China
| | - Junwei Zhu
- China National Center for Bioinformation, Beijing, 100101 China
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, 100101 China
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, 100101 China
| | - Hong Luo
- Key Laboratory of Plant Resources, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093 China
| | - Dongmei Tian
- China National Center for Bioinformation, Beijing, 100101 China
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, 100101 China
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, 100101 China
| | - Cuiping Li
- China National Center for Bioinformation, Beijing, 100101 China
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, 100101 China
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, 100101 China
| | - Jingchu Luo
- College of Life Sciences and Center for Bioinformatics, Peking University, Beijing, 100871 China
| | - Wenming Zhao
- University of Chinese Academy of Sciences, Beijing, 100049 China
- China National Center for Bioinformation, Beijing, 100101 China
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, 100101 China
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, 100101 China
| | - Huaiqing Hao
- Key Laboratory of Plant Resources, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093 China
| | - Hai-Chun Jing
- Key Laboratory of Plant Resources, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093 China
- Engineering Laboratory for Grass-Based Livestock Husbandry, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093 China
- University of Chinese Academy of Sciences, Beijing, 100049 China
| |
Collapse
|
20
|
Sorghum's Whole-Plant Transcriptome and Proteome Responses to Drought Stress: A Review. Life (Basel) 2021; 11:life11070704. [PMID: 34357076 PMCID: PMC8305457 DOI: 10.3390/life11070704] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2021] [Revised: 07/12/2021] [Accepted: 07/15/2021] [Indexed: 12/29/2022] Open
Abstract
Sorghum is a cereal crop with key agronomic traits of drought and heat stress tolerance, making it an ideal food and industrial commodity for hotter and more arid climates. These stress tolerances also present a useful scientific resource for studying the molecular basis for environmental resilience. Here we provide an extensive review of current transcriptome and proteome works conducted with laboratory, greenhouse, or field-grown sorghum plants exposed to drought, osmotic stress, or treated with the drought stress-regulatory phytohormone, abscisic acid. Large datasets from these studies reveal changes in gene/protein expression across diverse signaling and metabolic pathways. Together, the emerging patterns from these datasets reveal that the overall functional classes of stress-responsive genes/proteins within sorghum are similar to those observed in equivalent studies of other drought-sensitive model species. This highlights a monumental challenge of distinguishing key regulatory genes/proteins, with a primary role in sorghum adaptation to drought, from genes/proteins that change in expression because of stress. Finally, we discuss possible options for taking the research forward. Successful exploitation of sorghum research for implementation in other crops may be critical in establishing climate-resilient agriculture for future food security.
Collapse
|
21
|
Fernie AR, Alseekh S, Liu J, Yan J. Using precision phenotyping to inform de novo domestication. PLANT PHYSIOLOGY 2021; 186:1397-1411. [PMID: 33848336 PMCID: PMC8260140 DOI: 10.1093/plphys/kiab160] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Accepted: 03/22/2021] [Indexed: 05/09/2023]
Abstract
An update on the use of precision phenotyping to assess the potential of lesser cultivated species as candidates for de novo domestication or similar development for future agriculture.
Collapse
Affiliation(s)
- Alisdair R Fernie
- Max Planck Institute for Molecular Plant Physiology, Am Mühlenberg 1, 14476 Potsdam-Golm, Germany
- Centre of Plant Systems Biology and Biotechnology, 4000 Plovdiv, Bulgaria
| | - Saleh Alseekh
- Max Planck Institute for Molecular Plant Physiology, Am Mühlenberg 1, 14476 Potsdam-Golm, Germany
- Centre of Plant Systems Biology and Biotechnology, 4000 Plovdiv, Bulgaria
| | - Jie Liu
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, 430070 Wuhan, Hubei, China
| | - Jianbing Yan
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, 430070 Wuhan, Hubei, China
| |
Collapse
|
22
|
Orozco-Arias S, Candamil-Cortés MS, Jaimes PA, Piña JS, Tabares-Soto R, Guyot R, Isaza G. K-mer-based machine learning method to classify LTR-retrotransposons in plant genomes. PeerJ 2021; 9:e11456. [PMID: 34055489 PMCID: PMC8140598 DOI: 10.7717/peerj.11456] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2021] [Accepted: 04/24/2021] [Indexed: 12/15/2022] Open
Abstract
Every day more plant genomes are available in public databases and additional massive sequencing projects (i.e., that aim to sequence thousands of individuals) are formulated and released. Nevertheless, there are not enough automatic tools to analyze this large amount of genomic information. LTR retrotransposons are the most frequent repetitive sequences in plant genomes; however, their detection and classification are commonly performed using semi-automatic and time-consuming programs. Despite the availability of several bioinformatic tools that follow different approaches to detect and classify them, none of these tools can individually obtain accurate results. Here, we used Machine Learning algorithms based on k-mer counts to classify LTR retrotransposons from other genomic sequences and into lineages/families with an F1-Score of 95%, contributing to develop a free-alignment and automatic method to analyze these sequences.
Collapse
Affiliation(s)
- Simon Orozco-Arias
- Department of Computer Science, Universidad Autónoma de Manizales, Manizales, Caldas, Colombia.,Department of Systems and Informatics, Universidad de Caldas, Manizales, Caldas, Colombia
| | | | - Paula A Jaimes
- Department of Computer Science, Universidad Autónoma de Manizales, Manizales, Caldas, Colombia
| | - Johan S Piña
- Department of Computer Science, Universidad Autónoma de Manizales, Manizales, Caldas, Colombia
| | - Reinel Tabares-Soto
- Department of Electronics and Automation, Universidad Autónoma de Manizales, Manizales, Caldas, Colombia
| | - Romain Guyot
- Department of Electronics and Automation, Universidad Autónoma de Manizales, Manizales, Caldas, Colombia.,Institut de Recherche pour le Développement, CIRAD, Univ. Montpellier, Montpellier, France
| | - Gustavo Isaza
- Department of Systems and Informatics, Universidad de Caldas, Manizales, Caldas, Colombia
| |
Collapse
|
23
|
Panta M, Mishra A, Hoque MT, Atallah J. ClassifyTE: A stacking based prediction of hierarchical classification of transposable elements. Bioinformatics 2021; 37:2529-2536. [PMID: 33682878 DOI: 10.1093/bioinformatics/btab146] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2020] [Revised: 02/10/2021] [Accepted: 03/01/2021] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Transposable Elements (TEs) or jumping genes are DNA sequences that have an intrinsic capability to move within a host genome from one genomic location to another. Studies show that the presence of a TE within or adjacent to a functional gene may alter its expression. TEs can also cause an increase in the rate of mutation and can even mediate duplications and large insertions and deletions in the genome, promoting gross genetic rearrangements. The proper classification of identified jumping genes is important for analyzing their genetic and evolutionary effects. An effective classifier, which can explain the role of TEs in germline and somatic evolution more accurately, is needed. In this study, we examine the performance of a variety of machine learning (ML) techniques and propose a robust method, ClassifyTE, for the hierarchical classification of TEs with high accuracy, using a stacking-based ML method. RESULTS We propose a stacking-based approach for the hierarchical classification of TEs. When trained on three different benchmark datasets, our proposed system achieved 4%, 10.68%, and 10.13% average percentage improvement (using the hF measure) compared to several state-of-the-art methods. We developed an end-to-end automated hierarchical classification tool based on the proposed approach, ClassifyTE, to classify TEs up to the super-family level. We further evaluated our method on a new TE library generated by a homology-based classification method and found relatively high concordance at higher taxonomic levels. Thus, ClassifyTE paves the way for a more accurate analysis of the role of TEs. AVAILABILITY The source code and data are available at https://github.com/manisa/ClassifyTE. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Manisha Panta
- Department of Computer Science, University of New Orleans, New Orleans, LA 70148, USA
| | - Avdesh Mishra
- Department of Electrical Engineering and Computer Science, Texas A&M University-Kingsville, Kingsville, TX, 78363, USA
| | - Md Tamjidul Hoque
- Department of Computer Science, University of New Orleans, New Orleans, LA 70148, USA
| | - Joel Atallah
- Department of Biological Sciences, University of New Orleans, New Orleans, LA 70148, USA
| |
Collapse
|
24
|
Orozco-Arias S, Jaimes PA, Candamil MS, Jiménez-Varón CF, Tabares-Soto R, Isaza G, Guyot R. InpactorDB: A Classified Lineage-Level Plant LTR Retrotransposon Reference Library for Free-Alignment Methods Based on Machine Learning. Genes (Basel) 2021; 12:genes12020190. [PMID: 33525408 PMCID: PMC7910972 DOI: 10.3390/genes12020190] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2020] [Revised: 01/21/2021] [Accepted: 01/22/2021] [Indexed: 12/04/2022] Open
Abstract
Long terminal repeat (LTR) retrotransposons are mobile elements that constitute the major fraction of most plant genomes. The identification and annotation of these elements via bioinformatics approaches represent a major challenge in the era of massive plant genome sequencing. In addition to their involvement in genome size variation, LTR retrotransposons are also associated with the function and structure of different chromosomal regions and can alter the function of coding regions, among others. Several sequence databases of plant LTR retrotransposons are available for public access, such as PGSB and RepetDB, or restricted access such as Repbase. Although these databases are useful to identify LTR-RTs in new genomes by similarity, the elements of these databases are not fully classified to the lineage (also called family) level. Here, we present InpactorDB, a semi-curated dataset composed of 130,439 elements from 195 plant genomes (belonging to 108 plant species) classified to the lineage level. This dataset has been used to train two deep neural networks (i.e., one fully connected and one convolutional) for the rapid classification of these elements. In lineage-level classification approaches, we obtain up to 98% performance, indicated by the F1-score, precision and recall scores.
Collapse
Affiliation(s)
- Simon Orozco-Arias
- Department of Computer Science, Universidad Autónoma de Manizales, 170002 Manizales, Colombia; (P.A.J.); (M.S.C.)
- Department of Systems and Informatics, Universidad de Caldas, 170002 Manizales, Colombia;
- Correspondence: (S.O.-A.); (R.G.)
| | - Paula A. Jaimes
- Department of Computer Science, Universidad Autónoma de Manizales, 170002 Manizales, Colombia; (P.A.J.); (M.S.C.)
| | - Mariana S. Candamil
- Department of Computer Science, Universidad Autónoma de Manizales, 170002 Manizales, Colombia; (P.A.J.); (M.S.C.)
| | | | - Reinel Tabares-Soto
- Department of Electronics and Automation, Universidad Autónoma de Manizales, 170002 Manizales, Colombia;
| | - Gustavo Isaza
- Department of Systems and Informatics, Universidad de Caldas, 170002 Manizales, Colombia;
| | - Romain Guyot
- Department of Electronics and Automation, Universidad Autónoma de Manizales, 170002 Manizales, Colombia;
- Institut de Recherche pour le Développement, CIRAD, University of Montpellier, 34394 Montpellier, France
- Correspondence: (S.O.-A.); (R.G.)
| |
Collapse
|
25
|
Jayakodi M, Padmarasu S, Haberer G, Bonthala VS, Gundlach H, Monat C, Lux T, Kamal N, Lang D, Himmelbach A, Ens J, Zhang XQ, Angessa TT, Zhou G, Tan C, Hill C, Wang P, Schreiber M, Boston LB, Plott C, Jenkins J, Guo Y, Fiebig A, Budak H, Xu D, Zhang J, Wang C, Grimwood J, Schmutz J, Guo G, Zhang G, Mochida K, Hirayama T, Sato K, Chalmers KJ, Langridge P, Waugh R, Pozniak CJ, Scholz U, Mayer KFX, Spannagl M, Li C, Mascher M, Stein N. The barley pan-genome reveals the hidden legacy of mutation breeding. Nature 2020; 588:284-289. [PMID: 33239781 PMCID: PMC7759462 DOI: 10.1038/s41586-020-2947-8] [Citation(s) in RCA: 293] [Impact Index Per Article: 58.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2020] [Accepted: 09/09/2020] [Indexed: 12/25/2022]
Abstract
Genetic diversity is key to crop improvement. Owing to pervasive genomic structural variation, a single reference genome assembly cannot capture the full complement of sequence diversity of a crop species (known as the 'pan-genome'1). Multiple high-quality sequence assemblies are an indispensable component of a pan-genome infrastructure. Barley (Hordeum vulgare L.) is an important cereal crop with a long history of cultivation that is adapted to a wide range of agro-climatic conditions2. Here we report the construction of chromosome-scale sequence assemblies for the genotypes of 20 varieties of barley-comprising landraces, cultivars and a wild barley-that were selected as representatives of global barley diversity. We catalogued genomic presence/absence variants and explored the use of structural variants for quantitative genetic analysis through whole-genome shotgun sequencing of 300 gene bank accessions. We discovered abundant large inversion polymorphisms and analysed in detail two inversions that are frequently found in current elite barley germplasm; one is probably the product of mutation breeding and the other is tightly linked to a locus that is involved in the expansion of geographical range. This first-generation barley pan-genome makes previously hidden genetic variation accessible to genetic studies and breeding.
Collapse
Affiliation(s)
- Murukarthick Jayakodi
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Sudharsan Padmarasu
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Georg Haberer
- Plant Genome and Systems Biology (PGSB), Helmholtz Center Munich, German Research Center for Environmental Health, Neuherberg, Germany
| | - Venkata Suresh Bonthala
- Plant Genome and Systems Biology (PGSB), Helmholtz Center Munich, German Research Center for Environmental Health, Neuherberg, Germany
| | - Heidrun Gundlach
- Plant Genome and Systems Biology (PGSB), Helmholtz Center Munich, German Research Center for Environmental Health, Neuherberg, Germany
| | - Cécile Monat
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Thomas Lux
- Plant Genome and Systems Biology (PGSB), Helmholtz Center Munich, German Research Center for Environmental Health, Neuherberg, Germany
| | - Nadia Kamal
- Plant Genome and Systems Biology (PGSB), Helmholtz Center Munich, German Research Center for Environmental Health, Neuherberg, Germany
| | - Daniel Lang
- Plant Genome and Systems Biology (PGSB), Helmholtz Center Munich, German Research Center for Environmental Health, Neuherberg, Germany
| | - Axel Himmelbach
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Jennifer Ens
- Department of Plant Sciences, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
| | - Xiao-Qi Zhang
- Western Barley Genetics Alliance, State Agricultural Biotechnology Centre, College of Science, Health, Engineering and Education, Murdoch University, Murdoch, Western Australia, Australia
| | - Tefera T Angessa
- Western Barley Genetics Alliance, State Agricultural Biotechnology Centre, College of Science, Health, Engineering and Education, Murdoch University, Murdoch, Western Australia, Australia
| | - Gaofeng Zhou
- Western Barley Genetics Alliance, State Agricultural Biotechnology Centre, College of Science, Health, Engineering and Education, Murdoch University, Murdoch, Western Australia, Australia
- Agriculture and Food, Department of Primary Industries and Regional Development, South Perth, Western Australia, Australia
| | - Cong Tan
- Western Barley Genetics Alliance, State Agricultural Biotechnology Centre, College of Science, Health, Engineering and Education, Murdoch University, Murdoch, Western Australia, Australia
| | - Camilla Hill
- Western Barley Genetics Alliance, State Agricultural Biotechnology Centre, College of Science, Health, Engineering and Education, Murdoch University, Murdoch, Western Australia, Australia
| | - Penghao Wang
- Western Barley Genetics Alliance, State Agricultural Biotechnology Centre, College of Science, Health, Engineering and Education, Murdoch University, Murdoch, Western Australia, Australia
| | | | - Lori B Boston
- HudsonAlpha, Institute for Biotechnology, Huntsville, AL, USA
| | | | - Jerry Jenkins
- HudsonAlpha, Institute for Biotechnology, Huntsville, AL, USA
| | - Yu Guo
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Anne Fiebig
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | | | - Dongdong Xu
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences (ICS-CAAS), Beijing, China
| | - Jing Zhang
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences (ICS-CAAS), Beijing, China
| | - Chunchao Wang
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences (ICS-CAAS), Beijing, China
| | - Jane Grimwood
- HudsonAlpha, Institute for Biotechnology, Huntsville, AL, USA
| | - Jeremy Schmutz
- HudsonAlpha, Institute for Biotechnology, Huntsville, AL, USA
| | - Ganggang Guo
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences (ICS-CAAS), Beijing, China
| | - Guoping Zhang
- College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China
| | - Keiichi Mochida
- Bioproductivity Informatics Research Team, RIKEN Center for Sustainable Resource Science, Yokohama, Japan
- Kihara Institute for Biological Research, Yokohama City University, Yokohama, Japan
- Institute of Plant Science and Resources, Okayama University, Kurashiki, Japan
| | - Takashi Hirayama
- Institute of Plant Science and Resources, Okayama University, Kurashiki, Japan
| | - Kazuhiro Sato
- Institute of Plant Science and Resources, Okayama University, Kurashiki, Japan
| | - Kenneth J Chalmers
- School of Agriculture, Food and Wine, University of Adelaide, Glen Osmond, South Australia, Australia
| | - Peter Langridge
- School of Agriculture, Food and Wine, University of Adelaide, Glen Osmond, South Australia, Australia
| | - Robbie Waugh
- The James Hutton Institute, Dundee, UK
- School of Agriculture, Food and Wine, University of Adelaide, Glen Osmond, South Australia, Australia
- School of Life Sciences, University of Dundee, Dundee, UK
| | - Curtis J Pozniak
- Department of Plant Sciences, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
| | - Uwe Scholz
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Klaus F X Mayer
- Plant Genome and Systems Biology (PGSB), Helmholtz Center Munich, German Research Center for Environmental Health, Neuherberg, Germany
- School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany
| | - Manuel Spannagl
- Plant Genome and Systems Biology (PGSB), Helmholtz Center Munich, German Research Center for Environmental Health, Neuherberg, Germany
| | - Chengdao Li
- Western Barley Genetics Alliance, State Agricultural Biotechnology Centre, College of Science, Health, Engineering and Education, Murdoch University, Murdoch, Western Australia, Australia.
- Agriculture and Food, Department of Primary Industries and Regional Development, South Perth, Western Australia, Australia.
- Hubei Collaborative Innovation Centre for Grain Industry, Yangtze University, Jingzhou, China.
| | - Martin Mascher
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany.
- German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Leipzig, Germany.
| | - Nils Stein
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany.
- Center for Integrated Breeding Research (CiBreed), Georg-August-University Göttingen, Göttingen, Germany.
| |
Collapse
|
26
|
Multiple wheat genomes reveal global variation in modern breeding. Nature 2020; 588:277-283. [PMID: 33239791 PMCID: PMC7759465 DOI: 10.1038/s41586-020-2961-x] [Citation(s) in RCA: 463] [Impact Index Per Article: 92.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2020] [Accepted: 09/09/2020] [Indexed: 12/24/2022]
Abstract
Advances in genomics have expedited the improvement of several agriculturally important crops but similar efforts in wheat (Triticum spp.) have been more challenging. This is largely owing to the size and complexity of the wheat genome1, and the lack of genome-assembly data for multiple wheat lines2,3. Here we generated ten chromosome pseudomolecule and five scaffold assemblies of hexaploid wheat to explore the genomic diversity among wheat lines from global breeding programs. Comparative analysis revealed extensive structural rearrangements, introgressions from wild relatives and differences in gene content resulting from complex breeding histories aimed at improving adaptation to diverse environments, grain yield and quality, and resistance to stresses4,5. We provide examples outlining the utility of these genomes, including a detailed multi-genome-derived nucleotide-binding leucine-rich repeat protein repertoire involved in disease resistance and the characterization of Sm16, a gene associated with insect resistance. These genome assemblies will provide a basis for functional gene discovery and breeding to deliver the next generation of modern wheat cultivars. Comparison of multiple genome assemblies from wheat reveals extensive diversity that results from the complex breeding history of wheat and provides a basis for further potential improvements to this important food crop.
Collapse
|
27
|
da Cruz MHP, Domingues DS, Saito PTM, Paschoal AR, Bugatti PH. TERL: classification of transposable elements by convolutional neural networks. Brief Bioinform 2020; 22:5900933. [PMID: 34020551 DOI: 10.1093/bib/bbaa185] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2020] [Revised: 07/07/2020] [Accepted: 07/20/2020] [Indexed: 11/12/2022] Open
Abstract
Transposable elements (TEs) are the most represented sequences occurring in eukaryotic genomes. Few methods provide the classification of these sequences into deeper levels, such as superfamily level, which could provide useful and detailed information about these sequences. Most methods that classify TE sequences use handcrafted features such as k-mers and homology-based search, which could be inefficient for classifying non-homologous sequences. Here we propose an approach, called transposable elements pepresentation learner (TERL), that preprocesses and transforms one-dimensional sequences into two-dimensional space data (i.e., image-like data of the sequences) and apply it to deep convolutional neural networks. This classification method tries to learn the best representation of the input data to classify it correctly. We have conducted six experiments to test the performance of TERL against other methods. Our approach obtained macro mean accuracies and F1-score of 96.4% and 85.8% for superfamilies and 95.7% and 91.5% for the order sequences from RepBase, respectively. We have also obtained macro mean accuracies and F1-score of 95.0% and 70.6% for sequences from seven databases into superfamily level and 89.3% and 73.9% for the order level, respectively. We surpassed accuracy, recall and specificity obtained by other methods on the experiment with the classification of order level sequences from seven databases and surpassed by far the time elapsed of any other method for all experiments. Therefore, TERL can learn how to predict any hierarchical level of the TEs classification system and is about 20 times and three orders of magnitude faster than TEclass and PASTEC, respectively https://github.com/muriloHoracio/TERL. Contact:murilocruz@alunos.utfpr.edu.br.
Collapse
Affiliation(s)
- Murilo Horacio Pereira da Cruz
- Federal University of Technology - Parana (UTFPR), Brazil.,Bioinformatics Graduation Program (PPGBIOINFO), Department of Computer Science, Federal University of Technology - Parana (UTFPR), Brazil
| | - Douglas Silva Domingues
- São Paulo State University at Botucatu, Brazil.,University of São Paulo, Brazil.,Department of Biodiversity, São Paulo State University at Rio Claro, Brazil
| | - Priscila Tiemi Maeda Saito
- Euripides Soares da Rocha University of Marilia, Brazil.,University of São Paulo (ICMC-USP), Brazil.,University of Campinas (IC-UNICAMP), Brazil.,Department of Computing, Federal University of Technology - Parana (UTFPR), Brazil
| | | | - Pedro Henrique Bugatti
- Euripides Soares da Rocha University of Marilia, Brazil.,University of São Paulo (ICMC-USP), Brazil.,Department of Computing, Federal University of Technology - Parana (UTFPR), Brazil
| |
Collapse
|
28
|
A chromosome-level genome assembly of the wild rice Oryza rufipogon facilitates tracing the origins of Asian cultivated rice. SCIENCE CHINA-LIFE SCIENCES 2020; 64:282-293. [PMID: 32737856 DOI: 10.1007/s11427-020-1738-x] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/05/2020] [Accepted: 05/19/2020] [Indexed: 01/15/2023]
Abstract
Oryza rufipogon Griff. is a wild progenitor of the Asian cultivated rice Oryza sativa. To better understand the genomic diversity of the wild rice, high-quality reference genomes of O. rufipogon populations are needed, which also facilitate utilization of the wild genetic resources in rice breeding. In this study, we generated a chromosome-level genome assembly of O. rufipogon using a combination of short-read sequencing, single-molecule sequencing, BioNano and Hi-C platforms. The genome sequence (399.8 Mb) was assembled into 46 scaffolds on the 12 chromosomes, with contig N50 and scaffold N50 of 13.2 Mb and 20.3 Mb, respectively. The genome contains 36,520 protein-coding genes, and 49.37% of the genome consists of repetitive elements. The genome has strong synteny with those of the O. sativa subspecies indica and japonica, but containing some large structural variations. Evolutionary analysis unveiled the polyphyletic origins of O. sativa, in which the japonica and indica genome formations involved different divergent O. rufipogon (including O. nivara) lineages, accompanied by introgression of genomic regions between japonica and indica. This high-quality reference genome provides insight on the genome evolution of the wild rice and the origins of the O. sativa subspecies, and valuable information for basic research and rice breeding.
Collapse
|
29
|
European maize genomes highlight intraspecies variation in repeat and gene content. Nat Genet 2020; 52:950-957. [PMID: 32719517 PMCID: PMC7467862 DOI: 10.1038/s41588-020-0671-9] [Citation(s) in RCA: 64] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2019] [Accepted: 06/25/2020] [Indexed: 12/22/2022]
Abstract
The diversity of maize (Zea mays) is the backbone of modern heterotic patterns and hybrid breeding. Historically, US farmers exploited this variability to establish today’s highly productive Corn Belt inbred lines from blends of dent and flint germplasm pools. Here, we report de novo genome sequences of four European flint lines assembled to pseudomolecules with scaffold N50 ranging from 6.1 to 10.4 Mb. Comparative analyses with two US Corn Belt lines explains the pronounced differences between both germplasms. While overall syntenic order and consolidated gene annotations reveal only moderate pangenomic differences, whole-genome alignments delineating the core and dispensable genome, and the analysis of heterochromatic knobs and orthologous long terminal repeat retrotransposons unveil the dynamics of the maize genome. The high-quality genome sequences of the flint pool complement the maize pangenome and provide an important tool to study maize improvement at a genome scale and to enhance modern hybrid breeding. De novo genome assemblies of four European flint maize lines and comparison with two US Corn Belt genomes provide insights into the dynamics of intraspecies variation in repeat and gene content in maize genomes.
Collapse
|
30
|
Zhang S, Shen S, Peng J, Zhou X, Kong X, Ren P, Liu F, Han L, Zhan S, Huang Y, Zhang A, Zhang Z. Chromosome‐level genome assembly of an important pine defoliator,
Dendrolimus punctatus
(Lepidoptera; Lasiocampidae). Mol Ecol Resour 2020; 20:1023-1037. [DOI: 10.1111/1755-0998.13169] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2019] [Revised: 02/29/2020] [Accepted: 04/09/2020] [Indexed: 12/22/2022]
Affiliation(s)
- Sufang Zhang
- Key Laboratory of Forest Protection of State Forestry and Grassland Administration, Research Institute of Forest Ecology, Environment and Protection Chinese Academy of Forestry Beijing China
| | - Sifan Shen
- Key Laboratory of Forest Protection of State Forestry and Grassland Administration, Research Institute of Forest Ecology, Environment and Protection Chinese Academy of Forestry Beijing China
| | - Jiong Peng
- Nextomics Biosciences Institute Wuhan China
| | - Xin Zhou
- Beijing Advanced Innovation Center for Food Nutrition and Human Health College of Plant Protection China Agricultural University Beijing China
| | - Xiangbo Kong
- Key Laboratory of Forest Protection of State Forestry and Grassland Administration, Research Institute of Forest Ecology, Environment and Protection Chinese Academy of Forestry Beijing China
| | | | - Fu Liu
- Key Laboratory of Forest Protection of State Forestry and Grassland Administration, Research Institute of Forest Ecology, Environment and Protection Chinese Academy of Forestry Beijing China
| | | | - Shuai Zhan
- CAS Key Laboratory of Insect Developmental and Evolutionary Biology, CAS Center for Excellence in Molecular Plant Sciences, Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences Chinese Academy of Sciences Shanghai China
| | - Yongping Huang
- CAS Key Laboratory of Insect Developmental and Evolutionary Biology, CAS Center for Excellence in Molecular Plant Sciences, Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences Chinese Academy of Sciences Shanghai China
| | - Aibing Zhang
- College of Life Sciences Capital Normal University Beijing China
| | - Zhen Zhang
- Key Laboratory of Forest Protection of State Forestry and Grassland Administration, Research Institute of Forest Ecology, Environment and Protection Chinese Academy of Forestry Beijing China
| |
Collapse
|
31
|
Rotasperti L, Sansoni F, Mizzotti C, Tadini L, Pesaresi P. Barley's Second Spring as A Model Organism for Chloroplast Research. PLANTS 2020; 9:plants9070803. [PMID: 32604986 PMCID: PMC7411767 DOI: 10.3390/plants9070803] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/10/2020] [Revised: 06/22/2020] [Accepted: 06/24/2020] [Indexed: 12/19/2022]
Abstract
Barley (Hordeum vulgare) has been widely used as a model crop for studying molecular and physiological processes such as chloroplast development and photosynthesis. During the second half of the 20th century, mutants such as albostrians led to the discovery of the nuclear-encoded, plastid-localized RNA polymerase and the retrograde (chloroplast-to-nucleus) signalling communication pathway, while chlorina-f2 and xantha mutants helped to shed light on the chlorophyll biosynthetic pathway, on the light-harvesting proteins and on the organization of the photosynthetic apparatus. However, during the last 30 years, a large fraction of chloroplast research has switched to the more “user-friendly” model species Arabidopsis thaliana, the first plant species whose genome was sequenced and published at the end of 2000. Despite its many advantages, Arabidopsis has some important limitations compared to barley, including the lack of a real canopy and the absence of the proplastid-to-chloroplast developmental gradient across the leaf blade. These features, together with the availability of large collections of natural genetic diversity and mutant populations for barley, a complete genome assembly and protocols for genetic transformation and gene editing, have relaunched barley as an ideal model species for chloroplast research. In this review, we provide an update on the genomics tools now available for barley, and review the biotechnological strategies reported to increase photosynthesis efficiency in model species, which deserve to be validated in barley.
Collapse
|
32
|
Measuring Performance Metrics of Machine Learning Algorithms for Detecting and Classifying Transposable Elements. Processes (Basel) 2020. [DOI: 10.3390/pr8060638] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Because of the promising results obtained by machine learning (ML) approaches in several fields, every day is more common, the utilization of ML to solve problems in bioinformatics. In genomics, a current issue is to detect and classify transposable elements (TEs) because of the tedious tasks involved in bioinformatics methods. Thus, ML was recently evaluated for TE datasets, demonstrating better results than bioinformatics applications. A crucial step for ML approaches is the selection of metrics that measure the realistic performance of algorithms. Each metric has specific characteristics and measures properties that may be different from the predicted results. Although the most commonly used way to compare measures is by using empirical analysis, a non-result-based methodology has been proposed, called measure invariance properties. These properties are calculated on the basis of whether a given measure changes its value under certain modifications in the confusion matrix, giving comparative parameters independent of the datasets. Measure invariance properties make metrics more or less informative, particularly on unbalanced, monomodal, or multimodal negative class datasets and for real or simulated datasets. Although several studies applied ML to detect and classify TEs, there are no works evaluating performance metrics in TE tasks. Here, we analyzed 26 different metrics utilized in binary, multiclass, and hierarchical classifications, through bibliographic sources, and their invariance properties. Then, we corroborated our findings utilizing freely available TE datasets and commonly used ML algorithms. Based on our analysis, the most suitable metrics for TE tasks must be stable, even using highly unbalanced datasets, multimodal negative class, and training datasets with errors or outliers. Based on these parameters, we conclude that the F1-score and the area under the precision-recall curve are the most informative metrics since they are calculated based on other metrics, providing insight into the development of an ML application.
Collapse
|
33
|
Yan H, Bombarely A, Li S. DeepTE: a computational method for de novo classification of transposons with convolutional neural network. Bioinformatics 2020; 36:4269-4275. [DOI: 10.1093/bioinformatics/btaa519] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2020] [Revised: 04/12/2020] [Accepted: 05/12/2020] [Indexed: 01/23/2023] Open
Abstract
Abstract
Motivation
Transposable elements (TEs) classification is an essential step to decode their roles in genome evolution. With a large number of genomes from non-model species becoming available, accurate and efficient TE classification has emerged as a new challenge in genomic sequence analysis.
Results
We developed a novel tool, DeepTE, which classifies unknown TEs using convolutional neural networks (CNNs). DeepTE transferred sequences into input vectors based on k-mer counts. A tree structured classification process was used where eight models were trained to classify TEs into super families and orders. DeepTE also detected domains inside TEs to correct false classification. An additional model was trained to distinguish between non-TEs and TEs in plants. Given unclassified TEs of different species, DeepTE can classify TEs into seven orders, which include 15, 24 and 16 super families in plants, metazoans and fungi, respectively. In several benchmarking tests, DeepTE outperformed other existing tools for TE classification. In conclusion, DeepTE successfully leverages CNN for TE classification, and can be used to precisely classify TEs in newly sequenced eukaryotic genomes.
Availability and implementation
DeepTE is accessible at https://github.com/LiLabAtVT/DeepTE.
Supplementary information
Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Haidong Yan
- School of Plant and Environmental Sciences (SPES), Virginia Tech, Blacksburg, VA 24061, USA
| | - Aureliano Bombarely
- School of Plant and Environmental Sciences (SPES), Virginia Tech, Blacksburg, VA 24061, USA
- Department of Life Sciences, University of Milan, Milan 20122, Italy
| | - Song Li
- School of Plant and Environmental Sciences (SPES), Virginia Tech, Blacksburg, VA 24061, USA
- Graduate Program in Genetics, Bioinformatics and Computational Biology (GBCB), Virginia Tech, Blacksburg, VA 24061, USA
| |
Collapse
|
34
|
De novo Genome Assembly of the indica Rice Variety IR64 Using Linked-Read Sequencing and Nanopore Sequencing. G3-GENES GENOMES GENETICS 2020; 10:1495-1501. [PMID: 32184372 PMCID: PMC7202035 DOI: 10.1534/g3.119.400871] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
IR64 is a rice variety with high-yield that has been widely cultivated around the world. IR64 has been replaced by modern varieties in most growing areas. Given that modern varieties are mostly progenies or relatives of IR64, genetic analysis of IR64 is valuable for rice functional genomics. However, chromosome-level genome sequences of IR64 have not been available previously. Here, we sequenced the IR64 genome using synthetic long reads obtained by linked-read sequencing and ultra-long reads obtained by nanopore sequencing. We integrated these data and generated the de novo assembly of the IR64 genome of 367 Mb, equivalent to 99% of the estimated size. Continuity of the IR64 genome assembly was improved compared with that of a publicly available IR64 genome assembly generated by short reads only. We annotated 41,458 protein-coding genes, including 657 IR64-specific genes, that are missing in other high-quality rice genome assemblies IRGSP-1.0 of japonica cultivar Nipponbare or R498 of indica cultivar Shuhui498. The IR64 genome assembly will serve as a genome resource for rice functional genomics as well as genomics-driven and/or molecular breeding.
Collapse
|
35
|
Chowdhury MR, Basak J, Bahadur RP. Elucidating the Functional Role of Predicted miRNAs in Post- Transcriptional Gene Regulation Along with Symbiosis in Medicago truncatula. Curr Bioinform 2020. [DOI: 10.2174/1574893614666191003114202] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Background:
microRNAs are small non-coding RNAs which inhibit translational and
post-transcriptional processes whereas long non-coding RNAs are found to regulate both
transcriptional and post-transcriptional gene expression. Medicago truncatula is a well-known
model plant for studying legume biology and is also used as a forage crop. In spite of its
importance in nitrogen fixation and soil fertility improvement, little information is available about
Medicago non-coding RNAs that play important role in symbiosis.
Objective:
In this study we have tried to understand the role of Medicago ncRNAs in symbiosis
and regulation of transcription factors.
Methods:
We have identified novel miRNAs by computational methods considering various
parameters like length, MFEI, AU content, SSR signatures and tried to establish an interaction
model with their targets obtained through psRNATarget server.
Results:
149 novel miRNAs are predicted along with their 770 target proteins. We have also
shown that 51 of these novel miRNAs are targeting 282 lncRNAs.
Conclusion:
In this study role of Medicago miRNAs in the regulation of various transcription
factors are elucidated. Knowledge gained from this study will have a positive impact on the
nitrogen fixing ability of this important model plant, which in turn will improve the soil fertility.
Collapse
Affiliation(s)
- Moumita Roy Chowdhury
- Computational Structural Biology Lab, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur-721302, India
| | - Jolly Basak
- Laboratory of Plant Stress Biology, Department of Biotechnology, Visva-Bharati, Santiniketan-731235, India
| | - Ranjit Prasad Bahadur
- Computational Structural Biology Lab, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur-721302, India
| |
Collapse
|
36
|
Lansing H, Doering L, Fischer K, Baune MC, Schaewen AV. Analysis of potential redundancy among Arabidopsis 6-phosphogluconolactonase isoforms in peroxisomes. JOURNAL OF EXPERIMENTAL BOTANY 2020; 71:823-836. [PMID: 31641750 DOI: 10.1093/jxb/erz473] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/03/2019] [Accepted: 10/10/2019] [Indexed: 05/21/2023]
Abstract
Recent work revealed that PGD2, an Arabidopsis 6-phosphogluconate dehydrogenase (6-PGD) catalysing the third step of the oxidative pentose-phosphate pathway (OPPP) in peroxisomes, is essential during fertilization. Earlier studies on the second step, catalysed by PGL3, a dually targeted Arabidopsis 6-phosphogluconolactonase (6-PGL), reported the importance of OPPP reactions in plastids but their irrelevance in peroxisomes. Assuming redundancy of 6-PGL activity in peroxisomes, we examined the sequences of other higher plant enzymes. In tomato, there exist two 6-PGL isoforms with the strong PTS1 motif SKL. However, their analysis revealed problems regarding peroxisomal targeting: reporter-PGL detection in peroxisomes required construct modification, which was also applied to the Arabidopsis isoforms. The relative contribution of PGL3 versus PGL5 during fertilization was assessed by mutant crosses. Reduced transmission ratios were found for pgl3-1 (T-DNA-eliminated PTS1) and also for knock-out allele pgl5-2. The prominent role of PGL3 showed as compromised growth of pgl3-1 seedlings on sucrose and higher activity of mutant PGL3-1 versus PGL5 using purified recombinant proteins. Evidence for PTS1-independent uptake was found for PGL3-1 and other Arabidopsis PGL isoforms, indicating that peroxisome import may be supported by a piggybacking mechanism. Thus, multiple redundancy at the level of the second OPPP step in peroxisomes explains the occurrence of pgl3-1 mutant plants.
Collapse
Affiliation(s)
- Hannes Lansing
- Molekulare Physiologie der Pflanzen, Institut für Biologie und Biotechnologie der Pflanzen, Westfälische Wilhelms-Universität Münster, Schlossplatz 7, D-48149 Münster, Germany
| | - Lennart Doering
- Molekulare Physiologie der Pflanzen, Institut für Biologie und Biotechnologie der Pflanzen, Westfälische Wilhelms-Universität Münster, Schlossplatz 7, D-48149 Münster, Germany
| | - Kerstin Fischer
- Molekulare Physiologie der Pflanzen, Institut für Biologie und Biotechnologie der Pflanzen, Westfälische Wilhelms-Universität Münster, Schlossplatz 7, D-48149 Münster, Germany
| | - Marie-Christin Baune
- Molekulare Physiologie der Pflanzen, Institut für Biologie und Biotechnologie der Pflanzen, Westfälische Wilhelms-Universität Münster, Schlossplatz 7, D-48149 Münster, Germany
| | - Antje Von Schaewen
- Molekulare Physiologie der Pflanzen, Institut für Biologie und Biotechnologie der Pflanzen, Westfälische Wilhelms-Universität Münster, Schlossplatz 7, D-48149 Münster, Germany
| |
Collapse
|
37
|
Monat C, Padmarasu S, Lux T, Wicker T, Gundlach H, Himmelbach A, Ens J, Li C, Muehlbauer GJ, Schulman AH, Waugh R, Braumann I, Pozniak C, Scholz U, Mayer KFX, Spannagl M, Stein N, Mascher M. TRITEX: chromosome-scale sequence assembly of Triticeae genomes with open-source tools. Genome Biol 2019; 20:284. [PMID: 31849336 DOI: 10.1101/631648] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2019] [Accepted: 11/25/2019] [Indexed: 05/29/2023] Open
Abstract
Chromosome-scale genome sequence assemblies underpin pan-genomic studies. Recent genome assembly efforts in the large-genome Triticeae crops wheat and barley have relied on the commercial closed-source assembly algorithm DeNovoMagic. We present TRITEX, an open-source computational workflow that combines paired-end, mate-pair, 10X Genomics linked-read with chromosome conformation capture sequencing data to construct sequence scaffolds with megabase-scale contiguity ordered into chromosomal pseudomolecules. We evaluate the performance of TRITEX on publicly available sequence data of tetraploid wild emmer and hexaploid bread wheat, and construct an improved annotated reference genome sequence assembly of the barley cultivar Morex as a community resource.
Collapse
Affiliation(s)
- Cécile Monat
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Sudharsan Padmarasu
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Thomas Lux
- PGSB - Plant Genome and Systems Biology, Helmholtz Center Munich - German Research Center for Environmental Health, Neuherberg, Germany
| | - Thomas Wicker
- Department of Plant and Microbial Biology, University of Zurich, Zurich, Switzerland
| | - Heidrun Gundlach
- PGSB - Plant Genome and Systems Biology, Helmholtz Center Munich - German Research Center for Environmental Health, Neuherberg, Germany
| | - Axel Himmelbach
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Jennifer Ens
- Department of Plant Sciences, University of Saskatchewan, Saskatoon, Canada
| | - Chengdao Li
- Western Barley Genetics Alliance, School of Veterinary and Life Sciences (VLS), Murdoch University, Murdoch, WA, Australia
- Hubei Collaborative Innovation Center for Grain Industry/School of Agriculture, Yangtze University, Jingzhou, China
| | - Gary J Muehlbauer
- Department of Agronomy and Plant Genetics & Department of Plant and Microbial Biology, University of Minnesota, St. Paul, MN, USA
| | - Alan H Schulman
- Green Technology, Natural Resources Institute (Luke), Viikki Plant Science Centre, and Institute of Biotechnology, University of Helsinki, Helsinki, Finland
| | - Robbie Waugh
- The James Hutton Institute, Dundee, UK
- School of Life Sciences, University of Dundee, Dundee, UK
| | | | - Curtis Pozniak
- Department of Plant Sciences, University of Saskatchewan, Saskatoon, Canada
| | - Uwe Scholz
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Klaus F X Mayer
- PGSB - Plant Genome and Systems Biology, Helmholtz Center Munich - German Research Center for Environmental Health, Neuherberg, Germany
- School of Life Sciences Weihenstephan, Technical University of Munich, Munich, Germany
| | - Manuel Spannagl
- PGSB - Plant Genome and Systems Biology, Helmholtz Center Munich - German Research Center for Environmental Health, Neuherberg, Germany
| | - Nils Stein
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany.
- Department of Crop Sciences, Center for Integrated Breeding Research (CiBreed), Georg-August-University Göttingen, Göttingen, Germany.
| | - Martin Mascher
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany.
- German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Leipzig, Germany.
| |
Collapse
|
38
|
Monat C, Padmarasu S, Lux T, Wicker T, Gundlach H, Himmelbach A, Ens J, Li C, Muehlbauer GJ, Schulman AH, Waugh R, Braumann I, Pozniak C, Scholz U, Mayer KFX, Spannagl M, Stein N, Mascher M. TRITEX: chromosome-scale sequence assembly of Triticeae genomes with open-source tools. Genome Biol 2019; 20:284. [PMID: 31849336 PMCID: PMC6918601 DOI: 10.1186/s13059-019-1899-5] [Citation(s) in RCA: 140] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2019] [Accepted: 11/25/2019] [Indexed: 11/24/2022] Open
Abstract
Chromosome-scale genome sequence assemblies underpin pan-genomic studies. Recent genome assembly efforts in the large-genome Triticeae crops wheat and barley have relied on the commercial closed-source assembly algorithm DeNovoMagic. We present TRITEX, an open-source computational workflow that combines paired-end, mate-pair, 10X Genomics linked-read with chromosome conformation capture sequencing data to construct sequence scaffolds with megabase-scale contiguity ordered into chromosomal pseudomolecules. We evaluate the performance of TRITEX on publicly available sequence data of tetraploid wild emmer and hexaploid bread wheat, and construct an improved annotated reference genome sequence assembly of the barley cultivar Morex as a community resource.
Collapse
Affiliation(s)
- Cécile Monat
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Sudharsan Padmarasu
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Thomas Lux
- PGSB - Plant Genome and Systems Biology, Helmholtz Center Munich - German Research Center for Environmental Health, Neuherberg, Germany
| | - Thomas Wicker
- Department of Plant and Microbial Biology, University of Zurich, Zurich, Switzerland
| | - Heidrun Gundlach
- PGSB - Plant Genome and Systems Biology, Helmholtz Center Munich - German Research Center for Environmental Health, Neuherberg, Germany
| | - Axel Himmelbach
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Jennifer Ens
- Department of Plant Sciences, University of Saskatchewan, Saskatoon, Canada
| | - Chengdao Li
- Western Barley Genetics Alliance, School of Veterinary and Life Sciences (VLS), Murdoch University, Murdoch, WA, Australia
- Hubei Collaborative Innovation Center for Grain Industry/School of Agriculture, Yangtze University, Jingzhou, China
| | - Gary J Muehlbauer
- Department of Agronomy and Plant Genetics & Department of Plant and Microbial Biology, University of Minnesota, St. Paul, MN, USA
| | - Alan H Schulman
- Green Technology, Natural Resources Institute (Luke), Viikki Plant Science Centre, and Institute of Biotechnology, University of Helsinki, Helsinki, Finland
| | - Robbie Waugh
- The James Hutton Institute, Dundee, UK
- School of Life Sciences, University of Dundee, Dundee, UK
| | | | - Curtis Pozniak
- Department of Plant Sciences, University of Saskatchewan, Saskatoon, Canada
| | - Uwe Scholz
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Klaus F X Mayer
- PGSB - Plant Genome and Systems Biology, Helmholtz Center Munich - German Research Center for Environmental Health, Neuherberg, Germany
- School of Life Sciences Weihenstephan, Technical University of Munich, Munich, Germany
| | - Manuel Spannagl
- PGSB - Plant Genome and Systems Biology, Helmholtz Center Munich - German Research Center for Environmental Health, Neuherberg, Germany
| | - Nils Stein
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany.
- Department of Crop Sciences, Center for Integrated Breeding Research (CiBreed), Georg-August-University Göttingen, Göttingen, Germany.
| | - Martin Mascher
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany.
- German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Leipzig, Germany.
| |
Collapse
|
39
|
da Cruz MHP, Saito PTM, Paschoal AR, Bugatti PH. Classification of Transposable Elements by Convolutional Neural Networks. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING 2019. [DOI: 10.1007/978-3-030-20915-5_15] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
40
|
|
41
|
Alaux M, Rogers J, Letellier T, Flores R, Alfama F, Pommier C, Mohellibi N, Durand S, Kimmel E, Michotey C, Guerche C, Loaec M, Lainé M, Steinbach D, Choulet F, Rimbert H, Leroy P, Guilhot N, Salse J, Feuillet C, Paux E, Eversole K, Adam-Blondon AF, Quesneville H. Linking the International Wheat Genome Sequencing Consortium bread wheat reference genome sequence to wheat genetic and phenomic data. Genome Biol 2018; 19:111. [PMID: 30115101 PMCID: PMC6097284 DOI: 10.1186/s13059-018-1491-4] [Citation(s) in RCA: 136] [Impact Index Per Article: 19.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2018] [Accepted: 07/23/2018] [Indexed: 01/24/2023] Open
Abstract
The Wheat@URGI portal has been developed to provide the international community of researchers and breeders with access to the bread wheat reference genome sequence produced by the International Wheat Genome Sequencing Consortium. Genome browsers, BLAST, and InterMine tools have been established for in-depth exploration of the genome sequence together with additional linked datasets including physical maps, sequence variations, gene expression, and genetic and phenomic data from other international collaborative projects already stored in the GnpIS information system. The portal provides enhanced search and browser features that will facilitate the deployment of the latest genomics resources in wheat improvement.
Collapse
Affiliation(s)
- Michael Alaux
- URGI, INRA, Université Paris-Saclay, 78026, Versailles, France.
| | - Jane Rogers
- International Wheat Genome Sequencing Consortium (IWGSC), 18 High Street, Little Eversden, Cambridge, CB23 1HE, UK
| | | | - Raphaël Flores
- URGI, INRA, Université Paris-Saclay, 78026, Versailles, France
| | | | - Cyril Pommier
- URGI, INRA, Université Paris-Saclay, 78026, Versailles, France
| | - Nacer Mohellibi
- URGI, INRA, Université Paris-Saclay, 78026, Versailles, France
| | - Sophie Durand
- URGI, INRA, Université Paris-Saclay, 78026, Versailles, France
| | - Erik Kimmel
- URGI, INRA, Université Paris-Saclay, 78026, Versailles, France
| | - Célia Michotey
- URGI, INRA, Université Paris-Saclay, 78026, Versailles, France
| | - Claire Guerche
- URGI, INRA, Université Paris-Saclay, 78026, Versailles, France
| | - Mikaël Loaec
- URGI, INRA, Université Paris-Saclay, 78026, Versailles, France
| | - Mathilde Lainé
- URGI, INRA, Université Paris-Saclay, 78026, Versailles, France
| | - Delphine Steinbach
- URGI, INRA, Université Paris-Saclay, 78026, Versailles, France
- Present address: GQE-Le Moulon UMR 320, INRA, Université Paris-Sud, Université Paris-Saclay, CNRS, AgroParisTech, Ferme du Moulon, 91190, Gif-sur-Yvette, France
| | - Frédéric Choulet
- GDEC, INRA, Université Clermont Auvergne, 63000, Clermont-Ferrand, France
| | - Hélène Rimbert
- GDEC, INRA, Université Clermont Auvergne, 63000, Clermont-Ferrand, France
| | - Philippe Leroy
- GDEC, INRA, Université Clermont Auvergne, 63000, Clermont-Ferrand, France
| | - Nicolas Guilhot
- GDEC, INRA, Université Clermont Auvergne, 63000, Clermont-Ferrand, France
| | - Jérôme Salse
- GDEC, INRA, Université Clermont Auvergne, 63000, Clermont-Ferrand, France
| | - Catherine Feuillet
- GDEC, INRA, Université Clermont Auvergne, 63000, Clermont-Ferrand, France
- Present address: Inari Agriculture, 200 Sydney Street, Cambridge, MA, 02139, USA
| | - Etienne Paux
- GDEC, INRA, Université Clermont Auvergne, 63000, Clermont-Ferrand, France
| | - Kellye Eversole
- International Wheat Genome Sequencing Consortium (IWGSC), 5207 Wyoming Road, Bethesda, Maryland, 20816, USA
| | | | | |
Collapse
|
42
|
Hussain M, Iqbal MA, Till BJ, Rahman MU. Identification of induced mutations in hexaploid wheat genome using exome capture assay. PLoS One 2018; 13:e0201918. [PMID: 30102729 PMCID: PMC6089429 DOI: 10.1371/journal.pone.0201918] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2018] [Accepted: 07/24/2018] [Indexed: 12/02/2022] Open
Abstract
Wheat is a staple food crop of many countries. Improving resilience to biotic and abiotic stresses remain key breeding targets. Among these, rust diseases are the most detrimental in terms of depressing wheat production. In the present study, chemical mutagenesis was used to induce mutations in the wheat variety NN-Gandum-1. This cultivar is moderately resistant to leaf and yellow rust. The aim of mutagenesis was to improve resistance to the disease as well as to study function of genes conferring resistance to the disease. In the present investigation, a 0.8% EMS dose was found optimum for supporting 45-55% germination of NN-Gandum-1. A total of 3,634 M2 fertile plants were produced from each of the M1 plant. Out of these, 33 (0.91%) and 20 plants (0.55%) showed absolute resistance to leaf and yellow rust, respectively. While 126 (3.46%) and 127 plants (3.49%) exhibited high susceptibility to the leaf and yellow rust, respectively. In the M4 generation, a total of 11 M4 lines (nine absolute resistant and two highly susceptible) and one wild type were selected for NGS-based exome capture assay. A total of 104,779 SNPs were identified that were randomly distributed throughout the wheat sub genomes (A, B and D). Induced mutations in intronic sequences predominated. The highest total number of SNPs detected in this assay were mapped to chr.2B (14,273 SNPs), which contains the highest number of targeted base pairs in the assay. The average mutation density across all regions interrogated was estimated to be one mutation per 20.91 Mb. The highest mutation frequency was found in chr.2D (1/11.7 kb) and the lowest in chr.7D (1/353.4 kb). Out of the detected mutations, 101 SNPs were filtered using analysis criteria aimed to enrich for mutations that may affect gene function. Out of these, one putative SNP detected in Lr21 were selected for further analysis. The SNP identified in chimeric allele (Lr21) of a resistant mutant (N1-252) was located in a NBS domain of chr.1BS at 3.4 Mb position. Through computational analysis, it was demonstrated that this identified SNP causes a substitution of glutamic acid with alanine, resulting in a predicted altered protein structure. This mutation, therefore, is a candidate for contributing to the resistance phenotype in the mutant line. Based on this work, we conclude that the wheat mutant resource developed is useful as a source of novel genetic variation for forward-genetic screens and also as a useful tool for gaining insights into the important biological circuits of different traits of complex genomes like wheat.
Collapse
Affiliation(s)
- Momina Hussain
- Plant Genomics & Mol. Breeding Lab, National Institute for Biotechnology & Genetic Engineering (NIBGE), Faisalabad, Pakistan
- Department of Biotechnology, Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad, Pakistan
| | - Muhammad Atif Iqbal
- Plant Genomics & Mol. Breeding Lab, National Institute for Biotechnology & Genetic Engineering (NIBGE), Faisalabad, Pakistan
- Department of Biotechnology, Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad, Pakistan
| | - Bradley J. Till
- University of Vienna, Department fürChromosomenbiologie, Vienna, Austria
| | - Mehboob-ur- Rahman
- Plant Genomics & Mol. Breeding Lab, National Institute for Biotechnology & Genetic Engineering (NIBGE), Faisalabad, Pakistan
- Department of Biotechnology, Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad, Pakistan
| |
Collapse
|
43
|
Vogel A, Schwacke R, Denton AK, Usadel B, Hollmann J, Fischer K, Bolger A, Schmidt MHW, Bolger ME, Gundlach H, Mayer KFX, Weiss-Schneeweiss H, Temsch EM, Krause K. Footprints of parasitism in the genome of the parasitic flowering plant Cuscuta campestris. Nat Commun 2018; 9:2515. [PMID: 29955043 PMCID: PMC6023873 DOI: 10.1038/s41467-018-04344-z] [Citation(s) in RCA: 113] [Impact Index Per Article: 16.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2017] [Accepted: 04/23/2018] [Indexed: 11/09/2022] Open
Abstract
A parasitic lifestyle, where plants procure some or all of their nutrients from other living plants, has evolved independently in many dicotyledonous plant families and is a major threat for agriculture globally. Nevertheless, no genome sequence of a parasitic plant has been reported to date. Here we describe the genome sequence of the parasitic field dodder, Cuscuta campestris. The genome contains signatures of a fairly recent whole-genome duplication and lacks genes for pathways superfluous to a parasitic lifestyle. Specifically, genes needed for high photosynthetic activity are lost, explaining the low photosynthesis rates displayed by the parasite. Moreover, several genes involved in nutrient uptake processes from the soil are lost. On the other hand, evidence for horizontal gene transfer by way of genomic DNA integration from the parasite's hosts is found. We conclude that the parasitic lifestyle has left characteristic footprints in the C. campestris genome.
Collapse
Affiliation(s)
- Alexander Vogel
- Institute for Botany and Molecular Genetics, BioEconomy Science Center, Worringer Weg 3, RWTH Aachen University, Aachen, 52074, Germany
| | - Rainer Schwacke
- Institute for Bio- and Geosciences (IBG-2: Plant Sciences), Forschungszentrum Jülich, Wilhelm Johnen Straße, Jülich, 52428, Germany
| | - Alisandra K Denton
- Institute for Botany and Molecular Genetics, BioEconomy Science Center, Worringer Weg 3, RWTH Aachen University, Aachen, 52074, Germany.,Institute of Plant Biochemistry, Heinrich Heine University Düsseldorf, Universitätsstraße 1, Düsseldorf, 40225, Germany
| | - Björn Usadel
- Institute for Botany and Molecular Genetics, BioEconomy Science Center, Worringer Weg 3, RWTH Aachen University, Aachen, 52074, Germany.,Institute for Bio- and Geosciences (IBG-2: Plant Sciences), Forschungszentrum Jülich, Wilhelm Johnen Straße, Jülich, 52428, Germany
| | - Julien Hollmann
- Department of Arctic and Marine Biology, UiT The Arctic University of Norway, Biologibygget, Framstredet 39, Tromsø, 9037, Norway
| | - Karsten Fischer
- Department of Arctic and Marine Biology, UiT The Arctic University of Norway, Biologibygget, Framstredet 39, Tromsø, 9037, Norway
| | - Anthony Bolger
- Institute for Botany and Molecular Genetics, BioEconomy Science Center, Worringer Weg 3, RWTH Aachen University, Aachen, 52074, Germany
| | - Maximilian H-W Schmidt
- Institute for Botany and Molecular Genetics, BioEconomy Science Center, Worringer Weg 3, RWTH Aachen University, Aachen, 52074, Germany
| | - Marie E Bolger
- Institute for Bio- and Geosciences (IBG-2: Plant Sciences), Forschungszentrum Jülich, Wilhelm Johnen Straße, Jülich, 52428, Germany
| | - Heidrun Gundlach
- Helmholtz Zentrum München (HMGU), Plant Genome and Systems Biology (PGSB), Ingolstädter Landstraße 1, Neuherberg, 85764, Germany
| | - Klaus F X Mayer
- Helmholtz Zentrum München (HMGU), Plant Genome and Systems Biology (PGSB), Ingolstädter Landstraße 1, Neuherberg, 85764, Germany.,Technical University Munich, School of Life Sciences Weihenstephan, Alte Akademie 8, Freising, 85354, Germany
| | - Hanna Weiss-Schneeweiss
- Department of Botany and Biodiversity Research, Faculty Center Biodiversity, University of Vienna, Rennweg 14, Vienna, 1030, Austria
| | - Eva M Temsch
- Department of Botany and Biodiversity Research, Faculty Center Biodiversity, University of Vienna, Rennweg 14, Vienna, 1030, Austria
| | - Kirsten Krause
- Department of Arctic and Marine Biology, UiT The Arctic University of Norway, Biologibygget, Framstredet 39, Tromsø, 9037, Norway.
| |
Collapse
|
44
|
Liao P, Li S, Cui X, Zheng Y. A comprehensive review of web-based resources of non-coding RNAs for plant science research. Int J Biol Sci 2018; 14:819-832. [PMID: 29989090 PMCID: PMC6036741 DOI: 10.7150/ijbs.24593] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2017] [Accepted: 03/14/2018] [Indexed: 01/06/2023] Open
Abstract
Non-coding RNAs (ncRNAs) are transcribed from genome but not translated into proteins. Many ncRNAs are key regulators of plants growth and development, metabolism and stress tolerance. In order to make the web-based ncRNA resources for plant science research be more easily accessible and understandable, we made a comprehensive review for 83 web-based resources of three types, including genome databases containing ncRNA data, microRNA (miRNA) databases and long non-coding RNA (lncRNA) databases. To facilitate effective usage of these resources, we also suggested some preferred resources of miRNAs and lncRNAs for performing meaningful analysis.
Collapse
Affiliation(s)
- Peiran Liao
- Faculty of Life Science and Technology, Kunming University of Science and Technology, Kunming, Yunnan, 650500,China
| | - Shipeng Li
- Faculty of Life Science and Technology, Kunming University of Science and Technology, Kunming, Yunnan, 650500,China
| | - Xiuming Cui
- Faculty of Life Science and Technology, Kunming University of Science and Technology, Kunming, Yunnan, 650500,China
- Yunnan key laboratory of Panax notoginseng, Kunming, Yunnan, 650500, China
| | - Yun Zheng
- Yunnan Key Laboratory of Primate Biomedical Research, Institute of Primate Translational Medicine, Kunming University of Science and Technology, Kunming, Yunnan, 650500, China
| |
Collapse
|
45
|
Lang D, Ullrich KK, Murat F, Fuchs J, Jenkins J, Haas FB, Piednoel M, Gundlach H, Van Bel M, Meyberg R, Vives C, Morata J, Symeonidi A, Hiss M, Muchero W, Kamisugi Y, Saleh O, Blanc G, Decker EL, van Gessel N, Grimwood J, Hayes RD, Graham SW, Gunter LE, McDaniel SF, Hoernstein SNW, Larsson A, Li FW, Perroud PF, Phillips J, Ranjan P, Rokshar DS, Rothfels CJ, Schneider L, Shu S, Stevenson DW, Thümmler F, Tillich M, Villarreal Aguilar JC, Widiez T, Wong GKS, Wymore A, Zhang Y, Zimmer AD, Quatrano RS, Mayer KFX, Goodstein D, Casacuberta JM, Vandepoele K, Reski R, Cuming AC, Tuskan GA, Maumus F, Salse J, Schmutz J, Rensing SA. The Physcomitrella patens chromosome-scale assembly reveals moss genome structure and evolution. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2018; 93:515-533. [PMID: 29237241 DOI: 10.1111/tpj.13801] [Citation(s) in RCA: 279] [Impact Index Per Article: 39.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/11/2017] [Revised: 11/20/2017] [Accepted: 11/24/2017] [Indexed: 05/18/2023]
Abstract
The draft genome of the moss model, Physcomitrella patens, comprised approximately 2000 unordered scaffolds. In order to enable analyses of genome structure and evolution we generated a chromosome-scale genome assembly using genetic linkage as well as (end) sequencing of long DNA fragments. We find that 57% of the genome comprises transposable elements (TEs), some of which may be actively transposing during the life cycle. Unlike in flowering plant genomes, gene- and TE-rich regions show an overall even distribution along the chromosomes. However, the chromosomes are mono-centric with peaks of a class of Copia elements potentially coinciding with centromeres. Gene body methylation is evident in 5.7% of the protein-coding genes, typically coinciding with low GC and low expression. Some giant virus insertions are transcriptionally active and might protect gametes from viral infection via siRNA mediated silencing. Structure-based detection methods show that the genome evolved via two rounds of whole genome duplications (WGDs), apparently common in mosses but not in liverworts and hornworts. Several hundred genes are present in colinear regions conserved since the last common ancestor of plants. These syntenic regions are enriched for functions related to plant-specific cell growth and tissue organization. The P. patens genome lacks the TE-rich pericentromeric and gene-rich distal regions typical for most flowering plant genomes. More non-seed plant genomes are needed to unravel how plant genomes evolve, and to understand whether the P. patens genome structure is typical for mosses or bryophytes.
Collapse
Affiliation(s)
- Daniel Lang
- Plant Biotechnology, Faculty of Biology, University of Freiburg, Schaenzlestr. 1, 79104, Freiburg, Germany
- Plant Genome and Systems Biology, Helmholtz Center Munich, 85764, Neuherberg, Germany
| | - Kristian K Ullrich
- Plant Cell Biology, Faculty of Biology, University of Marburg, Marburg, Germany
| | - Florent Murat
- INRA, UMR 1095 Genetics, Diversity and Ecophysiology of Cereals (GDEC), 5 Chemin de Beaulieu, 63100, Clermont-Ferrand, France
| | - Jörg Fuchs
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Corrensstrasse 3, OT Gatersleben, D-06466, Stadt Seeland, Germany
| | - Jerry Jenkins
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
| | - Fabian B Haas
- Plant Cell Biology, Faculty of Biology, University of Marburg, Marburg, Germany
| | - Mathieu Piednoel
- Department of Plant Developmental Biology, Max Planck Institute for Plant Breeding Research, Carl-von-Linné Weg 10, D-50829, Cologne, Germany
| | - Heidrun Gundlach
- Plant Genome and Systems Biology, Helmholtz Center Munich, 85764, Neuherberg, Germany
| | - Michiel Van Bel
- VIB Center for Plant Systems Biology, Technologiepark 927, 9052, Ghent, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 927, B-9052, Gent, Belgium
| | - Rabea Meyberg
- Plant Cell Biology, Faculty of Biology, University of Marburg, Marburg, Germany
| | - Cristina Vives
- Center for Research in Agricultural Genomics, CRAG (CSIC-IRTA-UAB-UB), Campus UAB, Bellaterra, Cerdanyola del Vallès, 08193, Barcelona, Spain
| | - Jordi Morata
- Center for Research in Agricultural Genomics, CRAG (CSIC-IRTA-UAB-UB), Campus UAB, Bellaterra, Cerdanyola del Vallès, 08193, Barcelona, Spain
| | | | - Manuel Hiss
- Plant Cell Biology, Faculty of Biology, University of Marburg, Marburg, Germany
| | - Wellington Muchero
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, 37831, USA
| | - Yasuko Kamisugi
- Centre for Plant Sciences, Faculty of Biological Sciences, University of Leeds, Leeds, LS2 9JT, UK
| | - Omar Saleh
- Plant Biotechnology, Faculty of Biology, University of Freiburg, Schaenzlestr. 1, 79104, Freiburg, Germany
| | - Guillaume Blanc
- Structural and Genomic Information Laboratory (IGS), Aix-Marseille Université, CNRS, UMR 7256 (IMM FR 3479), Marseille, France
| | - Eva L Decker
- Plant Biotechnology, Faculty of Biology, University of Freiburg, Schaenzlestr. 1, 79104, Freiburg, Germany
| | - Nico van Gessel
- Plant Biotechnology, Faculty of Biology, University of Freiburg, Schaenzlestr. 1, 79104, Freiburg, Germany
| | - Jane Grimwood
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
- DOE Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | | | - Sean W Graham
- Department of Botany, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
| | - Lee E Gunter
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, 37831, USA
| | - Stuart F McDaniel
- Department of Biology, University of Florida, Gainesville, FL, 32611, USA
| | - Sebastian N W Hoernstein
- Plant Biotechnology, Faculty of Biology, University of Freiburg, Schaenzlestr. 1, 79104, Freiburg, Germany
| | - Anders Larsson
- Department of Organismal Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Fay-Wei Li
- Boyce Thompson Institute, Ithaca, NY, 14853, USA
| | | | | | - Priya Ranjan
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, 37831, USA
| | - Daniel S Rokshar
- DOE Joint Genome Institute, Walnut Creek, CA, 94598, USA
- Department of Molecular and Cell Biology, University of California, Berkeley, CA, 94720, USA
| | - Carl J Rothfels
- University Herbarium and Department of Integrative Biology, University of California, Berkeley, CA, 94720-2465, USA
| | - Lucas Schneider
- Plant Cell Biology, Faculty of Biology, University of Marburg, Marburg, Germany
| | - Shengqiang Shu
- DOE Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | | | - Fritz Thümmler
- Vertis Biotechnologie AG, Lise-Meitner-Str. 30, 85354, Freising, Germany
| | - Michael Tillich
- Max Planck Institute of Molecular Plant Physiology, Am Muehlenberg 1, 14476, Potsdam-Golm, Germany
| | | | - Thomas Widiez
- Department of Plant Biology, University of Geneva, Sciences III, Geneva 4, CH-1211, Switzerland
- Department of Plant Biology & Pathology Rutgers, The State University of New Jersey, New Brunswick, NJ, 08901, USA
| | - Gane Ka-Shu Wong
- Department of Biological Sciences, University of Alberta, Edmonton, AB, T6G 2E9, Canada
- Department of Medicine, University of Alberta, Edmonton, AB, T6G 2E1, Canada
- BGI-Shenzhen, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China
| | - Ann Wymore
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, 37831, USA
| | - Yong Zhang
- Shenzhen Huahan Gene Life Technology Co. Ltd, Shenzhen, China
| | - Andreas D Zimmer
- Plant Biotechnology, Faculty of Biology, University of Freiburg, Schaenzlestr. 1, 79104, Freiburg, Germany
| | - Ralph S Quatrano
- Department of Biology, Washington University, St. Louis, MO, USA
| | - Klaus F X Mayer
- Plant Genome and Systems Biology, Helmholtz Center Munich, 85764, Neuherberg, Germany
- WZW, Technical University Munich, Munich, Germany
| | | | - Josep M Casacuberta
- Center for Research in Agricultural Genomics, CRAG (CSIC-IRTA-UAB-UB), Campus UAB, Bellaterra, Cerdanyola del Vallès, 08193, Barcelona, Spain
| | - Klaas Vandepoele
- VIB Center for Plant Systems Biology, Technologiepark 927, 9052, Ghent, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 927, B-9052, Gent, Belgium
| | - Ralf Reski
- Plant Biotechnology, Faculty of Biology, University of Freiburg, Schaenzlestr. 1, 79104, Freiburg, Germany
- BIOSS Centre for Biological Signalling Studies, University of Freiburg, Schaenzlestr. 18, 79104, Freiburg, Germany
| | - Andrew C Cuming
- Centre for Plant Sciences, Faculty of Biological Sciences, University of Leeds, Leeds, LS2 9JT, UK
| | - Gerald A Tuskan
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, 37831, USA
| | - Florian Maumus
- URGI, INRA, Université Paris-Saclay, 78026, Versailles, France
| | - Jérome Salse
- INRA, UMR 1095 Genetics, Diversity and Ecophysiology of Cereals (GDEC), 5 Chemin de Beaulieu, 63100, Clermont-Ferrand, France
| | - Jeremy Schmutz
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
- DOE Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Stefan A Rensing
- Plant Cell Biology, Faculty of Biology, University of Marburg, Marburg, Germany
- BIOSS Centre for Biological Signalling Studies, University of Freiburg, Schaenzlestr. 18, 79104, Freiburg, Germany
| |
Collapse
|
46
|
Avni R, Nave M, Barad O, Baruch K, Twardziok SO, Gundlach H, Hale I, Mascher M, Spannagl M, Wiebe K, Jordan KW, Golan G, Deek J, Ben-Zvi B, Ben-Zvi G, Himmelbach A, MacLachlan RP, Sharpe AG, Fritz A, Ben-David R, Budak H, Fahima T, Korol A, Faris JD, Hernandez A, Mikel MA, Levy AA, Steffenson B, Maccaferri M, Tuberosa R, Cattivelli L, Faccioli P, Ceriotti A, Kashkush K, Pourkheirandish M, Komatsuda T, Eilam T, Sela H, Sharon A, Ohad N, Chamovitz DA, Mayer KFX, Stein N, Ronen G, Peleg Z, Pozniak CJ, Akhunov ED, Distelfeld A. Wild emmer genome architecture and diversity elucidate wheat evolution and domestication. Science 2018; 357:93-97. [PMID: 28684525 DOI: 10.1126/science.aan0032] [Citation(s) in RCA: 502] [Impact Index Per Article: 71.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2017] [Accepted: 06/08/2017] [Indexed: 01/07/2023]
Abstract
Wheat (Triticum spp.) is one of the founder crops that likely drove the Neolithic transition to sedentary agrarian societies in the Fertile Crescent more than 10,000 years ago. Identifying genetic modifications underlying wheat's domestication requires knowledge about the genome of its allo-tetraploid progenitor, wild emmer (T. turgidum ssp. dicoccoides). We report a 10.1-gigabase assembly of the 14 chromosomes of wild tetraploid wheat, as well as analyses of gene content, genome architecture, and genetic diversity. With this fully assembled polyploid wheat genome, we identified the causal mutations in Brittle Rachis 1 (TtBtr1) genes controlling shattering, a key domestication trait. A study of genomic diversity among wild and domesticated accessions revealed genomic regions bearing the signature of selection under domestication. This reference assembly will serve as a resource for accelerating the genome-assisted improvement of modern wheat varieties.
Collapse
Affiliation(s)
- Raz Avni
- School of Plant Sciences and Food Security, Tel Aviv University, Tel Aviv, Israel
| | - Moran Nave
- School of Plant Sciences and Food Security, Tel Aviv University, Tel Aviv, Israel
| | | | | | - Sven O Twardziok
- Helmholtz Zentrum München, Plant Genome and Systems Biology, Neuherberg, Germany
| | - Heidrun Gundlach
- Helmholtz Zentrum München, Plant Genome and Systems Biology, Neuherberg, Germany
| | - Iago Hale
- University of New Hampshire, Durham, NH, USA
| | - Martin Mascher
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), OT Gatersleben, Stadt Seeland, Germany. .,German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Leipzig, Germany
| | - Manuel Spannagl
- Helmholtz Zentrum München, Plant Genome and Systems Biology, Neuherberg, Germany
| | | | | | - Guy Golan
- The Hebrew University of Jerusalem, Rehovot, Israel
| | - Jasline Deek
- School of Plant Sciences and Food Security, Tel Aviv University, Tel Aviv, Israel
| | - Batsheva Ben-Zvi
- School of Plant Sciences and Food Security, Tel Aviv University, Tel Aviv, Israel
| | | | - Axel Himmelbach
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), OT Gatersleben, Stadt Seeland, Germany
| | | | - Andrew G Sharpe
- University of Saskatchewan, Saskatoon, Canada.,Global Institute for Food Security, Saskatoon, SK, Canada
| | | | - Roi Ben-David
- Agricultural Research Organization (ARO), Bet Dagan, Israel
| | | | | | | | - Justin D Faris
- U.S. Department of Agriculture (USDA)-Agricultural Research Service, Fargo, ND, USA
| | | | | | | | | | | | | | - Luigi Cattivelli
- Council of Agricultural Research and Economics-Research Centre for Genomics and Bioinformatics, Fiorenzuola d'Arda, Italy
| | - Primetta Faccioli
- Council of Agricultural Research and Economics-Research Centre for Genomics and Bioinformatics, Fiorenzuola d'Arda, Italy
| | - Aldo Ceriotti
- CNR-National Research Council, Institute of Agricultural Biology and Biotechnology, Milano, Italy
| | | | | | - Takao Komatsuda
- National Institute of Agrobiological Sciences, Tsukuba, Japan
| | - Tamar Eilam
- The Institute for Cereal Crops Improvement, Tel Aviv University, Tel Aviv, Israel
| | - Hanan Sela
- The Institute for Cereal Crops Improvement, Tel Aviv University, Tel Aviv, Israel
| | - Amir Sharon
- School of Plant Sciences and Food Security, Tel Aviv University, Tel Aviv, Israel.,The Institute for Cereal Crops Improvement, Tel Aviv University, Tel Aviv, Israel
| | - Nir Ohad
- School of Plant Sciences and Food Security, Tel Aviv University, Tel Aviv, Israel
| | - Daniel A Chamovitz
- School of Plant Sciences and Food Security, Tel Aviv University, Tel Aviv, Israel
| | - Klaus F X Mayer
- Helmholtz Zentrum München, Plant Genome and Systems Biology, Neuherberg, Germany.,School of Life Sciences Weihenstephan, Technical University of Munich, Germany
| | - Nils Stein
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), OT Gatersleben, Stadt Seeland, Germany
| | | | - Zvi Peleg
- The Hebrew University of Jerusalem, Rehovot, Israel
| | | | | | - Assaf Distelfeld
- School of Plant Sciences and Food Security, Tel Aviv University, Tel Aviv, Israel.,The Institute for Cereal Crops Improvement, Tel Aviv University, Tel Aviv, Israel
| |
Collapse
|
47
|
Chen F, Dong W, Zhang J, Guo X, Chen J, Wang Z, Lin Z, Tang H, Zhang L. The Sequenced Angiosperm Genomes and Genome Databases. FRONTIERS IN PLANT SCIENCE 2018; 9:418. [PMID: 29706973 PMCID: PMC5909171 DOI: 10.3389/fpls.2018.00418] [Citation(s) in RCA: 82] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/11/2017] [Accepted: 03/15/2018] [Indexed: 05/18/2023]
Abstract
Angiosperms, the flowering plants, provide the essential resources for human life, such as food, energy, oxygen, and materials. They also promoted the evolution of human, animals, and the planet earth. Despite the numerous advances in genome reports or sequencing technologies, no review covers all the released angiosperm genomes and the genome databases for data sharing. Based on the rapid advances and innovations in the database reconstruction in the last few years, here we provide a comprehensive review for three major types of angiosperm genome databases, including databases for a single species, for a specific angiosperm clade, and for multiple angiosperm species. The scope, tools, and data of each type of databases and their features are concisely discussed. The genome databases for a single species or a clade of species are especially popular for specific group of researchers, while a timely-updated comprehensive database is more powerful for address of major scientific mysteries at the genome scale. Considering the low coverage of flowering plants in any available database, we propose construction of a comprehensive database to facilitate large-scale comparative studies of angiosperm genomes and to promote the collaborative studies of important questions in plant biology.
Collapse
Affiliation(s)
- Fei Chen
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, College of Life Sciences, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Ministry of Education Key Laboratory of Genetics, Breeding and Multiple Utilization of Corps, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Wei Dong
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, College of Life Sciences, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Ministry of Education Key Laboratory of Genetics, Breeding and Multiple Utilization of Corps, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Jiawei Zhang
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, College of Life Sciences, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Ministry of Education Key Laboratory of Genetics, Breeding and Multiple Utilization of Corps, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Xinyue Guo
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, College of Life Sciences, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Ministry of Education Key Laboratory of Genetics, Breeding and Multiple Utilization of Corps, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Junhao Chen
- State Key Laboratory of Subtropical Silviculture, School of Forestry and Biotechnology, Zhejiang Agriculture and Forestry University, Hangzhou, China
| | - Zhengjia Wang
- State Key Laboratory of Subtropical Silviculture, School of Forestry and Biotechnology, Zhejiang Agriculture and Forestry University, Hangzhou, China
| | - Zhenguo Lin
- Department of Biology, Saint Louis University, St. Louis, MO, United States
| | - Haibao Tang
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, College of Life Sciences, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Ministry of Education Key Laboratory of Genetics, Breeding and Multiple Utilization of Corps, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Liangsheng Zhang
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, College of Life Sciences, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Ministry of Education Key Laboratory of Genetics, Breeding and Multiple Utilization of Corps, Fujian Agriculture and Forestry University, Fuzhou, China
- *Correspondence: Liangsheng Zhang
| |
Collapse
|
48
|
Schmutzer T, Bolger ME, Rudd S, Chen J, Gundlach H, Arend D, Oppermann M, Weise S, Lange M, Spannagl M, Usadel B, Mayer KFX, Scholz U. Bioinformatics in the plant genomic and phenomic domain: The German contribution to resources, services and perspectives. J Biotechnol 2017; 261:37-45. [PMID: 28698099 DOI: 10.1016/j.jbiotec.2017.07.006] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2017] [Revised: 06/30/2017] [Accepted: 07/04/2017] [Indexed: 10/19/2022]
Abstract
Plant genetic resources are a substantial opportunity for plant breeding, preservation and maintenance of biological diversity. As part of the German Network for Bioinformatics Infrastructure (de.NBI) the German Crop BioGreenformatics Network (GCBN) focuses mainly on crop plants and provides both data and software infrastructure which are tailored to the needs of the plant research community. Our mission and key objectives include: (1) provision of transparent access to germplasm seeds, (2) the delivery of improved workflows for plant gene annotation, and (3) implementation of bioinformatics services that link genotypes and phenotypes. This review introduces the GCBN's spectrum of web-services and integrated data resources that address common research problems in the plant genomics community.
Collapse
Affiliation(s)
- Thomas Schmutzer
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Corrensstraße 3, 06466 Seeland, Germany
| | - Marie E Bolger
- Forschungszentrum Jülich (FZJ), Institute of Bio- and Geosciences (IBG-2) Plant Sciences, Wilhelm-Johnen-Straße, 52425 Jülich, Germany
| | - Stephen Rudd
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Corrensstraße 3, 06466 Seeland, Germany
| | - Jinbo Chen
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Corrensstraße 3, 06466 Seeland, Germany
| | - Heidrun Gundlach
- Helmholtz Zentrum München (HMGU), Plant Genome and Systems Biology (PGSB), Ingolstädter Landstraße 1, 85764 Neuherberg, Germany
| | - Daniel Arend
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Corrensstraße 3, 06466 Seeland, Germany
| | - Markus Oppermann
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Corrensstraße 3, 06466 Seeland, Germany
| | - Stephan Weise
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Corrensstraße 3, 06466 Seeland, Germany
| | - Matthias Lange
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Corrensstraße 3, 06466 Seeland, Germany
| | - Manuel Spannagl
- Helmholtz Zentrum München (HMGU), Plant Genome and Systems Biology (PGSB), Ingolstädter Landstraße 1, 85764 Neuherberg, Germany
| | - Björn Usadel
- Forschungszentrum Jülich (FZJ), Institute of Bio- and Geosciences (IBG-2) Plant Sciences, Wilhelm-Johnen-Straße, 52425 Jülich, Germany
| | - Klaus F X Mayer
- Helmholtz Zentrum München (HMGU), Plant Genome and Systems Biology (PGSB), Ingolstädter Landstraße 1, 85764 Neuherberg, Germany; School of Life Sciences Weihenstephan, Technical University of Munich, Alte Akademie 8, 85354 Freising, Germany
| | - Uwe Scholz
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Corrensstraße 3, 06466 Seeland, Germany.
| |
Collapse
|
49
|
Mathur S, Umakanth AV, Tonapi VA, Sharma R, Sharma MK. Sweet sorghum as biofuel feedstock: recent advances and available resources. BIOTECHNOLOGY FOR BIOFUELS 2017; 10:146. [PMID: 28603553 PMCID: PMC5465577 DOI: 10.1186/s13068-017-0834-9] [Citation(s) in RCA: 68] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/13/2016] [Accepted: 05/30/2017] [Indexed: 05/08/2023]
Abstract
Sweet sorghum is a promising target for biofuel production. It is a C4 crop with low input requirements and accumulates high levels of sugars in its stalks. However, large-scale planting on marginal lands would require improved varieties with optimized biofuel-related traits and tolerance to biotic and abiotic stresses. Considering this, many studies have been carried out to generate genetic and genomic resources for sweet sorghum. In this review, we discuss various attributes of sweet sorghum that make it an ideal candidate for biofuel feedstock, and provide an overview of genetic diversity, tools, and resources available for engineering and/or marker-assisting breeding of sweet sorghum. Finally, the progress made so far, in identification of genes/quantitative trait loci (QTLs) important for agronomic traits and ongoing molecular breeding efforts to generate improved varieties, has been discussed.
Collapse
Affiliation(s)
- Supriya Mathur
- Crop Genetics & Informatics Group, School of Biotechnology, Jawaharlal Nehru University, New Delhi, India
| | - A. V. Umakanth
- Indian Council of Agricultural Research-Indian Institute of Millets Research, Hyderabad, India
| | - V. A. Tonapi
- Indian Council of Agricultural Research-Indian Institute of Millets Research, Hyderabad, India
| | - Rita Sharma
- Crop Genetics & Informatics Group, School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi, India
| | - Manoj K. Sharma
- Crop Genetics & Informatics Group, School of Biotechnology, Jawaharlal Nehru University, New Delhi, India
| |
Collapse
|
50
|
Massive expansion and differential evolution of small heat shock proteins with wheat (Triticum aestivum L.) polyploidization. Sci Rep 2017; 7:2581. [PMID: 28566710 PMCID: PMC5451465 DOI: 10.1038/s41598-017-01857-3] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2016] [Accepted: 04/03/2017] [Indexed: 12/13/2022] Open
Abstract
Wheat (Triticum aestivum), one of the world’s most important crops, is facing unprecedented challenges due to global warming. To evaluate the gene resources for heat adaptation in hexaploid wheat, small heat shock proteins (sHSPs), the key plant heat protection genes, were comprehensively analysed in wheat and related species. We found that the sHSPs of hexaploid wheat were massively expanded in A and B subgenomes with intrachromosomal duplications during polyploidization. These expanded sHSPs were under similar purifying selection and kept the expressional patterns with the original copies. Generally, a strong purifying selection acted on the α-crystallin domain (ACD) and theoretically constrain conserved function. Meanwhile, weaker purifying selection and strong positive selection acted on the N-terminal region, which conferred sHSP flexibility, allowing adjustments to a wider range of substrates in response to genomic and environmental changes. Notably, in CI, CV, ER, MI and MII subfamilies, gene duplications, expression variations and functional divergence occurred before wheat polyploidization. Our results indicate the massive expansion of active sHSPs in hexaploid wheat may also provide more raw materials for evolving functional novelties and generating genetic diversity to face future global climate changes, and highlight the expansion of stress response genes with wheat polyploidization.
Collapse
|