1
|
Sanderson LA, Caron CT, Tan RL, Bett KE. A PostgreSQL Tripal solution for large-scale genotypic and phenotypic data. Database (Oxford) 2021; 2021:baab051. [PMID: 34389844 PMCID: PMC8363843 DOI: 10.1093/database/baab051] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2021] [Revised: 05/11/2021] [Accepted: 08/03/2021] [Indexed: 11/15/2022]
Abstract
Researchers are seeking cost-effective solutions for management and analysis of large-scale genotypic and phenotypic data. Open-source software is uniquely positioned to fill this need through user-focused, crowd-sourced development. Tripal, an open-source toolkit for developing biological data web portals, uses the GMOD Chado database schema to achieve flexible, ontology-driven storage in PostgreSQL. Tripal also aids research-focused web portals in providing data according to findable, accessible, interoperable, reusable (FAIR) principles. We describe here a fully relational PostgreSQL solution to handle large-scale genotypic and phenotypic data that is implemented as a collection of freely available, open-source modules. These Tripal extension modules provide a holistic approach for importing, storage, display and analysis within a relational database schema. Furthermore, they embody the Tripal approach to FAIR data by providing multiple search tools and ensuring metadata is fully described and interoperable. Our solution focuses on data integrity, as well as optimizing performance to provide a fully functional system that is currently being used in the production of Tripal portals for crop species. We fully describe the implementation of our solution and discuss why a PostgreSQL-powered web portal provides an efficient environment for researcher-driven genotypic and phenotypic data analysis.
Collapse
Affiliation(s)
- Lacey-Anne Sanderson
- Department of Plant Sciences, University of Saskatchewan, 51 Campus Drive, Saskatoon SK S7N 5A8, Canada
| | - Carolyn T Caron
- Department of Plant Sciences, University of Saskatchewan, 51 Campus Drive, Saskatoon SK S7N 5A8, Canada
| | - Reynold L Tan
- Department of Plant Sciences, University of Saskatchewan, 51 Campus Drive, Saskatoon SK S7N 5A8, Canada
| | - Kirstin E Bett
- Department of Plant Sciences, University of Saskatchewan, 51 Campus Drive, Saskatoon SK S7N 5A8, Canada
| |
Collapse
|
2
|
Staton M, Cannon E, Sanderson LA, Wegrzyn J, Anderson T, Buehler S, Cobo-Simón I, Faaberg K, Grau E, Guignon V, Gunoskey J, Inderski B, Jung S, Lager K, Main D, Poelchau M, Ramnath R, Richter P, West J, Ficklin S. Tripal, a community update after 10 years of supporting open source, standards-based genetic, genomic and breeding databases. Brief Bioinform 2021; 22:6318561. [PMID: 34251419 PMCID: PMC8574961 DOI: 10.1093/bib/bbab238] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2021] [Revised: 05/28/2021] [Accepted: 06/01/2021] [Indexed: 12/01/2022] Open
Abstract
Online, open access databases for biological knowledge serve as central repositories for research communities to store, find and analyze integrated, multi-disciplinary datasets. With increasing volumes, complexity and the need to integrate genomic, transcriptomic, metabolomic, proteomic, phenomic and environmental data, community databases face tremendous challenges in ongoing maintenance, expansion and upgrades. A common infrastructure framework using community standards shared by many databases can reduce development burden, provide interoperability, ensure use of common standards and support long-term sustainability. Tripal is a mature, open source platform built to meet this need. With ongoing improvement since its first release in 2009, Tripal provides full functionality for searching, browsing, loading and curating numerous types of data and is a primary technology powering at least 31 publicly available databases spanning plants, animals and human data, primarily storing genomics, genetics and breeding data. Tripal software development is managed by a shared, inclusive governance structure including both project management and advisory teams. Here, we report on the most important and innovative aspects of Tripal after 11 years development, including integration of diverse types of biological data, successful collaborative projects across member databases, and support for implementing FAIR principles.
Collapse
Affiliation(s)
| | - Ethalinda Cannon
- USDA-ARS, Corn Insects and Crop Genetics Research Unit, Ames, IA USA
| | | | | | | | | | | | - Kay Faaberg
- USDA-ARS, National Animal Disease Center, Ames, IA, USA
| | - Emily Grau
- University of Connecticut, Storrs, CT USA
| | | | | | | | - Sook Jung
- Washington State University, Pullman, WA USA
| | - Kelly Lager
- USDA-ARS, National Animal Disease Center, Ames, IA, USA
| | - Dorrie Main
- Washington State University, Pullman, WA USA
| | - Monica Poelchau
- USDA-ARS, National Agricultural Library, Beltsville, MD, USA
| | | | | | - Joe West
- University of Tennessee, Knoxville, TN USA
| | | |
Collapse
|
3
|
Alexander AC, Levenstein B, Sanderson LA, Blukacz-Richards EA, Chambers PA. How does climate variability affect water quality dynamics in Canada's oil sands region? Sci Total Environ 2020; 732:139062. [PMID: 32417553 DOI: 10.1016/j.scitotenv.2020.139062] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/21/2020] [Revised: 04/24/2020] [Accepted: 04/26/2020] [Indexed: 05/05/2023]
Abstract
In Canada's oil sands region, classic boreal hydrology (i.e., winter low flow followed by peaks during spring freshet and then summer flow recession) combined with erosion of both natural and anthropogenically-exposed bitumen results in seasonal and inter-annual variability in stream water chemistry. Using data collected from all seasons over three years (2012-2015), we investigated the mechanisms driving spatial and temporal change in the concentration of 26 water quality parameters for six rivers draining Canada's oil sands region. Mantel tests showed a strong spatial aggregation of climatic drivers (average daily precipitation, accumulated precipitation, snow water equivalent) associated with west versus east discharge patterns. Wavelet analysis highlighted unique watershed attributes, in particular the importance of developed area in lowering responsiveness to seasonal precipitation. Concentrations of most chemical parameters (20 of 23) showed distinct temporal patterns that were correlated with seasonal changes in hydrology which, in turn, were related to changes in weather. Comparison of concentrations observed in this study with those reported in the scientific literature for the same watersheds showed 81% of comparisons differed significantly. This was likely due to the short duration of previous field campaigns and thus the sampling of a very narrow window of the annual streamflow regime.
Collapse
Affiliation(s)
- A C Alexander
- Environment and Climate Change Canada, Fredericton, NB, Canada; Department of Biology and Canadian Rivers Institute, 10 Bailey Drive, PO Box 4400, University of New Brunswick, Fredericton, NB E3B 5A3, Canada.
| | - B Levenstein
- Department of Biology and Canadian Rivers Institute, 10 Bailey Drive, PO Box 4400, University of New Brunswick, Fredericton, NB E3B 5A3, Canada
| | - L A Sanderson
- Department of Biology and Canadian Rivers Institute, 10 Bailey Drive, PO Box 4400, University of New Brunswick, Fredericton, NB E3B 5A3, Canada
| | - E A Blukacz-Richards
- Environment and Climate Change Canada, Canada Centre for Inland Waters, 867 Lakeshore Road, PO Box 5050, Burlington, ON L7S 1A1, Canada
| | - P A Chambers
- Environment and Climate Change Canada, Canada Centre for Inland Waters, 867 Lakeshore Road, PO Box 5050, Burlington, ON L7S 1A1, Canada
| |
Collapse
|
4
|
Spoor S, Cheng CH, Sanderson LA, Condon B, Almsaeed A, Chen M, Bretaudeau A, Rasche H, Jung S, Main D, Bett K, Staton M, Wegrzyn JL, Feltus FA, Ficklin SP. Tripal v3: an ontology-based toolkit for construction of FAIR biological community databases. Database (Oxford) 2020; 2019:5532788. [PMID: 31328773 PMCID: PMC6643302 DOI: 10.1093/database/baz077] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/08/2019] [Revised: 05/12/2019] [Accepted: 05/22/2019] [Indexed: 12/20/2022]
Abstract
Community biological databases provide an important online resource for both public and private data, analysis tools and community engagement. These sites house genomic, transcriptomic, genetic, breeding and ancillary data for specific species, families or clades. Due to the complexity and increasing quantities of these data, construction of online resources is increasingly difficult especially with limited funding and access to technical expertise. Furthermore, online repositories are expected to promote FAIR data principles (findable, accessible, interoperable and reusable) that presents additional challenges. The open-source Tripal database toolkit seeks to mitigate these challenges by creating both the software and an interactive community of developers for construction of online community databases. Additionally, through coordinated, distributed co-development, Tripal sites encourage community-wide sustainability. Here, we report the release of Tripal version 3 that improves data accessibility and data sharing through systematic use of controlled vocabularies (CVs). Tripal uses the community-developed Chado database as a default data store, but now provides tools to support other data stores, while ensuring that CVs remain the central organizational structure for the data. A new site developer can use Tripal to develop a basic site with little to no programming, with the ability to integrate other data types using extension modules and the Tripal application programming interface. A thorough online User’s Guide and Developer’s Handbook are available at http://tripal.info, providing download, installation and step-by-step setup instructions.
Collapse
Affiliation(s)
- Shawna Spoor
- Department of Horticulture, Washington State University, Pullman, WA, USA
| | - Chun-Huai Cheng
- Department of Horticulture, Washington State University, Pullman, WA, USA
| | | | - Bradford Condon
- Department of Entomology and Plant Pathology, University of Tennessee, Knoxville, TN, USA
| | - Abdullah Almsaeed
- Department of Entomology and Plant Pathology, University of Tennessee, Knoxville, TN, USA
| | - Ming Chen
- Department of Entomology and Plant Pathology, University of Tennessee, Knoxville, TN, USA
| | - Anthony Bretaudeau
- INRA, UMR IGEPP, BIPAA/GenOuest, INRIA/Irisa - Campus de Beaulieu, Rennes Cedex, France
| | - Helena Rasche
- Bioinformatics Group, Department of Computer Science, University of Freiburg, Freiburg im Breisgau, Germany
| | - Sook Jung
- Department of Horticulture, Washington State University, Pullman, WA, USA
| | - Dorrie Main
- Department of Horticulture, Washington State University, Pullman, WA, USA
| | - Kirstin Bett
- Department of Plant Sciences, University of Saskatchewan, Saskatoon, SK, Canada
| | - Margaret Staton
- Department of Entomology and Plant Pathology, University of Tennessee, Knoxville, TN, USA
| | - Jill L Wegrzyn
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, USA.,Computational Biology Core, Institute for Systems Genomics, University of Connecticut, Storrs, CT, USA
| | - F Alex Feltus
- Dept. of Genetics and Biochemistry, Clemson University, Clemson, USA
| | - Stephen P Ficklin
- Department of Horticulture, Washington State University, Pullman, WA, USA
| |
Collapse
|
5
|
Sanderson LA, Caron CT, Tan R, Shen Y, Liu R, Bett KE. KnowPulse: A Web-Resource Focused on Diversity Data for Pulse Crop Improvement. Front Plant Sci 2019; 10:965. [PMID: 31428111 PMCID: PMC6690010 DOI: 10.3389/fpls.2019.00965] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/03/2019] [Accepted: 07/10/2019] [Indexed: 05/25/2023]
Abstract
KnowPulse (https://knowpulse.usask.ca) is a breeder-focused web portal for pulse breeders and geneticists. With a focus on diversity data, KnowPulse provides information on genetic markers, sequence variants, phenotypic traits and germplasm for chickpea, common bean, field pea, faba bean, and lentil. Genotypic data is accessible through the genotype matrix tool, displayed as a marker-by-germplasm table of genotype calls specific to germplasm chosen by the researcher. It is also summarized on genetic marker and sequence variant pages. Phenotypic data is visualized in trait distribution plots: violin plots for quantitative data and histograms for qualitative data. These plots are accessible through trait, germplasm, and experiment pages, as well as through a single page search tool. KnowPulse is built using the open-source Tripal toolkit and utilizes open-source tools including, but not limited to, species-specific JBrowse instances, a BLAST interface, and whole-genome CViTjs visualizations. KnowPulse is constantly evolving with data and tools added as they become available. Full integration of genetic maps and quantitative trait loci is imminent, and development of tools exploring structural variation is being explored.
Collapse
Affiliation(s)
| | | | | | | | | | - Kirstin E. Bett
- Department of Plant Sciences, University of Saskatchewan, Saskatoon, SK, Canada
| |
Collapse
|
6
|
Harper L, Campbell J, Cannon EKS, Jung S, Poelchau M, Walls R, Andorf C, Arnaud E, Berardini TZ, Birkett C, Cannon S, Carson J, Condon B, Cooper L, Dunn N, Elsik CG, Farmer A, Ficklin SP, Grant D, Grau E, Herndon N, Hu ZL, Humann J, Jaiswal P, Jonquet C, Laporte MA, Larmande P, Lazo G, McCarthy F, Menda N, Mungall CJ, Munoz-Torres MC, Naithani S, Nelson R, Nesdill D, Park C, Reecy J, Reiser L, Sanderson LA, Sen TZ, Staton M, Subramaniam S, Tello-Ruiz MK, Unda V, Unni D, Wang L, Ware D, Wegrzyn J, Williams J, Woodhouse M, Yu J, Main D. AgBioData consortium recommendations for sustainable genomics and genetics databases for agriculture. Database (Oxford) 2018; 2018:5096675. [PMID: 30239679 PMCID: PMC6146126 DOI: 10.1093/database/bay088] [Citation(s) in RCA: 38] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2018] [Revised: 07/19/2018] [Accepted: 07/30/2018] [Indexed: 01/07/2023]
Abstract
The future of agricultural research depends on data. The sheer volume of agricultural biological data being produced today makes excellent data management essential. Governmental agencies, publishers and science funders require data management plans for publicly funded research. Furthermore, the value of data increases exponentially when they are properly stored, described, integrated and shared, so that they can be easily utilized in future analyses. AgBioData (https://www.agbiodata.org) is a consortium of people working at agricultural biological databases, data archives and knowledgbases who strive to identify common issues in database development, curation and management, with the goal of creating database products that are more Findable, Accessible, Interoperable and Reusable. We strive to promote authentic, detailed, accurate and explicit communication between all parties involved in scientific data. As a step toward this goal, we present the current state of biocuration, ontologies, metadata and persistence, database platforms, programmatic (machine) access to data, communication and sustainability with regard to data curation. Each section describes challenges and opportunities for these topics, along with recommendations and best practices.
Collapse
Affiliation(s)
- Lisa Harper
- Corn Insects and Crop Genetics Research Unit, USDA-ARS, Ames, IA, USA
| | | | - Ethalinda K S Cannon
- Corn Insects and Crop Genetics Research Unit, USDA-ARS, Ames, IA, USA
- Computer Science, Iowa State University, Ames, IA, USA
| | - Sook Jung
- Horticulture, Washington State University, Pullman, WA, USA
| | - Monica Poelchau
- National Agricultural Library, USDA Agricultural Research Service, Beltsville, MD, USA
| | | | - Carson Andorf
- Corn Insects and Crop Genetics Research Unit, USDA-ARS, Ames, IA, USA
- Computer Science, Iowa State University, Ames, IA, USA
| | - Elizabeth Arnaud
- Bioversity International, Informatics Unit, Conservation and Availability Programme, Parc Scientifique Agropolis II, Montpellier, France
| | - Tanya Z Berardini
- The Arabidopsis Information Resource, Phoenix Bioinformatics, Fremont, CA, USA
| | | | - Steve Cannon
- Corn Insects and Crop Genetics Research Unit, USDA-ARS, Ames, IA, USA
| | - James Carson
- Texas Advanced Computing Center, The University of Texas at Austin, Austin, TX, USA
| | - Bradford Condon
- Entomology and Plant Pathology, University of Tennessee Knoxville, Knoxville, TN, USA
| | - Laurel Cooper
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, USA
| | - Nathan Dunn
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Christine G Elsik
- Division of Animal Sciences and Division of Plant Sciences, University of Missouri, Columbia, MO, USA
| | - Andrew Farmer
- National Center for Genome Resources, Santa Fe, NM, USA
| | | | - David Grant
- Corn Insects and Crop Genetics Research Unit, USDA-ARS, Ames, IA, USA
| | - Emily Grau
- National Center for Genome Resources, Santa Fe, NM, USA
| | - Nic Herndon
- Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, USA
| | - Zhi-Liang Hu
- Animal Science, Iowa State University, Ames, USA
| | - Jodi Humann
- Horticulture, Washington State University, Pullman, WA, USA
| | - Pankaj Jaiswal
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, USA
| | - Clement Jonquet
- Laboratory of Informatics, Robotics, Microelectronics of Montpellier, University of Montpellier & CNRS, Montpellier, France
| | - Marie-Angélique Laporte
- Bioversity International, Informatics Unit, Conservation and Availability Programme, Parc Scientifique Agropolis II, Montpellier, France
| | | | - Gerard Lazo
- Crop Improvement and Genetics Research Unit, USDA-ARS, Albany, CA, USA
| | - Fiona McCarthy
- School of Animal and Comparative Biomedical Sciences, University of Arizona, Tucson, AZ, USA
| | | | | | | | - Sushma Naithani
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, USA
| | - Rex Nelson
- Corn Insects and Crop Genetics Research Unit, USDA-ARS, Ames, IA, USA
| | - Daureen Nesdill
- Marriott Library, University of Utah, Salt Lake City, UT, USA
| | - Carissa Park
- Animal Science, Iowa State University, Ames, USA
| | - James Reecy
- Animal Science, Iowa State University, Ames, USA
| | - Leonore Reiser
- The Arabidopsis Information Resource, Phoenix Bioinformatics, Fremont, CA, USA
| | | | - Taner Z Sen
- Crop Improvement and Genetics Research Unit, USDA-ARS, Albany, CA, USA
| | - Margaret Staton
- Entomology and Plant Pathology, University of Tennessee Knoxville, Knoxville, TN, USA
| | | | | | - Victor Unda
- Horticulture, Washington State University, Pullman, WA, USA
| | - Deepak Unni
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Liya Wang
- Plant Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Doreen Ware
- USDA, Plant, Soil and Nutrition Research, Ithaca, NY, USA
- Plant Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Jill Wegrzyn
- Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, USA
| | - Jason Williams
- Cold Spring Harbor Laboratory, DNA Learning Center, Cold Spring Harbor, NY, USA
| | - Margaret Woodhouse
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA, USA
| | - Jing Yu
- Horticulture, Washington State University, Pullman, WA, USA
| | - Doreen Main
- Horticulture, Washington State University, Pullman, WA, USA
| |
Collapse
|
7
|
Gujaria-Verma N, Ramsay L, Sharpe AG, Sanderson LA, Debouck DG, Tar'an B, Bett KE. Gene-based SNP discovery in tepary bean (Phaseolus acutifolius) and common bean (P. vulgaris) for diversity analysis and comparative mapping. BMC Genomics 2016; 17:239. [PMID: 26979462 PMCID: PMC4793507 DOI: 10.1186/s12864-016-2499-3] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2015] [Accepted: 02/18/2016] [Indexed: 11/10/2022] Open
Abstract
Background Common bean (Phaseolus vulgaris) is an important grain legume and there has been a recent resurgence in interest in its relative, tepary bean (P. acutifolius), owing to this species’ ability to better withstand abiotic stresses. Genomic resources are scarce for this minor crop species and a better knowledge of the genome-level relationship between these two species would facilitate improvement in both. High-throughput genotyping has facilitated large-scale single nucleotide polymorphism (SNP) identification leading to the development of molecular markers with associated sequence information that can be used to place them in the context of a full genome assembly. Results Transcript-based SNPs were identified from six common bean and two tepary bean accessions and a subset were used to generate a 768-SNP Illumina GoldenGate assay for each species. The tepary bean assay was used to assess diversity in wild and cultivated tepary bean and to generate the first gene-based map of the tepary bean genome. Genotypic analyses of the diversity panel showed a clear separation between domesticated and cultivated tepary beans, two distinct groups within the domesticated types, and P. parvifolius was confirmed to be distinct. The genetic map of tepary bean was compared to the common bean genome assembly to demonstrate high levels of collinearity between the two species with differences limited to a few intra-chromosomal rearrangements. Conclusions The development of the first set of genomic resources specifically for tepary bean has allowed for greater insight into the structure of this species and its relationship to its agriculturally more prominent relative, common bean. These resources will be helpful in the development of efficient breeding strategies for both species and will facilitate the introgression of agriculturally important traits from one crop into the other. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-2499-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Neha Gujaria-Verma
- Department of Plant Sciences, University of Saskatchewan, 51 Campus Dr., Saskatoon, SK, S7N 5A8, Canada
| | - Larissa Ramsay
- Department of Plant Sciences, University of Saskatchewan, 51 Campus Dr., Saskatoon, SK, S7N 5A8, Canada
| | - Andrew G Sharpe
- National Research Council Canada, 110 Gymnasium Place, Saskatoon, SK, S7N 0W9, Canada
| | - Lacey-Anne Sanderson
- Department of Plant Sciences, University of Saskatchewan, 51 Campus Dr., Saskatoon, SK, S7N 5A8, Canada
| | - Daniel G Debouck
- Genetic Resources Program, International Center for Tropical Agriculture, Km 17 recta a Palmira, AA6713, Cali, Colombia
| | - Bunyamin Tar'an
- Department of Plant Sciences, University of Saskatchewan, 51 Campus Dr., Saskatoon, SK, S7N 5A8, Canada
| | - Kirstin E Bett
- Department of Plant Sciences, University of Saskatchewan, 51 Campus Dr., Saskatoon, SK, S7N 5A8, Canada.
| |
Collapse
|
8
|
Sindhu A, Ramsay L, Sanderson LA, Stonehouse R, Li R, Condie J, Shunmugam ASK, Liu Y, Jha AB, Diapari M, Burstin J, Aubert G, Tar’an B, Bett KE, Warkentin TD, Sharpe AG. Gene-based SNP discovery and genetic mapping in pea. Theor Appl Genet 2014; 127:2225-41. [PMID: 25119872 PMCID: PMC4180032 DOI: 10.1007/s00122-014-2375-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2014] [Accepted: 07/29/2014] [Indexed: 05/07/2023]
Abstract
KEY MESSAGE Gene-based SNPs were identified and mapped in pea using five recombinant inbred line populations segregating for traits of agronomic importance. Pea (Pisum sativum L.) is one of the world's oldest domesticated crops and has been a model system in plant biology and genetics since the work of Gregor Mendel. Pea is the second most widely grown pulse crop in the world following common bean. The importance of pea as a food crop is growing due to its combination of moderate protein concentration, slowly digestible starch, high dietary fiber concentration, and its richness in micronutrients; however, pea has lagged behind other major crops in harnessing recent advances in molecular biology, genomics and bioinformatics, partly due to its large genome size with a large proportion of repetitive sequence, and to the relatively limited investment in research in this crop globally. The objective of this research was the development of a genome-wide transcriptome-based pea single-nucleotide polymorphism (SNP) marker platform using next-generation sequencing technology. A total of 1,536 polymorphic SNP loci selected from over 20,000 non-redundant SNPs identified using deep transcriptome sequencing of eight diverse Pisum accessions were used for genotyping in five RIL populations using an Illumina GoldenGate assay. The first high-density pea SNP map defining all seven linkage groups was generated by integrating with previously published anchor markers. Syntenic relationships of this map with the model legume Medicago truncatula and lentil (Lens culinaris Medik.) maps were established. The genic SNP map establishes a foundation for future molecular breeding efforts by enabling both the identification and tracking of introgression of genomic regions harbouring QTLs related to agronomic and seed quality traits.
Collapse
Affiliation(s)
- Anoop Sindhu
- Department of Plant Sciences, Crop Development Centre, University of Saskatchewan, 51 Campus Drive, Saskatoon, SK S7N 5A8 Canada
| | - Larissa Ramsay
- National Research Council Canada, 110 Gymnasium Place, Saskatoon, SK S7N 0W9 Canada
- Present Address: Department of Plant Sciences, Crop Development Centre, University of Saskatchewan, 51 Campus Drive, Saskatoon, SK S7N 5A8 Canada
| | - Lacey-Anne Sanderson
- Department of Plant Sciences, Crop Development Centre, University of Saskatchewan, 51 Campus Drive, Saskatoon, SK S7N 5A8 Canada
| | - Robert Stonehouse
- Department of Plant Sciences, Crop Development Centre, University of Saskatchewan, 51 Campus Drive, Saskatoon, SK S7N 5A8 Canada
| | - Rong Li
- National Research Council Canada, 110 Gymnasium Place, Saskatoon, SK S7N 0W9 Canada
| | - Janet Condie
- National Research Council Canada, 110 Gymnasium Place, Saskatoon, SK S7N 0W9 Canada
| | - Arun S. K. Shunmugam
- Department of Plant Sciences, Crop Development Centre, University of Saskatchewan, 51 Campus Drive, Saskatoon, SK S7N 5A8 Canada
| | - Yong Liu
- Department of Plant Sciences, Crop Development Centre, University of Saskatchewan, 51 Campus Drive, Saskatoon, SK S7N 5A8 Canada
| | - Ambuj B. Jha
- Department of Plant Sciences, Crop Development Centre, University of Saskatchewan, 51 Campus Drive, Saskatoon, SK S7N 5A8 Canada
| | - Marwan Diapari
- Department of Plant Sciences, Crop Development Centre, University of Saskatchewan, 51 Campus Drive, Saskatoon, SK S7N 5A8 Canada
| | - Judith Burstin
- UMR1347 Agroecology, INRA, 17 rue de Sully, 21065 Dijon Cedex, France
| | - Gregoire Aubert
- UMR1347 Agroecology, INRA, 17 rue de Sully, 21065 Dijon Cedex, France
| | - Bunyamin Tar’an
- Department of Plant Sciences, Crop Development Centre, University of Saskatchewan, 51 Campus Drive, Saskatoon, SK S7N 5A8 Canada
| | - Kirstin E. Bett
- Department of Plant Sciences, Crop Development Centre, University of Saskatchewan, 51 Campus Drive, Saskatoon, SK S7N 5A8 Canada
| | - Thomas D. Warkentin
- Department of Plant Sciences, Crop Development Centre, University of Saskatchewan, 51 Campus Drive, Saskatoon, SK S7N 5A8 Canada
| | - Andrew G. Sharpe
- National Research Council Canada, 110 Gymnasium Place, Saskatoon, SK S7N 0W9 Canada
| |
Collapse
|
9
|
Sanderson LA, Ficklin SP, Cheng CH, Jung S, Feltus FA, Bett KE, Main D. Tripal v1.1: a standards-based toolkit for construction of online genetic and genomic databases. Database (Oxford) 2013; 2013:bat075. [PMID: 24163125 PMCID: PMC3808541 DOI: 10.1093/database/bat075] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Tripal is an open-source freely available toolkit for construction of online genomic and genetic databases. It aims to facilitate development of community-driven biological websites by integrating the GMOD Chado database schema with Drupal, a popular website creation and content management software. Tripal provides a suite of tools for interaction with a Chado database and display of content therein. The tools are designed to be generic to support the various ways in which data may be stored in Chado. Previous releases of Tripal have supported organisms, genomic libraries, biological stocks, stock collections and genomic features, their alignments and annotations. Also, Tripal and its extension modules provided loaders for commonly used file formats such as FASTA, GFF, OBO, GAF, BLAST XML, KEGG heir files and InterProScan XML. Default generic templates were provided for common views of biological data, which could be customized using an open Application Programming Interface to change the way data are displayed. Here, we report additional tools and functionality that are part of release v1.1 of Tripal. These include (i) a new bulk loader that allows a site curator to import data stored in a custom tab delimited format; (ii) full support of every Chado table for Drupal Views (a powerful tool allowing site developers to construct novel displays and search pages); (iii) new modules including ‘Feature Map’, ‘Genetic’, ‘Publication’, ‘Project’, ‘Contact’ and the ‘Natural Diversity’ modules. Tutorials, mailing lists, download and set-up instructions, extension modules and other documentation can be found at the Tripal website located at http://tripal.info. Database URL: http://tripal.info/
Collapse
Affiliation(s)
- Lacey-Anne Sanderson
- Department of Plant Sciences, University of Saskatchewan. Saskatoon, SK Canada, Department of Horticulture, Washington State University. Pullman, WA, USA and Department of Genetics and Biochemistry, Clemson University. Clemson, SC, USA
| | | | | | | | | | | | | |
Collapse
|
10
|
Sharpe AG, Ramsay L, Sanderson LA, Fedoruk MJ, Clarke WE, Li R, Kagale S, Vijayan P, Vandenberg A, Bett KE. Ancient orphan crop joins modern era: gene-based SNP discovery and mapping in lentil. BMC Genomics 2013; 14:192. [PMID: 23506258 PMCID: PMC3635939 DOI: 10.1186/1471-2164-14-192] [Citation(s) in RCA: 100] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2012] [Accepted: 02/22/2013] [Indexed: 12/22/2022] Open
Abstract
Background The genus Lens comprises a range of closely related species within the galegoid clade of the Papilionoideae family. The clade includes other important crops (e.g. chickpea and pea) as well as a sequenced model legume (Medicago truncatula). Lentil is a global food crop increasing in importance in the Indian sub-continent and elsewhere due to its nutritional value and quick cooking time. Despite this importance there has been a dearth of genetic and genomic resources for the crop and this has limited the application of marker-assisted selection strategies in breeding. Results We describe here the development of a deep and diverse transcriptome resource for lentil using next generation sequencing technology. The generation of data in multiple cultivated (L. culinaris) and wild (L. ervoides) genotypes together with the utilization of a bioinformatics workflow enabled the identification of a large collection of SNPs and the subsequent development of a genotyping platform that was used to establish the first comprehensive genetic map of the L. culinaris genome. Extensive collinearity with M. truncatula was evident on the basis of sequence homology between mapped markers and the model genome and large translocations and inversions relative to M. truncatula were identified. An estimate for the time divergence of L. culinaris from L. ervoides and of both from M. truncatula was also calculated. Conclusions The availability of the genomic and derived molecular marker resources presented here will help change lentil breeding strategies and lead to increased genetic gain in the future.
Collapse
Affiliation(s)
- Andrew G Sharpe
- National Research Council Canada, 110 Gymnasium Place, Saskatoon, SK, S7N 0W9, Canada.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
11
|
Jung S, Menda N, Redmond S, Buels RM, Friesen M, Bendana Y, Sanderson LA, Lapp H, Lee T, MacCallum B, Bett KE, Cain S, Clements D, Mueller LA, Main D. The Chado Natural Diversity module: a new generic database schema for large-scale phenotyping and genotyping data. Database (Oxford) 2011; 2011:bar051. [PMID: 22120662 PMCID: PMC3225077 DOI: 10.1093/database/bar051] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
Linking phenotypic with genotypic diversity has become a major requirement for basic and applied genome-centric biological research. To meet this need, a comprehensive database backend for efficiently storing, querying and analyzing large experimental data sets is necessary. Chado, a generic, modular, community-based database schema is widely used in the biological community to store information associated with genome sequence data. To meet the need to also accommodate large-scale phenotyping and genotyping projects, a new Chado module called Natural Diversity has been developed. The module strictly adheres to the Chado remit of being generic and ontology driven. The flexibility of the new module is demonstrated in its capacity to store any type of experiment that either uses or generates specimens or stock organisms. Experiments may be grouped or structured hierarchically, whereas any kind of biological entity can be stored as the observed unit, from a specimen to be used in genotyping or phenotyping experiments, to a group of species collected in the field that will undergo further lab analysis. We describe details of the Natural Diversity module, including the design approach, the relational schema and use cases implemented in several databases.
Collapse
Affiliation(s)
- Sook Jung
- Department of Horticulture and Landscape, Washington State University, Pullman, WA 99164, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
12
|
Ficklin SP, Sanderson LA, Cheng CH, Staton ME, Lee T, Cho IH, Jung S, Bett KE, Main D. Tripal: a construction toolkit for online genome databases. Database (Oxford) 2011; 2011:bar044. [PMID: 21959868 PMCID: PMC3263599 DOI: 10.1093/database/bar044] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
As the availability, affordability and magnitude of genomics and genetics research increases so does the need to provide online access to resulting data and analyses. Availability of a tailored online database is the desire for many investigators or research communities; however, managing the Information Technology infrastructure needed to create such a database can be an undesired distraction from primary research or potentially cost prohibitive. Tripal provides simplified site development by merging the power of Drupal, a popular web Content Management System with that of Chado, a community-derived database schema for storage of genomic, genetic and other related biological data. Tripal provides an interface that extends the content management features of Drupal to the data housed in Chado. Furthermore, Tripal provides a web-based Chado installer, genomic data loaders, web-based editing of data for organisms, genomic features, biological libraries, controlled vocabularies and stock collections. Also available are Tripal extensions that support loading and visualizations of NCBI BLAST, InterPro, Kyoto Encyclopedia of Genes and Genomes and Gene Ontology analyses, as well as an extension that provides integration of Tripal with GBrowse, a popular GMOD tool. An Application Programming Interface is available to allow creation of custom extensions by site developers, and the look-and-feel of the site is completely customizable through Drupal-based PHP template files. Addition of non-biological content and user-management is afforded through Drupal. Tripal is an open source and freely available software package found at http://tripal.sourceforge.net
Collapse
Affiliation(s)
- Stephen P Ficklin
- Department of Horticulture and Landscape Architecture, Washington State University, Pullman, WA 99164, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
13
|
Sanderson LA, Wright PA, Robinson JW, Ballantyne JS, Bernier NJ. Inhibition of glutamine synthetase during ammonia exposure in rainbow trout indicates a high reserve capacity to prevent brain ammonia toxicity. ACTA ACUST UNITED AC 2010; 213:2343-53. [PMID: 20543133 DOI: 10.1242/jeb.039156] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
Glutamine synthetase (GSase), the enzyme that catalyses the conversion of glutamate and ammonia to glutamine, is present at high levels in vertebrate brain tissue and is thought to protect the brain from elevated ammonia concentrations. We tested the hypothesis that high brain GSase activity is critical in preventing accumulation of brain ammonia and glutamate during ammonia loading in the ammonia-intolerant rainbow trout. Trout pre-injected with saline or the GSase inhibitor methionine sulfoximine (MSOX, 6 mg kg(-1)), were exposed to 0, 670 or 1000 micromol l(-1) NH(4)Cl in the water for 24 and 96 h. Brain ammonia levels were 3- to 6-fold higher in ammonia-exposed fish relative to control fish and MSOX treatment did not alter this. Brain GSase activity was unaffected by ammonia exposure, while MSOX inhibited GSase activity by approximately 75%. Brain glutamate levels were lower and glutamine levels were higher in fish exposed to ammonia relative to controls. While MSOX treatment had little impact on brain glutamate, glutamine levels were significantly reduced by 96 h. With ammonia treatment, significant changes in the concentration of multiple other brain amino acids occurred and these changes were mostly reversed or eliminated with MSOX. Overall the changes in amino acid levels suggest that multiple enzymatic pathways can supply glutamate for the production of glutamine via GSase during ammonia exposure and that alternative transaminase pathways can be recruited for ammonia detoxification. Plasma cortisol levels increased 7- to 15-fold at 24 h in response to ammonia and MSOX did not exacerbate this stress response. These findings indicate that rainbow trout possess a relatively large reserve capacity for ammonia detoxification and for preventing glutamate accumulation during hyperammonaemic conditions.
Collapse
Affiliation(s)
- L A Sanderson
- Department of Integrative Biology, University of Guelph, 488 Gordon Street, Guelph, ON, Canada, N1G 2W1
| | | | | | | | | |
Collapse
|