1
|
Deng CH, Naithani S, Kumari S, Cobo-Simón I, Quezada-Rodríguez EH, Skrabisova M, Gladman N, Correll MJ, Sikiru AB, Afuwape OO, Marrano A, Rebollo I, Zhang W, Jung S. Genotype and phenotype data standardization, utilization and integration in the big data era for agricultural sciences. Database (Oxford) 2023; 2023:baad088. [PMID: 38079567 PMCID: PMC10712715 DOI: 10.1093/database/baad088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Revised: 10/17/2023] [Accepted: 11/28/2023] [Indexed: 12/18/2023]
Abstract
Large-scale genotype and phenotype data have been increasingly generated to identify genetic markers, understand gene function and evolution and facilitate genomic selection. These datasets hold immense value for both current and future studies, as they are vital for crop breeding, yield improvement and overall agricultural sustainability. However, integrating these datasets from heterogeneous sources presents significant challenges and hinders their effective utilization. We established the Genotype-Phenotype Working Group in November 2021 as a part of the AgBioData Consortium (https://www.agbiodata.org) to review current data types and resources that support archiving, analysis and visualization of genotype and phenotype data to understand the needs and challenges of the plant genomic research community. For 2021-22, we identified different types of datasets and examined metadata annotations related to experimental design/methods/sample collection, etc. Furthermore, we thoroughly reviewed publicly funded repositories for raw and processed data as well as secondary databases and knowledgebases that enable the integration of heterogeneous data in the context of the genome browser, pathway networks and tissue-specific gene expression. Based on our survey, we recommend a need for (i) additional infrastructural support for archiving many new data types, (ii) development of community standards for data annotation and formatting, (iii) resources for biocuration and (iv) analysis and visualization tools to connect genotype data with phenotype data to enhance knowledge synthesis and to foster translational research. Although this paper only covers the data and resources relevant to the plant research community, we expect that similar issues and needs are shared by researchers working on animals. Database URL: https://www.agbiodata.org.
Collapse
Affiliation(s)
- Cecilia H Deng
- Molecular and Digital Breeding, New Cultivar Innovation, The New Zealand Institute for Plant and Food Research Limited, 120 Mt Albert Road, Auckland 1025, New Zealand
| | - Sushma Naithani
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR 97331, USA
| | - Sunita Kumari
- Cold Spring Harbor Laboratory, 1 Bungtown Rd, Cold Spring Harbor, New York, NY 11724, USA
| | - Irene Cobo-Simón
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, USA
- Institute of Forest Science (ICIFOR-INIA, CSIC), Madrid, Spain
| | - Elsa H Quezada-Rodríguez
- Departamento de Producción Agrícola y Animal, Universidad Autónoma Metropolitana-Xochimilco, Ciudad de México, México
- Centro de Ciencias de la Complejidad, Universidad Nacional Autónoma de México, Ciudad de México, México
| | - Maria Skrabisova
- Department of Biochemistry, Faculty of Science, Palacky University, Olomouc, Czech Republic
| | - Nick Gladman
- Cold Spring Harbor Laboratory, 1 Bungtown Rd, Cold Spring Harbor, New York, NY 11724, USA
- U.S. Department of Agriculture-Agricultural Research Service, NEA Robert W. Holley Center for Agriculture and Health, Cornell University, Ithaca, NY 14853, USA
| | - Melanie J Correll
- Agricultural and Biological Engineering Department, University of Florida, 1741 Museum Rd, Gainesville, FL 32611, USA
| | | | | | - Annarita Marrano
- Phoenix Bioinformatics, 39899 Balentine Drive, Suite 200, Newark, CA 94560, USA
| | | | - Wentao Zhang
- National Research Council Canada, 110 Gymnasium Pl, Saskatoon, Saskatchewan S7N 0W9, Canada
| | - Sook Jung
- Department of Horticulture, Washington State University, 303c Plant Sciences Building, Pullman, WA 99164-6414, USA
| |
Collapse
|
2
|
López-Ruiz BA, Quezada-Rodríguez EH, Piñeyro-Nelson A, Tovar H, García-Ponce B, Sánchez MDLP, Álvarez-Buylla ER, Garay-Arroyo A. Combined Approach of GWAS and Phylogenetic Analyses to Identify New Candidate Genes That Participate in Arabidopsis thaliana Primary Root Development Using Cellular Measurements and Primary Root Length. Plants (Basel) 2022; 11:3162. [PMID: 36432890 PMCID: PMC9697774 DOI: 10.3390/plants11223162] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/20/2022] [Revised: 11/13/2022] [Accepted: 11/15/2022] [Indexed: 06/16/2023]
Abstract
Genome-wide association studies (GWAS) have allowed the identification of different loci associated with primary root (PR) growth, and Arabidopsis is an excellent model for these studies. The PR length is controlled by cell proliferation, elongation, and differentiation; however, the specific contribution of proliferation and differentiation in the control of PR growth is still poorly studied. To this end, we analyzed 124 accessions and used a GWAS approach to identify potential causal genomic regions related to four traits: PR length, growth rate, cell proliferation and cell differentiation. Twenty-three genes and five statistically significant SNPs were identified. The SNP with the highest score mapped to the fifth exon of NAC048 and this change makes a missense variant in only 33.3% of the accessions with a large PR, compared with the accessions with a short PR length. Moreover, we detected five more SNPs in this gene and in NAC3 that allow us to discover closely related accessions according to the phylogenetic tree analysis. We also found that the association between genetic variants among the 18 genes with the highest scores in our GWAS and the phenotypic classes into which we divided our accessions are not straightforward and likely follow historical patterns.
Collapse
Affiliation(s)
- Brenda Anabel López-Ruiz
- Laboratorio de Genética Molecular, Desarrollo y Evolución de Plantas, Departamento de Ecología Funcional, Instituto de Ecología, Universidad Nacional Autónoma de México, Ciudad de México 04510, Mexico
| | - Elsa H. Quezada-Rodríguez
- Departamento de Producción Agrícola y Animal, Universidad Autónoma Metropolitana-Xochimilco, Ciudad de México 04510, Mexico
| | - Alma Piñeyro-Nelson
- Departamento de Producción Agrícola y Animal, Universidad Autónoma Metropolitana-Xochimilco, Ciudad de México 04510, Mexico
- Centro de Ciencias de la Complejidad, Universidad Nacional Autónoma de México, Ciudad de México 04510, Mexico
| | - Hugo Tovar
- División de Genómica Computacional, Instituto Nacional de Medicina Genómica (INMEGEN), Ciudad de México 14610, Mexico
| | - Berenice García-Ponce
- Laboratorio de Genética Molecular, Desarrollo y Evolución de Plantas, Departamento de Ecología Funcional, Instituto de Ecología, Universidad Nacional Autónoma de México, Ciudad de México 04510, Mexico
| | - María de la Paz Sánchez
- Laboratorio de Genética Molecular, Desarrollo y Evolución de Plantas, Departamento de Ecología Funcional, Instituto de Ecología, Universidad Nacional Autónoma de México, Ciudad de México 04510, Mexico
| | - Elena R. Álvarez-Buylla
- Laboratorio de Genética Molecular, Desarrollo y Evolución de Plantas, Departamento de Ecología Funcional, Instituto de Ecología, Universidad Nacional Autónoma de México, Ciudad de México 04510, Mexico
- Centro de Ciencias de la Complejidad, Universidad Nacional Autónoma de México, Ciudad de México 04510, Mexico
| | - Adriana Garay-Arroyo
- Laboratorio de Genética Molecular, Desarrollo y Evolución de Plantas, Departamento de Ecología Funcional, Instituto de Ecología, Universidad Nacional Autónoma de México, Ciudad de México 04510, Mexico
- Centro de Ciencias de la Complejidad, Universidad Nacional Autónoma de México, Ciudad de México 04510, Mexico
| |
Collapse
|
3
|
Jha SG, Borowsky AT, Cole BJ, Fahlgren N, Farmer A, Huang SSC, Karia P, Libault M, Provart NJ, Rice SL, Saura-Sanchez M, Agarwal P, Ahkami AH, Anderton CR, Briggs SP, Brophy JAN, Denolf P, Di Costanzo LF, Exposito-Alonso M, Giacomello S, Gomez-Cano F, Kaufmann K, Ko DK, Kumar S, Malkovskiy AV, Nakayama N, Obata T, Otegui MS, Palfalvi G, Quezada-Rodríguez EH, Singh R, Uhrig RG, Waese J, Van Wijk K, Wright RC, Ehrhardt DW, Birnbaum KD, Rhee SY. Vision, challenges and opportunities for a Plant Cell Atlas. eLife 2021; 10:e66877. [PMID: 34491200 PMCID: PMC8423441 DOI: 10.7554/elife.66877] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2021] [Accepted: 08/26/2021] [Indexed: 02/06/2023] Open
Abstract
With growing populations and pressing environmental problems, future economies will be increasingly plant-based. Now is the time to reimagine plant science as a critical component of fundamental science, agriculture, environmental stewardship, energy, technology and healthcare. This effort requires a conceptual and technological framework to identify and map all cell types, and to comprehensively annotate the localization and organization of molecules at cellular and tissue levels. This framework, called the Plant Cell Atlas (PCA), will be critical for understanding and engineering plant development, physiology and environmental responses. A workshop was convened to discuss the purpose and utility of such an initiative, resulting in a roadmap that acknowledges the current knowledge gaps and technical challenges, and underscores how the PCA initiative can help to overcome them.
Collapse
Affiliation(s)
- Suryatapa Ghosh Jha
- Department of Plant Biology, Carnegie Institution for ScienceStanfordUnited States
| | - Alexander T Borowsky
- Department of Botany and Plant Sciences, University of California, RiversideRiversideUnited States
| | - Benjamin J Cole
- Joint Genome Institute, Lawrence Berkeley National LaboratoryWalnut CreekUnited States
| | - Noah Fahlgren
- Donald Danforth Plant Science CenterSt. LouisUnited States
| | - Andrew Farmer
- National Center for Genome ResourcesSanta FeUnited States
| | | | - Purva Karia
- Department of Plant Biology, Carnegie Institution for ScienceStanfordUnited States
- Department of Cell and Systems Biology, University of TorontoTorontoCanada
| | - Marc Libault
- Department of Agronomy and Horticulture, University of Nebraska-LincolnLincolnUnited States
| | - Nicholas J Provart
- Department of Cell and Systems Biology and the Centre for the Analysis of Genome Evolution and Function, University of TorontoTorontoCanada
| | - Selena L Rice
- Department of Plant Biology, Carnegie Institution for ScienceStanfordUnited States
| | - Maite Saura-Sanchez
- Consejo Nacional de Investigaciones Científicas y Técnicas, Instituto de Investigaciones Fisiológicas y Ecológicas Vinculadas a la Agricultura, Facultad de Agronomía, Universidad de Buenos AiresBuenos AiresArgentina
| | - Pinky Agarwal
- National Institute of Plant Genome ResearchNew DelhiIndia
| | - Amir H Ahkami
- Environmental Molecular Sciences Division, Pacific Northwest National LaboratoryRichlandUnited States
| | - Christopher R Anderton
- Environmental Molecular Sciences Division, Pacific Northwest National LaboratoryRichlandUnited States
| | - Steven P Briggs
- Department of Biological Sciences, University of California, San DiegoSan DiegoUnited States
| | | | | | - Luigi F Di Costanzo
- Department of Agricultural Sciences, University of Naples Federico IINapoliItaly
| | - Moises Exposito-Alonso
- Department of Plant Biology, Carnegie Institution for ScienceStanfordUnited States
- Department of Plant Biology, Carnegie Institution for ScienceTübingenGermany
| | | | - Fabio Gomez-Cano
- Department of Biochemistry and Molecular Biology, Michigan State UniversityEast LansingUnited States
| | - Kerstin Kaufmann
- Department for Plant Cell and Molecular Biology, Institute for Biology, Humboldt-Universitaet zu BerlinBerlinGermany
| | - Dae Kwan Ko
- Great Lakes Bioenergy Research Center, Michigan State UniversityEast LansingUnited States
| | - Sagar Kumar
- Department of Plant Breeding & Genetics, Mata Gujri College, Fatehgarh Sahib, Punjabi UniversityPatialaIndia
| | - Andrey V Malkovskiy
- Department of Plant Biology, Carnegie Institution for ScienceStanfordUnited States
| | - Naomi Nakayama
- Department of Bioengineering, Imperial College LondonLondonUnited Kingdom
| | - Toshihiro Obata
- Department of Biochemistry, University of Nebraska-LincolnMadisonUnited States
| | - Marisa S Otegui
- Department of Botany, University of Wisconsin-MadisonMadisonUnited States
| | - Gergo Palfalvi
- Division of Evolutionary Biology, National Institute for Basic BiologyOkazakiJapan
| | - Elsa H Quezada-Rodríguez
- Ciencias Agrogenómicas, Escuela Nacional de Estudios Superiores Unidad León, Universidad Nacional Autónoma de MéxicoLeónMexico
| | - Rajveer Singh
- School of Agricultural Biotechnology, Punjab Agricultural UniversityLudhianaIndia
| | - R Glen Uhrig
- Department of Science, University of AlbertaEdmontonCanada
| | - Jamie Waese
- Department of Cell and Systems Biology/Centre for the Analysis of Genome Evolution and Function, University of TorontoTorontoCanada
| | - Klaas Van Wijk
- School of Integrated Plant Science, Plant Biology Section, Cornell UniversityIthacaUnited States
| | - R Clay Wright
- Department of Biological Systems Engineering, Virginia TechBlacksburgUnited States
| | - David W Ehrhardt
- Department of Plant Biology, Carnegie Institution for ScienceStanfordUnited States
| | - Kenneth D Birnbaum
- Center for Genomics and Systems Biology, New York UniversityNew YorkUnited States
| | - Seung Y Rhee
- Department of Plant Biology, Carnegie Institution for ScienceStanfordUnited States
| |
Collapse
|