1
|
Singhal R, Izquierdo P, Ranaweera T, Segura Abá K, Brown BN, Lehti-Shiu MD, Shiu SH. Using supervised machine-learning approaches to understand abiotic stress tolerance and design resilient crops. Philos Trans R Soc Lond B Biol Sci 2025; 380:20240252. [PMID: 40439305 PMCID: PMC12121380 DOI: 10.1098/rstb.2024.0252] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2024] [Revised: 01/14/2025] [Accepted: 01/15/2025] [Indexed: 06/02/2025] Open
Abstract
Abiotic stresses such as drought, heat, cold, salinity and flooding significantly impact plant growth, development and productivity. As the planet has warmed, these abiotic stresses have increased in frequency and intensity, affecting the global food supply and making it imperative to develop stress-resilient crops. In the past 20 years, the development of omics technologies has contributed to the growth of datasets for plants grown under a wide range of abiotic environments. Integration of these rapidly growing data using machine-learning (ML) approaches can complement existing breeding efforts by providing insights into the mechanisms underlying plant responses to stressful conditions, which can be used to guide the design of resilient crops. In this review, we introduce ML approaches and provide examples of how researchers use these approaches to predict molecular activities, gene functions and genotype responses under stressful conditions. Finally, we consider the potential and challenges of using such approaches to enable the design of crops that are better suited to a changing environment.This article is part of the theme issue 'Crops under stress: can we mitigate the impacts of climate change on agriculture and launch the 'Resilience Revolution'?'.
Collapse
Affiliation(s)
- Rajneesh Singhal
- Department of Plant Biology, Michigan State University, East Lansing, MI48824, USA
| | - Paulo Izquierdo
- Department of Plant Biology, Michigan State University, East Lansing, MI48824, USA
- DOE Great Lakes Bioenergy Research Center, Michigan State University, East Lansing, MI48824, USA
| | - Thilanka Ranaweera
- Department of Plant Biology, Michigan State University, East Lansing, MI48824, USA
- DOE Great Lakes Bioenergy Research Center, Michigan State University, East Lansing, MI48824, USA
| | - Kenia Segura Abá
- DOE Great Lakes Bioenergy Research Center, Michigan State University, East Lansing, MI48824, USA
- Genetics and Genome Sciences Program, Michigan State University, East Lansing, MI48824, USA
| | - Brianna N.I. Brown
- Department of Plant Biology, Michigan State University, East Lansing, MI48824, USA
| | | | - Shin-Han Shiu
- Department of Plant Biology, Michigan State University, East Lansing, MI48824, USA
- DOE Great Lakes Bioenergy Research Center, Michigan State University, East Lansing, MI48824, USA
- Genetics and Genome Sciences Program, Michigan State University, East Lansing, MI48824, USA
- Department of Computational Mathematics, Science, and Engineering, Michigan State University, East Lansing, MI48824, USA
| |
Collapse
|
2
|
Zhang R, Wang Y, Yang W, Wen J, Liu W, Zhi S, Li G, Chai N, Huang J, Xie Y, Xie X, Chen L, Gu M, Liu YG, Zhu Q. PlantGPT: An Arabidopsis-Based Intelligent Agent that Answers Questions about Plant Functional Genomics. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2025:e03926. [PMID: 40397417 DOI: 10.1002/advs.202503926] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2025] [Revised: 04/15/2025] [Indexed: 05/22/2025]
Abstract
Research into plant gene function is crucial for developing strategies to increase crop yields. The recent introduction of large language models (LLMs) offers a means to aggregate large amounts of data into a queryable format, but the output can contain inaccurate or false claims known as hallucinations. To minimize such hallucinations and produce high-quality knowledge-based outputs, the abstracts of over 60 000 plant research articles are compiled into a Chroma database for retrieval-augmented generation (RAG). Then linguistic data are used from 13 993 Arabidopsis (Arabidopsis thaliana) phenotypes and 23 323 gene functions to fine-tune the LLM Llama3-8B, producing PlantGPT, a virtual expert in Arabidopsis phenotype-gene research. By evaluating answers to test questions, it is demonstrated that PlantGPT outperforms general LLMs in answering specialized questions. The findings provide a blueprint for functional genomics research in food crops and demonstrate the potential for developing LLMs for plant research modalities. To provide broader access and facilitate adoption, the online tool http://www.plantgpt.icu is developed, which will allow researchers to use PlantGPT in their scientific investigations.
Collapse
Affiliation(s)
- Ruixiang Zhang
- Guangdong Basic Research Center of Excellence for Precise Breeding of Future Crops, Guangdong Laboratory for Lingnan Modern Agriculture, State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Agriculture, College of Life Science, South China Agricultural University, Guangzhou, 510642, China
| | - Yu Wang
- School of Life Sciences, Institute for Immunology, State Key Laboratory of Membrane Biology, China Ministry of Education Key Laboratory of Protein Sciences, Tsinghua University, Beijing, 100084, China
| | - Weiyang Yang
- Department of Automation, Tsinghua University, Beijing, 100084, China
| | - Jun Wen
- Guangdong Basic Research Center of Excellence for Precise Breeding of Future Crops, Guangdong Laboratory for Lingnan Modern Agriculture, State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Agriculture, College of Life Science, South China Agricultural University, Guangzhou, 510642, China
| | - Weizhi Liu
- Guangdong Basic Research Center of Excellence for Precise Breeding of Future Crops, Guangdong Laboratory for Lingnan Modern Agriculture, State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Agriculture, College of Life Science, South China Agricultural University, Guangzhou, 510642, China
| | - Shipeng Zhi
- Department of Medicine, Tsinghua University, Beijing, 100084, China
| | - Guangzhou Li
- Guangdong Basic Research Center of Excellence for Precise Breeding of Future Crops, Guangdong Laboratory for Lingnan Modern Agriculture, State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Agriculture, College of Life Science, South China Agricultural University, Guangzhou, 510642, China
| | - Nan Chai
- Guangdong Basic Research Center of Excellence for Precise Breeding of Future Crops, Guangdong Laboratory for Lingnan Modern Agriculture, State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Agriculture, College of Life Science, South China Agricultural University, Guangzhou, 510642, China
| | - Jiaqi Huang
- Engineering Research Center of Protection and Utilization of Plant Resources, College of Bioscience and Biotechnology, Shenyang Agricultural University, Shenyang, 110866, China
| | - Yongyao Xie
- Guangdong Basic Research Center of Excellence for Precise Breeding of Future Crops, Guangdong Laboratory for Lingnan Modern Agriculture, State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Agriculture, College of Life Science, South China Agricultural University, Guangzhou, 510642, China
| | - Xianrong Xie
- Guangdong Basic Research Center of Excellence for Precise Breeding of Future Crops, Guangdong Laboratory for Lingnan Modern Agriculture, State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Agriculture, College of Life Science, South China Agricultural University, Guangzhou, 510642, China
| | - Letian Chen
- Guangdong Basic Research Center of Excellence for Precise Breeding of Future Crops, Guangdong Laboratory for Lingnan Modern Agriculture, State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Agriculture, College of Life Science, South China Agricultural University, Guangzhou, 510642, China
| | - Miao Gu
- Department of Automation, Tsinghua University, Beijing, 100084, China
| | - Yao-Guang Liu
- Guangdong Basic Research Center of Excellence for Precise Breeding of Future Crops, Guangdong Laboratory for Lingnan Modern Agriculture, State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Agriculture, College of Life Science, South China Agricultural University, Guangzhou, 510642, China
| | - Qinlong Zhu
- Guangdong Basic Research Center of Excellence for Precise Breeding of Future Crops, Guangdong Laboratory for Lingnan Modern Agriculture, State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Agriculture, College of Life Science, South China Agricultural University, Guangzhou, 510642, China
| |
Collapse
|
3
|
Che Y, Zhang C, Xing J, Xi Q, Shao Y, Zhao L, Guo S, Zuo Y. Machine Learning-Based identification of resistance genes associated with sunflower broomrape. PLANT METHODS 2025; 21:62. [PMID: 40380306 PMCID: PMC12082884 DOI: 10.1186/s13007-025-01383-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/18/2025] [Accepted: 04/29/2025] [Indexed: 05/19/2025]
Abstract
BACKGROUND Sunflowers (Helianthus annuus L.), a vital oil crop, are facing a severe challenge from broomrape (Orobanche cumana), a parasitic plant that seriously jeopardizes the growth and development of sunflowers, limits global production and leads to substantial economic losses, which urges the development of resistant sunflower varieties. RESULTS This study aims to identify resistance genes from a comprehensive transcriptomic profile of 103 sunflower varieties based on gene expression data and then constructs predictive models with the key resistant genes. The least absolute shrinkage and selection operator (LASSO) regression and random forest feature importance ranking method were used to identify resistance genes. These genes were considered as biomarkers in constructing machine learning models with Support Vector Machine (SVM), K-Nearest Neighbours (KNN), Logistic Regression (LR), and Gaussian Naive Bayes (GaussianNB). The SVM model constructed with the 24 key genes selected by the LASSO method demonstrated high classification accuracy (0.9514) and a robust AUC value (0.9865), effectively distinguishing between resistant and susceptible varieties based on gene expression data. Furthermore, we discovered a correlation between key genes and differential metabolites, particularly jasmonic acid (JA). CONCLUSION Our study highlights a novel perspective on screening sunflower varieties for broomrape resistance, which is anticipated to guide future biological research and breeding strategies.
Collapse
Affiliation(s)
- Yingxue Che
- The State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot, 010020, China
| | - Congzi Zhang
- The State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot, 010020, China
| | - Jixiang Xing
- The State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot, 010020, China
| | - Qilemuge Xi
- The State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot, 010020, China
| | - Ying Shao
- Inner Mongolia Academy of Agricultural and Animal Husbandry Sciences, Hohhot, 010000, China
| | - Lingmin Zhao
- Inner Mongolia Academy of Agricultural and Animal Husbandry Sciences, Hohhot, 010000, China
| | - Shuchun Guo
- Inner Mongolia Academy of Agricultural and Animal Husbandry Sciences, Hohhot, 010000, China.
| | - Yongchun Zuo
- The State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot, 010020, China.
| |
Collapse
|
4
|
Shohag MJI, Yang Q, He Z, Mostafa I, Chen S, Yang X. Multi-omics integration uncovers the zinc metabolic regulatory network in the hyperaccumulating ecotype of Sedum alfredii Hance. JOURNAL OF HAZARDOUS MATERIALS 2025; 494:138523. [PMID: 40373399 DOI: 10.1016/j.jhazmat.2025.138523] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/03/2025] [Revised: 05/03/2025] [Accepted: 05/05/2025] [Indexed: 05/17/2025]
Abstract
Plants require a fine balance of zinc (Zn) for proper growth and development. This fine-tuning of Zn metabolism is tightly regulated and often challenging task for plants. Hyperaccumulating ecotype of Sedum alfredii Hance, a Zn hyperaccumulator form the Crassulaceae family, offers a unique model to study Zn homeostasis. To date, their complex molecular mechanisms underlying Zn regulation remain largely unknown. Here, we present a large-scale comparative investigation of Zn homeostasis networks in Zn hyperaccumulating and non-hyperaccumulating ecotypes of S. alfredii. By integrating transcriptomics, proteomics, metabolomics, and ionomics, we uncovered that transcriptional and translational changes play critical roles in maintaining Zn homeostasis. These adaptations include enhanced photosynthetic efficiency, improved Zn ion binding in shoots, and increased antioxidative capacities. Additionally, carbon and sulfur metabolic pathways were found to respond significantly to Zn treatment. Key components of the tricarboxylic acid (TCA) cycle, along with stress-related amino acids, fatty acids, sugars, antioxidants, and Zn-binding phenolics, were coordinately modulated under Zn exposure. This multi-omics integration provides novel insights into the functional genomics and metabolic adaptations of the Zn hyperaccumulator S. alfredii and will facilitate biotechnological applications of Zn hyperaccumulation traits for biofortification, phytoremediation and food crop safety.
Collapse
Affiliation(s)
- M J I Shohag
- Ministry of Education (MOE) Key Laboratory of Environmental Remediation and Ecosystem Health, College of Environmental and Resources Science, Zhejiang University, Hangzhou 310058, People's Republic of China; Department of Soil, Water and Ecosystem Sciences, Indian River Research and Education Center, Institute of Food and Agricultural Sciences, University of Florida, Fort Pierce, FL 34945, United States of America; Department of Horticultural Sciences, Institute of Food and Agricultural Sciences, University of Florida, Gainesville, FL 32611, United States of America; Department of Agriculture, Faculty of Agricultural Sciences, Bangabandhu Sheikh Mujibur Rahman Science and Technology University, Gopalganj 8100, Bangladesh
| | - Qianying Yang
- Ministry of Education (MOE) Key Laboratory of Environmental Remediation and Ecosystem Health, College of Environmental and Resources Science, Zhejiang University, Hangzhou 310058, People's Republic of China; State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangdong Basic Research Center of Excellence for Precise Breeding of Future Crops, Key Laboratory for Enhancing Resource Use Efficiency of Crops in South China, Ministry of Agriculture and Rural Affairs, Guangdong Provincial Key Laboratory of Plant Molecular Breeding, College of Agriculture, South China Agricultural University, Guangzhou 510642, China
| | - Zhenli He
- Ministry of Education (MOE) Key Laboratory of Environmental Remediation and Ecosystem Health, College of Environmental and Resources Science, Zhejiang University, Hangzhou 310058, People's Republic of China; Department of Soil, Water and Ecosystem Sciences, Indian River Research and Education Center, Institute of Food and Agricultural Sciences, University of Florida, Fort Pierce, FL 34945, United States of America
| | - Islam Mostafa
- Department of Pharmacognosy, Faculty of Pharmacy, Zagazig University, Zagazig 44519, Egypt; Department of Chemistry, University of Florida, Gainesville, FL 32603, United States of America
| | - Sixue Chen
- Department of Biology, University of Mississippi, Oxford, MS 38677, United States of America
| | - Xiaoe Yang
- Ministry of Education (MOE) Key Laboratory of Environmental Remediation and Ecosystem Health, College of Environmental and Resources Science, Zhejiang University, Hangzhou 310058, People's Republic of China; Department of Soil, Water and Ecosystem Sciences, Indian River Research and Education Center, Institute of Food and Agricultural Sciences, University of Florida, Fort Pierce, FL 34945, United States of America.
| |
Collapse
|