1
|
Zhu W, Li W, Zhang H, Li L. Big data and artificial intelligence-aided crop breeding: Progress and prospects. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2025; 67:722-739. [PMID: 39467106 PMCID: PMC11951406 DOI: 10.1111/jipb.13791] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/30/2024] [Revised: 08/25/2024] [Accepted: 09/10/2024] [Indexed: 10/30/2024]
Abstract
The past decade has witnessed rapid developments in gene discovery, biological big data (BBD), artificial intelligence (AI)-aided technologies, and molecular breeding. These advancements are expected to accelerate crop breeding under the pressure of increasing demands for food. Here, we first summarize current breeding methods and discuss the need for new ways to support breeding efforts. Then, we review how to combine BBD and AI technologies for genetic dissection, exploring functional genes, predicting regulatory elements and functional domains, and phenotypic prediction. Finally, we propose the concept of intelligent precision design breeding (IPDB) driven by AI technology and offer ideas about how to implement IPDB. We hope that IPDB will enhance the predictability, efficiency, and cost of crop breeding compared with current technologies. As an example of IPDB, we explore the possibilities offered by CropGPT, which combines biological techniques, bioinformatics, and breeding art from breeders, and presents an open, shareable, and cooperative breeding system. IPDB provides integrated services and communication platforms for biologists, bioinformatics experts, germplasm resource specialists, breeders, dealers, and farmers, and should be well suited for future breeding.
Collapse
Affiliation(s)
- Wanchao Zhu
- Key Laboratory of Biology and Genetic Improvement of Maize in Arid Area of Northwest Region, College of AgronomyNorthwest A&F UniversityYangling712100China
- National Key Laboratory of Crop Genetic ImprovementHuazhong Agricultural UniversityWuhan430070China
| | - Weifu Li
- College of InformaticsHuazhong Agricultural UniversityWuhan430070China
- Engineering Research Center of Intelligent Technology for AgricultureMinistry of EducationWuhan430070China
| | - Hongwei Zhang
- State Key Laboratory of Crop Gene Resources and Breeding, National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop SciencesChinese Academy of Agricultural SciencesBeijing100081China
| | - Lin Li
- National Key Laboratory of Crop Genetic ImprovementHuazhong Agricultural UniversityWuhan430070China
| |
Collapse
|
2
|
Keerthi I, Shukla V, Kalluru S, Mohammad LA, Kumari PL, Ramireddy E, Vemireddy LR. Prioritization of candidate genes for major QTLs governing yield traits employing integrated multi-omics approach in rice (Oryza sativa L.). Brief Funct Genomics 2024; 23:843-857. [PMID: 39228011 DOI: 10.1093/bfgp/elae035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2024] [Revised: 08/10/2024] [Accepted: 08/19/2024] [Indexed: 09/05/2024] Open
Abstract
Rapidly identifying candidate genes underlying major QTLs is crucial for improving rice (Oryza sativa L.). In this study, we developed a workflow to rapidly prioritize candidate genes underpinning 99 major QTLs governing yield component traits. This workflow integrates multiomics databases, including sequence variation, gene expression, gene ontology, co-expression analysis, and protein-protein interaction. We predicted 206 candidate genes for 99 reported QTLs governing ten economically important yield-contributing traits using this approach. Among these, transcription factors belonging to families of MADS-box, WRKY, helix-loop-helix, TCP, MYB, GRAS, auxin response factor, and nuclear transcription factor Y subunit were promising. Validation of key prioritized candidate genes in contrasting rice genotypes for sequence variation and differential expression identified Leucine-Rich Repeat family protein (LOC_Os03g28270) and cytochrome P450 (LOC_Os02g57290) as candidate genes for the major QTLs GL1 and pl2.1, which govern grain length and panicle length, respectively. In conclusion, this study demonstrates that our workflow can significantly narrow down a large number of annotated genes in a QTL to a very small number of the most probable candidates, achieving approximately a 21-fold reduction. These candidate genes have potential implications for enhancing rice yield.
Collapse
Affiliation(s)
- Issa Keerthi
- Department of Molecular Biology and Biotechnology, S.V. Agricultural College, Acharya NG Ranga Agricultural University (ANGRAU), Tirupati, Andhra Pradesh 517502, India
| | - Vishnu Shukla
- Department of Biology, Indian Institute of Science Education and Research Tirupati (IISER) Tirupati, Andhra Pradesh 517507, India
| | - Sudhamani Kalluru
- Department of Genetics and Plant Breeding, S.V. Agricultural College, Acharya NG Ranga Agricultural University (ANGRAU), Tirupati, Andhra Pradesh 517502, India
| | - Lal Ahamed Mohammad
- Department of Genetics and Plant Breeding, Agricultural College, Acharya NG Ranga Agricultural University (ANGRAU), Bapatla, Guntur, Andhra Pradesh 522101, India
| | - P Lavanya Kumari
- Department of Statistics and Computer Applications, S.V. Agricultural College, Acharya NG Ranga Agricultural University (ANGRAU), Tirupati, Andhra Pradesh 517502, India
| | - Eswarayya Ramireddy
- Department of Biology, Indian Institute of Science Education and Research Tirupati (IISER) Tirupati, Andhra Pradesh 517507, India
| | - Lakshminarayana R Vemireddy
- Department of Molecular Biology and Biotechnology, S.V. Agricultural College, Acharya NG Ranga Agricultural University (ANGRAU), Tirupati, Andhra Pradesh 517502, India
| |
Collapse
|
3
|
Kaňovská I, Biová J, Škrabišová M. New perspectives of post-GWAS analyses: From markers to causal genes for more precise crop breeding. CURRENT OPINION IN PLANT BIOLOGY 2024; 82:102658. [PMID: 39549685 DOI: 10.1016/j.pbi.2024.102658] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/03/2024] [Revised: 10/08/2024] [Accepted: 10/19/2024] [Indexed: 11/18/2024]
Abstract
Crop breeding advancement is hindered by the imperfection of methods to reveal genes underlying key traits. Genome-wide Association Study (GWAS) is one such method, identifying genomic regions linked to phenotypes. Post-GWAS analyses predict candidate genes and assist in causative mutation (CM) recognition. Here, we assess post-GWAS approaches, address limitations in omics data integration and stress the importance of evaluating associated variants within a broader context of publicly available datasets. Recent advances in bioinformatics tools and genomic strategies for CM identification and allelic variation exploration are reviewed. We discuss the role of markers and marker panel development for more precise breeding. Finally, we highlight the perspectives and challenges of GWAS-based CM prediction for complex quantitative traits.
Collapse
Affiliation(s)
- Ivana Kaňovská
- Department of Biochemistry, Faculty of Science, Palacký University in Olomouc, Šlechtitelů 27, Olomouc 77900, Czech Republic
| | - Jana Biová
- Department of Biochemistry, Faculty of Science, Palacký University in Olomouc, Šlechtitelů 27, Olomouc 77900, Czech Republic
| | - Mária Škrabišová
- Department of Biochemistry, Faculty of Science, Palacký University in Olomouc, Šlechtitelů 27, Olomouc 77900, Czech Republic.
| |
Collapse
|
4
|
Farooq MA, Gao S, Hassan MA, Huang Z, Rasheed A, Hearne S, Prasanna B, Li X, Li H. Artificial intelligence in plant breeding. Trends Genet 2024; 40:891-908. [PMID: 39117482 DOI: 10.1016/j.tig.2024.07.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2024] [Revised: 07/06/2024] [Accepted: 07/12/2024] [Indexed: 08/10/2024]
Abstract
Harnessing cutting-edge technologies to enhance crop productivity is a pivotal goal in modern plant breeding. Artificial intelligence (AI) is renowned for its prowess in big data analysis and pattern recognition, and is revolutionizing numerous scientific domains including plant breeding. We explore the wider potential of AI tools in various facets of breeding, including data collection, unlocking genetic diversity within genebanks, and bridging the genotype-phenotype gap to facilitate crop breeding. This will enable the development of crop cultivars tailored to the projected future environments. Moreover, AI tools also hold promise for refining crop traits by improving the precision of gene-editing systems and predicting the potential effects of gene variants on plant phenotypes. Leveraging AI-enabled precision breeding can augment the efficiency of breeding programs and holds promise for optimizing cropping systems at the grassroots level. This entails identifying optimal inter-cropping and crop-rotation models to enhance agricultural sustainability and productivity in the field.
Collapse
Affiliation(s)
- Muhammad Amjad Farooq
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences (CAAS), International Maize and Wheat Improvement Center (CIMMYT) China office, Beijing 100081, China; Nanfan Research Institute, CAAS, Sanya, Hainan 572024, China
| | - Shang Gao
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences (CAAS), International Maize and Wheat Improvement Center (CIMMYT) China office, Beijing 100081, China; Nanfan Research Institute, CAAS, Sanya, Hainan 572024, China
| | - Muhammad Adeel Hassan
- Adaptive Cropping Systems Laboratory, Beltsville Agricultural Research Center, US Department of Agriculture, Beltsville, MD 20705, USA; Oak Ridge Institute for Science and Education, Oak Ridge, TN 37830, USA
| | - Zhangping Huang
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences (CAAS), International Maize and Wheat Improvement Center (CIMMYT) China office, Beijing 100081, China; Nanfan Research Institute, CAAS, Sanya, Hainan 572024, China
| | - Awais Rasheed
- Department of Plant Sciences, Quaid-i-Azam University, Islamabad 45320, Pakistan
| | - Sarah Hearne
- CIMMYT, KM 45 Carretera Mexico-Veracruz, El Batan, Texcoco 56237, Mexico
| | - Boddupalli Prasanna
- CIMMYT, International Centre for Research in Agroforestry (ICRAF) House, Nairobi 00100, Kenya
| | - Xinhai Li
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences (CAAS), International Maize and Wheat Improvement Center (CIMMYT) China office, Beijing 100081, China
| | - Huihui Li
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences (CAAS), International Maize and Wheat Improvement Center (CIMMYT) China office, Beijing 100081, China; Nanfan Research Institute, CAAS, Sanya, Hainan 572024, China.
| |
Collapse
|
5
|
Wang XY, Ren CX, Fan QW, Xu YP, Wang LW, Mao ZL, Cai XZ. Integrated Assays of Genome-Wide Association Study, Multi-Omics Co-Localization, and Machine Learning Associated Calcium Signaling Genes with Oilseed Rape Resistance to Sclerotinia sclerotiorum. Int J Mol Sci 2024; 25:6932. [PMID: 39000053 PMCID: PMC11240920 DOI: 10.3390/ijms25136932] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2024] [Revised: 06/20/2024] [Accepted: 06/20/2024] [Indexed: 07/14/2024] Open
Abstract
Sclerotinia sclerotiorum (Ss) is one of the most devastating fungal pathogens, causing huge yield loss in multiple economically important crops including oilseed rape. Plant resistance to Ss pertains to quantitative disease resistance (QDR) controlled by multiple minor genes. Genome-wide identification of genes involved in QDR to Ss is yet to be conducted. In this study, we integrated several assays including genome-wide association study (GWAS), multi-omics co-localization, and machine learning prediction to identify, on a genome-wide scale, genes involved in the oilseed rape QDR to Ss. Employing GWAS and multi-omics co-localization, we identified seven resistance-associated loci (RALs) associated with oilseed rape resistance to Ss. Furthermore, we developed a machine learning algorithm and named it Integrative Multi-Omics Analysis and Machine Learning for Target Gene Prediction (iMAP), which integrates multi-omics data to rapidly predict disease resistance-related genes within a broad chromosomal region. Through iMAP based on the identified RALs, we revealed multiple calcium signaling genes related to the QDR to Ss. Population-level analysis of selective sweeps and haplotypes of variants confirmed the positive selection of the predicted calcium signaling genes during evolution. Overall, this study has developed an algorithm that integrates multi-omics data and machine learning methods, providing a powerful tool for predicting target genes associated with specific traits. Furthermore, it makes a basis for further understanding the role and mechanisms of calcium signaling genes in the QDR to Ss.
Collapse
Affiliation(s)
- Xin-Yao Wang
- Key Laboratory of Biology and Ecological Control of Crop Pathogens and Insects of Zhejiang Province, Institute of Biotechnology, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, China; (X.-Y.W.); (C.-X.R.); (Q.-W.F.); (L.-W.W.); (Z.-L.M.)
| | - Chun-Xiu Ren
- Key Laboratory of Biology and Ecological Control of Crop Pathogens and Insects of Zhejiang Province, Institute of Biotechnology, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, China; (X.-Y.W.); (C.-X.R.); (Q.-W.F.); (L.-W.W.); (Z.-L.M.)
| | - Qing-Wen Fan
- Key Laboratory of Biology and Ecological Control of Crop Pathogens and Insects of Zhejiang Province, Institute of Biotechnology, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, China; (X.-Y.W.); (C.-X.R.); (Q.-W.F.); (L.-W.W.); (Z.-L.M.)
| | - You-Ping Xu
- Centre of Analysis and Measurement, Zhejiang University, 866 Yu Hang Tang Road, Hangzhou 310058, China;
| | - Lu-Wen Wang
- Key Laboratory of Biology and Ecological Control of Crop Pathogens and Insects of Zhejiang Province, Institute of Biotechnology, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, China; (X.-Y.W.); (C.-X.R.); (Q.-W.F.); (L.-W.W.); (Z.-L.M.)
| | - Zhou-Lu Mao
- Key Laboratory of Biology and Ecological Control of Crop Pathogens and Insects of Zhejiang Province, Institute of Biotechnology, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, China; (X.-Y.W.); (C.-X.R.); (Q.-W.F.); (L.-W.W.); (Z.-L.M.)
| | - Xin-Zhong Cai
- Key Laboratory of Biology and Ecological Control of Crop Pathogens and Insects of Zhejiang Province, Institute of Biotechnology, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, China; (X.-Y.W.); (C.-X.R.); (Q.-W.F.); (L.-W.W.); (Z.-L.M.)
- Hainan Institute, Zhejiang University, Sanya 572025, China
| |
Collapse
|
6
|
Chen L, Liu G, Zhang T. Integrating machine learning and genome editing for crop improvement. ABIOTECH 2024; 5:262-277. [PMID: 38974863 PMCID: PMC11224061 DOI: 10.1007/s42994-023-00133-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Accepted: 12/18/2023] [Indexed: 07/09/2024]
Abstract
Genome editing is a promising technique that has been broadly utilized for basic gene function studies and trait improvements. Simultaneously, the exponential growth of computational power and big data now promote the application of machine learning for biological research. In this regard, machine learning shows great potential in the refinement of genome editing systems and crop improvement. Here, we review the advances of machine learning to genome editing optimization, with emphasis placed on editing efficiency and specificity enhancement. Additionally, we demonstrate how machine learning bridges genome editing and crop breeding, by accurate key site detection and guide RNA design. Finally, we discuss the current challenges and prospects of these two techniques in crop improvement. By integrating advanced genome editing techniques with machine learning, progress in crop breeding will be further accelerated in the future.
Collapse
Affiliation(s)
- Long Chen
- Jiangsu Key Laboratory of Crop Genomics and Molecular Breeding/Zhongshan Biological Breeding Laboratory/Key Laboratory of Plant Functional Genomics of the Ministry of Education, Agricultural College of Yangzhou University, Yangzhou, 225009 China
- Jiangsu Co-Innovation Center for Modern Production Technology of Grain Crops/Jiangsu Key Laboratory of Crop Genetics and Physiology, Yangzhou University, Yangzhou, 225009 China
| | - Guanqing Liu
- Jiangsu Key Laboratory of Crop Genomics and Molecular Breeding/Zhongshan Biological Breeding Laboratory/Key Laboratory of Plant Functional Genomics of the Ministry of Education, Agricultural College of Yangzhou University, Yangzhou, 225009 China
- Jiangsu Co-Innovation Center for Modern Production Technology of Grain Crops/Jiangsu Key Laboratory of Crop Genetics and Physiology, Yangzhou University, Yangzhou, 225009 China
| | - Tao Zhang
- Jiangsu Key Laboratory of Crop Genomics and Molecular Breeding/Zhongshan Biological Breeding Laboratory/Key Laboratory of Plant Functional Genomics of the Ministry of Education, Agricultural College of Yangzhou University, Yangzhou, 225009 China
- Jiangsu Co-Innovation Center for Modern Production Technology of Grain Crops/Jiangsu Key Laboratory of Crop Genetics and Physiology, Yangzhou University, Yangzhou, 225009 China
| |
Collapse
|
7
|
Matuszkiewicz M, Grądzielewska A, Święcicka M, Ozturk A, Mokrzycka M, Igbari Aramide D, Song J, Kilian A, Rakoczy-Trojanowska M. Identification of quantitative trait loci associated with leaf rust resistance in rye by precision mapping. BMC PLANT BIOLOGY 2024; 24:291. [PMID: 38632518 PMCID: PMC11022434 DOI: 10.1186/s12870-024-04960-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/05/2024] [Accepted: 03/29/2024] [Indexed: 04/19/2024]
Abstract
BACKGROUND Leaf rust (LR) is among the most destructive fungal diseases of rye (Secale cereale L.). Despite intensive research using various analytical and methodological approaches, such as quantitative trait locus (QTL) mapping, candidate gene expression analysis, and transcriptome sequencing, the genetic basis of the rye immune response to LR remains unclear. RESULTS A genome-wide association study was employed to detect QTLs controlling the immune response to LR of rye. A mapping population, G38A, was constructed by crossing two inbred lines: 723 (susceptible to LR) and JKI-NIL-Pr3 (a donor of the LR resistance gene Pr3). For genotyping, SNP-DArT and silico-DArT markers were used. Resistance phenotyping was conducted by visual assessment of the infection severity in detached leaf segments inoculated with two isolates of Puccinia recondita f. sp. secalis, namely, 60/17/2.1 (isolate S) in the main experiment and 86/n/2.1_5x (isolate N) in the validation experiment, at 10 and 17 days post-infection (dpi), respectively. In total, 42,773 SNP-DArT and 105,866 silico-DArT markers were included in the main analysis including isolate S, of which 129 and 140 SNP-DArTs and 767 and 776 silico-DArTs were significantly associated (p ≤ 0.001; - log10(p) ≥ 3.0) with the immune response to LR at 10 and 17 dpi, respectively. Most significant markers were mapped to chromosome 1R. The number of common markers from both systems and at both time points occupying common chromosomal positions was 37, of which 21 were positioned in genes, comprising 18 markers located in exons and three in introns. This gene pool included genes encoding proteins with a known function in response to LR (e.g., a NBS-LRR disease resistance protein-like protein and carboxyl-terminal peptidase). CONCLUSION This study has expanded and supplemented existing knowledge of the genetic basis of rye resistance to LR by (1) detecting two QTLs associated with the LR immune response of rye, of which one located on the long arm of chromosome 1R is newly detected, (2) assigning hundreds of markers significantly associated with the immune response to LR to genes in the 'Lo7' genome, and (3) predicting the potential translational effects of polymorphisms of SNP-DArT markers located within protein-coding genes.
Collapse
Affiliation(s)
- Mateusz Matuszkiewicz
- Department of Plant Genetics, Breeding and Biotechnology, Institute of Biology, Warsaw, University of Life Sciences, Warsaw, Poland
| | | | - Magdalena Święcicka
- Department of Plant Genetics, Breeding and Biotechnology, Institute of Biology, Warsaw, University of Life Sciences, Warsaw, Poland
| | - Alperen Ozturk
- Graduate School of Science and Technology, Niigata University, Niigata, Japan
| | - Monika Mokrzycka
- Department of Biometry and Bioinformatics, Institute of Plant Genetics Polish Academy of Sciences, Poznań, Poland
| | - Dolapo Igbari Aramide
- Department of Plant Genetics, Breeding and Biotechnology, Institute of Biology, Warsaw, University of Life Sciences, Warsaw, Poland
- Department of Botany, Faculty of Science, University of Lagos, Akoka, Lagos, Yaba, Nigeria
| | - Jie Song
- Diversity Arrays Technology, University of Canberra, Monana Street, Bruce, ACT, 2617, Australia
| | - Andrzej Kilian
- Diversity Arrays Technology, University of Canberra, Monana Street, Bruce, ACT, 2617, Australia
| | - Monika Rakoczy-Trojanowska
- Department of Plant Genetics, Breeding and Biotechnology, Institute of Biology, Warsaw, University of Life Sciences, Warsaw, Poland.
| |
Collapse
|
8
|
Sharma N, Raman H, Wheeler D, Kalenahalli Y, Sharma R. Data-driven approaches to improve water-use efficiency and drought resistance in crop plants. PLANT SCIENCE : AN INTERNATIONAL JOURNAL OF EXPERIMENTAL PLANT BIOLOGY 2023; 336:111852. [PMID: 37659733 DOI: 10.1016/j.plantsci.2023.111852] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/11/2022] [Revised: 08/23/2023] [Accepted: 08/29/2023] [Indexed: 09/04/2023]
Abstract
With the increasing population, there lies a pressing demand for food, feed and fibre, while the changing climatic conditions pose severe challenges for agricultural production worldwide. Water is the lifeline for crop production; thus, enhancing crop water-use efficiency (WUE) and improving drought resistance in crop varieties are crucial for overcoming these challenges. Genetically-driven improvements in yield, WUE and drought tolerance traits can buffer the worst effects of climate change on crop production in dry areas. While traditional crop breeding approaches have delivered impressive results in increasing yield, the methods remain time-consuming and are often limited by the existing allelic variation present in the germplasm. Significant advances in breeding and high-throughput omics technologies in parallel with smart agriculture practices have created avenues to dramatically speed up the process of trait improvement by leveraging the vast volumes of genomic and phenotypic data. For example, individual genome and pan-genome assemblies, along with transcriptomic, metabolomic and proteomic data from germplasm collections, characterised at phenotypic levels, could be utilised to identify marker-trait associations and superior haplotypes for crop genetic improvement. In addition, these omics approaches enable the identification of genes involved in pathways leading to the expression of a trait, thereby providing an understanding of the genetic, physiological and biochemical basis of trait variation. These data-driven gene discoveries and validation approaches are essential for crop improvement pipelines, including genomic breeding, speed breeding and gene editing. Herein, we provide an overview of prospects presented using big data-driven approaches (including artificial intelligence and machine learning) to harness new genetic gains for breeding programs and develop drought-tolerant crop varieties with favourable WUE and high-yield potential traits.
Collapse
Affiliation(s)
- Niharika Sharma
- NSW Department of Primary Industries, Orange Agricultural Institute, Orange, NSW 2800, Australia.
| | - Harsh Raman
- NSW Department of Primary Industries, Wagga Wagga Agricultural Institute, Wagga Wagga, NSW 2650, Australia
| | - David Wheeler
- NSW Department of Primary Industries, Orange Agricultural Institute, Orange, NSW 2800, Australia
| | - Yogendra Kalenahalli
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Hyderabad, Telangana 502324, India
| | - Rita Sharma
- Department of Biological Sciences, BITS Pilani, Pilani Campus, Rajasthan 333031, India
| |
Collapse
|
9
|
Ferrão LFV, Dhakal R, Dias R, Tieman D, Whitaker V, Gore MA, Messina C, Resende MFR. Machine learning applications to improve flavor and nutritional content of horticultural crops through breeding and genetics. Curr Opin Biotechnol 2023; 83:102968. [PMID: 37515935 DOI: 10.1016/j.copbio.2023.102968] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2023] [Revised: 06/19/2023] [Accepted: 06/21/2023] [Indexed: 07/31/2023]
Abstract
Over the last decades, significant strides were made in understanding the biochemical factors influencing the nutritional content and flavor profile of fruits and vegetables. Product differentiation in the produce aisle is the natural consequence of increasing consumer power in the food industry. Cotton-candy grapes, specialty tomatoes, and pineapple-flavored white strawberries provide a few examples. Given the increased demand for flavorful varieties, and pressing need to reduce micronutrient malnutrition, we expect breeding to increase its prioritization toward these traits. Reaching this goal will, in part, necessitate knowledge of the genetic architecture controlling these traits, as well as the development of breeding methods that maximize their genetic gain. Can artificial intelligence (AI) help predict flavor preferences, and can such insights be leveraged by breeding programs? In this Perspective, we outline both the opportunities and challenges for the development of more flavorful and nutritious crops, and how AI can support these breeding initiatives.
Collapse
Affiliation(s)
- Luís Felipe V Ferrão
- Horticultural Sciences Department, University of Florida, Gainesville, FL, United States
| | - Rakshya Dhakal
- Plant Breeding Graduate Program, University of Florida, Gainesville, FL, United States
| | - Raquel Dias
- Microbiology and Cell Science Department, University of Florida, Gainesville, FL, United States
| | - Denise Tieman
- Horticultural Sciences Department, University of Florida, Gainesville, FL, United States
| | - Vance Whitaker
- Horticultural Sciences Department, University of Florida, Gainesville, FL, United States; Plant Breeding Graduate Program, University of Florida, Gainesville, FL, United States
| | - Michael A Gore
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, United States
| | - Carlos Messina
- Horticultural Sciences Department, University of Florida, Gainesville, FL, United States; Plant Breeding Graduate Program, University of Florida, Gainesville, FL, United States
| | - Márcio F R Resende
- Horticultural Sciences Department, University of Florida, Gainesville, FL, United States; Plant Breeding Graduate Program, University of Florida, Gainesville, FL, United States.
| |
Collapse
|
10
|
Depuydt T, De Rybel B, Vandepoele K. Charting plant gene functions in the multi-omics and single-cell era. TRENDS IN PLANT SCIENCE 2023; 28:283-296. [PMID: 36307271 DOI: 10.1016/j.tplants.2022.09.008] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/20/2022] [Revised: 09/09/2022] [Accepted: 09/30/2022] [Indexed: 06/16/2023]
Abstract
Despite the increased access to high-quality plant genome sequences, the set of genes with a known function remains far from complete. With the advent of novel bulk and single-cell omics profiling methods, we are entering a new era where advanced and highly integrative functional annotation strategies are being developed to elucidate the functions of all plant genes. Here, we review different multi-omics approaches to improve functional and regulatory gene characterization and highlight the power of machine learning and network biology to fully exploit the complementary information embedded in different omics layers. Finally, we discuss the potential of emerging single-cell methods and algorithms to further increase the resolution, allowing generation of functional insights about plant biology.
Collapse
Affiliation(s)
- Thomas Depuydt
- Ghent University, Department of Plant Biotechnology and Bioinformatics, Ghent, Belgium; Vlaams Instituut voor Biotechnologie, Center for Plant Systems Biology, Ghent, Belgium
| | - Bert De Rybel
- Ghent University, Department of Plant Biotechnology and Bioinformatics, Ghent, Belgium; Vlaams Instituut voor Biotechnologie, Center for Plant Systems Biology, Ghent, Belgium
| | - Klaas Vandepoele
- Ghent University, Department of Plant Biotechnology and Bioinformatics, Ghent, Belgium; Vlaams Instituut voor Biotechnologie, Center for Plant Systems Biology, Ghent, Belgium; Ghent University, Bioinformatics Institute Ghent, Ghent, Belgium.
| |
Collapse
|
11
|
Yan J, Wang X. Machine learning bridges omics sciences and plant breeding. TRENDS IN PLANT SCIENCE 2023; 28:199-210. [PMID: 36153276 DOI: 10.1016/j.tplants.2022.08.018] [Citation(s) in RCA: 32] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/08/2022] [Revised: 08/15/2022] [Accepted: 08/23/2022] [Indexed: 06/16/2023]
Abstract
Some of the biological knowledge obtained from fundamental research will be implemented in applied plant breeding. To bridge basic research and breeding practice, machine learning (ML) holds great promise to translate biological knowledge and omics data into precision-designed plant breeding. Here, we review ML for multi-omics analysis in plants, including data dimensionality reduction, inference of gene-regulation networks, and gene discovery and prioritization. These applications will facilitate understanding trait regulation mechanisms and identifying target genes potentially applicable to knowledge-driven molecular design breeding. We also highlight applications of deep learning in plant phenomics and ML in genomic selection-assisted breeding, such as various ML algorithms that model the correlations among genotypes (genes), phenotypes (traits), and environments, to ultimately achieve data-driven genomic design breeding.
Collapse
Affiliation(s)
- Jun Yan
- National Maize Improvement Center, College of Agronomy and Biotechnology, China Agricultural University, Beijing 100094, China; Frontiers Science Center for Molecular Design Breeding, China Agricultural University, Beijing 100094, China
| | - Xiangfeng Wang
- National Maize Improvement Center, College of Agronomy and Biotechnology, China Agricultural University, Beijing 100094, China; Frontiers Science Center for Molecular Design Breeding, China Agricultural University, Beijing 100094, China.
| |
Collapse
|
12
|
Naqvi RZ, Siddiqui HA, Mahmood MA, Najeebullah S, Ehsan A, Azhar M, Farooq M, Amin I, Asad S, Mukhtar Z, Mansoor S, Asif M. Smart breeding approaches in post-genomics era for developing climate-resilient food crops. FRONTIERS IN PLANT SCIENCE 2022; 13:972164. [PMID: 36186056 PMCID: PMC9523482 DOI: 10.3389/fpls.2022.972164] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Accepted: 08/15/2022] [Indexed: 06/16/2023]
Abstract
Improving the crop traits is highly required for the development of superior crop varieties to deal with climate change and the associated abiotic and biotic stress challenges. Climate change-driven global warming can trigger higher insect pest pressures and plant diseases thus affecting crop production sternly. The traits controlling genes for stress or disease tolerance are economically imperative in crop plants. In this scenario, the extensive exploration of available wild, resistant or susceptible germplasms and unraveling the genetic diversity remains vital for breeding programs. The dawn of next-generation sequencing technologies and omics approaches has accelerated plant breeding by providing the genome sequences and transcriptomes of several plants. The availability of decoded plant genomes offers an opportunity at a glance to identify candidate genes, quantitative trait loci (QTLs), molecular markers, and genome-wide association studies that can potentially aid in high throughput marker-assisted breeding. In recent years genomics is coupled with marker-assisted breeding to unravel the mechanisms to harness better better crop yield and quality. In this review, we discuss the aspects of marker-assisted breeding and recent perspectives of breeding approaches in the era of genomics, bioinformatics, high-tech phonemics, genome editing, and new plant breeding technologies for crop improvement. In nutshell, the smart breeding toolkit in the post-genomics era can steadily help in developing climate-smart future food crops.
Collapse
|
13
|
Demenkov PS, Oshchepkova ЕА, Demenkov PS, Ivanisenko TV, Ivanisenko VA. Prioritization of biological processes based on the reconstruction and analysis of associative gene networks describing the response of plants to adverse environmental factors. Vavilovskii Zhurnal Genet Selektsii 2021; 25:580-592. [PMID: 34723066 PMCID: PMC8543060 DOI: 10.18699/vj21.065] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2020] [Revised: 06/21/2021] [Accepted: 06/21/2021] [Indexed: 11/23/2022] Open
Abstract
Methods for prioritizing or ranking candidate genes according to their importance based on specif ic criteria
via the analysis of gene networks are widely used in biomedicine to search for genes associated with diseases and to
predict biomarkers, pharmacological targets and other clinically relevant molecules. These methods have also been
used in other f ields, particularly in crop production. This is largely due to the development of technologies to solve
problems in marker-oriented and genomic selection, which requires knowledge of the molecular genetic mechanisms
underlying the formation of agriculturally valuable traits. A new direction for the study of molecular genetic mechanisms
is the prioritization of biological processes based on the analysis of associative gene networks. Associative gene
networks are heterogeneous networks whose vertices can depict both molecular genetic objects (genes, proteins, metabolites,
etc.) and the higher-level factors (biological processes, diseases, external environmental factors, etc.) related
to regulatory, physicochemical or associative interactions. Using a previously developed method, biological processes
involved in plant responses to increased cadmium content, saline stress and drought conditions were prioritized according
to their degree of connection with the gene networks in the SOLANUM TUBEROSUM knowledge base. The
prioritization results indicate that fundamental processes, such as gene expression, post-translational modif ications,
protein degradation, programmed cell death, photosynthesis, signal transmission and stress response play important
roles in the common molecular genetic mechanisms for plant response to various adverse factors. On the other hand, a
group of processes related to the development of seeds (“seeding development”) was revealed to be drought specif ic,
while processes associated with ion transport (“ion transport”) were included in the list of responses specif ic to salt
stress and processes associated with the metabolism of lipids were found to be involved specif ically in the response to
cadmium.
Collapse
Affiliation(s)
- P S Demenkov
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia Novosibirsk State University, Novosibirsk, Russia
| | - Е А Oshchepkova
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - P S Demenkov
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia Novosibirsk State University, Novosibirsk, Russia
| | - T V Ivanisenko
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - V A Ivanisenko
- Novosibirsk State University, Novosibirsk, Russiavosibirsk, Russia Kurchatov Genomic Center of ICG SB RAS, Novosibirsk, Russia
| |
Collapse
|
14
|
Genome-wide association study and gene network analyses reveal potential candidate genes for high night temperature tolerance in rice. Sci Rep 2021; 11:6747. [PMID: 33762605 PMCID: PMC7991035 DOI: 10.1038/s41598-021-85921-z] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2020] [Accepted: 03/08/2021] [Indexed: 12/13/2022] Open
Abstract
High night temperatures (HNT) are shown to significantly reduce rice (Oryza sativa L.) yield and quality. A better understanding of the genetic architecture of HNT tolerance will help rice breeders to develop varieties adapted to future warmer climates. In this study, a diverse indica rice panel displayed a wide range of phenotypic variability in yield and quality traits under control night (24 °C) and higher night (29 °C) temperatures. Genome-wide association analysis revealed 38 genetic loci associated across treatments (18 for control and 20 for HNT). Nineteen loci were detected with the relative changes in the traits between control and HNT. Positive phenotypic correlations and co-located genetic loci with previously cloned grain size genes revealed common genetic regulation between control and HNT, particularly grain size. Network-based predictive models prioritized 20 causal genes at the genetic loci based on known gene/s expression under HNT in rice. Our study provides important insights for future candidate gene validation and molecular marker development to enhance HNT tolerance in rice. Integrated physiological, genomic, and gene network-informed approaches indicate that the candidate genes for stay-green trait may be relevant to minimizing HNT-induced yield and quality losses during grain filling in rice by optimizing source-sink relationships.
Collapse
|
15
|
Abstract
Technological developments have revolutionized measurements on plant genotypes and phenotypes, leading to routine production of large, complex data sets. This has led to increased efforts to extract meaning from these measurements and to integrate various data sets. Concurrently, machine learning has rapidly evolved and is now widely applied in science in general and in plant genotyping and phenotyping in particular. Here, we review the application of machine learning in the context of plant science and plant breeding. We focus on analyses at different phenotype levels, from biochemical to yield, and in connecting genotypes to these. In this way, we illustrate how machine learning offers a suite of methods that enable researchers to find meaningful patterns in relevant plant data.
Collapse
Affiliation(s)
- Aalt Dirk Jan van Dijk
- Bioinformatics Group, Department of Plant Sciences, Wageningen University and Research, Wageningen 6708 PB, the Netherlands
- Biometris, Department of Plant Sciences, Wageningen University and Research, Wageningen 6708 PB, the Netherlands
| | - Gert Kootstra
- Farm Technology, Department of Plant Sciences, Wageningen University and Research, Wageningen 6708 PB, the Netherlands
| | - Willem Kruijer
- Biometris, Department of Plant Sciences, Wageningen University and Research, Wageningen 6708 PB, the Netherlands
| | - Dick de Ridder
- Bioinformatics Group, Department of Plant Sciences, Wageningen University and Research, Wageningen 6708 PB, the Netherlands
| |
Collapse
|
16
|
Fu Y, Xu J, Tang Z, Wang L, Yin D, Fan Y, Zhang D, Deng F, Zhang Y, Zhang H, Wang H, Xing W, Yin L, Zhu S, Zhu M, Yu M, Li X, Liu X, Yuan X, Zhao S. A gene prioritization method based on a swine multi-omics knowledgebase and a deep learning model. Commun Biol 2020; 3:502. [PMID: 32913254 PMCID: PMC7483748 DOI: 10.1038/s42003-020-01233-4] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2020] [Accepted: 08/07/2020] [Indexed: 12/27/2022] Open
Abstract
The analyses of multi-omics data have revealed candidate genes for objective traits. However, they are integrated poorly, especially in non-model organisms, and they pose a great challenge for prioritizing candidate genes for follow-up experimental verification. Here, we present a general convolutional neural network model that integrates multi-omics information to prioritize the candidate genes of objective traits. By applying this model to Sus scrofa, which is a non-model organism, but one of the most important livestock animals, the model precision was 72.9%, recall 73.5%, and F1-Measure 73.4%, demonstrating a good prediction performance compared with previous studies in Arabidopsis thaliana and Oryza sativa. Additionally, to facilitate the use of the model, we present ISwine (http://iswine.iomics.pro/), which is an online comprehensive knowledgebase in which we incorporated almost all the published swine multi-omics data. Overall, the results suggest that the deep learning strategy will greatly facilitate analyses of multi-omics integration in the future. Yuhua Fu et al. develop a CNN model that integrates multi-omics information to prioritize candidate genes of objective traits. Their model performs well when applied to important livestock non-model animals like Sus scrofa. Finally, the authors present ISwine, an online comprehensive knowledgebase which includes all published swine omics data to facilitate the integration of heterogeneous data.
Collapse
Affiliation(s)
- Yuhua Fu
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction, Ministry of Education, Key Laboratory of Swine Genetics and Breeding, Ministry of Agriculture, College of Animal Science and Technology, Huazhong Agricultural University, 430070, Wuhan, Hubei, P.R. China.,School of Computer Science and Technology, Wuhan University of Technology, 430070, Wuhan, Hubei, P.R. China
| | - Jingya Xu
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction, Ministry of Education, Key Laboratory of Swine Genetics and Breeding, Ministry of Agriculture, College of Animal Science and Technology, Huazhong Agricultural University, 430070, Wuhan, Hubei, P.R. China
| | - Zhenshuang Tang
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction, Ministry of Education, Key Laboratory of Swine Genetics and Breeding, Ministry of Agriculture, College of Animal Science and Technology, Huazhong Agricultural University, 430070, Wuhan, Hubei, P.R. China
| | - Lu Wang
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction, Ministry of Education, Key Laboratory of Swine Genetics and Breeding, Ministry of Agriculture, College of Animal Science and Technology, Huazhong Agricultural University, 430070, Wuhan, Hubei, P.R. China
| | - Dong Yin
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction, Ministry of Education, Key Laboratory of Swine Genetics and Breeding, Ministry of Agriculture, College of Animal Science and Technology, Huazhong Agricultural University, 430070, Wuhan, Hubei, P.R. China
| | - Yu Fan
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction, Ministry of Education, Key Laboratory of Swine Genetics and Breeding, Ministry of Agriculture, College of Animal Science and Technology, Huazhong Agricultural University, 430070, Wuhan, Hubei, P.R. China
| | - Dongdong Zhang
- School of Computer Science and Technology, Wuhan University of Technology, 430070, Wuhan, Hubei, P.R. China
| | - Fei Deng
- School of Computer Science and Technology, Wuhan University of Technology, 430070, Wuhan, Hubei, P.R. China
| | - Yanping Zhang
- School of Computer Science and Technology, Wuhan University of Technology, 430070, Wuhan, Hubei, P.R. China
| | - Haohao Zhang
- School of Computer Science and Technology, Wuhan University of Technology, 430070, Wuhan, Hubei, P.R. China
| | - Haiyan Wang
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction, Ministry of Education, Key Laboratory of Swine Genetics and Breeding, Ministry of Agriculture, College of Animal Science and Technology, Huazhong Agricultural University, 430070, Wuhan, Hubei, P.R. China
| | - Wenhui Xing
- School of Computer Science and Technology, Wuhan University of Technology, 430070, Wuhan, Hubei, P.R. China
| | - Lilin Yin
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction, Ministry of Education, Key Laboratory of Swine Genetics and Breeding, Ministry of Agriculture, College of Animal Science and Technology, Huazhong Agricultural University, 430070, Wuhan, Hubei, P.R. China
| | - Shilin Zhu
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction, Ministry of Education, Key Laboratory of Swine Genetics and Breeding, Ministry of Agriculture, College of Animal Science and Technology, Huazhong Agricultural University, 430070, Wuhan, Hubei, P.R. China
| | - Mengjin Zhu
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction, Ministry of Education, Key Laboratory of Swine Genetics and Breeding, Ministry of Agriculture, College of Animal Science and Technology, Huazhong Agricultural University, 430070, Wuhan, Hubei, P.R. China
| | - Mei Yu
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction, Ministry of Education, Key Laboratory of Swine Genetics and Breeding, Ministry of Agriculture, College of Animal Science and Technology, Huazhong Agricultural University, 430070, Wuhan, Hubei, P.R. China
| | - Xinyun Li
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction, Ministry of Education, Key Laboratory of Swine Genetics and Breeding, Ministry of Agriculture, College of Animal Science and Technology, Huazhong Agricultural University, 430070, Wuhan, Hubei, P.R. China
| | - Xiaolei Liu
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction, Ministry of Education, Key Laboratory of Swine Genetics and Breeding, Ministry of Agriculture, College of Animal Science and Technology, Huazhong Agricultural University, 430070, Wuhan, Hubei, P.R. China.
| | - Xiaohui Yuan
- School of Computer Science and Technology, Wuhan University of Technology, 430070, Wuhan, Hubei, P.R. China.
| | - Shuhong Zhao
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction, Ministry of Education, Key Laboratory of Swine Genetics and Breeding, Ministry of Agriculture, College of Animal Science and Technology, Huazhong Agricultural University, 430070, Wuhan, Hubei, P.R. China.
| |
Collapse
|
17
|
QTG-Finder2: A Generalized Machine-Learning Algorithm for Prioritizing QTL Causal Genes in Plants. G3-GENES GENOMES GENETICS 2020; 10:2411-2421. [PMID: 32430305 PMCID: PMC7341141 DOI: 10.1534/g3.120.401122] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
Linkage mapping has been widely used to identify quantitative trait loci (QTL) in many plants and usually requires a time-consuming and labor-intensive fine mapping process to find the causal gene underlying the QTL. Previously, we described QTG-Finder, a machine-learning algorithm to rationally prioritize candidate causal genes in QTLs. While it showed good performance, QTG-Finder could only be used in Arabidopsis and rice because of the limited number of known causal genes in other species. Here we tested the feasibility of enabling QTG-Finder to work on species that have few or no known causal genes by using orthologs of known causal genes as the training set. The model trained with orthologs could recall about 64% of Arabidopsis and 83% of rice causal genes when the top 20% ranked genes were considered, which is similar to the performance of models trained with known causal genes. The average precision was 0.027 for Arabidopsis and 0.029 for rice. We further extended the algorithm to include polymorphisms in conserved non-coding sequences and gene presence/absence variation as additional features. Using this algorithm, QTG-Finder2, we trained and cross-validated Sorghum bicolor and Setaria viridis models. The S. bicolor model was validated by causal genes curated from the literature and could recall 70% of causal genes when the top 20% ranked genes were considered. In addition, we applied the S. viridis model and public transcriptome data to prioritize a plant height QTL and identified 13 candidate genes. QTL-Finder2 can accelerate the discovery of causal genes in any plant species and facilitate agricultural trait improvement.
Collapse
|
18
|
Kliebenstein DJ. Using networks to identify and interpret natural variation. CURRENT OPINION IN PLANT BIOLOGY 2020; 54:122-126. [PMID: 32413801 DOI: 10.1016/j.pbi.2020.04.005] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/06/2019] [Accepted: 04/20/2020] [Indexed: 06/11/2023]
Abstract
Studies on natural variation and network biology inherently work to summarize vast amounts of information and data. The combination of these two areas of study while creating datasets of immense complexity is critical to their mutual progress. Networks are necessary as a way to work to reduce the dimensionality inherent in natural variation with 100 s to 1000 s of genotypes. Correspondingly natural variation is essential for testing how networks may or may not be shared across individuals or species. Advances in this area of cross-fertilization including using networks directly as phenotypes and the use of networks to help in prioritizing candidate gene validation efforts. Interesting new observations on frequent presence-absence variation in gene content and adaptation is beginning to highlight the potential for natural variation in network presence-absence. This review attempts to delve into these new insights.
Collapse
Affiliation(s)
- Daniel J Kliebenstein
- Department of Plant Sciences, University of California, Davis, One Shields Avenue, Davis, CA, 95616, USA; DynaMo Center of Excellence, University of Copenhagen, Thorvaldsensvej 40, DK-1871, Frederiksberg C, Denmark.
| |
Collapse
|
19
|
Ahmed H, Nazir MF, Pan Z, Gong W, Iqbal MS, He S, Du X. Genotyping by Sequencing Revealed QTL Hotspots for Trichome-Based Plant Defense in Gossypium hirsutum. Genes (Basel) 2020; 11:E368. [PMID: 32231109 PMCID: PMC7231212 DOI: 10.3390/genes11040368] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2020] [Revised: 03/19/2020] [Accepted: 03/26/2020] [Indexed: 01/23/2023] Open
Abstract
Cotton possesses certain physical features, including leaf and stem trichomes that help plants deter damage caused by insect pests, and to some extent, from abiotic factors as well. Among those features, trichomes (pubescence) hold a special place as a first line of defense and a managemental tool against sucking insect pests of cotton. Different insect pests of cotton (whiteflies, aphids, jassids, and boll weevil) severely damage the yield and quality of the crop. Likewise, whiteflies, aphids, jassids, and other insect pests are considered as potential carriers for cotton leaf curl viruses and other diseases. Genotyping by sequencing (GBS) study was conducted to understand and explore the genomic regions governing hairy (Pubescence) leaves and stem phenotypes. A total of 224 individuals developed from an intraspecific cross (densely haired cotton (Liaoyang duomao mian) × hairless cotton (Zong 128)) and characterized phenotypically for leaf and stem pubescence in different environments. Here we identify and report significant QTLs (quantitative trait loci) associated with leaf and stem pubescence, and the response of plant under pest (aphid) infestation. Further, we identified putative genes colocalized on chromosome A06 governing mechanism for trichome development and host-pest interaction. Our study provides a comprehensive insight into genetic architecture that can be employed to improve molecular marker-assisted breeding programs aimed at developing biotic (insect pests) resilient cotton cultivars.
Collapse
Affiliation(s)
- Haris Ahmed
- Institute of Cotton Research, Chinese Academy of Agricultural Sciences, State Key Laboratory of Cotton Biology, Anyang, Henan 455000, China; (H.A.); (M.F.N.); (Z.P.); (M.S.I.); (S.H.)
| | - Mian Faisal Nazir
- Institute of Cotton Research, Chinese Academy of Agricultural Sciences, State Key Laboratory of Cotton Biology, Anyang, Henan 455000, China; (H.A.); (M.F.N.); (Z.P.); (M.S.I.); (S.H.)
| | - Zhoe Pan
- Institute of Cotton Research, Chinese Academy of Agricultural Sciences, State Key Laboratory of Cotton Biology, Anyang, Henan 455000, China; (H.A.); (M.F.N.); (Z.P.); (M.S.I.); (S.H.)
| | - Wenfang Gong
- Key Laboratory of Cultivation and Protection for Non-Wood Forest Trees, Ministry of Education, Central South University of Forestry and Technology, Changsha 410004, China;
| | - Muhammad Shahid Iqbal
- Institute of Cotton Research, Chinese Academy of Agricultural Sciences, State Key Laboratory of Cotton Biology, Anyang, Henan 455000, China; (H.A.); (M.F.N.); (Z.P.); (M.S.I.); (S.H.)
- Cotton Research Station, Ayub Agriculture Research Institute, Faisalabad 38000, Pakistan
| | - Shoupu He
- Institute of Cotton Research, Chinese Academy of Agricultural Sciences, State Key Laboratory of Cotton Biology, Anyang, Henan 455000, China; (H.A.); (M.F.N.); (Z.P.); (M.S.I.); (S.H.)
| | - Xiongming Du
- Institute of Cotton Research, Chinese Academy of Agricultural Sciences, State Key Laboratory of Cotton Biology, Anyang, Henan 455000, China; (H.A.); (M.F.N.); (Z.P.); (M.S.I.); (S.H.)
- Zhengzhou Research Base, State Key Laboratory of Cotton Biology, Zhengzhou University, Zhengzhou 450001, China
| |
Collapse
|