1
|
Sun R, Sun B, Zheng Z, Zhang Q, Hu X, Guo R, Feng L, Chai S, Wang J, Qiu P, Yu P, Liu Y, Song W, Li Y, Qiu L. Chromosome-level genome assembly of a high-yield Chinese soybean variety Mengdou1137 unlocks genetic potential of disease and lodging resistance. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2025; 138:119. [PMID: 40369104 PMCID: PMC12078370 DOI: 10.1007/s00122-025-04881-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/12/2024] [Accepted: 03/11/2025] [Indexed: 05/16/2025]
Abstract
KEY MESSAGE We assembled the genome of Mengdou1137 with high quality and revealed the specific disease resistance genes and a large number of genomic variations related to agronomic traits. As a cornerstone in the global agricultural landscape, soybean stands as a pivotal oilseed crop, underpinning both nutritional and industrial applications. The burgeoning development of novel soybean varieties significantly propels the crop's industrialization, offering enhanced traits that cater to diverse agricultural and commercial needs. In this study, we present the de novo assembly of the genome a high-yield Chinese soybean variety Mengdou1137, employing an integrated approach of both long-read and short-read sequencing technologies to achieve comprehensive genomic insights. Achieving a notable assembly with a genome size of 999.99 Mb, our work features a contig N50 of 14.92 Mb and a scaffold N50 of 50.26 Mb, successfully anchoring 98.24% of sequences across the 20 chromosomes. Through meticulous comparative analysis with existing soybean genomes, our research unveiled 115 Mengdou1137-specific disease resistance genes alongside a substantial array of agronomical trait-associated genomic variants. Among the salient genomic features, we identified a favorable haplotype of the dwarf gene PH13, a critical determinant of plant stature, underscoring its potential for breeding compact soybean varieties with lodging resistance. This high-quality assembly of the Mengdou1137 genome not only enriches the repository of soybean genetic resources but also paves the way for future innovations in soybean breeding and trait improvement, offering valuable insights for the enhancement of this crucial agricultural commodity.
Collapse
Affiliation(s)
- Rujian Sun
- State Key Laboratory of Crop Gene Resources and Breeding/the National Key Facility for Crop Gene Resources and Genetic Improvement (NFCRI)/Key Laboratory of Grain Crop Genetic Resources Evaluation and Utilization (MARA), Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
- Hulunbuir Institute of Agriculture and Animal Husbandry, Hulunbuir, 021000, Inner Mongolia, China
| | - Bincheng Sun
- Hulunbuir Institute of Agriculture and Animal Husbandry, Hulunbuir, 021000, Inner Mongolia, China
| | - Zihao Zheng
- Department of Agronomy, Iowa State University, Ames, IA, 50011-1051, USA
| | - Qi Zhang
- Hulunbuir Institute of Agriculture and Animal Husbandry, Hulunbuir, 021000, Inner Mongolia, China
| | - Xingguo Hu
- Hulunbuir Institute of Agriculture and Animal Husbandry, Hulunbuir, 021000, Inner Mongolia, China
| | - Rongqi Guo
- Hulunbuir Institute of Agriculture and Animal Husbandry, Hulunbuir, 021000, Inner Mongolia, China
| | - Lei Feng
- Hulunbuir Institute of Agriculture and Animal Husbandry, Hulunbuir, 021000, Inner Mongolia, China
| | - Shen Chai
- Hulunbuir Institute of Agriculture and Animal Husbandry, Hulunbuir, 021000, Inner Mongolia, China
| | - Jingshun Wang
- Hulunbuir Institute of Agriculture and Animal Husbandry, Hulunbuir, 021000, Inner Mongolia, China
| | - Ping Qiu
- Hulunbuir Institute of Agriculture and Animal Husbandry, Hulunbuir, 021000, Inner Mongolia, China
| | - Ping Yu
- Hulunbuir Institute of Agriculture and Animal Husbandry, Hulunbuir, 021000, Inner Mongolia, China
| | - Ying Liu
- Hulunbuir Institute of Agriculture and Animal Husbandry, Hulunbuir, 021000, Inner Mongolia, China
| | - Wei Song
- Hulunbuir Institute of Agriculture and Animal Husbandry, Hulunbuir, 021000, Inner Mongolia, China
| | - Yinghui Li
- State Key Laboratory of Crop Gene Resources and Breeding/the National Key Facility for Crop Gene Resources and Genetic Improvement (NFCRI)/Key Laboratory of Grain Crop Genetic Resources Evaluation and Utilization (MARA), Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081, China.
| | - Lijuan Qiu
- State Key Laboratory of Crop Gene Resources and Breeding/the National Key Facility for Crop Gene Resources and Genetic Improvement (NFCRI)/Key Laboratory of Grain Crop Genetic Resources Evaluation and Utilization (MARA), Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081, China.
| |
Collapse
|
2
|
Tian Z, Nepomuceno AL, Song Q, Stupar RM, Liu B, Kong F, Ma J, Lee SH, Jackson SA. Soybean2035: A decadal vision for soybean functional genomics and breeding. MOLECULAR PLANT 2025; 18:245-271. [PMID: 39772289 DOI: 10.1016/j.molp.2025.01.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/12/2024] [Revised: 12/29/2024] [Accepted: 01/05/2025] [Indexed: 01/31/2025]
Abstract
Soybean, the fourth most important crop in the world, uniquely serves as a source of both plant oil and plant protein for the world's food and animal feed. Although soybean production has increased approximately 13-fold over the past 60 years, the continually growing global population necessitates further increases in soybean production. In the past, especially in the last decade, significant progress has been made in both functional genomics and molecular breeding. However, many more challenges should be overcome to meet the anticipated future demand. Here, we summarize past achievements in the areas of soybean omics, functional genomics, and molecular breeding. Furthermore, we analyze trends in these areas, including shortages and challenges, and propose new directions, potential approaches, and possible outputs toward 2035. Our views and perspectives provide insight into accelerating the development of elite soybean varieties to meet the increasing demands of soybean production.
Collapse
Affiliation(s)
- Zhixi Tian
- Yazhouwan National Laboratory, Sanya, Hainan, China.
| | | | - Qingxin Song
- State Key Laboratory of Crop Genetics & Germplasm Enhancement and Utilization, Jiangsu Collaborative Innovation Center for Modern Crop Production, Nanjing Agricultural University, Nanjing, Jiangsu, China.
| | - Robert M Stupar
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN, USA.
| | - Bin Liu
- State Key Laboratory of Crop Gene Resources and Breeding, Key Laboratory of Soybean Biology (Beijing) (MARA), Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China.
| | - Fanjiang Kong
- Guangdong Provincial Key Laboratory of Plant Adaptation and Molecular Design, Innovative Center of Molecular Genetics and Evolution, School of Life Sciences, Guangzhou University, Guangzhou, China.
| | - Jianxin Ma
- Department of Agronomy, Purdue University, West Lafayette, IN, USA.
| | - Suk-Ha Lee
- Department of Agriculture, Forestry and Bioresources and Research Institute of Agriculture and Life Sciences, Seoul National University, Seoul, Republic of Korea.
| | - Scott A Jackson
- Center for Applied Genetic Technologies, University of Georgia, Athens, GA, USA.
| |
Collapse
|
3
|
Potapova NA, Zorkoltseva IV, Zlobin AS, Shcherban AB, Fedyaeva AV, Salina EA, Svishcheva GR, Aksenovich TI, Tsepilov YA. Genome-Wide Association Study on Imputed Genotypes of 180 Eurasian Soybean Glycine max Varieties for Oil and Protein Contents in Seeds. PLANTS (BASEL, SWITZERLAND) 2025; 14:255. [PMID: 39861608 PMCID: PMC11768550 DOI: 10.3390/plants14020255] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/04/2024] [Revised: 01/13/2025] [Accepted: 01/15/2025] [Indexed: 01/27/2025]
Abstract
Soybean (Glycine max) is a leguminous plant with a broad range of applications, particularly in agriculture and food production, where its seed composition-especially oil and protein content-is highly valued. Improving these traits is a primary focus of soybean breeding programs. In this study, we conducted a genome-wide association study (GWAS) to identify genetic loci linked to oil and protein content in seeds, using imputed genotype data for 180 Eurasian soybean varieties and the novel "genotypic twins" approach. This dataset encompassed 87 Russian and European cultivars and 93 breeding lines from Western Siberia. We identified 11 novel loci significantly associated with oil and protein content in seeds (p-value < 1.5 × 10-6), including one locus on chromosome 11 linked to protein content and 10 loci associated with oil content (chromosomes 1, 5, 11, 16, 17, and 18). The protein-associated locus is located near a gene encoding a CBL-interacting protein kinase, which is involved in key biological processes, including stress response mechanisms such as drought and osmotic stress. The oil-associated loci were linked to genes with diverse functions, including lipid transport, nutrient reservoir activity, and stress responses, such as Sec14p-like phosphatidylinositol transfer proteins and Germin-like proteins. These findings suggest that the loci identified not only influence oil and protein content but may also contribute to plant resilience under environmental stress conditions. The data obtained from this study provide valuable genetic markers that can be used in breeding programs to optimize oil and protein content, particularly in varieties adapted to Russian climates, and contribute to the development of high-yielding, nutritionally enhanced soybean cultivars.
Collapse
Affiliation(s)
- Nadezhda A. Potapova
- Kurchatov Genomics Center, Institute of Cytology and Genetics SB RAS, Lavrentiev Av. 10, 630090 Novosibirsk, Russia; (A.S.Z.); (A.B.S.); (E.A.S.); (G.R.S.); (Y.A.T.)
| | - Irina V. Zorkoltseva
- Kurchatov Genomics Center, Institute of Cytology and Genetics SB RAS, Lavrentiev Av. 10, 630090 Novosibirsk, Russia; (A.S.Z.); (A.B.S.); (E.A.S.); (G.R.S.); (Y.A.T.)
- The Federal Research Center, Institute of Cytology and Genetics SB RAS, Lavrentiev Av. 10, 630090 Novosibirsk, Russia; (A.V.F.); (T.I.A.)
| | - Alexander S. Zlobin
- Kurchatov Genomics Center, Institute of Cytology and Genetics SB RAS, Lavrentiev Av. 10, 630090 Novosibirsk, Russia; (A.S.Z.); (A.B.S.); (E.A.S.); (G.R.S.); (Y.A.T.)
| | - Andrey B. Shcherban
- Kurchatov Genomics Center, Institute of Cytology and Genetics SB RAS, Lavrentiev Av. 10, 630090 Novosibirsk, Russia; (A.S.Z.); (A.B.S.); (E.A.S.); (G.R.S.); (Y.A.T.)
- The Federal Research Center, Institute of Cytology and Genetics SB RAS, Lavrentiev Av. 10, 630090 Novosibirsk, Russia; (A.V.F.); (T.I.A.)
| | - Anna V. Fedyaeva
- The Federal Research Center, Institute of Cytology and Genetics SB RAS, Lavrentiev Av. 10, 630090 Novosibirsk, Russia; (A.V.F.); (T.I.A.)
| | - Elena A. Salina
- Kurchatov Genomics Center, Institute of Cytology and Genetics SB RAS, Lavrentiev Av. 10, 630090 Novosibirsk, Russia; (A.S.Z.); (A.B.S.); (E.A.S.); (G.R.S.); (Y.A.T.)
- The Federal Research Center, Institute of Cytology and Genetics SB RAS, Lavrentiev Av. 10, 630090 Novosibirsk, Russia; (A.V.F.); (T.I.A.)
| | - Gulnara R. Svishcheva
- Kurchatov Genomics Center, Institute of Cytology and Genetics SB RAS, Lavrentiev Av. 10, 630090 Novosibirsk, Russia; (A.S.Z.); (A.B.S.); (E.A.S.); (G.R.S.); (Y.A.T.)
- The Federal Research Center, Institute of Cytology and Genetics SB RAS, Lavrentiev Av. 10, 630090 Novosibirsk, Russia; (A.V.F.); (T.I.A.)
- Institute of General Genetics RAS, Gubkin St. 3, 119333 Moscow, Russia
| | - Tatiana I. Aksenovich
- The Federal Research Center, Institute of Cytology and Genetics SB RAS, Lavrentiev Av. 10, 630090 Novosibirsk, Russia; (A.V.F.); (T.I.A.)
| | - Yakov A. Tsepilov
- Kurchatov Genomics Center, Institute of Cytology and Genetics SB RAS, Lavrentiev Av. 10, 630090 Novosibirsk, Russia; (A.S.Z.); (A.B.S.); (E.A.S.); (G.R.S.); (Y.A.T.)
- The Federal Research Center, Institute of Cytology and Genetics SB RAS, Lavrentiev Av. 10, 630090 Novosibirsk, Russia; (A.V.F.); (T.I.A.)
| |
Collapse
|
4
|
Tek AL, Nagaki K, Yıldız Akkamış H, Tanaka K, Kobayashi H. Chromosome-specific barcode system with centromeric repeat in cultivated soybean and wild progenitor. Life Sci Alliance 2024; 7:e202402802. [PMID: 39353738 PMCID: PMC11447526 DOI: 10.26508/lsa.202402802] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2024] [Revised: 09/21/2024] [Accepted: 09/24/2024] [Indexed: 10/04/2024] Open
Abstract
Wild soybean Glycine soja is the progenitor of cultivated soybean Glycine max Information on soybean functional centromeres is limited despite extensive genome analysis. These species are an ideal model for studying centromere dynamics for domestication and breeding. We performed a detailed chromatin immunoprecipitation analysis using centromere-specific histone H3 protein to delineate two distinct centromeric DNA sequences with unusual repeating units with monomer sizes of 90-92 bp (CentGm-1) and 413-bp (CentGm-4) shorter and longer than standard nucleosomes. These two unrelated DNA sequences with no sequence similarity are part of functional centromeres in both species. Our results provide a comparison of centromere properties between a cultivated and a wild species under the effect of the same kinetochore protein. Possible sequence homogenization specific to each chromosome could highlight the mechanism for evolutionary conservation of centromeric properties independent of domestication and breeding. Moreover, a unique barcode system to track each chromosome is developed using CentGm-4 units. Our results with a unifying centromere composition model using CentGm-1 and CentGm-4 superfamilies could have far-reaching implications for comparative and evolutionary genome research.
Collapse
Affiliation(s)
- Ahmet L Tek
- Department of Agricultural Genetic Engineering, Ayhan Şahenk Faculty of Agricultural Sciences and Technologies, Niğde Ömer Halisdemir University, Niğde, Türkiye
| | - Kiyotaka Nagaki
- Institute of Plant Science and Resources, Okayama University, Kurashiki, Japan
| | - Hümeyra Yıldız Akkamış
- Department of Agricultural Genetic Engineering, Ayhan Şahenk Faculty of Agricultural Sciences and Technologies, Niğde Ömer Halisdemir University, Niğde, Türkiye
| | - Keisuke Tanaka
- NODAI Genome Research Center, Tokyo University of Agriculture, Setagaya, Japan
| | - Hisato Kobayashi
- NODAI Genome Research Center, Tokyo University of Agriculture, Setagaya, Japan
| |
Collapse
|
5
|
Kaur H, Shannon LM, Samac DA. A stepwise guide for pangenome development in crop plants: an alfalfa (Medicago sativa) case study. BMC Genomics 2024; 25:1022. [PMID: 39482604 PMCID: PMC11526573 DOI: 10.1186/s12864-024-10931-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2024] [Accepted: 10/21/2024] [Indexed: 11/03/2024] Open
Abstract
BACKGROUND The concept of pangenomics and the importance of structural variants is gaining recognition within the plant genomics community. Due to advancements in sequencing and computational technology, it has become feasible to sequence the entire genome of numerous individuals of a single species at a reasonable cost. Pangenomes have been constructed for many major diploid crops, including rice, maize, soybean, sorghum, pearl millet, peas, sunflower, grapes, and mustards. However, pangenomes for polyploid species are relatively scarce and are available in only few crops including wheat, cotton, rapeseed, and potatoes. MAIN BODY In this review, we explore the various methods used in crop pangenome development, discussing the challenges and implications of these techniques based on insights from published pangenome studies. We offer a systematic guide and discuss the tools available for constructing a pangenome and conducting downstream analyses. Alfalfa, a highly heterozygous, cross pollinated and autotetraploid forage crop species, is used as an example to discuss the concerns and challenges offered by polyploid crop species. We conducted a comparative analysis using linear and graph-based methods by constructing an alfalfa graph pangenome using three publicly available genome assemblies. To illustrate the intricacies captured by pangenome graphs for a complex crop genome, we used five different gene sequences and aligned them against the three graph-based pangenomes. The comparison of the three graph pangenome methods reveals notable variations in the genomic variation captured by each pipeline. CONCLUSION Pangenome resources are proving invaluable by offering insights into core and dispensable genes, novel gene discovery, and genome-wide patterns of variation. Developing user-friendly online portals for linear pangenome visualization has made these resources accessible to the broader scientific and breeding community. However, challenges remain with graph-based pangenomes including compatibility with other tools, extraction of sequence for regions of interest, and visualization of genetic variation captured in pangenome graphs. These issues necessitate further refinement of tools and pipelines to effectively address the complexities of polyploid, highly heterozygous, and cross-pollinated species.
Collapse
Affiliation(s)
- Harpreet Kaur
- Department of Horticultural Science, University of Minnesota, St. Paul, MN, 55108, USA.
| | - Laura M Shannon
- Department of Horticultural Science, University of Minnesota, St. Paul, MN, 55108, USA
| | - Deborah A Samac
- USDA-ARS, Plant Science Research Unit, St. Paul, MN, 55108, USA
| |
Collapse
|
6
|
Zhang C, Shao Z, Kong Y, Du H, Li W, Yang Z, Li X, Ke H, Sun Z, Shao J, Chen S, Zhang H, Chu J, Xing X, Tian R, Qin N, Li J, Huang M, Sun Y, Huo X, Meng C, Wang G, Liu Y, Ma Z, Tian S, Li X. High-quality genome of a modern soybean cultivar and resequencing of 547 accessions provide insights into the role of structural variation. Nat Genet 2024; 56:2247-2258. [PMID: 39251789 DOI: 10.1038/s41588-024-01901-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Accepted: 08/08/2024] [Indexed: 09/11/2024]
Abstract
Soybean provides protein, oil and multiple health-related compounds. Understanding the effects of structural variations (SVs) on economic traits in modern breeding is important for soybean improvement. Here we assembled the high-quality genome of modern cultivar Nongdadou2 (NDD2) and identified 25,814 SV-gene pairs compared to 29 reported genomes, with 13 NDD2-private SVs validated in 547 deep-resequencing (average = 18.05-fold) accessions, which advances our understanding of genomic variation biology. We found some insertions/deletions involved in seed protein and weight formation, an inversion related to adaptation to drought and a large intertranslocation implicated in a key divergence event in soybean. Of 749,714 SVs from 547 accessions, 6,013 were significantly associated with 22 yield-related and seed-quality-related traits determined in ten location × year environments. We uncovered 1,761 associated SVs that hit genes or regulatory regions, with 12 in GmMQT influencing oil and isoflavone contents. Our work provides resources and insights into SV roles in soybean improvement.
Collapse
Affiliation(s)
- Caiying Zhang
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, China.
| | - Zhenqi Shao
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, China
| | - Youbin Kong
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, China
| | - Hui Du
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, China
| | - Wenlong Li
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, China
| | - Zhanwu Yang
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, China
| | - Xiangkong Li
- Novogene Bioinformatics Institute, Beijing, China
| | - Huifeng Ke
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, China
| | - Zhengwen Sun
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, China
| | - Jiabiao Shao
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, China
| | - Shiliang Chen
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, China
| | - Hua Zhang
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, China
| | - Jiahao Chu
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, China
| | - Xinzhu Xing
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, China
| | - Rui Tian
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, China
| | - Ning Qin
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, China
| | - Junru Li
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, China
| | - Meihong Huang
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, China
| | - Yaqian Sun
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, China
| | - Xiaobo Huo
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, China
| | - Chengsheng Meng
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, China
| | - Guoning Wang
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, China
| | - Yuan Liu
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, China
| | - Zhiying Ma
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, China.
| | - Shilin Tian
- Novogene Bioinformatics Institute, Beijing, China.
| | - Xihuan Li
- State Key Laboratory of North China Crop Improvement and Regulation, North China Key Laboratory for Crop Germplasm Resources of Education Ministry, Hebei Agricultural University, Baoding, China.
| |
Collapse
|
7
|
Huang Y, Koo DH, Mao Y, Herman EM, Zhang J, Schmidt MA. A complete reference genome for the soybean cv. Jack. PLANT COMMUNICATIONS 2024; 5:100765. [PMID: 37968906 PMCID: PMC10873888 DOI: 10.1016/j.xplc.2023.100765] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Revised: 11/08/2023] [Accepted: 11/09/2023] [Indexed: 11/17/2023]
Affiliation(s)
- Yicheng Huang
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan 430070, China
| | - Dal-Hoe Koo
- Wheat Genetics Resource Center, Department of Plant Pathology, Kansas State University, Manhattan, KS 66506, USA
| | - Yizhou Mao
- BIO5 Institute, School of Plant Sciences, University of Arizona, Tucson, AZ 85721, USA
| | - Eliot M Herman
- BIO5 Institute, School of Plant Sciences, University of Arizona, Tucson, AZ 85721, USA
| | - Jianwei Zhang
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan 430070, China.
| | - Monica A Schmidt
- BIO5 Institute, School of Plant Sciences, University of Arizona, Tucson, AZ 85721, USA.
| |
Collapse
|
8
|
Zhou Y, Kathiresan N, Yu Z, Rivera LF, Yang Y, Thimma M, Manickam K, Chebotarov D, Mauleon R, Chougule K, Wei S, Gao T, Green CD, Zuccolo A, Xie W, Ware D, Zhang J, McNally KL, Wing RA. A high-performance computational workflow to accelerate GATK SNP detection across a 25-genome dataset. BMC Biol 2024; 22:13. [PMID: 38273258 PMCID: PMC10809545 DOI: 10.1186/s12915-024-01820-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Accepted: 01/09/2024] [Indexed: 01/27/2024] Open
Abstract
BACKGROUND Single-nucleotide polymorphisms (SNPs) are the most widely used form of molecular genetic variation studies. As reference genomes and resequencing data sets expand exponentially, tools must be in place to call SNPs at a similar pace. The genome analysis toolkit (GATK) is one of the most widely used SNP calling software tools publicly available, but unfortunately, high-performance computing versions of this tool have yet to become widely available and affordable. RESULTS Here we report an open-source high-performance computing genome variant calling workflow (HPC-GVCW) for GATK that can run on multiple computing platforms from supercomputers to desktop machines. We benchmarked HPC-GVCW on multiple crop species for performance and accuracy with comparable results with previously published reports (using GATK alone). Finally, we used HPC-GVCW in production mode to call SNPs on a "subpopulation aware" 16-genome rice reference panel with ~ 3000 resequenced rice accessions. The entire process took ~ 16 weeks and resulted in the identification of an average of 27.3 M SNPs/genome and the discovery of ~ 2.3 million novel SNPs that were not present in the flagship reference genome for rice (i.e., IRGSP RefSeq). CONCLUSIONS This study developed an open-source pipeline (HPC-GVCW) to run GATK on HPC platforms, which significantly improved the speed at which SNPs can be called. The workflow is widely applicable as demonstrated successfully for four major crop species with genomes ranging in size from 400 Mb to 2.4 Gb. Using HPC-GVCW in production mode to call SNPs on a 25 multi-crop-reference genome data set produced over 1.1 billion SNPs that were publicly released for functional and breeding studies. For rice, many novel SNPs were identified and were found to reside within genes and open chromatin regions that are predicted to have functional consequences. Combined, our results demonstrate the usefulness of combining a high-performance SNP calling architecture solution with a subpopulation-aware reference genome panel for rapid SNP discovery and public deployment.
Collapse
Affiliation(s)
- Yong Zhou
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
- Arizona Genomics Institute (AGI), School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA
| | - Nagarajan Kathiresan
- KAUST Supercomputing Laboratory (KSL), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Zhichao Yu
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, 430070, China
| | - Luis F Rivera
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Yujian Yang
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, 430070, China
| | - Manjula Thimma
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Keerthana Manickam
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Dmytro Chebotarov
- International Rice Research Institute (IRRI), Los Baños, Laguna, 4031, Philippines
| | - Ramil Mauleon
- International Rice Research Institute (IRRI), Los Baños, Laguna, 4031, Philippines
| | - Kapeel Chougule
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
| | - Sharon Wei
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
| | - Tingting Gao
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, 430070, China
| | - Carl D Green
- Information Technology Department, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Andrea Zuccolo
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
- Crop Science Research Center (CSRC), Scuola Superiore Sant'Anna, Pisa, 56127, Italy
| | - Weibo Xie
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, 430070, China
| | - Doreen Ware
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
- USDA ARS NEA Plant, Soil & Nutrition Laboratory Research Unit, Ithaca, NY, 14853, USA
| | - Jianwei Zhang
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, 430070, China
| | - Kenneth L McNally
- International Rice Research Institute (IRRI), Los Baños, Laguna, 4031, Philippines
| | - Rod A Wing
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia.
- Arizona Genomics Institute (AGI), School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA.
- International Rice Research Institute (IRRI), Los Baños, Laguna, 4031, Philippines.
| |
Collapse
|
9
|
Hu Y, Liu Y, Wei JJ, Zhang WK, Chen SY, Zhang JS. Regulation of seed traits in soybean. ABIOTECH 2023; 4:372-385. [PMID: 38106437 PMCID: PMC10721594 DOI: 10.1007/s42994-023-00122-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Accepted: 10/18/2023] [Indexed: 12/19/2023]
Abstract
Soybean (Glycine max) is an essential economic crop that provides vegetative oil and protein for humans, worldwide. Increasing soybean yield as well as improving seed quality is of great importance. Seed weight/size, oil and protein content are the three major traits determining seed quality, and seed weight also influences soybean yield. In recent years, the availability of soybean omics data and the development of related techniques have paved the way for better research on soybean functional genomics, providing a comprehensive understanding of gene functions. This review summarizes the regulatory genes that influence seed size/weight, oil content and protein content in soybean. We also provided a general overview of the pleiotropic effect for the genes in controlling seed traits and environmental stresses. Ultimately, it is expected that this review will be beneficial in breeding improved traits in soybean.
Collapse
Affiliation(s)
- Yang Hu
- State Key Lab of Plant Genomics, Institute of Genetics and Developmental Biology, Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, 100101 China
- College of Advanced Agricultural Sciences, University of Chinese Academy of Sciences, Beijing, 100049 China
| | - Yue Liu
- State Key Lab of Plant Genomics, Institute of Genetics and Developmental Biology, Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, 100101 China
- College of Advanced Agricultural Sciences, University of Chinese Academy of Sciences, Beijing, 100049 China
| | - Jun-Jie Wei
- State Key Lab of Plant Genomics, Institute of Genetics and Developmental Biology, Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, 100101 China
- College of Advanced Agricultural Sciences, University of Chinese Academy of Sciences, Beijing, 100049 China
| | - Wan-Ke Zhang
- State Key Lab of Plant Genomics, Institute of Genetics and Developmental Biology, Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, 100101 China
| | - Shou-Yi Chen
- State Key Lab of Plant Genomics, Institute of Genetics and Developmental Biology, Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, 100101 China
| | - Jin-Song Zhang
- State Key Lab of Plant Genomics, Institute of Genetics and Developmental Biology, Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, 100101 China
- College of Advanced Agricultural Sciences, University of Chinese Academy of Sciences, Beijing, 100049 China
| |
Collapse
|
10
|
Du H, Fang C, Li Y, Kong F, Liu B. Understandings and future challenges in soybean functional genomics and molecular breeding. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2023; 65:468-495. [PMID: 36511121 DOI: 10.1111/jipb.13433] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2022] [Accepted: 12/11/2022] [Indexed: 06/17/2023]
Abstract
Soybean (Glycine max) is a major source of plant protein and oil. Soybean breeding has benefited from advances in functional genomics. In particular, the release of soybean reference genomes has advanced our understanding of soybean adaptation to soil nutrient deficiencies, the molecular mechanism of symbiotic nitrogen (N) fixation, biotic and abiotic stress tolerance, and the roles of flowering time in regional adaptation, plant architecture, and seed yield and quality. Nevertheless, many challenges remain for soybean functional genomics and molecular breeding, mainly related to improving grain yield through high-density planting, maize-soybean intercropping, taking advantage of wild resources, utilization of heterosis, genomic prediction and selection breeding, and precise breeding through genome editing. This review summarizes the current progress in soybean functional genomics and directs future challenges for molecular breeding of soybean.
Collapse
Affiliation(s)
- Haiping Du
- Guangdong Key Laboratory of Plant Adaptation and Molecular Design, Guangzhou Key Laboratory of Crop Gene Editing, Innovative Center of Molecular Genetics and Evolution, School of Life Sciences, Guangzhou University, Guangzhou, 510006, China
| | - Chao Fang
- Guangdong Key Laboratory of Plant Adaptation and Molecular Design, Guangzhou Key Laboratory of Crop Gene Editing, Innovative Center of Molecular Genetics and Evolution, School of Life Sciences, Guangzhou University, Guangzhou, 510006, China
| | - Yaru Li
- Guangdong Key Laboratory of Plant Adaptation and Molecular Design, Guangzhou Key Laboratory of Crop Gene Editing, Innovative Center of Molecular Genetics and Evolution, School of Life Sciences, Guangzhou University, Guangzhou, 510006, China
| | - Fanjiang Kong
- Guangdong Key Laboratory of Plant Adaptation and Molecular Design, Guangzhou Key Laboratory of Crop Gene Editing, Innovative Center of Molecular Genetics and Evolution, School of Life Sciences, Guangzhou University, Guangzhou, 510006, China
| | - Baohui Liu
- Guangdong Key Laboratory of Plant Adaptation and Molecular Design, Guangzhou Key Laboratory of Crop Gene Editing, Innovative Center of Molecular Genetics and Evolution, School of Life Sciences, Guangzhou University, Guangzhou, 510006, China
| |
Collapse
|
11
|
McIntyre LM. Celebrating discovery across the tree of life. G3 (BETHESDA, MD.) 2023; 13:6986389. [PMID: 36634225 PMCID: PMC9836344 DOI: 10.1093/g3journal/jkac318] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
|
12
|
Liu J, Chen S, Liu M, Chen Y, Fan W, Lee S, Xiao H, Kudrna D, Li Z, Chen X, Peng Y, Tian K, Zhang B, Wing RA, Zhang J, Wang X. Full-Length Transcriptome Sequencing Reveals Alternative Splicing and lncRNA Regulation during Nodule Development in Glycine max. Int J Mol Sci 2022; 23:7371. [PMID: 35806374 PMCID: PMC9266934 DOI: 10.3390/ijms23137371] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2022] [Revised: 06/25/2022] [Accepted: 06/29/2022] [Indexed: 02/04/2023] Open
Abstract
Alternative splicing (AS) is a ubiquitous phenomenon among eukaryotic intron-containing genes, which greatly contributes to transcriptome and proteome diversity. Here we performed the isoform sequencing (Iso-Seq) of soybean underground tissues inoculated and uninoculated with Rhizobium and obtained 200,681 full-length transcripts covering 26,183 gene loci. It was found that 80.78% of the multi-exon loci produced more than one splicing variant. Comprehensive analysis of these identified 7874 differentially splicing events with highly diverse splicing patterns during nodule development, especially in defense and transport-related processes. We further profiled genes with differential isoform usage and revealed that 2008 multi-isoform loci underwent stage-specific or simultaneous major isoform switches after Rhizobium inoculation, indicating that AS is a vital way to regulate nodule development. Moreover, we took the lead in identifying 1563 high-confidence long non-coding RNAs (lncRNAs) in soybean, and 157 of them are differentially expressed during nodule development. Therefore, our study uncovers the landscape of AS during the soybean-Rhizobium interaction and provides systematic transcriptomic data for future study of multiple novel directions in soybean.
Collapse
Affiliation(s)
- Jing Liu
- College of Life Science and Technology, Huazhong Agricultural University, Wuhan 430070, China; (J.L.); (S.C.); (M.L.); (Y.C.); (W.F.); (Z.L.); (X.C.); (K.T.)
- State Key Laboratory of Crop Stress Adaptation and Improvement, School of Life Sciences, Henan University, Kaifeng 475001, China; (H.X.); (Y.P.); (B.Z.)
| | - Shengcai Chen
- College of Life Science and Technology, Huazhong Agricultural University, Wuhan 430070, China; (J.L.); (S.C.); (M.L.); (Y.C.); (W.F.); (Z.L.); (X.C.); (K.T.)
- State Key Laboratory of Crop Stress Adaptation and Improvement, School of Life Sciences, Henan University, Kaifeng 475001, China; (H.X.); (Y.P.); (B.Z.)
| | - Min Liu
- College of Life Science and Technology, Huazhong Agricultural University, Wuhan 430070, China; (J.L.); (S.C.); (M.L.); (Y.C.); (W.F.); (Z.L.); (X.C.); (K.T.)
| | - Yimian Chen
- College of Life Science and Technology, Huazhong Agricultural University, Wuhan 430070, China; (J.L.); (S.C.); (M.L.); (Y.C.); (W.F.); (Z.L.); (X.C.); (K.T.)
- State Key Laboratory of Crop Stress Adaptation and Improvement, School of Life Sciences, Henan University, Kaifeng 475001, China; (H.X.); (Y.P.); (B.Z.)
| | - Wei Fan
- College of Life Science and Technology, Huazhong Agricultural University, Wuhan 430070, China; (J.L.); (S.C.); (M.L.); (Y.C.); (W.F.); (Z.L.); (X.C.); (K.T.)
- State Key Laboratory of Crop Stress Adaptation and Improvement, School of Life Sciences, Henan University, Kaifeng 475001, China; (H.X.); (Y.P.); (B.Z.)
| | - Seunghee Lee
- Arizona Genomics Institute, School of Plant Sciences, University of Arizona, Tucson, AZ 85721, USA; (S.L.); (D.K.); (R.A.W.); (J.Z.)
| | - Han Xiao
- State Key Laboratory of Crop Stress Adaptation and Improvement, School of Life Sciences, Henan University, Kaifeng 475001, China; (H.X.); (Y.P.); (B.Z.)
| | - Dave Kudrna
- Arizona Genomics Institute, School of Plant Sciences, University of Arizona, Tucson, AZ 85721, USA; (S.L.); (D.K.); (R.A.W.); (J.Z.)
| | - Zixin Li
- College of Life Science and Technology, Huazhong Agricultural University, Wuhan 430070, China; (J.L.); (S.C.); (M.L.); (Y.C.); (W.F.); (Z.L.); (X.C.); (K.T.)
| | - Xu Chen
- College of Life Science and Technology, Huazhong Agricultural University, Wuhan 430070, China; (J.L.); (S.C.); (M.L.); (Y.C.); (W.F.); (Z.L.); (X.C.); (K.T.)
- State Key Laboratory of Crop Stress Adaptation and Improvement, School of Life Sciences, Henan University, Kaifeng 475001, China; (H.X.); (Y.P.); (B.Z.)
| | - Yaqi Peng
- State Key Laboratory of Crop Stress Adaptation and Improvement, School of Life Sciences, Henan University, Kaifeng 475001, China; (H.X.); (Y.P.); (B.Z.)
| | - Kewei Tian
- College of Life Science and Technology, Huazhong Agricultural University, Wuhan 430070, China; (J.L.); (S.C.); (M.L.); (Y.C.); (W.F.); (Z.L.); (X.C.); (K.T.)
- State Key Laboratory of Crop Stress Adaptation and Improvement, School of Life Sciences, Henan University, Kaifeng 475001, China; (H.X.); (Y.P.); (B.Z.)
| | - Bao Zhang
- State Key Laboratory of Crop Stress Adaptation and Improvement, School of Life Sciences, Henan University, Kaifeng 475001, China; (H.X.); (Y.P.); (B.Z.)
| | - Rod A. Wing
- Arizona Genomics Institute, School of Plant Sciences, University of Arizona, Tucson, AZ 85721, USA; (S.L.); (D.K.); (R.A.W.); (J.Z.)
| | - Jianwei Zhang
- Arizona Genomics Institute, School of Plant Sciences, University of Arizona, Tucson, AZ 85721, USA; (S.L.); (D.K.); (R.A.W.); (J.Z.)
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China
| | - Xuelu Wang
- State Key Laboratory of Crop Stress Adaptation and Improvement, School of Life Sciences, Henan University, Kaifeng 475001, China; (H.X.); (Y.P.); (B.Z.)
| |
Collapse
|