1
|
Wang Y, Xia J, Wang Z, Ying Z, Xiong Z, Wang C, Shi R. Combined analysis of multi-omics reveals the potential mechanism of flower color and aroma formation in Macadamia integrifolia. FRONTIERS IN PLANT SCIENCE 2023; 13:1095644. [PMID: 36816481 PMCID: PMC9931397 DOI: 10.3389/fpls.2022.1095644] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/11/2022] [Accepted: 12/30/2022] [Indexed: 06/18/2023]
Abstract
INTRODUCTION Macadamia integrifolia Maiden & Betche is a domesticated high-value nut crop. The development of nut flower affects the fruit setting rate, yield and quality of nuts. Therefore, in this experiment, two varieties with different flower color, flowering time, flowering quantity and nut yield (single fruit weight) were selected as the research objects. METHODS Transcriptome (RNA-Seq) and metabolome (LC-MS/MS, GC-MS) analyses were performed to study the regulatory mechanisms of nut flower development, color and aroma. RESULTS The results indicated that plant hormone signal transduction, starch sucrose metabolism, phenylpropanoid metabolism, flavonoid biosynthesis, and anthocyanin biosynthesis pathways were related to nut flower development and flower color formation. In the early stage of flowering, most of the differentially expressed genes (DEGs) are involved in the IAA signal transduction pathway, while in the later stage, the brassinolide signal pathway is mainly involved. In starch and sugar metabolism, DEGs are mainly involved in regulating and hydrolyzing stored starch into small molecular sugars in flower tissues. In the phenylpropanoid biosynthesis pathway, DEGs are mainly related to the color and aroma (volatile organic compounds, VOCs) formation of nut flowers. Four color formation metabolites (anthocyanins) in nut flowers were found by LC-MS/MS detection. In addition, the VOCs showed no significant difference between red nut flowers (R) and white nut flowers (W), which was mainly reflected in the aroma formation stage (flowering time). And 12 common differentially accumulation metabolites (DAMs) were detected by GC-MS and LC-MS/MS. At the same time, the DEGs, AAT, LOX and PAL genes, were also identified to regulate key metabolite synthesis during nut flower development. These genes were further verified by qRT-PCR. CONCLUSION Our results provide insights to clarify the molecular mechanism of color and aroma formation during M. integrifolia flower development that pave the way for nut quality and yield breeding.
Collapse
Affiliation(s)
- Yonggui Wang
- Key Laboratory for Forest Resources Conservation and Utilization in the Southwest Mountains of China, Ministry of Education, International Ecological Forestry Research Center of Kunming, Southwest Forestry University, Kunming, China
| | - Jing Xia
- Key Laboratory for Forest Resources Conservation and Utilization in the Southwest Mountains of China, Ministry of Education, International Ecological Forestry Research Center of Kunming, Southwest Forestry University, Kunming, China
| | - Zile Wang
- Yunnan Agricultural University College of Plant Protection, Kunming, China
| | - Zhiping Ying
- Key Laboratory for Forest Resources Conservation and Utilization in the Southwest Mountains of China, Ministry of Education, International Ecological Forestry Research Center of Kunming, Southwest Forestry University, Kunming, China
| | - Zhi Xiong
- Key Laboratory for Forest Resources Conservation and Utilization in the Southwest Mountains of China, Ministry of Education, International Ecological Forestry Research Center of Kunming, Southwest Forestry University, Kunming, China
| | - Changming Wang
- Key Laboratory for Forest Resources Conservation and Utilization in the Southwest Mountains of China, Ministry of Education, International Ecological Forestry Research Center of Kunming, Southwest Forestry University, Kunming, China
| | - Rui Shi
- Key Laboratory for Forest Resources Conservation and Utilization in the Southwest Mountains of China, Ministry of Education, International Ecological Forestry Research Center of Kunming, Southwest Forestry University, Kunming, China
| |
Collapse
|
2
|
Mohd Sanusi NSN, Rosli R, Chan KL, Halim MAA, Ting NC, Singh R, Low ETL. Integrated consensus genetic map and genomic scaffold re-ordering of oil palm (Elaeis guineensis) genome. Comput Biol Chem 2023; 102:107801. [PMID: 36528019 DOI: 10.1016/j.compbiolchem.2022.107801] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Revised: 07/21/2022] [Accepted: 12/05/2022] [Indexed: 12/13/2022]
Abstract
A high-quality reference genome is an important resource that can help decipher the genetic basis of traits in combination with linkage or association analyses. The publicly available oil palm draft genome sequence of AVROS pisifera (EG5) accounts for 1.535 Gb of the 1.8 Gb oil palm genome. However, the assemblies are fragmented, and the earlier assembly only had 43% of the sequences placed on pseudo-chromosomes. By integrating a number of SNP and SSR-based genetic maps, a consensus map (AM_EG5.1), comprising of 828.243 Mb genomic scaffolds anchored to 16 pseudo-chromosomes, was generated. This accounted for 54% of the genome assembly, which is a significant improvement to the original assembly. The total length of N50 scaffolds anchored to the pseudo-chromosomes increased by ∼18% compared to the previous assembly. A total of 139 quantitative trait loci for agronomically important quantitative traits, sourced from literature, were successfully mapped on the new pseudo-chromosomes. The improved assembly could also be used as a reference to identify potential errors in placement of specific markers in the linkage groups of the genetic maps used to assemble the consensus map. The 3422 unique markers from five genetic maps, anchored to the pseudo-chromosomes of AM_EG5.1, are an important resource that can be used preferentially to either construct new maps or fill gaps in existing genetic maps. Synteny analysis further revealed that the AM_EG5.1 had high collinearity with the date palm genome cultivar 'Barhee BC4' and shared most of its segmental duplications. This improved chromosomal-level genome is a valuable resource for genetic research in oil palm.
Collapse
Affiliation(s)
| | - Rozana Rosli
- Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
| | - Kuang-Lim Chan
- Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
| | - Mohd Amin Ab Halim
- Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
| | - Ngoot-Chin Ting
- Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
| | - Rajinder Singh
- Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia
| | - Eng-Ti Leslie Low
- Malaysian Palm Oil Board, 6 Persiaran Institusi, Bandar Baru Bangi, 43000 Kajang, Selangor, Malaysia.
| |
Collapse
|
3
|
Wang P, Mo Y, Wang Y, Fei Y, Huang J, Ni J, Xu ZF. Macadamia germplasm and genomic database (MacadamiaGGD): A comprehensive platform for germplasm innovation and functional genomics in Macadamia. FRONTIERS IN PLANT SCIENCE 2022; 13:1007266. [PMID: 36388568 PMCID: PMC9646992 DOI: 10.3389/fpls.2022.1007266] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/30/2022] [Accepted: 10/11/2022] [Indexed: 06/16/2023]
Abstract
As an important nut crop species, macadamia continues to gain increased amounts of attention worldwide. Nevertheless, with the vast increase in macadamia omic data, it is becoming difficult for researchers to effectively process and utilize the information. In this work, we developed the first integrated germplasm and genomic database for macadamia (MacadamiaGGD), which includes five genomes of four species; three chloroplast and mitochondrial genomes; genome annotations; transcriptomic data for three macadamia varieties, germplasm data for four species and 262 main varieties; nine genetic linkage maps; and 35 single-nucleotide polymorphisms (SNPs). The database serves as a valuable collection of simple sequence repeat (SSR) markers, including both markers that are based on macadamia genomic sequences and developed in this study and markers developed previously. MacadamiaGGD is also integrated with multiple bioinformatic tools, such as search, JBrowse, BLAST, primer designer, sequence fetch, enrichment analysis, multiple sequence alignment, genome alignment, and gene homology annotation, which allows users to conveniently analyze their data of interest. MacadamiaGGD is freely available online (http://MacadamiaGGD.net). We believe that the database and additional information of the SSR markers can help scientists better understand the genomic sequence information of macadamia and further facilitate molecular breeding efforts of this species.
Collapse
Affiliation(s)
- Pan Wang
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Forestry, Guangxi University, Nanning, China
- Key Laboratory of National Forestry and Grassland Administration for Fast-Growing Tree Breeding and Cultivation in Central and Southern China, College of Forestry, Guangxi University, Nanning, China
| | - Yi Mo
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Forestry, Guangxi University, Nanning, China
- Key Laboratory of National Forestry and Grassland Administration for Fast-Growing Tree Breeding and Cultivation in Central and Southern China, College of Forestry, Guangxi University, Nanning, China
| | - Yi Wang
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Forestry, Guangxi University, Nanning, China
- Key Laboratory of National Forestry and Grassland Administration for Fast-Growing Tree Breeding and Cultivation in Central and Southern China, College of Forestry, Guangxi University, Nanning, China
| | - Yuchong Fei
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Forestry, Guangxi University, Nanning, China
- Key Laboratory of National Forestry and Grassland Administration for Fast-Growing Tree Breeding and Cultivation in Central and Southern China, College of Forestry, Guangxi University, Nanning, China
| | - Jianting Huang
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Forestry, Guangxi University, Nanning, China
- Key Laboratory of National Forestry and Grassland Administration for Fast-Growing Tree Breeding and Cultivation in Central and Southern China, College of Forestry, Guangxi University, Nanning, China
| | - Jun Ni
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Forestry, Guangxi University, Nanning, China
- Key Laboratory of National Forestry and Grassland Administration for Fast-Growing Tree Breeding and Cultivation in Central and Southern China, College of Forestry, Guangxi University, Nanning, China
| | - Zeng-Fu Xu
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Forestry, Guangxi University, Nanning, China
- Key Laboratory of National Forestry and Grassland Administration for Fast-Growing Tree Breeding and Cultivation in Central and Southern China, College of Forestry, Guangxi University, Nanning, China
| |
Collapse
|
4
|
Zakeel MCM, Alam M, Geering ADW, Topp B, Akinsanmi OA. Discovery of Single Nucleotide Polymorphisms for Resistance to Abnormal Vertical Growth in Macadamia. FRONTIERS IN PLANT SCIENCE 2021; 12:756815. [PMID: 35003155 PMCID: PMC8739493 DOI: 10.3389/fpls.2021.756815] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/11/2021] [Accepted: 12/07/2021] [Indexed: 06/14/2023]
Abstract
Abnormal vertical growth (AVG) syndrome is a serious threat to the Australian macadamia industry as it decreases the yield of nuts by as much as 70% per annum. A lack of information on the cause of AVG has hindered the development of an effective disease management strategy. Discovery of genetic markers associated with disease resistance can be used as tool for rapid selection of elite cultivars, hence helps in efficient disease management. Differences in field susceptibility of macadamia cultivars provide an opportunity for discovery of genetic markers that are associated with host resistance. REML mixed model analysis was performed to estimate the AVG rating of 51 cultivars from multiple origins using phenotypic data from 359 trees planted in four sites. Most of the Hawaiian cultivars were found as susceptible, while selections from the Australian macadamia industry breeding program were predominantly resistant. All the cultivars were genotyped for 13,221 DArTseq-based single nucleotide polymorphism (SNP) markers. A bulked sample analysis was performed using 20 genotypes each at the extremes of AVG phenotypic ratings. Ten SNP markers were predicted to be associated with AVG resistance and two arbitrarily selected SNP markers were validated using PCR and Sanger sequencing. Our findings suggest that AVG resistance in the commercial cultivars may be derived from the genomic introgression of Macadamia tetraphylla through interspecific hybridization. The results may support marker-assisted selection for macadamia germplasm with AVG resistance.
Collapse
Affiliation(s)
- Mohamed Cassim Mohamed Zakeel
- Centre for Horticultural Science, Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Brisbane, QLD, Australia
| | - Mobashwer Alam
- Centre for Horticultural Science, Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Nambour, QLD, Australia
| | - Andrew D. W. Geering
- Centre for Horticultural Science, Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Brisbane, QLD, Australia
| | - Bruce Topp
- Centre for Horticultural Science, Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Nambour, QLD, Australia
| | - Olufemi A. Akinsanmi
- Centre for Horticultural Science, Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Brisbane, QLD, Australia
| |
Collapse
|
5
|
McCartney AM, Hilario E, Choi S, Guhlin J, Prebble JM, Houliston G, Buckley TR, Chagné D. An exploration of assembly strategies and quality metrics on the accuracy of the rewarewa (Knightia excelsa) genome. Mol Ecol Resour 2021; 21:2125-2144. [PMID: 33955186 PMCID: PMC8362059 DOI: 10.1111/1755-0998.13406] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Revised: 03/18/2021] [Accepted: 04/20/2021] [Indexed: 12/17/2022]
Abstract
We used long read sequencing data generated from Knightia excelsa, a nectar-producing Proteaceae tree endemic to Aotearoa (New Zealand), to explore how sequencing data type, volume and workflows can impact final assembly accuracy and chromosome reconstruction. Establishing a high-quality genome for this species has specific cultural importance to Māori and commercial importance to honey producers in Aotearoa. Assemblies were produced by five long read assemblers using data subsampled based on read lengths, two polishing strategies and two Hi-C mapping methods. Our results from subsampling the data by read length showed that each assembler tested performed differently depending on the coverage and the read length of the data. Subsampling highlighted that input data with longer read lengths but perhaps lower coverage constructed more contiguous, kmers and gene-complete assemblies than short read length input data with higher coverage. The final genome assembly was constructed into 14 pseudochromosomes using an initial flye long read assembly, a racon/medaka/pilon combined polishing strategy, salsa2 and allhic scaffolding, juicebox curation, and Macadamia linkage map validation. We highlighted the importance of developing assembly workflows based on the volume and read length of sequencing data and established a robust set of quality metrics for generating high-quality assemblies. Scaffolding analyses highlighted that problems found in the initial assemblies could not be resolved accurately by Hi-C data and that assembly scaffolding was more successful when the underlying contig assembly was of higher accuracy. These findings provide insight into how quality assessment tools can be implemented throughout genome assembly pipelines to inform the de novo reconstruction of a high-quality genome assembly for nonmodel organisms.
Collapse
Affiliation(s)
- Ann M. McCartney
- Manaaki Whenua ‐ Landcare ResearchAucklandNew Zealand
- Genomics AotearoaDunedinNew Zealand
| | - Elena Hilario
- Genomics AotearoaDunedinNew Zealand
- The New Zealand Institute for Plant and Food Research (Plant & Food Research)SandringhamNew Zealand
| | - Seung‐Sub Choi
- Manaaki Whenua ‐ Landcare ResearchAucklandNew Zealand
- Genomics AotearoaDunedinNew Zealand
- School of Biological SciencesThe University of AucklandAucklandNew Zealand
| | - Joseph Guhlin
- Genomics AotearoaDunedinNew Zealand
- University of OtagoDunedinNew Zealand
| | - Jessica M. Prebble
- Genomics AotearoaDunedinNew Zealand
- Manaaki Whenua Landcare ResearchLincolnNew Zealand
| | - Gary Houliston
- Genomics AotearoaDunedinNew Zealand
- Manaaki Whenua Landcare ResearchLincolnNew Zealand
| | - Thomas R. Buckley
- Manaaki Whenua ‐ Landcare ResearchAucklandNew Zealand
- Genomics AotearoaDunedinNew Zealand
- School of Biological SciencesThe University of AucklandAucklandNew Zealand
| | - David Chagné
- Genomics AotearoaDunedinNew Zealand
- Plant & Food ResearchFitzherbert, Palmerston NorthNew Zealand
| |
Collapse
|
6
|
Andrés-Hernández L, Halimi RA, Mauleon R, Mayes S, Baten A, King GJ. Challenges for FAIR-compliant description and comparison of crop phenotype data with standardized controlled vocabularies. Database (Oxford) 2021; 2021:baab028. [PMID: 33991093 PMCID: PMC8122365 DOI: 10.1093/database/baab028] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2020] [Revised: 04/14/2021] [Accepted: 04/30/2021] [Indexed: 12/04/2022]
Abstract
Crop phenotypic data underpin many pre-breeding efforts to characterize variation within germplasm collections. Although there has been an increase in the global capacity for accumulating and comparing such data, a lack of consistency in the systematic description of metadata often limits integration and sharing. We therefore aimed to understand some of the challenges facing findable, accesible, interoperable and reusable (FAIR) curation and annotation of phenotypic data from minor and underutilized crops. We used bambara groundnut (Vigna subterranea) as an exemplar underutilized crop to assess the ability of the Crop Ontology system to facilitate curation of trait datasets, so that they are accessible for comparative analysis. This involved generating a controlled vocabulary Trait Dictionary of 134 terms. Systematic quantification of syntactic and semantic cohesiveness of the full set of 28 crop-specific COs identified inconsistencies between trait descriptor names, a relative lack of cross-referencing to other ontologies and a flat ontological structure for classifying traits. We also evaluated the Minimal Information About a Phenotyping Experiment and FAIR compliance of bambara trait datasets curated within the CropStoreDB schema. We discuss specifications for a more systematic and generic approach to trait controlled vocabularies, which would benefit from representation of terms that adhere to Open Biological and Biomedical Ontologies principles. In particular, we focus on the benefits of reuse of existing definitions within pre- and post-composed axioms from other domains in order to facilitate the curation and comparison of datasets from a wider range of crops. Database URL: https://www.cropstoredb.org/cs_bambara.html.
Collapse
Affiliation(s)
- Liliana Andrés-Hernández
- Southern Cross Plant Science, Southern Cross University, PO Box 157, Lismore, NSW 2480, Australia
| | - Razlin Azman Halimi
- Southern Cross Plant Science, Southern Cross University, PO Box 157, Lismore, NSW 2480, Australia
| | - Ramil Mauleon
- Southern Cross Plant Science, Southern Cross University, PO Box 157, Lismore, NSW 2480, Australia
| | - Sean Mayes
- School of Biosciences, University of Nottingham, Sutton Bonington, Leicestershire, LE12 5RD,Nottingham, Nottingham, UK
| | - Abdul Baten
- Institute of Precision Medicine & Bioinformatics, Sydney Local Health District, Royal Prince Alfred Hospital, Missenden Road, Camperdown, NSW 2050, Australia
| | - Graham J King
- Southern Cross Plant Science, Southern Cross University, PO Box 157, Lismore, NSW 2480, Australia
| |
Collapse
|
7
|
Chromosome-Scale Assembly and Annotation of the Macadamia Genome ( Macadamia integrifolia HAES 741). G3-GENES GENOMES GENETICS 2020; 10:3497-3504. [PMID: 32747341 PMCID: PMC7534425 DOI: 10.1534/g3.120.401326] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/02/2022]
Abstract
Macadamia integrifolia is a representative of the large basal eudicot family Proteaceae and the main progenitor species of the Australian native nut crop macadamia. Since its commercialisation in Hawaii fewer than 100 years ago, global production has expanded rapidly. However, genomic resources are limited in comparison to other horticultural crops. The first draft assembly of M. integrifolia had good coverage of the functional gene space but its high fragmentation has restricted its use in comparative genomics and association studies. Here we have generated an improved assembly of cultivar HAES 741 (4,094 scaffolds, 745 Mb, N50 413 kb) using a combination of Illumina paired and PacBio long read sequences. Scaffolds were anchored to 14 pseudo-chromosomes using seven genetic linkage maps. This assembly has improved contiguity and coverage, with >120 Gb of additional sequence. Following annotation, 34,274 protein-coding genes were predicted, representing 90% of the expected gene content. Our results indicate that the macadamia genome is repetitive and heterozygous. The total repeat content was 55% and genome-wide heterozygosity, estimated by read mapping, was 0.98% or an average of one SNP per 102 bp. This is the first chromosome-scale genome assembly for macadamia and the Proteaceae. It is expected to be a valuable resource for breeding, gene discovery, conservation and evolutionary genomics.
Collapse
|