1
|
Wu P, Xue N, Yang J, Zhang Q, Sun Y, Zhang W. OGU: A Toolbox for Better Utilising Organelle Genomic Data. Mol Ecol Resour 2025; 25:e14044. [PMID: 39523951 PMCID: PMC11887606 DOI: 10.1111/1755-0998.14044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Revised: 10/04/2024] [Accepted: 10/28/2024] [Indexed: 11/16/2024]
Abstract
Organelle genomes serve as crucial datasets for investigating the genetics and evolution of plants and animals, genome diversity, and species identification. To enhance the collection, analysis, and visualisation of such data, we have developed a novel open-source software tool named Organelle Genome Utilities (OGU). The software encompasses three modules designed to streamline the handling of organelle genome data. The data collection module is dedicated to retrieving, validating and organising sequence information. The evaluation module assesses sequence variance using a range of methods, including novel metrics termed stem and terminal phylogenetic diversity. The primer module designs universal primers for downstream applications. Finally, a visualisation pipeline has been developed to present comprehensive insights into organelle genomes across different lineages rather than focusing solely on individual species. The performance, compatibility and stability of OGU have been rigorously evaluated through benchmarking with four datasets, including one million mixed GenBank records, plastid genomic data from the Lamiaceae family, mitochondrial data from rodents, and 308 plastid genomes sourced from various angiosperm families. Based on software capabilities, we identified 30 plastid intergenic spacers. These spacers exhibit a moderate evolutionary rate and offer practical utility comparable to coding regions, highlighting the potential applications of intergenic spacers in organelle genomes. We anticipate that OGU will substantially enhance the efficient utilisation of organelle genomic data and broaden the prospects for related research endeavours.
Collapse
Affiliation(s)
- Ping Wu
- College of Life SciencesSichuan Normal UniversityChengduSichuanChina
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of BotanyChinese Academy of SciencesBeijingChina
| | - Ningning Xue
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of BotanyChinese Academy of SciencesBeijingChina
| | - Jie Yang
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of BotanyChinese Academy of SciencesBeijingChina
- College of Biology and FoodShangqiu Normal UniversityShangqiuHenanChina
| | - Qiang Zhang
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of BotanyChinese Academy of SciencesBeijingChina
- Institute of Marine Science and TechnologyShandong UniversityQingdaoChina
| | - Yuzhe Sun
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of BotanyChinese Academy of SciencesBeijingChina
| | - Wen Zhang
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of BotanyChinese Academy of SciencesBeijingChina
- China National GeneBank, BGI‐ShenzhenShenzhenChina
| |
Collapse
|
2
|
Yu Y, Wang X, Qu R, OuYang Z, Guo J, Zhao Y, Huang L. Extraction and analysis of high-quality chloroplast DNA with reduced nuclear DNA for medicinal plants. BMC Biotechnol 2024; 24:20. [PMID: 38637734 PMCID: PMC11025248 DOI: 10.1186/s12896-024-00843-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Accepted: 03/15/2024] [Indexed: 04/20/2024] Open
Abstract
BACKGROUND Obtaining high-quality chloroplast genome sequences requires chloroplast DNA (cpDNA) samples that meet the sequencing requirements. The quality of extracted cpDNA directly impacts the efficiency and accuracy of sequencing analysis. Currently, there are no reported methods for extracting cpDNA from Erigeron breviscapus. Therefore, we developed a suitable method for extracting cpDNA from E. breviscapus and further verified its applicability to other medicinal plants. RESULTS We conducted a comparative analysis of chloroplast isolation and cpDNA extraction using modified high-salt low-pH method, the high-salt method, and the NaOH low-salt method, respectively. Subsequently, the number of cpDNA copies relative to the nuclear DNA (nDNA ) was quantified via qPCR. As anticipated, chloroplasts isolated from E. breviscapus using the modified high-salt low-pH method exhibited intact structures with minimal cell debris. Moreover, the concentration, purity, and quality of E. breviscapus cpDNA extracted through this method surpassed those obtained from the other two methods. Furthermore, qPCR analysis confirmed that the modified high-salt low-pH method effectively minimized nDNA contamination in the extracted cpDNA. We then applied the developed modified high-salt low-pH method to other medicinal plant species, including Mentha haplocalyx, Taraxacum mongolicum, and Portulaca oleracea. The resultant effect on chloroplast isolation and cpDNA extraction further validated the generalizability and efficacy of this method across different plant species. CONCLUSIONS The modified high-salt low-pH method represents a reliable approach for obtaining high-quality cpDNA from E. breviscapus. Its universal applicability establishes a solid foundation for chloroplast genome sequencing and analysis of this species. Moreover, it serves as a benchmark for developing similar methods to extract chloroplast genomes from other medicinal plants.
Collapse
Affiliation(s)
- Yifan Yu
- State Key Laboratory for Quality Ensurance and Sustainable Use of Dao-di Herbs, National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, 100700, Beijing, China
- School of Pharmacy, Jiangsu University, 212013, Zhenjiang, China
| | - Xinxin Wang
- State Key Laboratory for Quality Ensurance and Sustainable Use of Dao-di Herbs, National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, 100700, Beijing, China
| | - Renjun Qu
- State Key Laboratory for Quality Ensurance and Sustainable Use of Dao-di Herbs, National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, 100700, Beijing, China
| | - Zhen OuYang
- School of Pharmacy, Jiangsu University, 212013, Zhenjiang, China
| | - Juan Guo
- State Key Laboratory for Quality Ensurance and Sustainable Use of Dao-di Herbs, National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, 100700, Beijing, China
| | - Yujun Zhao
- State Key Laboratory for Quality Ensurance and Sustainable Use of Dao-di Herbs, National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, 100700, Beijing, China.
| | - Luqi Huang
- State Key Laboratory for Quality Ensurance and Sustainable Use of Dao-di Herbs, National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, 100700, Beijing, China.
- School of Pharmacy, Jiangsu University, 212013, Zhenjiang, China.
| |
Collapse
|
3
|
Xu C, Li J, Song LY, Guo ZJ, Song SW, Zhang LD, Zheng HL. PlantC2U: deep learning of cross-species sequence landscapes predicts plastid C-to-U RNA editing in plants. JOURNAL OF EXPERIMENTAL BOTANY 2024; 75:2266-2279. [PMID: 38190348 DOI: 10.1093/jxb/erae007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/29/2023] [Accepted: 01/07/2024] [Indexed: 01/10/2024]
Abstract
In plants, C-to-U RNA editing mainly occurs in plastid and mitochondrial transcripts, which contributes to a complex transcriptional regulatory network. More evidence reveals that RNA editing plays critical roles in plant growth and development. However, accurate detection of RNA editing sites using transcriptome sequencing data alone is still challenging. In the present study, we develop PlantC2U, which is a convolutional neural network, to predict plastid C-to-U RNA editing based on the genomic sequence. PlantC2U achieves >95% sensitivity and 99% specificity, which outperforms the PREPACT tool, random forests, and support vector machines. PlantC2U not only further checks RNA editing sites from transcriptome data to reduce possible false positives, but also assesses the effect of different mutations on C-to-U RNA editing based on the flanking sequences. Moreover, we found the patterns of tissue-specific RNA editing in the mangrove plant Kandelia obovata, and observed reduced C-to-U RNA editing rates in the cold stress response of K. obovata, suggesting their potential regulatory roles in plant stress adaptation. In addition, we present RNAeditDB, available online at https://jasonxu.shinyapps.io/RNAeditDB/. Together, PlantC2U and RNAeditDB will help researchers explore the RNA editing events in plants and thus will be of broad utility for the plant research community.
Collapse
Affiliation(s)
- Chaoqun Xu
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen 361102, China
| | - Jing Li
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen 361102, China
| | - Ling-Yu Song
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen 361102, China
| | - Ze-Jun Guo
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen 361102, China
| | - Shi-Wei Song
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen 361102, China
| | - Lu-Dan Zhang
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen 361102, China
| | - Hai-Lei Zheng
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen 361102, China
| |
Collapse
|
4
|
Li T, Zhang S, Deng Y, Li Y. Comparative Analysis of Chloroplast Genomes for the Genus Manglietia Blume (Magnoliaceae): Molecular Structure and Phylogenetic Evolution. Genes (Basel) 2024; 15:406. [PMID: 38674341 PMCID: PMC11048997 DOI: 10.3390/genes15040406] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Revised: 03/19/2024] [Accepted: 03/20/2024] [Indexed: 04/28/2024] Open
Abstract
Manglietia Blume, belonging to the Magnoliaceae family and mainly distributed in tropical and subtropical regions of Asia, has great scientific and economic value. In this study, we employed next-generation sequencing followed by de novo assembly to investigate the adaptive evolution of Manglietia using plastid genetic information. We newly sequenced the complete or nearly complete plastomes of four Manglietia species (Manglietia aromatica, Manglietia calcarea, Manglietia kwangtungensis, and Manglietia glauca) and conducted comparative analysis with seventeen published plastomes to examine the evolutionary pattern within this genus. The plastomes of these five newly sequenced Manglietia species range from 157,093 bp (M. calcarea2) to 160,493 bp (M. kwangtungensis), all exhibiting circular structures when mapped. Nucleotide diversity was observed across the plastomes, leading us to identify 13 mutational hotspot regions, comprising eight intergenic spacer regions and five gene regions. Our phylogenetic analyses based on 77 protein-coding genes generated phylogenetic relationships with high support and resolution for Manglietia. This genus can be divided into three clades, and the previously proposed infrageneric classifications are not supported by our studies. Furthermore, the close affinity between M. aromatica and M. calcarea is supported by the present work, and further studies are necessary to conclude the taxonomic treatment for the latter. These results provide resources for the comparative plastome, breeding, and plastid genetic engineering of Magnoliaceae and flowering plants.
Collapse
Affiliation(s)
- Tingzhang Li
- College of Forestry and Landscape Architecture, South China Agricultural University, Guangzhou 510642, China; (T.L.); (S.Z.)
| | - Shuangyu Zhang
- College of Forestry and Landscape Architecture, South China Agricultural University, Guangzhou 510642, China; (T.L.); (S.Z.)
| | - Yunfei Deng
- State Key Laboratory of Plant Diversity and Specialty Crops, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou 510650, China
- Key Laboratory of National Forestry and Grassland Administration on Plant Conservation and Utilization in Southern China, Guangzhou 510650, China
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou 510650, China
| | - Yuling Li
- College of Forestry and Landscape Architecture, South China Agricultural University, Guangzhou 510642, China; (T.L.); (S.Z.)
| |
Collapse
|
5
|
Tian D, Xu T, Kang H, Luo H, Wang Y, Chen M, Li R, Ma L, Wang Z, Hao L, Tang B, Zou D, Xiao J, Zhao W, Bao Y, Zhang Z, Song S. Plant genomic resources at National Genomics Data Center: assisting in data-driven breeding applications. ABIOTECH 2024; 5:94-106. [PMID: 38576435 PMCID: PMC10987443 DOI: 10.1007/s42994-023-00134-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/10/2023] [Accepted: 12/18/2023] [Indexed: 04/06/2024]
Abstract
Genomic data serve as an invaluable resource for unraveling the intricacies of the higher plant systems, including the constituent elements within and among species. Through various efforts in genomic data archiving, integrative analysis and value-added curation, the National Genomics Data Center (NGDC), which is a part of the China National Center for Bioinformation (CNCB), has successfully established and currently maintains a vast amount of database resources. This dedicated initiative of the NGDC facilitates a data-rich ecosystem that greatly strengthens and supports genomic research efforts. Here, we present a comprehensive overview of central repositories dedicated to archiving, presenting, and sharing plant omics data, introduce knowledgebases focused on variants or gene-based functional insights, highlight species-specific multiple omics database resources, and briefly review the online application tools. We intend that this review can be used as a guide map for plant researchers wishing to select effective data resources from the NGDC for their specific areas of study. Supplementary Information The online version contains supplementary material available at 10.1007/s42994-023-00134-4.
Collapse
Affiliation(s)
- Dongmei Tian
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
| | - Tianyi Xu
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
| | - Hailong Kang
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
- University of Chinese Academy of Sciences, Beijing, 100049 China
| | - Hong Luo
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
| | - Yanqing Wang
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
| | - Meili Chen
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
| | - Rujiao Li
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
| | - Lina Ma
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
| | - Zhonghuang Wang
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
- University of Chinese Academy of Sciences, Beijing, 100049 China
| | - Lili Hao
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
| | - Bixia Tang
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
| | - Dong Zou
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
| | - Jingfa Xiao
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
- University of Chinese Academy of Sciences, Beijing, 100049 China
| | - Wenming Zhao
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
- University of Chinese Academy of Sciences, Beijing, 100049 China
| | - Yiming Bao
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
- University of Chinese Academy of Sciences, Beijing, 100049 China
| | - Zhang Zhang
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
- University of Chinese Academy of Sciences, Beijing, 100049 China
| | - Shuhui Song
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences & China National Center for Bioinformation, Beijing, 100101 China
- University of Chinese Academy of Sciences, Beijing, 100049 China
| |
Collapse
|
6
|
CNCB-NGDC Members and Partners, Bai X, Bao Y, Bei S, Bu C, Cao R, Cao Y, Cen H, Chao J, Chen F, Chen H, Chen K, Chen M, Chen M, Chen M, Chen Q, Chen R, Chen S, Chen T, Chen X, Chen X, Cheng Y, Chu Y, Cui Q, Dong L, Du Z, Duan G, Fan S, Fan Z, Fang X, Fang Z, Feng Z, Fu S, Gao F, Gao G, Gao H, Gao W, Gao X, Gao X, Gao X, Gong J, Gong J, Gou Y, Gu S, Guo AY, Guo G, Guo X, Han C, Hao D, Hao L, He Q, He S, He S, Hu W, Huang K, Huang T, Huang X, Huang Y, Jia P, Jia Y, Jiang C, Jiang M, Jiang S, Jiang T, Jiang X, Jin E, Jin W, Kang H, Kang H, Kong D, Lan L, Lei W, Li CY, Li C, Li C, Li H, Li J, Li J, Li L, Li P, Li R, Li X, Li Y, Li Y, Li Z, Liao X, Lin S, Lin Y, Ling Y, Liu B, Liu CJ, Liu D, Liu GH, Liu L, Liu S, Liu W, Liu X, Liu X, Liu Y, Liu Y, et alCNCB-NGDC Members and Partners, Bai X, Bao Y, Bei S, Bu C, Cao R, Cao Y, Cen H, Chao J, Chen F, Chen H, Chen K, Chen M, Chen M, Chen M, Chen Q, Chen R, Chen S, Chen T, Chen X, Chen X, Cheng Y, Chu Y, Cui Q, Dong L, Du Z, Duan G, Fan S, Fan Z, Fang X, Fang Z, Feng Z, Fu S, Gao F, Gao G, Gao H, Gao W, Gao X, Gao X, Gao X, Gong J, Gong J, Gou Y, Gu S, Guo AY, Guo G, Guo X, Han C, Hao D, Hao L, He Q, He S, He S, Hu W, Huang K, Huang T, Huang X, Huang Y, Jia P, Jia Y, Jiang C, Jiang M, Jiang S, Jiang T, Jiang X, Jin E, Jin W, Kang H, Kang H, Kong D, Lan L, Lei W, Li CY, Li C, Li C, Li H, Li J, Li J, Li L, Li P, Li R, Li X, Li Y, Li Y, Li Z, Liao X, Lin S, Lin Y, Ling Y, Liu B, Liu CJ, Liu D, Liu GH, Liu L, Liu S, Liu W, Liu X, Liu X, Liu Y, Liu Y, Lu M, Lu T, Luo H, Luo H, Luo M, Luo S, Luo X, Ma L, Ma Y, Mai J, Meng J, Meng X, Meng Y, Meng Y, Miao W, Miao YR, Ni L, Nie Z, Niu G, Niu X, Niu Y, Pan R, Pan S, Peng D, Peng J, Qi J, Qi Y, Qian Q, Qin Y, Qu H, Ren J, Ren J, Sang Z, Shang K, Shen WK, Shen Y, Shi Y, Song S, Song T, Su T, Sun J, Sun Y, Sun Y, Sun Y, Tang B, Tang D, Tang Q, Tang Z, Tian D, Tian F, Tian W, Tian Z, Wang A, Wang G, Wang G, Wang J, Wang J, Wang P, Wang P, Wang W, Wang Y, Wang Y, Wang Y, Wang Y, Wang Z, Wei H, Wei Y, Wei Z, Wu D, Wu G, Wu S, Wu S, Wu W, Wu W, Wu Z, Xia Z, Xiao J, Xiao L, Xiao Y, Xie G, Xie GY, Xie J, Xie Y, Xiong J, Xiong Z, Xu D, Xu S, Xu T, Xu T, Xue Y, Xue Y, Yan C, Yang D, Yang F, Yang F, Yang H, Yang J, Yang K, Yang N, Yang QY, Yang S, Yang X, Yang X, Yang X, Yang YG, Ye W, Yu C, Yu F, Yu S, Yuan C, Yuan H, Zeng J, Zhai S, Zhang C, Zhang F, Zhang G, Zhang M, Zhang P, Zhang Q, Zhang R, Zhang S, Zhang W, Zhang W, Zhang W, Zhang X, Zhang X, Zhang Y, Zhang Y, Zhang Y, Zhang YE, Zhang Y, Zhang Z, Zhang Z, Zhao D, Zhao F, Zhao G, Zhao M, Zhao W, Zhao W, Zhao X, Zhao Y, Zhao Y, Zhao Z, Zheng X, Zheng Y, Zhou C, Zhou H, Zhou X, Zhou X, Zhou Y, Zhou Y, Zhu J, Zhu L, Zhu R, Zhu T, Zong W, Zou D, Zuo Z. Database Resources of the National Genomics Data Center, China National Center for Bioinformation in 2024. Nucleic Acids Res 2024; 52:D18-D32. [PMID: 38018256 PMCID: PMC10767964 DOI: 10.1093/nar/gkad1078] [Show More Authors] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 10/12/2023] [Accepted: 10/27/2023] [Indexed: 11/30/2023] Open
Abstract
The National Genomics Data Center (NGDC), which is a part of the China National Center for Bioinformation (CNCB), provides a family of database resources to support the global academic and industrial communities. With the rapid accumulation of multi-omics data at an unprecedented pace, CNCB-NGDC continuously expands and updates core database resources through big data archiving, integrative analysis and value-added curation. Importantly, NGDC collaborates closely with major international databases and initiatives to ensure seamless data exchange and interoperability. Over the past year, significant efforts have been dedicated to integrating diverse omics data, synthesizing expanding knowledge, developing new resources, and upgrading major existing resources. Particularly, several database resources are newly developed for the biodiversity of protists (P10K), bacteria (NTM-DB, MPA) as well as plant (PPGR, SoyOmics, PlantPan) and disease/trait association (CROST, HervD Atlas, HALL, MACdb, BioKA, BioKA, RePoS, PGG.SV, NAFLDkb). All the resources and services are publicly accessible at https://ngdc.cncb.ac.cn.
Collapse
|
7
|
Niu T, Tian C, Yang Y, Liu Q, Liu L, Tao Q, Li Z, Wu Z. Complete Chloroplast Genome of Corethrodendron fruticosum (Papilionoideae: Fabaceae): Comparative and Phylogenetic Analysis. Genes (Basel) 2023; 14:1289. [PMID: 37372469 DOI: 10.3390/genes14061289] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Revised: 06/16/2023] [Accepted: 06/17/2023] [Indexed: 06/29/2023] Open
Abstract
Corethrodendron fruticosum is an endemic forage grasses in China with high ecological value. In this study, the complete chloroplast genome of C. fruticosum was sequenced using Illumina paired-end sequencing. The C. fruticosum chloroplast genome was 123,100 bp and comprised 105 genes, including 74 protein-coding genes, 4 rRNA-coding genes, and 27 tRNA-coding genes. The genome had a GC content of 34.53%, with 50 repetitive sequences and 63 simple repeat repetitive sequences that did not contain reverse repeats. The simple repeats included 45 single-nucleotide repeats, which accounted for the highest proportion and primarily comprised A/T repeats. A comparative analysis of C. fruticosum, C. multijugum, and four Hedysarum species revealed that the six genomes were highly conserved, with differentials primarily located in the conserved non-coding regions. Moreover, the accD and clpP genes in the coding regions exhibited high nucleotide variability. Accordingly, these genes may serve as molecular markers for the classification and phylogenetic analysis of Corethrodendron species. Phylogenetic analysis further revealed that C. fruticosum and C. multijugum appeared in different clades than the four Hedysarum species. The newly sequenced chloroplast genome provides further insights into the phylogenetic position of C. fruticosum, which is useful for the classification and identification of Corethrodendron.
Collapse
Affiliation(s)
- Tianxiu Niu
- Key Laboratory of National Forestry and Grassland Administration on Grassland Resources and Ecology in the Yellow River Delta, Qingdao Key Laboratory of Specialty Plant Germplasm Innovation and Utilization in Saline Soils of Coastal Beach, College of Grassland Science, Qingdao Agricultural University, Qingdao 266109, China
- Institute of Grassland Research, Chinese Academy of Agricultural Sciences, Hohhot 010010, China
| | - Chunyu Tian
- Institute of Grassland Research, Chinese Academy of Agricultural Sciences, Hohhot 010010, China
- Key Laboratory of Grassland Resources and Utilization of Ministry of Agriculture, Hohhot 010010, China
| | - Yanting Yang
- Institute of Grassland Research, Chinese Academy of Agricultural Sciences, Hohhot 010010, China
- Key Laboratory of Grassland Resources and Utilization of Ministry of Agriculture, Hohhot 010010, China
| | - Qian Liu
- Institute of Grassland Research, Chinese Academy of Agricultural Sciences, Hohhot 010010, China
- Key Laboratory of Grassland Resources and Utilization of Ministry of Agriculture, Hohhot 010010, China
| | - Lemeng Liu
- Institute of Grassland Research, Chinese Academy of Agricultural Sciences, Hohhot 010010, China
- Key Laboratory of Grassland Resources and Utilization of Ministry of Agriculture, Hohhot 010010, China
| | - Qibo Tao
- Key Laboratory of National Forestry and Grassland Administration on Grassland Resources and Ecology in the Yellow River Delta, Qingdao Key Laboratory of Specialty Plant Germplasm Innovation and Utilization in Saline Soils of Coastal Beach, College of Grassland Science, Qingdao Agricultural University, Qingdao 266109, China
| | - Zhiyong Li
- Institute of Grassland Research, Chinese Academy of Agricultural Sciences, Hohhot 010010, China
- Key Laboratory of Grassland Resources and Utilization of Ministry of Agriculture, Hohhot 010010, China
| | - Zinian Wu
- Institute of Grassland Research, Chinese Academy of Agricultural Sciences, Hohhot 010010, China
- Key Laboratory of Grassland Resources and Utilization of Ministry of Agriculture, Hohhot 010010, China
| |
Collapse
|
8
|
Qu XJ, Zou D, Zhang RY, Stull GW, Yi TS. Progress, challenge and prospect of plant plastome annotation. FRONTIERS IN PLANT SCIENCE 2023; 14:1166140. [PMID: 37324662 PMCID: PMC10266425 DOI: 10.3389/fpls.2023.1166140] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Accepted: 05/02/2023] [Indexed: 06/17/2023]
Abstract
The plastome (plastid genome) represents an indispensable molecular data source for studying phylogeny and evolution in plants. Although the plastome size is much smaller than that of nuclear genome, and multiple plastome annotation tools have been specifically developed, accurate annotation of plastomes is still a challenging task. Different plastome annotation tools apply different principles and workflows, and annotation errors frequently occur in published plastomes and those issued in GenBank. It is therefore timely to compare available annotation tools and establish standards for plastome annotation. In this review, we review the basic characteristics of plastomes, trends in the publication of new plastomes, the annotation principles and application of major plastome annotation tools, and common errors in plastome annotation. We propose possible methods to judge pseudogenes and RNA-editing genes, jointly consider sequence similarity, customed algorithms, conserved domain or protein structure. We also propose the necessity of establishing a database of reference plastomes with standardized annotations, and put forward a set of quantitative standards for evaluating plastome annotation quality for the scientific community. In addition, we discuss how to generate standardized GenBank annotation flatfiles for submission and downstream analysis. Finally, we prospect future technologies for plastome annotation integrating plastome annotation approaches with diverse evidences and algorithms of nuclear genome annotation tools. This review will help researchers more efficiently use available tools to achieve high-quality plastome annotation, and promote the process of standardized annotation of the plastome.
Collapse
Affiliation(s)
- Xiao-Jian Qu
- Shandong Provincial Key Laboratory of Plant Stress Research, College of Life Sciences, Shandong Normal University, Ji’nan, Shandong, China
| | - Dan Zou
- Shandong Provincial Key Laboratory of Plant Stress Research, College of Life Sciences, Shandong Normal University, Ji’nan, Shandong, China
| | - Rui-Yu Zhang
- Shandong Provincial Key Laboratory of Plant Stress Research, College of Life Sciences, Shandong Normal University, Ji’nan, Shandong, China
| | - Gregory W. Stull
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Ting-Shuang Yi
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, China
- University of Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
9
|
CNCB-NGDC Members and Partners, Xue Y, Bao Y, Zhang Z, Zhao W, Xiao J, He S, Zhang G, Li Y, Zhao G, Chen R, Ma Y, Chen M, Li C, Jiang S, Zou D, Gong Z, Zhao X, Wang Y, Zhu J, Zhang Z, Zhao W, Xue Y, Bao Y, Song S, Zhang G, Ling Y, Wang Y, Yang J, Zhuang X, Duan G, Wu G, Chen X, Tian D, Li Z, Sun Y, Du Z, Hao L, Song S, Gao Y, Xiao J, Zhang Z, Bao Y, Tang B, Zhao W, Zhang Y, Zhang H, Zhang Z, Qian Q, Zhang Z, Xiao J, Kang H, Huang T, Chen X, Xia Z, Zhou X, Chao J, Tang B, Wang Z, Zhu J, Du Z, Zhang S, Xiao J, Tian W, Wang W, Zhao W, Wu S, Huang Y, Zhang M, Gong Z, Wang G, Zheng X, Zong W, Zhao W, Xing P, Li R, Liu Z, Bao Y, Lu M, Zhang Y, Yang F, Mai J, Gao Q, Xu X, Kang H, Hou L, Shang Y, Qain Q, Liu J, Jiang M, Zhang H, Bu C, Wang J, Zhang Z, Zhang Z, Zeng J, Li J, Xiao J, Pan S, Kang H, et alCNCB-NGDC Members and Partners, Xue Y, Bao Y, Zhang Z, Zhao W, Xiao J, He S, Zhang G, Li Y, Zhao G, Chen R, Ma Y, Chen M, Li C, Jiang S, Zou D, Gong Z, Zhao X, Wang Y, Zhu J, Zhang Z, Zhao W, Xue Y, Bao Y, Song S, Zhang G, Ling Y, Wang Y, Yang J, Zhuang X, Duan G, Wu G, Chen X, Tian D, Li Z, Sun Y, Du Z, Hao L, Song S, Gao Y, Xiao J, Zhang Z, Bao Y, Tang B, Zhao W, Zhang Y, Zhang H, Zhang Z, Qian Q, Zhang Z, Xiao J, Kang H, Huang T, Chen X, Xia Z, Zhou X, Chao J, Tang B, Wang Z, Zhu J, Du Z, Zhang S, Xiao J, Tian W, Wang W, Zhao W, Wu S, Huang Y, Zhang M, Gong Z, Wang G, Zheng X, Zong W, Zhao W, Xing P, Li R, Liu Z, Bao Y, Lu M, Zhang Y, Yang F, Mai J, Gao Q, Xu X, Kang H, Hou L, Shang Y, Qain Q, Liu J, Jiang M, Zhang H, Bu C, Wang J, Zhang Z, Zhang Z, Zeng J, Li J, Xiao J, Pan S, Kang H, Liu X, Lin S, Yuan N, Zhang Z, Bao Y, Jia P, Zheng X, Zong W, Li Z, Sun Y, Ma Y, Xiong Z, Wu S, Yang F, Zhao W, Bu C, Du Z, Xiao J, Bao Y, Chen X, Chen T, Zhang S, Sun Y, Yu C, Tang B, Zhu J, Dong L, Zhai S, Sun Y, Chen Q, Yang X, Zhang X, Sang Z, Wang Y, Zhao Y, Chen H, Lan L, Wang Y, Zhao W, Wang A, Yu C, Wang Y, Zhang S, Ma Y, Jia Y, Zhao X, Chen M, Li C, Tian D, Tang B, Pan Y, Dong L, Liu X, Song S, Liu X, Tian D, Li C, Tang B, Wang Z, Zhang R, Pan Y, Wang Y, Zou D, Song S, Li C, Zou D, Ma L, Gong Z, Zhu J, Teng X, Li L, Li N, Cui Y, Duan G, Zhang M, Jin T, Kang H, Wang Z, Wu G, Huang T, Zhao W, Jin E, Zhang T, Zhang Z, Zhao W, Xue Y, Bao Y, Song S, Xu T, Zou D, Chen M, Niu G, Pan R, Zhu T, Chu Y, Hao L, Sang J, Pan R, Zou D, Zhang Y, Wang Z, Chen M, Zhang Y, Xu T, Yao Q, Zhu T, Niu G, Hao L, Xiong Z, Yang F, Wang G, Li R, Zong W, Zhang M, Zou D, Zhao W, Wang G, Yang F, Wu S, Zhang X, Guo X, Ma Y, Xiong Z, Li R, Li Z, Liu L, Feng C, Qin Y, Xiao J, Ma L, Jing W, Luo S, Li Z, Ma L, Jiang S, Qian Q, Zhu T, Zong W, Shang Y, Jin T, Zhang Y, Chen M, Wu Z, Chu Y, Zhang R, Luo S, Jing W, Zou D, Bao Y, Xiao J, Zhang Z, Zou D, Liu L, Qin Y, Luo S, Jing W, Li Q, Liu P, Sun Y, Ma L, Jiang S, Fan Z, Zhao W, Xiao J, Bao Y, Zhang Z, Shen WK, Guo AY, Zuo Z, Ren J, Zhang X, Xiao Y, Li X, Zhang X, Xiao Y, Li X, Liu D, Zhang C, Xue Y, Zhao Z, Jiang T, Wu W, Zhao F, Meng X, Chen M, Gou Y, Chen M, Xue Y, Peng D, Xue Y, Luo H, Gao F, Ning W, Xue Y, Liu W, Ling Y, Cao R, Zhang G, Wei Y, Xue Y, Liu CJ, Guo AY, Xie GY, Guo AY, Yuan H, Su T, Zhang YE, Zhou C, Wang P, Zhang G, Zhou Y, Chen M, Guo G, Zhang Q, Guo AY, Fu S, Tan X, Xue Y, Tang D, Xue Y, Zhang W, Xue Y, Luo M, Guo AY, Xie Y, Ren J, Miao YR, Guo AY, Zhou Y, Chen M, Guo G, Huang X, Feng Z, Xue Y, Liu CJ, Guo AY, Liao X, Gao X, Wang J, Xie G, Guo AY, Yuan C, Chen M, Yang D, Tian F, Gao G, Wu W, Chen M, Han C, Xue Y, Cui Q, Xiao C, Li CY, Luo X, Ren J, Zhang X, Xiao Y, Li X, Tang Q, Guo AY, Luo H, Gao F, Xue Y, Bao Y, Zhang Z, Zhao W, Xiao J, He S, Zhang G, Li Y, Zhao G, Chen R. Database Resources of the National Genomics Data Center, China National Center for Bioinformation in 2023. Nucleic Acids Res 2023; 51:D18-D28. [PMID: 36420893 PMCID: PMC9825504 DOI: 10.1093/nar/gkac1073] [Show More Authors] [Citation(s) in RCA: 97] [Impact Index Per Article: 48.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Revised: 10/14/2022] [Accepted: 10/27/2022] [Indexed: 11/27/2022] Open
Abstract
The National Genomics Data Center (NGDC), part of the China National Center for Bioinformation (CNCB), provides a family of database resources to support global academic and industrial communities. With the explosive accumulation of multi-omics data generated at an unprecedented rate, CNCB-NGDC constantly expands and updates core database resources by big data archive, integrative analysis and value-added curation. In the past year, efforts have been devoted to integrating multiple omics data, synthesizing the growing knowledge, developing new resources and upgrading a set of major resources. Particularly, several database resources are newly developed for infectious diseases and microbiology (MPoxVR, KGCoV, ProPan), cancer-trait association (ASCancer Atlas, TWAS Atlas, Brain Catalog, CCAS) as well as tropical plants (TCOD). Importantly, given the global health threat caused by monkeypox virus and SARS-CoV-2, CNCB-NGDC has newly constructed the monkeypox virus resource, along with frequent updates of SARS-CoV-2 genome sequences, variants as well as haplotypes. All the resources and services are publicly accessible at https://ngdc.cncb.ac.cn.
Collapse
|
10
|
Zeng Q, Chen M, Wang S, Xu X, Li T, Xiang Z, He N. Comparative and phylogenetic analyses of the chloroplast genome reveal the taxonomy of the Morus genus. FRONTIERS IN PLANT SCIENCE 2022; 13:1047592. [PMID: 36507423 PMCID: PMC9729782 DOI: 10.3389/fpls.2022.1047592] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/18/2022] [Accepted: 10/24/2022] [Indexed: 06/17/2023]
Abstract
Mulberry (genus Morus) is an economically important woody plant with an altered ploidy level. The variable number of Morus species recognized by different studies indicates that the genus is in need of revision. In this study, the chloroplast (CP) genomes of 123 Morus varieties were de novo assembled and systematically analyzed. The 123 varieties represented six Morus species, namely, Morus alba, Morus nigra, Morus notabilis, Morus rubra, Morus celtidifolia, and Morus serrata. The Morus CP genome was found to be 158,969~159,548 bp in size with 125 genes, including 81 protein coding, 36 tRNA, and 8 rRNA genes. The 87 out of 123 mulberry accessions were assigned to 14 diverse groups with identical CP genome, which indicated that they are maternally inherited and share 14 common ancestors. Then 50 diverse CP genomes occurred in 123 mulberry accessions for further study. The CP genomes of the Morus genus with a quadripartite structure have two inverted repeat (IR) regions (25,654~25,702 bp) dividing the circular genome into a large single-copy (LSC) region (87,873~88,243 bp) and small single-copy (SSC) region (19,740~19,994 bp). Analysis of the phylogenetic tree constructed using the complete CP genome sequences of Morus revealed a monophyletic genus and that M. alba consisted of two clades, M. alba var. alba and M. alba var. multicaulis. The Japanese cultivated germplasms were derived from M. alba var. multicaulis. We propose that the Morus genus be classified into six species, M. nigra, M. notabilis, M. serrata, M. celtidifolia, M. rubra, and M. alba with two subspecies, M. alba var. alba and M. alba var. multicaulis. Our findings provide a valuable resource for the classification, domestication, and breeding improvement of mulberry.
Collapse
|