1
|
Zhang K, Qu G, Zhang Y, Liu J. Assembly and comparative analysis of the first complete mitochondrial genome of Astragalus membranaceus (Fisch.) Bunge: an invaluable traditional Chinese medicine. BMC PLANT BIOLOGY 2024; 24:1055. [PMID: 39511474 PMCID: PMC11546474 DOI: 10.1186/s12870-024-05780-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/14/2024] [Accepted: 11/04/2024] [Indexed: 11/15/2024]
Abstract
BACKGROUND Astragalus membranaceus (Fisch.) Bunge is one of the most well-known tonic herbs in traditional Chinese medicine, renowned for its remarkable medicinal value in various clinical contexts. The corresponding chloroplast (cp) and nuclear genomes have since been accordingly sequenced, providing valuable information for breeding and phylogeny studies. However, the mitochondrial genome (mitogenome) of A. membranaceus remains unexplored, which hinders comprehensively understanding the evolution of its genome. RESULTS For this study, we de novo assembled the mitogenome of A. membranaceus (Fisch.) Bunge var. mongholicus (Bunge) P. K. Hsiao using a strategy integrating Illumina and Nanopore sequencing technology and subsequently performed comparative analysis with its close relatives. The mitogenome has a multi-chromosome structure, consisting of two circular chromosomes with a total length of 398,048 bp and an overall GC content of 45.3%. It encodes 54 annotated functional genes, comprising 33 protein-coding genes (PCGs), 18 tRNA genes, and 3 rRNA genes. An investigation of codon usage in the PCGs revealed an obvious preference for codons ending in A or U (T) bases, given their high frequency. RNA editing identified 500 sites in the coding regions of mt PCGs that exhibit a perfect conversion of the base C to U, a process that tends to lead to the conversion of hydrophilic amino acids into hydrophobic amino acids. From the mitogenome analysis, a total of 399 SSRs, 4 tandem repeats, and 77 dispersed repeats were found, indicating that A. membranaceus possesses fewer repeats compared to its close relatives with similarly sized mitogenomes. Selection pressure analysis indicated that most mt PCGs were purifying selection genes, while only five PCGs (ccmB, ccmFc, ccmFn, nad3, and nad9) were positive selection genes. Notably, positive selection emerged as a critical factor in the evolution of ccmB and nad9 in all the pairwise species comparisons, suggesting the extremely critical role of these genes in the evolution of A. membranaceus. Moreover, we inferred that 22 homologous fragments have been transferred from cp to mitochondria (mt), in which 5 cp-derived tRNA genes remain intact in the mitogenome. Further comparative analysis revealed that the syntenic region and mt gene organization are relatively conserved within the provided legumes. The comparison of gene content indicated that the gene composition of Fabaceae mitogenomes differed. Finally, the phylogenetic tree established from analysis is largely congruent with the taxonomic relationships of Fabaceae species and highlights the close relationship between Astragalus and Oxytropis. CONCLUSIONS We provide the first report of the assembled and annotated A. membranaceus mitogenome, which enriches the genetic resources available for the Astragalus genus and lays the foundation for comprehensive exploration of this invaluable medicinal plant.
Collapse
Affiliation(s)
- Kun Zhang
- College of Agriculture and Life Sciences, Shanxi Datong University, Datong, Shanxi, China.
- Key Laboratory of Organic Dry Farming for Special Crops in Datong City, Datong, Shanxi, China.
| | - Gaoyang Qu
- College of Horticulture, Shenyang Agricultural University, Shenyang, Liaoning, China
| | - Yue Zhang
- College of Agriculture and Life Sciences, Shanxi Datong University, Datong, Shanxi, China
| | - Jianxia Liu
- College of Agriculture and Life Sciences, Shanxi Datong University, Datong, Shanxi, China
- Key Laboratory of Organic Dry Farming for Special Crops in Datong City, Datong, Shanxi, China
| |
Collapse
|
2
|
Ha YH, Chang KS, Gil HY. Characteristics of chloroplast and mitochondrial genomes and intracellular gene transfer in the Korean endemic shrub, Sophora koreensis Nakai (Fabaceae). Gene 2024; 894:147963. [PMID: 37926173 DOI: 10.1016/j.gene.2023.147963] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Revised: 10/24/2023] [Accepted: 11/01/2023] [Indexed: 11/07/2023]
Abstract
Sophora koreensis Nakai, an endemic species distributed only in the Korean Peninsula, is of great geographical, economic, and taxonomic importance. Although its complete chloroplast (cp) genome sequence has been reported, its mitochondrial (mt) genome sequence has not yet been studied. Therefore, in this study, we aimed to investigate its mt genome sequence and compare it with those reported for other Fabaceae species. Total genomic DNA was extracted from fresh S. koreensis leaves collected from natural habitats in Gangwon-do Province, South Korea. This was followed by polymerase chain reaction (PCR) amplification of cpDNA insertions in the mt genome and the detection of microsatellites and dispersed repeats in the cp and mt genomes. Finally, the cp and mt genomes of S. koreensis were compared with those reported for other Fabaceae species. The cp sequence of S. koreensis showed identical gene orders and contents as those previously reported. Only six substitutions and one deletion were detected with 99 % homology. Conversely, the complete mt genome sequence, which was 517,845 bp in length and encoded 61 genes, including 43 protein-coding, 15 transfer RNAs, and 3 ribosomal RNA genes, was considerably different from that of S. japonica in terms of gene order and composition. Further, the mt genome of S. koreensis included ca. 7 and 3 kb insertions, representing an intracellular gene transfer (IGT) event, and the regions with these insertions were determined to be originally present in the cp genome. This IGT event was also confirmed via PCR amplification. IGT events can be induced via biological gene expression control or the use of repetitive sequences, and they provide important insights into the evolutionary lineage of S. koreensis. However, further studies are needed to clarify the gene transfer mechanisms between the two organelles.
Collapse
Affiliation(s)
- Young-Ho Ha
- Division of Forest Biodiversity, Korea National Arboretum, Pocheon-si, Gyeonggi-do 11186, Republic of Korea
| | - Kae Sun Chang
- DMZ Botanic Garden, Korea National Arboretum, Yanggu-gun, Gangwon-do 24564, Republic of Korea
| | - Hee-Young Gil
- Division of Forest Biodiversity, Korea National Arboretum, Pocheon-si, Gyeonggi-do 11186, Republic of Korea.
| |
Collapse
|
3
|
Zhou P, Zhang Q, Li F, Huang J, Zhang M. Assembly and comparative analysis of the complete mitochondrial genome of Ilex metabaptista (Aquifoliaceae), a Chinese endemic species with a narrow distribution. BMC PLANT BIOLOGY 2023; 23:393. [PMID: 37580695 PMCID: PMC10424370 DOI: 10.1186/s12870-023-04377-7] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/07/2023] [Accepted: 07/12/2023] [Indexed: 08/16/2023]
Abstract
BACKGROUND Ilex metabaptista is a woody tree species with strong waterlogging tolerance and is also admired as a landscape plant with high development prospects and scientific research value. Unfortunately, populations of this species have declined due to habitat loss. Thus, it is a great challenge for us to efficiently protect I. metabaptista resources from extinction. Molecular biology research can provide the scientific basis for the conservation of species. However, the study of I. metabaptista genetics is still in its infancy. To date, no mitochondrial genome (mitogenome) in the genus Ilex has been analysed in detail. RESULTS The mitogenome of I. metabaptista was assembled based on the reads from Illumina and Nanopore sequencing platforms; it was a typical circular DNA molecule of 529,560 bp with a GC content of 45.61% and contained 67 genes, including 42 protein-coding genes, 22 tRNA genes, and 3 rRNA genes. Repeat sequence analysis and prediction of RNA editing sites revealed a total of 286 dispersed repeats, 140 simple repeats, 18 tandem repeats, and 543 RNA editing sites. Analysis of codon usage showed that codons ending in A/T were preferred. Gene migration was observed to occur between the mitogenome and chloroplast genome via the detection of homologous fragments. In addition, Ka/Ks analysis revealed that most of the protein-coding genes in the mitogenome had undergone negative selection, and only the ccmB gene had undergone potential positive selection in most asterids. Nucleotide polymorphism analysis revealed the variation in each gene, with atp9 being the most notable. Furthermore, comparative analysis showed that the GC contents were conserved, but the sizes and structure of mitogenomes varied greatly among asterids. Phylogenetic analysis based on the mitogenomes reflected the exact evolutionary and taxonomic status of I. metabaptista. CONCLUSION In this study, we sequenced and annotated the mitogenome of I. metabaptista and compared it with the mitogenomes of other asterids, which provided essential background information for further understanding of the genetics of this plant and helped lay the foundation for future studies on molecular breeding of I. metabaptista.
Collapse
Affiliation(s)
- Peng Zhou
- Jiangsu Academy of Forestry, 109 Danyang Road, Dongshanqiao, Nanjing, 211153, China
| | - Qiang Zhang
- Co-Innovation Center for Sustainable Forestry in Southern China, Key Laboratory of State Forestry and Grassland Administration on Subtropical Forest Biodiversity Conservation, College of Biology and the Environment, Nanjing Forestry University, 210037, Nanjing, China.
| | - Fei Li
- Jiangsu Academy of Forestry, 109 Danyang Road, Dongshanqiao, Nanjing, 211153, China
| | - Jing Huang
- Jiangsu Academy of Forestry, 109 Danyang Road, Dongshanqiao, Nanjing, 211153, China
| | - Min Zhang
- Jiangsu Academy of Forestry, 109 Danyang Road, Dongshanqiao, Nanjing, 211153, China.
| |
Collapse
|
4
|
Edera AA, Howell KA, Nevill PG, Small I, Sanchez-Puerta MV. Evolution of cox2 introns in angiosperm mitochondria and efficient splicing of an elongated cox2i691 intron. Gene 2023; 869:147393. [PMID: 36966978 DOI: 10.1016/j.gene.2023.147393] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Revised: 03/08/2023] [Accepted: 03/21/2023] [Indexed: 04/03/2023]
Abstract
In angiosperms, the mitochondrial cox2 gene harbors up to two introns, commonly referred to as cox2i373 and cox2i691. We studied the cox2 from 222 fully-sequenced mitogenomes from 30 angiosperm orders and analyzed the evolution of their introns. Unlike cox2i373, cox2i691 shows a distribution among plants that is shaped by frequent intron loss events driven by localized retroprocessing. In addition, cox2i691 exhibits sporadic elongations, frequently in domain IV of introns. Such elongations are poorly related to repeat content and two of them showed the presence of LINE transposons, suggesting that increasing intron size is very likely due to nuclear intracelular DNA transfer followed by incorporation into the mitochondrial DNA. Surprisingly, we found that cox2i691 is erroneously annotated as absent in 30 mitogenomes deposited in public databases. Although each of the cox2 introns is ∼1.5 kb in length, a cox2i691 of 4.2 kb has been reported in Acacia ligulata (Fabaceae). It is still unclear whether its unusual length is due to a trans-splicing arrangement or the loss of functionality of the interrupted cox2. Through analyzing short-read RNA sequencing of Acacia with a multi-step computational strategy, we found that the Acacia cox2 is functional and its long intron is spliced in cis in a very efficient manner despite its length.
Collapse
Affiliation(s)
- Alejandro A Edera
- Research Institute for Signals, Systems and Computational Intelligence, sinc(i), FICH-UNL, CONICET, Ciudad Universitaria UNL, 3000 Santa Fe, Argentina.
| | - Katharine A Howell
- Australian Research Council Centre of Excellence in Plant Energy Biology, The University of Western Australia, Crawley, Western Australia, Australia
| | - Paul G Nevill
- Botanic Gardens and Parks Authority, Kings Park and Botanic Garden, Fraser Avenue, Kings Park, Western Australia, Australia; School of Plant Biology, The University of Western Australia, Crawley, Western Australia, Australia
| | - Ian Small
- Australian Research Council Centre of Excellence in Plant Energy Biology, The University of Western Australia, Crawley, Western Australia, Australia; Centre of Excellence in Computational Systems Biology, The University of Western Australia, Crawley, Western Australia, Australia
| | - M Virginia Sanchez-Puerta
- IBAM, Universidad Nacional de Cuyo, CONICET, Facultad de Ciencias Agrarias, Almirante Brown 500, M5528AHB Chacras de Coria, Argentina; Facultad de Ciencias Exactas y Naturales, Universidad Nacional de Cuyo, 5500 Mendoza, Argentina
| |
Collapse
|
5
|
Munasinghe M, Ågren JA. When and why are mitochondria paternally inherited? Curr Opin Genet Dev 2023; 80:102053. [PMID: 37245242 DOI: 10.1016/j.gde.2023.102053] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Revised: 04/17/2023] [Accepted: 04/26/2023] [Indexed: 05/30/2023]
Abstract
In contrast with nuclear genes that are passed on through both parents, mitochondrial genes are maternally inherited in most species, most of the time. The genetic conflict stemming from this transmission asymmetry is well-documented, and there is an abundance of population-genetic theory associated with it. While occasional or aberrant paternal inheritance occurs, there are only a few cases where exclusive paternal inheritance of mitochondrial genomes is the evolved state. Why this is remains poorly understood. By examining commonalities between species with exclusive paternal inheritance, we discuss what they may tell us about the evolutionary forces influencing mitochondrial inheritance patterns. We end by discussing recent technological advances that make exploring the causes and consequences of paternal inheritance feasible.
Collapse
Affiliation(s)
- Manisha Munasinghe
- Department of Plant and Microbial Biology, University of Minnesota, St. Paul, MN, USA. https://twitter.com/@ManishaMuna
| | - J Arvid Ågren
- Department of Evolutionary Biology, Uppsala University, Uppsala, Sweden; Lerner Research Institute, Cleveland Clinic Foundation, Cleveland, OH, USA.
| |
Collapse
|
6
|
Gao L, Xu W, Xin T, Song J. Application of third-generation sequencing to herbal genomics. FRONTIERS IN PLANT SCIENCE 2023; 14:1124536. [PMID: 36959935 PMCID: PMC10027759 DOI: 10.3389/fpls.2023.1124536] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Accepted: 02/02/2023] [Indexed: 06/18/2023]
Abstract
There is a long history of traditional medicine use. However, little genetic information is available for the plants used in traditional medicine, which limits the exploitation of these natural resources. Third-generation sequencing (TGS) techniques have made it possible to gather invaluable genetic information and develop herbal genomics. In this review, we introduce two main TGS techniques, PacBio SMRT technology and Oxford Nanopore technology, and compare the two techniques against Illumina, the predominant next-generation sequencing technique. In addition, we summarize the nuclear and organelle genome assemblies of commonly used medicinal plants, choose several examples from genomics, transcriptomics, and molecular identification studies to dissect the specific processes and summarize the advantages and disadvantages of the two TGS techniques when applied to medicinal organisms. Finally, we describe how we expect that TGS techniques will be widely utilized to assemble telomere-to-telomere (T2T) genomes and in epigenomics research involving medicinal plants.
Collapse
|
7
|
Multichromosomal Mitochondrial Genome of Paphiopedilum micranthum: Compact and Fragmented Genome, and Rampant Intracellular Gene Transfer. Int J Mol Sci 2023; 24:ijms24043976. [PMID: 36835385 PMCID: PMC9966765 DOI: 10.3390/ijms24043976] [Citation(s) in RCA: 27] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2023] [Revised: 02/11/2023] [Accepted: 02/13/2023] [Indexed: 02/18/2023] Open
Abstract
Orchidaceae is one of the largest families of angiosperms. Considering the large number of species in this family and its symbiotic relationship with fungi, Orchidaceae provide an ideal model to study the evolution of plant mitogenomes. However, to date, there is only one draft mitochondrial genome of this family available. Here, we present a fully assembled and annotated sequence of the mitochondrial genome (mitogenome) of Paphiopedilum micranthum, a species with high economic and ornamental value. The mitogenome of P. micranthum was 447,368 bp in length and comprised 26 circular subgenomes ranging in size from 5973 bp to 32,281 bp. The genome encoded for 39 mitochondrial-origin, protein-coding genes; 16 tRNAs (three of plastome origin); three rRNAs; and 16 ORFs, while rpl10 and sdh3 were lost from the mitogenome. Moreover, interorganellar DNA transfer was identified in 14 of the 26 chromosomes. These plastid-derived DNA fragments represented 28.32% (46,273 bp) of the P. micranthum plastome, including 12 intact plastome origin genes. Remarkably, the mitogenome of P. micranthum and Gastrodia elata shared 18% (about 81 kb) of their mitochondrial DNA sequences. Additionally, we found a positive correlation between repeat length and recombination frequency. The mitogenome of P. micranthum had more compact and fragmented chromosomes compared to other species with multichromosomal structures. We suggest that repeat-mediated homologous recombination enables the dynamic structure of mitochondrial genomes in Orchidaceae.
Collapse
|
8
|
Wang X, Li LL, Xiao Y, Chen XY, Chen JH, Hu XS. A complete sequence of mitochondrial genome of Neolamarckia cadamba and its use for systematic analysis. Sci Rep 2021; 11:21452. [PMID: 34728739 PMCID: PMC8564537 DOI: 10.1038/s41598-021-01040-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Accepted: 10/22/2021] [Indexed: 11/09/2022] Open
Abstract
Neolamarckia cadamba is an important tropical and subtropical tree for timber industry in southern China and is also a medicinal plant because of the secondary product cadambine. N. cadamba belongs to Rubiaceae family and its taxonomic relationships with other species are not fully evaluated based on genome sequences. Here, we report the complete sequences of mitochondrial genome of N. cadamba, which is 414,980 bp in length and successfully assembled in two genome circles (109,836 bp and 305,144 bp). The mtDNA harbors 83 genes in total, including 40 protein-coding genes (PCGs), 31 transfer RNA genes, 6 ribosomal RNA genes, and 6 other genes. The base composition of the whole genome is estimated as 27.26% for base A, 22.63% for C, 22.53% for G, and 27.56% for T, with the A + T content of 54.82% (54.45% in the small circle and 54.79% in the large circle). Repetitive sequences account for ~ 0.14% of the whole genome. A maximum likelihood (ML) tree based on DNA sequences of 24 PCGs supports that N. cadamba belongs to order Gentianales. A ML tree based on rps3 gene of 60 species in family Rubiaceae shows that N. cadamba is more related to Cephalanthus accidentalis and Hymenodictyon parvifolium and belongs to the Cinchonoideae subfamily. The result indicates that N. cadamba is genetically distant from the species and genera of Rubiaceae in systematic position. As the first sequence of mitochondrial genome of N. cadamba, it will provide a useful resource to investigate genetic variation and develop molecular markers for genetic breeding in the future.
Collapse
Affiliation(s)
- Xi Wang
- College of Forestry and Landscape Architecture, South China Agricultural University, Guangdong, 510642, China.,Guangdong Key Laboratory for Innovative Development and Utilization of Forest Plant Germplasm, South China Agricultural University, Guangdong, 510642, China
| | - Ling-Ling Li
- College of Forestry and Landscape Architecture, South China Agricultural University, Guangdong, 510642, China.,Guangdong Key Laboratory for Innovative Development and Utilization of Forest Plant Germplasm, South China Agricultural University, Guangdong, 510642, China
| | - Yu Xiao
- College of Forestry and Landscape Architecture, South China Agricultural University, Guangdong, 510642, China.,Guangdong Key Laboratory for Innovative Development and Utilization of Forest Plant Germplasm, South China Agricultural University, Guangdong, 510642, China
| | - Xiao-Yang Chen
- College of Forestry and Landscape Architecture, South China Agricultural University, Guangdong, 510642, China.,Guangdong Key Laboratory for Innovative Development and Utilization of Forest Plant Germplasm, South China Agricultural University, Guangdong, 510642, China
| | - Jie-Hu Chen
- Science Corporation of Gene (SCGene), Guangzhou, 510000, China
| | - Xin-Sheng Hu
- College of Forestry and Landscape Architecture, South China Agricultural University, Guangdong, 510642, China. .,Guangdong Key Laboratory for Innovative Development and Utilization of Forest Plant Germplasm, South China Agricultural University, Guangdong, 510642, China.
| |
Collapse
|
9
|
Shidhi PR, Biju VC, Anu S, Vipin CL, Deelip KR, Achuthsankar SN. Genome Characterization, Comparison and Phylogenetic Analysis of Complete Mitochondrial Genome of Evolvulus alsinoides Reveals Highly Rearranged Gene Order in Solanales. Life (Basel) 2021; 11:769. [PMID: 34440513 PMCID: PMC8398076 DOI: 10.3390/life11080769] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2021] [Revised: 07/25/2021] [Accepted: 07/27/2021] [Indexed: 11/23/2022] Open
Abstract
Mitogenome sequencing provides an understanding of the evolutionary mechanism of mitogenome formation, mechanisms driving plant gene order, genome structure, and migration sequences. Data on the mitochondrial genome for family Convolvulaceae members is lacking. E. alsinoides, also known as shankhpushpi, is an important medicinal plant under the family Convolvulaceae, widely used in the Ayurvedic system of medicine. We identified the mitogenome of E. alsinoides using the Illumina mate-pair sequencing platform, and annotated using bioinformatics approaches in the present study. The mitogenome of E. alsinoides was 344184 bp in length and comprised 46 unique coding genes, including 31 protein-coding genes (PCGs), 12 tRNA genes, and 3 rRNA genes. The secondary structure of tRNAs shows that all the tRNAs can be folded into canonical clover-leaf secondary structures, except three trnW, trnG, and trnC. Measurement of the skewness of the nucleotide composition showed that the AT and GC skew is positive, indicating higher A's and G's in the mitogenome of E. alsinoides. The Ka/Ks ratios of 11 protein-coding genes (atp1, ccmC, cob, cox1, rps19, rps12, nad3, nad9, atp9, rpl5, nad4L) were <1, indicating that these genes were under purifying selection. Synteny and gene order analysis were performed to identify homologous genes among the related species. Synteny blocks representing nine genes (nad9, nad2, ccmFc, nad1, nad4, nad5, matR, cox1, nad7) were observed in all the species of Solanales. Gene order comparison showed that a high level of gene rearrangement has occurred among all the species of Solanales. The mitogenome data obtained in the present study could be used as the Convolvulaceae family representative for future studies, as there is no complex taxonomic history associated with this plant.
Collapse
Affiliation(s)
- Pattayampadam Ramakrishnan Shidhi
- Department of Computational Biology and Bioinformatics, University of Kerala, Thiruvananthapuram 695581, India; (V.C.B.); (S.A.); (C.L.V.); (S.N.A.)
| | - Vadakkemukadiyil Chellappan Biju
- Department of Computational Biology and Bioinformatics, University of Kerala, Thiruvananthapuram 695581, India; (V.C.B.); (S.A.); (C.L.V.); (S.N.A.)
| | - Sasi Anu
- Department of Computational Biology and Bioinformatics, University of Kerala, Thiruvananthapuram 695581, India; (V.C.B.); (S.A.); (C.L.V.); (S.N.A.)
| | - Chandrasekharan Laila Vipin
- Department of Computational Biology and Bioinformatics, University of Kerala, Thiruvananthapuram 695581, India; (V.C.B.); (S.A.); (C.L.V.); (S.N.A.)
| | - Kumar Raveendran Deelip
- Campus Computing Facility (CCF) at the Central Laboratory for Instrumentation and Facilitation, University of Kerala, Thiruvananthapuram 695581, India;
| | - Sukumaran Nair Achuthsankar
- Department of Computational Biology and Bioinformatics, University of Kerala, Thiruvananthapuram 695581, India; (V.C.B.); (S.A.); (C.L.V.); (S.N.A.)
| |
Collapse
|
10
|
Choi IS, Wojciechowski MF, Ruhlman TA, Jansen RK. In and out: Evolution of viral sequences in the mitochondrial genomes of legumes (Fabaceae). Mol Phylogenet Evol 2021; 163:107236. [PMID: 34147655 DOI: 10.1016/j.ympev.2021.107236] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2021] [Revised: 06/11/2021] [Accepted: 06/14/2021] [Indexed: 10/21/2022]
Abstract
Plant specific mitoviruses in the 'genus' Mitovirus (Narnaviridae) and their integrated sequences (non-retroviral endogenous RNA viral elements or NERVEs) have been recently identified in various plant lineages. However, the sparse phylogenetic coverage of complete plant mitochondrial genome (mitogenome) sequences and the non-conserved nature of mitochondrial intergenic regions have hindered comparative studies on mitovirus NERVEs in plants. In this study, 10 new mitogenomes were sequenced from legumes (Fabaceae). Based on comparative genomic analysis of 27 total mitogenomes, we identified mitovirus NERVEs and transposable elements across the family. All legume mitogenomes included NERVEs and total NERVE length varied from ca. 2 kb in the papilionoid Trifolium to 35 kb in the mimosoid Acacia. Most of the NERVE integration sites were in highly variable intergenic regions, however, some were positioned in six cis-spliced mitochondrial introns. In the Acacia mitogenome, there were L1-like transposon sequences including an almost full-length copy with target site duplications (TSDs). The integration sites of NERVEs in four introns showed evidence of L1-like retrotransposition events. Phylogenetic analysis revealed that there were multiple instances of precise deletion of NERVEs between TSDs. This study provides clear evidence that a L1-like retrotransposition mechanism has a long history of contributing to the integration of viral RNA into plant mitogenomes while microhomology-mediated deletion can restore the integration site.
Collapse
Affiliation(s)
- In-Su Choi
- Department of Integrative Biology, University of Texas at Austin, Austin, TX 78712, USA; School of Life Sciences, Arizona State University, Tempe, AZ 85287, USA.
| | | | - Tracey A Ruhlman
- Department of Integrative Biology, University of Texas at Austin, Austin, TX 78712, USA.
| | - Robert K Jansen
- Department of Integrative Biology, University of Texas at Austin, Austin, TX 78712, USA; Centre of Excellence in Bionanoscience Research, Department of Biological Sciences, Faculty of Science, King Abdulaziz University, Jeddah 21589, Saudi Arabia.
| |
Collapse
|
11
|
Masutani B, Arimura SI, Morishita S. Investigating the mitochondrial genomic landscape of Arabidopsis thaliana by long-read sequencing. PLoS Comput Biol 2021; 17:e1008597. [PMID: 33434206 PMCID: PMC7833223 DOI: 10.1371/journal.pcbi.1008597] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2020] [Revised: 01/25/2021] [Accepted: 12/01/2020] [Indexed: 11/18/2022] Open
Abstract
Plant mitochondrial genomes have distinctive features compared to those of animals; namely, they are large and divergent, with sizes ranging from hundreds of thousands of to a few million bases. Recombination among repetitive regions is thought to produce similar structures that differ slightly, known as "multipartite structures," which contribute to different phenotypes. Although many reference plant mitochondrial genomes represent almost all the genes in mitochondria, the full spectrum of their structures remains largely unknown. The emergence of long-read sequencing technology is expected to yield this landscape; however, many studies aimed to assemble only one representative circular genome, because properly understanding multipartite structures using existing assemblers is not feasible. To elucidate multipartite structures, we leveraged the information in existing reference genomes and classified long reads according to their corresponding structures. We developed a method that exploits two classic algorithms, partial order alignment (POA) and the hidden Markov model (HMM) to construct a sensitive read classifier. This method enables us to represent a set of reads as a POA graph and analyze it using the HMM. We can then calculate the likelihood of a read occurring in a given cluster, resulting in an iterative clustering algorithm. For synthetic data, our proposed method reliably detected one variation site out of 9,000-bp synthetic long reads with a 15% sequencing-error rate and produced accurate clustering. It was also capable of clustering long reads from six very similar sequences containing only slight differences. For real data, we assembled putative multipartite structures of mitochondrial genomes of Arabidopsis thaliana from nine accessions sequenced using PacBio Sequel. The results indicated that there are recurrent and strain-specific structures in A. thaliana mitochondrial genomes.
Collapse
Affiliation(s)
- Bansho Masutani
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, Japan
- * E-mail:
| | - Shin-ichi Arimura
- Laboratory of Plant Molecular Genetics, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan
| | - Shinichi Morishita
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, Japan
| |
Collapse
|
12
|
He W, Chen C, Adedze YMN, Dong X, Xi K, Sun Y, Dang T, Jin D. Multicentric origin and diversification of atp6- orf79-like structures reveal mitochondrial gene flows in Oryza rufipogon and Oryza sativa. Evol Appl 2020; 13:2284-2299. [PMID: 33005224 PMCID: PMC7513716 DOI: 10.1111/eva.13022] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2020] [Revised: 04/26/2020] [Accepted: 05/13/2020] [Indexed: 11/27/2022] Open
Abstract
Cytoplasmic male sterility (CMS) is a widely used genetic tool in modern hybrid rice breeding. Most genes conferring rice gametophytic CMS are homologous to orf79 and co-transcribe with atp6. However, the origin, differentiation and flow of these mitochondrial genes in wild and cultivated rice species remain unclear. In this study, we performed de novo assembly of the mitochondrial genomes of 221 common wild rice (Oryza rufipogon Griff.) and 369 Asian cultivated rice (Oryza sativa L.) accessions, and identified 16 haplotypes of atp6-orf79-like structures and 11 orf79 alleles. These homologous structures were classified into 4 distinct groups (AO-I, AO-II, AO-III and AO-IV), all of which were observed in O. rufipogon but only AO-I was detected in O. sativa, causing a decrease in the frequency of atp6-orf79-like structures from 19.9% to 8.1%. Phylogenetic and biogeographic analyses revealed that the different groups of these gametophytic CMS-related genes in O. rufipogon evolved in a multicentric pattern. The geographical origin of the atp6-orf79-like structures was further traced back, and a candidate region in north-east of Gangetic Plain on the Indian Peninsula (South Asia) was identified as the origin centre of AO-I. The orf79 alleles were detected in all three cytoplasmic types (Or-CT0, Or-CT1 and Or-CT2) of O. rufipogon, but only two alleles (orf79a and orf79b) were observed in Or-CT0 type of O. sativa, while no orf79 allele was found in other types of O. sativa. Our results also revealed that the orf79 alleles in cultivated rice originated from the wild rice population in South and South-East Asia. In addition, strong positive selection pressure was detected on the sequence variations of orf79 alleles, and a special evolutionary strategy was noted in these gametophytic CMS-related genes, suggesting that their divergence could be beneficial to their survival in evolution.
Collapse
Affiliation(s)
- Wenchuang He
- MOA Key Laboratory of Crop Ecophysiology and Farming System in the Middle Reaches of the Yangtze River College of Plant Science and Technology Huazhong Agricultural University Wuhan China
| | - Caijin Chen
- MOA Key Laboratory of Crop Ecophysiology and Farming System in the Middle Reaches of the Yangtze River College of Plant Science and Technology Huazhong Agricultural University Wuhan China
- Institute of Biological and Environmental Sciences University of Aberdeen Aberdeen UK
| | | | - Xilong Dong
- MOA Key Laboratory of Crop Ecophysiology and Farming System in the Middle Reaches of the Yangtze River College of Plant Science and Technology Huazhong Agricultural University Wuhan China
| | - Kun Xi
- MOA Key Laboratory of Crop Ecophysiology and Farming System in the Middle Reaches of the Yangtze River College of Plant Science and Technology Huazhong Agricultural University Wuhan China
| | - Yongsheng Sun
- MOA Key Laboratory of Crop Ecophysiology and Farming System in the Middle Reaches of the Yangtze River College of Plant Science and Technology Huazhong Agricultural University Wuhan China
| | - Tengfei Dang
- MOA Key Laboratory of Crop Ecophysiology and Farming System in the Middle Reaches of the Yangtze River College of Plant Science and Technology Huazhong Agricultural University Wuhan China
| | - Deming Jin
- MOA Key Laboratory of Crop Ecophysiology and Farming System in the Middle Reaches of the Yangtze River College of Plant Science and Technology Huazhong Agricultural University Wuhan China
| |
Collapse
|
13
|
The draft mitochondrial genome of Magnolia biondii and mitochondrial phylogenomics of angiosperms. PLoS One 2020; 15:e0231020. [PMID: 32294100 PMCID: PMC7159230 DOI: 10.1371/journal.pone.0231020] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2019] [Accepted: 03/13/2020] [Indexed: 12/15/2022] Open
Abstract
The mitochondrial genomes of flowering plants are well known for their large size, variable coding-gene set and fluid genome structure. The available mitochondrial genomes of the early angiosperms show extreme genetic diversity in genome size, structure, and sequences, such as rampant HGTs in Amborella mt genome, numerous repeated sequences in Nymphaea mt genome, and conserved gene evolution in Liriodendron mt genome. However, currently available early angiosperm mt genomes are still limited, hampering us from obtaining an overall picture of the mitogenomic evolution in angiosperms. Here we sequenced and assembled the draft mitochondrial genome of Magnolia biondii Pamp. from Magnoliaceae (magnoliids) using Oxford Nanopore sequencing technology. We recovered a single linear mitochondrial contig of 967,100 bp with an average read coverage of 122 × and a GC content of 46.6%. This draft mitochondrial genome contains a rich 64-gene set, similar to those of Liriodendron and Nymphaea, including 41 protein-coding genes, 20 tRNAs, and 3 rRNAs. Twenty cis-spliced and five trans-spliced introns break ten protein-coding genes in the Magnolia mt genome. Repeated sequences account for 27% of the draft genome, with 17 out of the 1,145 repeats showing recombination evidence. Although partially assembled, the approximately 1-Mb mt genome of Magnolia is still among the largest in angiosperms, which is possibly due to the expansion of repeated sequences, retention of ancestral mtDNAs, and the incorporation of nuclear genome sequences. Mitochondrial phylogenomic analysis of the concatenated datasets of 38 conserved protein-coding genes from 91 representatives of angiosperm species supports the sister relationship of magnoliids with monocots and eudicots, which is congruent with plastid evidence.
Collapse
|
14
|
Methods and Tools for Plant Organelle Genome Sequencing, Assembly, and Downstream Analysis. Methods Mol Biol 2020; 2107:49-98. [PMID: 31893443 DOI: 10.1007/978-1-0716-0235-5_4] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
Organelles play an important role in a eukaryotic cell. Among them, the two organelles, chloroplast and mitochondria, are responsible for the critical function of photosynthesis and aerobic respiration. Organellar genomes are also very important for plant systematic studies. Here we have described the methods for isolation of the mitochondrial and plastid DNA and its subsequent sequencing with the help of NGS technology. We have also discussed in detail the various tools available for assembly, annotation, and visualization of the organelle genome sequence.
Collapse
|
15
|
Martins G, Balbino E, Marques A, Almeida C. Complete mitochondrial genomes of the Spondias tuberosa Arr. Cam and Spondias mombin L. reveal highly repetitive DNA sequences. Gene 2019; 720:144026. [DOI: 10.1016/j.gene.2019.144026] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2018] [Revised: 07/09/2019] [Accepted: 07/29/2019] [Indexed: 11/30/2022]
|
16
|
Choi IS, Schwarz EN, Ruhlman TA, Khiyami MA, Sabir JSM, Hajarah NH, Sabir MJ, Rabah SO, Jansen RK. Fluctuations in Fabaceae mitochondrial genome size and content are both ancient and recent. BMC PLANT BIOLOGY 2019; 19:448. [PMID: 31653201 PMCID: PMC6814987 DOI: 10.1186/s12870-019-2064-8] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/18/2019] [Accepted: 10/02/2019] [Indexed: 05/12/2023]
Abstract
BACKGROUND Organelle genome studies of Fabaceae, an economically and ecologically important plant family, have been biased towards the plastid genome (plastome). Thus far, less than 15 mitochondrial genome (mitogenome) sequences of Fabaceae have been published, all but four of which belong to the subfamily Papilionoideae, limiting the understanding of size variation and content across the family. To address this, four mitogenomes were sequenced and assembled from three different subfamilies (Cercidoideae, Detarioideae and Caesalpinioideae). RESULTS Phylogenetic analysis based on shared mitochondrial protein coding regions produced a fully resolved and well-supported phylogeny that was completely congruent with the plastome tree. Comparative analyses suggest that two kinds of mitogenome expansions have occurred in Fabaceae. Size expansion of four genera (Tamarindus, Libidibia, Haematoxylum, and Leucaena) in two subfamilies (Detarioideae and Caesalpinioideae) occurred in relatively deep nodes, and was mainly caused by intercellular gene transfer and/or interspecific horizontal gene transfer (HGT). The second, more recent expansion occurred in the Papilionoideae as a result of duplication of native mitochondrial sequences. Family-wide gene content analysis revealed 11 gene losses, four (rps2, 7, 11 and 13) of which occurred in the ancestor of Fabaceae. Losses of the remaining seven genes (cox2, rpl2, rpl10, rps1, rps19, sdh3, sdh4) were restricted to specific lineages or occurred independently in different clades. Introns of three genes (cox2, ccmFc and rps10) showed extensive lineage-specific length variation due to large sequence insertions and deletions. Shared DNA analysis among Fabaceae mitogenomes demonstrated a substantial decay of intergenic spacers and provided further insight into HGT between the mimosoid clade of Caesalpinioideae and the holoparasitic Lophophytum (Balanophoraceae). CONCLUSION This study represents the most exhaustive analysis of Fabaceae mitogenomes so far, and extends the understanding the dynamic variation in size and gene/intron content. The four newly sequenced mitogenomes reported here expands the phylogenetic coverage to four subfamilies. The family has experienced multiple mitogenome size fluctuations in both ancient and recent times. The causes of these size variations are distinct in different lineages. Fabaceae mitogenomes experienced extensive size fluctuation by recruitment of exogenous DNA and duplication of native mitochondrial DNA.
Collapse
Affiliation(s)
- In-Su Choi
- Department of Integrative Biology, University of Texas at Austin, Austin, TX 78712 USA
| | - Erika N. Schwarz
- Department of Biological Sciences, St. Edward’s University, Austin, TX 78704 USA
| | - Tracey A. Ruhlman
- Department of Integrative Biology, University of Texas at Austin, Austin, TX 78712 USA
| | - Mohammad A. Khiyami
- King Abdulaziz City for Science and Technology (KACST), Riyadh, 11442 Saudi Arabia
| | - Jamal S. M. Sabir
- Centre of Excellence in Bionanoscience Research, Department of Biological Sciences, Faculty of Science, King Abdulaziz University, Jeddah, 21589 Saudi Arabia
| | - Nahid H. Hajarah
- Centre of Excellence in Bionanoscience Research, Department of Biological Sciences, Faculty of Science, King Abdulaziz University, Jeddah, 21589 Saudi Arabia
| | - Mernan J. Sabir
- Centre of Excellence in Bionanoscience Research, Department of Biological Sciences, Faculty of Science, King Abdulaziz University, Jeddah, 21589 Saudi Arabia
| | - Samar O. Rabah
- Department of Biological Sciences, Faculty of Science, King Abdulaziz University, Jeddah, 21589 Saudi Arabia
| | - Robert K. Jansen
- Department of Integrative Biology, University of Texas at Austin, Austin, TX 78712 USA
- Centre of Excellence in Bionanoscience Research, Department of Biological Sciences, Faculty of Science, King Abdulaziz University, Jeddah, 21589 Saudi Arabia
| |
Collapse
|
17
|
Rawal HC, Kumar PM, Bera B, Singh NK, Mondal TK. Decoding and analysis of organelle genomes of Indian tea (Camellia assamica) for phylogenetic confirmation. Genomics 2019; 112:659-668. [PMID: 31029862 DOI: 10.1016/j.ygeno.2019.04.018] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2018] [Revised: 03/03/2019] [Accepted: 04/24/2019] [Indexed: 01/16/2023]
Abstract
The NCBI database has >15 chloroplast (cp) genome sequences available for different Camellia species but none for C. assamica. There is no report of any mitochondrial (mt) genome in the Camellia genus or Theaceae family. With the strong believes that these organelle genomes can play a great tool for taxonomic and phylogenetic analysis, we successfully assembled and analyzed cp and mt genome of C. assamica. We assembled the complete mt genome of C. assamica in a single circular contig of 707,441 bp length comprising of a total of 66 annotated genes, including 35 protein-coding genes, 29 tRNAs and two rRNAs. The first ever cp genome of C. assamica resulted in a circular contig of 157,353 bp length with a typical quadripartite structure. Phylogenetic analysis based on these organelle genomes showed that C. assamica was closely related to C. sinensis and C. leptophylla. It also supports Caryophyllales as Superasterids.
Collapse
Affiliation(s)
- Hukam C Rawal
- ICAR-National Research Centre on Plant Biotechnology, Pusa, New Delhi 110012, India
| | - P Mohan Kumar
- Tea Board, Ministry of Commerce and Industry, Govt. of India, 14, B.T.M. Sarani, Kolkata 700 001, India
| | - Biswajit Bera
- Tea Board, Ministry of Commerce and Industry, Govt. of India, 14, B.T.M. Sarani, Kolkata 700 001, India
| | - Nagendra Kumar Singh
- ICAR-National Research Centre on Plant Biotechnology, Pusa, New Delhi 110012, India
| | - Tapan Kumar Mondal
- ICAR-National Research Centre on Plant Biotechnology, Pusa, New Delhi 110012, India.
| |
Collapse
|
18
|
Genome-scale transfer of mitochondrial DNA from legume hosts to the holoparasite Lophophytum mirabile (Balanophoraceae). Mol Phylogenet Evol 2019; 132:243-250. [DOI: 10.1016/j.ympev.2018.12.006] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2018] [Revised: 12/06/2018] [Accepted: 12/06/2018] [Indexed: 11/23/2022]
|