1
|
Melia T, Fatayat, Wahibah NN, Fatonah S, Roslim DI, Adnan A. Genome-wide DNA polymorphisms in two peatland adapted Coffea liberica varieties. BMC Genom Data 2025; 26:11. [PMID: 39953379 PMCID: PMC11829567 DOI: 10.1186/s12863-025-01305-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2024] [Accepted: 02/04/2025] [Indexed: 02/17/2025] Open
Abstract
OBJECTIVES Coffea liberica is one of the species within the Coffea genus known for its distinctive flavor and resistance to leaf rust disease. Through breeding approaches, two superior varieties of C. liberica, designated as Liberoid Meranti 1 (Lim 1) and Liberoid Meranti 2 (Lim 2), were introduced in 2015. These varieties are known for their high adaptability in peatlands. The genetic basis of plant adaptability to peatlands remains largely unknown. It is therefore essential to identify genome-wide DNA polymorphisms in Lim 1 and 2 in order to gain insights into its capacity for adaptation in peatlands. DATA DESCRIPTION Whole genome sequencing was performed on three plants from each variety (Lim 1 and 2), resulting in 430 million sequencing reads. The mean depth of sequencing for each sample was 36.90x. The reads were mapped to the Coffea canephora genome, with an average mapping rate of 96.34%. The sequencing data revealed the presence of 3,766,805 single-nucleotide polymorphisms (SNPs) and 1,123,683 insertion-deletions (indels) in all six plants. Among the SNPs, there was a notable prevalence of transitions, with a ratio of approximately twofold compared to transversions. The generated data offers invaluable genomic resources for marker development, with significant implications for understanding peatlands adaptability.
Collapse
Affiliation(s)
- Tisha Melia
- Computer Science Department, Faculty of Mathematics and Natural Sciences, Universitas Riau, Pekanbaru, Riau, Indonesia.
| | - Fatayat
- Computer Science Department, Faculty of Mathematics and Natural Sciences, Universitas Riau, Pekanbaru, Riau, Indonesia
| | - Ninik Nihayatul Wahibah
- Biology Department, Faculty of Mathematics and Natural Sciences, Universitas Riau, Pekanbaru, Riau, Indonesia
| | - Siti Fatonah
- Biology Department, Faculty of Mathematics and Natural Sciences, Universitas Riau, Pekanbaru, Riau, Indonesia
| | - Dewi Indriyani Roslim
- Biology Department, Faculty of Mathematics and Natural Sciences, Universitas Riau, Pekanbaru, Riau, Indonesia
| | - Arisman Adnan
- Mathematics Department, Faculty of Mathematics and Natural Sciences, Universitas Riau, Pekanbaru, Riau, Indonesia
| |
Collapse
|
2
|
Galanti D, Jung JH, Müller C, Bossdorf O. Discarded sequencing reads uncover natural variation in pest resistance in Thlaspi arvense. eLife 2024; 13:RP95510. [PMID: 39699583 DOI: 10.7554/elife.95510] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2024] Open
Abstract
Understanding the genomic basis of natural variation in plant pest resistance is an important goal in plant science, but it usually requires large and labor-intensive phenotyping experiments. Here, we explored the possibility that non-target reads from plant DNA sequencing can serve as phenotyping proxies for addressing such questions. We used data from a whole-genome and -epigenome sequencing study of 207 natural lines of field pennycress (Thlaspi arvense) that were grown in a common environment and spontaneously colonized by aphids, mildew, and other microbes. We found that the numbers of non-target reads assigned to the pest species differed between populations, had significant SNP-based heritability, and were associated with climate of origin and baseline glucosinolate contents. Specifically, pennycress lines from cold and thermally fluctuating habitats, presumably less favorable to aphids, showed higher aphid DNA load, i.e., decreased aphid resistance. Genome-wide association analyses identified genetic variants at known defense genes but also novel genomic regions associated with variation in aphid and mildew DNA load. Moreover, we found several differentially methylated regions associated with pathogen loads, in particular differential methylation at transposons and hypomethylation in the promoter of a gene involved in stomatal closure, likely induced by pathogens. Our study provides first insights into the defense mechanisms of Thlaspi arvense, a rising crop and model species, and demonstrates that non-target whole-genome sequencing reads, usually discarded, can be leveraged to estimate intensities of plant biotic interactions. With rapidly increasing numbers of large sequencing datasets worldwide, this approach should have broad application in fundamental and applied research.
Collapse
Affiliation(s)
- Dario Galanti
- Plant Evolutionary Ecology, Institute of Evolution and Ecology, University of Tübingen, Tübingen, Germany
- Royal Botanic Gardens, Kew, Richmond upon Thames, United Kingdom
| | - Jun Hee Jung
- Plant Evolutionary Ecology, Institute of Evolution and Ecology, University of Tübingen, Tübingen, Germany
| | | | - Oliver Bossdorf
- Plant Evolutionary Ecology, Institute of Evolution and Ecology, University of Tübingen, Tübingen, Germany
| |
Collapse
|
3
|
Ortiz AJ, Sharbrough J. Genome-wide patterns of homoeologous gene flow in allotetraploid coffee. APPLICATIONS IN PLANT SCIENCES 2024; 12:e11584. [PMID: 39184198 PMCID: PMC11342229 DOI: 10.1002/aps3.11584] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Revised: 01/11/2024] [Accepted: 01/25/2024] [Indexed: 08/27/2024]
Abstract
Premise Allopolyploidy-a hybridization-induced whole-genome duplication event-has been a major driver of plant diversification. The extent to which chromosomes pair with their proper homolog vs. with their homoeolog in allopolyploids varies across taxa, and methods to detect homoeologous gene flow (HGF) are needed to understand how HGF has shaped polyploid lineages. Methods The ABBA-BABA test represents a classic method for detecting introgression between closely related species, but here we developed a modified use of the ABBA-BABA test to characterize the extent and direction of HGF in allotetraploid Coffea arabica. Results We found that HGF is abundant in the C. arabica genome, with both subgenomes serving as donors and recipients of variation. We also found that HGF is highly maternally biased in plastid-targeted-but not mitochondrial-targeted-genes, as would be expected if plastid-nuclear incompatibilities exist between the two parent species. Discussion Together, our analyses provide a simple framework for detecting HGF and new evidence consistent with selection favoring overwriting of paternally derived alleles by maternally derived alleles to ameliorate plastid-nuclear incompatibilities. Natural selection therefore appears to shape the direction and intensity of HGF in allopolyploid coffee, indicating that cytoplasmic inheritance has long-term consequences for polyploid lineages.
Collapse
Affiliation(s)
- Andre J. Ortiz
- Department of BiologyNew Mexico Institute of Mining and TechnologySocorroNew MexicoUSA
| | - Joel Sharbrough
- Department of BiologyNew Mexico Institute of Mining and TechnologySocorroNew MexicoUSA
| |
Collapse
|
4
|
Scalabrin S, Magris G, Liva M, Vitulo N, Vidotto M, Scaglione D, Del Terra L, Ruosi MR, Navarini L, Pellegrino G, Berny Mier Y Teran JC, Toniutti L, Suggi Liverani F, Cerutti M, Di Gaspero G, Morgante M. A chromosome-scale assembly reveals chromosomal aberrations and exchanges generating genetic diversity in Coffea arabica germplasm. Nat Commun 2024; 15:463. [PMID: 38263403 PMCID: PMC10805892 DOI: 10.1038/s41467-023-44449-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Accepted: 12/13/2023] [Indexed: 01/25/2024] Open
Abstract
In order to better understand the mechanisms generating genetic diversity in the recent allotetraploid species Coffea arabica, here we present a chromosome-level assembly obtained with long read technology. Two genomic compartments with different structural and functional properties are identified in the two homoeologous genomes. The resequencing data from a large set of accessions reveals low intraspecific diversity in the center of origin of the species. Across a limited number of genomic regions, diversity increases in some cultivated genotypes to levels similar to those observed within one of the progenitor species, Coffea canephora, presumably as a consequence of introgressions deriving from the so-called Timor hybrid. It also reveals that, in addition to few, early-occurring exchanges between homoeologous chromosomes, there are numerous recent chromosomal aberrations including aneuploidies, deletions, duplications and exchanges. These events are still polymorphic in the germplasm and could represent a fundamental source of genetic variation in such a lowly variable species.
Collapse
Affiliation(s)
| | - Gabriele Magris
- Istituto di Genomica Applicata, 33100, Udine, Italy
- Department of Agricultural, Food, Environmental and Animal Sciences, University of Udine, 33100, Udine, Italy
| | - Mario Liva
- IGA Technology Services, 33100, Udine, Italy
- Istituto di Genomica Applicata, 33100, Udine, Italy
- Department of Agricultural, Food, Environmental and Animal Sciences, University of Udine, 33100, Udine, Italy
| | - Nicola Vitulo
- Department of Biotechnology, University of Verona, 37134, Verona, Italy
| | | | | | | | | | | | | | | | - Lucile Toniutti
- World Coffee Research, Portland, 97225, OR, USA
- CIRAD, UMR AGAP Institut, 97130, Capesterre-Belle-Eau, Guadeloupe, France
- UMR AGAP Institut, University of Montpellier, CIRAD, INRAE, Institut Agro, 34060, Montpellier, France
| | | | | | | | - Michele Morgante
- Istituto di Genomica Applicata, 33100, Udine, Italy.
- Department of Agricultural, Food, Environmental and Animal Sciences, University of Udine, 33100, Udine, Italy.
| |
Collapse
|
5
|
Ni Y, Zhang X, Li J, Lu Q, Chen H, Ma B, Liu C. Genetic diversity of Coffea arabica L. mitochondrial genomes caused by repeat- mediated recombination and RNA editing. FRONTIERS IN PLANT SCIENCE 2023; 14:1261012. [PMID: 37885664 PMCID: PMC10598636 DOI: 10.3389/fpls.2023.1261012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Accepted: 09/25/2023] [Indexed: 10/28/2023]
Abstract
Background Coffea arabica L. is one of the most important crops widely cultivated in 70 countries across Asia, Africa, and Latin America. Mitochondria are essential organelles that play critical roles in cellular respiration, metabolism, and differentiation. C. arabica's nuclear and chloroplast genomes have been reported. However, its mitochondrial genome remained unreported. Here, we intended to sequence and characterize its mitochondrial genome to maximize the potential of its genomes for evolutionary studies, molecular breeding, and molecular marker developments. Results We sequenced the total DNA of C. arabica using Illumina and Nanopore platforms. We then assembled the mitochondrial genome with a hybrid strategy using Unicycler software. We found that the mitochondrial genome comprised two circular chromosomes with lengths of 867,678 bp and 153,529 bp, encoding 40 protein-coding genes, 26 tRNA genes, and three rRNA genes. We also detected 270 Simple Sequence Repeats and 34 tandem repeats in the mitochondrial genome. We found 515 high-scoring sequence pairs (HSPs) for a self-to-self similarity comparison using BLASTn. Three HSPs were found to mediate recombination by the mapping of long reads. Furthermore, we predicted 472 using deep-mt with the convolutional neural network model. Then we randomly validated 90 RNA editing events by PCR amplification and Sanger sequencing, with the majority being non-synonymous substitutions and only three being synonymous substitutions. These findings provide valuable insights into the genetic characteristics of the C. arabica mitochondrial genome, which can be helpful for future study on coffee breeding and mitochondrial genome evolution. Conclusion Our study sheds new light on the evolution of C. arabica organelle genomes and their potential use in genetic breeding, providing valuable data for developing molecular markers that can improve crop productivity and quality. Furthermore, the discovery of RNA editing events in the mitochondrial genome of C. arabica offers insights into the regulation of gene expression in this species, contributing to a better understanding of coffee genetics and evolution.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Chang Liu
- Center for Bioinformatics, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences, Peking Union Medical College, Beijing, China
| |
Collapse
|
6
|
Ruperao P, Bajaj P, Subramani R, Yadav R, Reddy Lachagari VB, Lekkala SP, Rathore A, Archak S, Angadi UB, Singh R, Singh K, Mayes S, Rangan P. A pilot-scale comparison between single and double-digest RAD markers generated using GBS strategy in sesame (Sesamum indicum L.). PLoS One 2023; 18:e0286599. [PMID: 37267340 DOI: 10.1371/journal.pone.0286599] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Accepted: 05/19/2023] [Indexed: 06/04/2023] Open
Abstract
To reduce the genome sequence representation, restriction site-associated DNA sequencing (RAD-seq) protocols is being widely used either with single-digest or double-digest methods. In this study, we genotyped the sesame population (48 sample size) in a pilot scale to compare single and double-digest RAD-seq (sd and ddRAD-seq) methods. We analysed the resulting short-read data generated from both protocols and assessed their performance impacting the downstream analysis using various parameters. The distinct k-mer count and gene presence absence variation (PAV) showed a significant difference between the sesame samples studied. Additionally, the variant calling from both datasets (sdRAD-seq and ddRAD-seq) exhibits a significant difference between them. The combined variants from both datasets helped in identifying the most diverse samples and possible sub-groups in the sesame population. The most diverse samples identified from each analysis (k-mer, gene PAV, SNP count, Heterozygosity, NJ and PCA) can possibly be representative samples holding major diversity of the small sesame population used in this study. The best possible strategies with suggested inputs for modifications to utilize the RAD-seq strategy efficiently on a large dataset containing thousands of samples to be subjected to molecular analysis like diversity, population structure and core development studies were discussed.
Collapse
Affiliation(s)
- Pradeep Ruperao
- Center of Excellence in Genomics and Systems Biology, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
| | - Prasad Bajaj
- Center of Excellence in Genomics and Systems Biology, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
| | - Rajkumar Subramani
- ICAR-National Bureau of Plant Genetic Resources, PUSA Campus, New Delhi, India
| | - Rashmi Yadav
- ICAR-National Bureau of Plant Genetic Resources, PUSA Campus, New Delhi, India
| | | | | | | | - Sunil Archak
- ICAR-National Bureau of Plant Genetic Resources, PUSA Campus, New Delhi, India
| | - Ulavappa B Angadi
- ICAR-Indian Agricultural Statistical Research Institute, New Delhi, India
| | - Rakesh Singh
- ICAR-National Bureau of Plant Genetic Resources, PUSA Campus, New Delhi, India
| | - Kuldeep Singh
- Genebank, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
| | - Sean Mayes
- Center of Excellence in Genomics and Systems Biology, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
| | - Parimalan Rangan
- ICAR-National Bureau of Plant Genetic Resources, PUSA Campus, New Delhi, India
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St. Lucia, Australia
| |
Collapse
|
7
|
Del Giudice L, Bazakos C, Vassiliou MF. Study of genetic variation and its association with tensile strength among bamboo species through whole genome resequencing. FRONTIERS IN PLANT SCIENCE 2022; 13:935751. [PMID: 35968086 PMCID: PMC9365670 DOI: 10.3389/fpls.2022.935751] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/04/2022] [Accepted: 07/05/2022] [Indexed: 06/15/2023]
Abstract
Moso bamboo (Phyllostachys edulis) is a versatile plant species that is widely used as a construction material by many low-income countries due to the lack of major construction materials such as steel and reinforced concrete. It is also widely used in China. Bamboo is an economically sustainable material that behaves exceptionally in natural disasters such as earthquakes and it can offer viable solutions for contemporary engineering challenges. Despite bamboo's potential in the engineering sector, biological features such as its long generation time, its large genome size, and its polyploidy are constraining factors for genetic and genomic studies that potentially can assist the breeding efforts. This study re-sequenced 8 Phyllostachys species and 18 natural accessions of Ph. edulis, generating a large set of functionally annotated molecular markers (Single Nucleotide Polymorphisms (SNPs) and InDels) providing key genomic resource information. Moreover, all this genomic information was used to carry out a preliminary genome-wide association analysis and several candidate genes were identified to be correlated with a mechanical property that is of high interest to structural engineers: its tensile strength normal to its fibers (i.e., splitting).
Collapse
Affiliation(s)
- Lorenzo Del Giudice
- Chair of Seismic Design and Analysis, Institute of Structural Engineering, ETH Zurich, Zurich, Switzerland
| | - Christos Bazakos
- Chair of Seismic Design and Analysis, Institute of Structural Engineering, ETH Zurich, Zurich, Switzerland
- Institute of Plant Breeding and Genetic Resources, ELGO-Dimitra, Thessaloniki, Greece
- Joint Laboratory of Horticulture, ELGO-Dimitra, Thessaloniki, Greece
- Department of Comparative Development and Genetics, Max Planck Institute for Plant Breeding Research, Cologne, Germany
| | - Michalis F. Vassiliou
- Chair of Seismic Design and Analysis, Institute of Structural Engineering, ETH Zurich, Zurich, Switzerland
| |
Collapse
|