1
|
Lecomte L, Árnyasi M, Ferchaud A, Kent M, Lien S, Stenløkk K, Sylvestre F, Bernatchez L, Mérot C. Investigating structural variant, indel and single nucleotide polymorphism differentiation between locally adapted Atlantic salmon populations. Evol Appl 2024; 17:e13653. [PMID: 38495945 PMCID: PMC10940791 DOI: 10.1111/eva.13653] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2023] [Revised: 12/14/2023] [Accepted: 01/13/2024] [Indexed: 03/19/2024] Open
Abstract
Genomic structural variants (SVs) are now recognized as an integral component of intraspecific polymorphism and are known to contribute to evolutionary processes in various organisms. However, they are inherently difficult to detect and genotype from readily available short-read sequencing data, and therefore remain poorly documented in wild populations. Salmonid species displaying strong interpopulation variability in both life history traits and habitat characteristics, such as Atlantic salmon (Salmo salar), offer a prime context for studying adaptive polymorphism, but the contribution of SVs to fine-scale local adaptation has yet to be explored. Here, we performed a comparative analysis of SVs, single nucleotide polymorphisms (SNPs) and small indels (<50 bp) segregating in the Romaine and Puyjalon salmon, two putatively locally adapted populations inhabiting neighboring rivers (Québec, Canada) and showing pronounced variation in life history traits, namely growth, fecundity, and age at maturity and smoltification. We first catalogued polymorphism using a hybrid SV characterization approach pairing both short- (16X) and long-read sequencing (20X) for variant discovery with graph-based genotyping of SVs across 60 salmon genomes, along with characterization of SNPs and small indels from short reads. We thus identified 115,907 SVs, 8,777,832 SNPs and 1,089,321 short indels, with SVs covering 4.8 times more base pairs than SNPs. All three variant types revealed a highly congruent population structure and similar patterns of F ST and density variation along the genome. Finally, we performed outlier detection and redundancy analysis (RDA) to identify variants of interest in the putative local adaptation of Romaine and Puyjalon salmon. Genes located near these variants were enriched for biological processes related to nervous system function, suggesting that observed variation in traits such as age at smoltification could arise from differences in neural development. This study therefore demonstrates the feasibility of large-scale SV characterization and highlights its relevance for salmonid population genomics.
Collapse
Affiliation(s)
- Laurie Lecomte
- Institut de Biologie Intégrative et des Systèmes (IBIS)Université LavalQuébecCanada
- Département de BiologieUniversité LavalQuébecCanada
| | - Mariann Árnyasi
- Department of Animal and Aquacultural Sciences (IHA), Faculty of Life Sciences (BIOVIT), Centre for Integrative Genetics (CIGENE)Norwegian University of Life Sciences (NMBU)ÅsNorway
| | - Anne‐Laure Ferchaud
- Institut de Biologie Intégrative et des Systèmes (IBIS)Université LavalQuébecCanada
- Département de BiologieUniversité LavalQuébecCanada
- Present address:
Parks Canada, Office of the Chief Ecosystem ScientistQuébecQCCanada
| | - Matthew Kent
- Department of Animal and Aquacultural Sciences (IHA), Faculty of Life Sciences (BIOVIT), Centre for Integrative Genetics (CIGENE)Norwegian University of Life Sciences (NMBU)ÅsNorway
| | - Sigbjørn Lien
- Department of Animal and Aquacultural Sciences (IHA), Faculty of Life Sciences (BIOVIT), Centre for Integrative Genetics (CIGENE)Norwegian University of Life Sciences (NMBU)ÅsNorway
| | - Kristina Stenløkk
- Department of Animal and Aquacultural Sciences (IHA), Faculty of Life Sciences (BIOVIT), Centre for Integrative Genetics (CIGENE)Norwegian University of Life Sciences (NMBU)ÅsNorway
| | - Florent Sylvestre
- Institut de Biologie Intégrative et des Systèmes (IBIS)Université LavalQuébecCanada
- Département de BiologieUniversité LavalQuébecCanada
| | - Louis Bernatchez
- Institut de Biologie Intégrative et des Systèmes (IBIS)Université LavalQuébecCanada
- Département de BiologieUniversité LavalQuébecCanada
| | - Claire Mérot
- Institut de Biologie Intégrative et des Systèmes (IBIS)Université LavalQuébecCanada
- Département de BiologieUniversité LavalQuébecCanada
- Present address:
UMR 6553 Ecobio, OSUR, CNRSUniversité de RennesRennesFrance
| |
Collapse
|
2
|
Garcia JF, Morales-Cruz A, Cochetel N, Minio A, Figueroa-Balderas R, Rolshausen PE, Baumgartner K, Cantu D. Comparative Pangenomic Insights into the Distinct Evolution of Virulence Factors Among Grapevine Trunk Pathogens. MOLECULAR PLANT-MICROBE INTERACTIONS : MPMI 2024; 37:127-142. [PMID: 37934016 DOI: 10.1094/mpmi-09-23-0129-r] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/08/2023]
Abstract
The permanent organs of grapevines (Vitis vinifera L.), like those of other woody perennials, are colonized by various unrelated pathogenic ascomycete fungi secreting cell wall-degrading enzymes and phytotoxic secondary metabolites that contribute to host damage and disease symptoms. Trunk pathogens differ in the symptoms they induce and the extent and speed of damage. Isolates of the same species often display a wide virulence range, even within the same vineyard. This study focuses on Eutypa lata, Neofusicoccum parvum, and Phaeoacremonium minimum, causal agents of Eutypa dieback, Botryosphaeria dieback, and Esca, respectively. We sequenced 50 isolates from viticulture regions worldwide and built nucleotide-level, reference-free pangenomes for each species. Through examination of genomic diversity and pangenome structure, we analyzed intraspecific conservation and variability of putative virulence factors, focusing on functions under positive selection and recent gene family dynamics of contraction and expansion. Our findings reveal contrasting distributions of putative virulence factors in the core, dispensable, and private genomes of each pangenome. For example, carbohydrate active enzymes (CAZymes) were prevalent in the core genomes of each pangenome, whereas biosynthetic gene clusters were prevalent in the dispensable genomes of E. lata and P. minimum. The dispensable fractions were also enriched in Gypsy transposable elements and virulence factors under positive selection (polyketide synthase genes in E. lata and P. minimum, glycosyltransferases in N. parvum). Our findings underscore the complexity of the genomic architecture in each species and provide insights into their adaptive strategies, enhancing our understanding of the underlying mechanisms of virulence. [Formula: see text] Copyright © 2024 The Author(s). This is an open access article distributed under the CC BY-NC-ND 4.0 International license.
Collapse
Affiliation(s)
- Jadran F Garcia
- Department of Viticulture and Enology, University of California, Davis, Davis, CA, U.S.A
| | - Abraham Morales-Cruz
- Department of Viticulture and Enology, University of California, Davis, Davis, CA, U.S.A
- U.S. Department of Energy, Joint Genome Institute, Lawrence Berkeley National Lab, Berkeley, CA, U.S.A
| | - Noé Cochetel
- Department of Viticulture and Enology, University of California, Davis, Davis, CA, U.S.A
| | - Andrea Minio
- Department of Viticulture and Enology, University of California, Davis, Davis, CA, U.S.A
| | - Rosa Figueroa-Balderas
- Department of Viticulture and Enology, University of California, Davis, Davis, CA, U.S.A
| | - Philippe E Rolshausen
- Department of Botany and Plant Sciences, University of California, Riverside, Riverside, CA, U.S.A
| | - Kendra Baumgartner
- Crops Pathology and Genetics Research Unit, U.S. Department of Agriculture-Agricultural Research Service, Davis, CA, U.S.A
| | - Dario Cantu
- Department of Viticulture and Enology, University of California, Davis, Davis, CA, U.S.A
- Genome Center, University of California, Davis, Davis, CA, U.S.A
| |
Collapse
|
3
|
Otto M, Wiehe T. The structured coalescent in the context of gene copy number variation. Theor Popul Biol 2023; 154:67-78. [PMID: 37657649 DOI: 10.1016/j.tpb.2023.08.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 08/16/2023] [Accepted: 08/22/2023] [Indexed: 09/03/2023]
Abstract
The Structured Coalescent was introduced to describe the coalescent process in spatially subdivided populations with migration. Here, we re-interpret migration routes of individuals in the original model as "migration routes" of single genes in tandemly arranged gene arrays. A gene copy may change its position within the array via unequal recombination. Hence, in a coalescent framework, two copies sampled from two chromosomes may coalesce only if they are at exactly homologous positions. Otherwise, one or multiple recombination events have to occur before they can coalesce, thereby increasing mean coalescence time and expected genetic diversity among the copies in a gene array. We explicitly calculate the transition probabilities on these routes backward in time. We simulate the structured coalescent with migration and coalescence rates informed by the unequal recombination process of gene copies. With this novel interpretation of population structure models we determine coalescence times and expected genetic diversity in samples of orthologous and paralogous copies from a gene family. As a case study, we discuss the site frequency spectrum of a small gene family in the two scenarios of high and of no gene copy number variation among individuals. These examples underline the significance of our model, since standard test-statistics may lead to misinterpretations when analyzing sequence data of multi-copy genes due to their different expected genetic diversity.
Collapse
Affiliation(s)
- Moritz Otto
- University of Cologne, Institute for Genetics, Zuelpicher Str. 47a, Cologne, 50674, Germany
| | - Thomas Wiehe
- University of Cologne, Institute for Genetics, Zuelpicher Str. 47a, Cologne, 50674, Germany.
| |
Collapse
|
4
|
Lyu X, Xia Y, Wang C, Zhang K, Deng G, Shen Q, Gao W, Zhang M, Liao N, Ling J, Bo Y, Hu Z, Yang J, Zhang M. Pan-genome analysis sheds light on structural variation-based dissection of agronomic traits in melon crops. PLANT PHYSIOLOGY 2023; 193:1330-1348. [PMID: 37477947 DOI: 10.1093/plphys/kiad405] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Accepted: 06/21/2023] [Indexed: 07/22/2023]
Abstract
Sweetness and appearance of fresh fruits are key palatable and preference attributes for consumers and are often controlled by multiple genes. However, fine-mapping the key loci or genes of interest by single genome-based genetic analysis is challenging. Herein, we present the chromosome-level genome assembly of 1 landrace melon accession (Cucumis melo ssp. agrestis) with wild morphologic features and thus construct a melon pan-genome atlas via integrating sequenced melon genome datasets. Our comparative genomic analysis reveals a total of 3.4 million genetic variations, of which the presence/absence variations (PAVs) are mainly involved in regulating the function of genes for sucrose metabolism during melon domestication and improvement. We further resolved several loci that are accountable for sucrose contents, flesh color, rind stripe, and suture using a structural variation (SV)-based genome-wide association study. Furthermore, via bulked segregation analysis (BSA)-seq and map-based cloning, we uncovered that a single gene, (CmPIRL6), determines the edible or inedible characteristics of melon fruit exocarp. These findings provide important melon pan-genome information and provide a powerful toolkit for future pan-genome-informed cultivar breeding of melon.
Collapse
Affiliation(s)
- Xiaolong Lyu
- Laboratory of Germplasm Innovation and Molecular Breeding, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, China
| | - Yuelin Xia
- Laboratory of Germplasm Innovation and Molecular Breeding, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, China
| | - Chenhao Wang
- Laboratory of Germplasm Innovation and Molecular Breeding, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, China
| | - Kejia Zhang
- Laboratory of Germplasm Innovation and Molecular Breeding, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, China
| | - Guancong Deng
- Laboratory of Germplasm Innovation and Molecular Breeding, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, China
| | - Qinghui Shen
- Laboratory of Germplasm Innovation and Molecular Breeding, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, China
| | - Wei Gao
- Laboratory of Germplasm Innovation and Molecular Breeding, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, China
- Hainan Institute, Zhejiang University, Yazhou District, Sanya 572025, China
| | - Mengyi Zhang
- Laboratory of Germplasm Innovation and Molecular Breeding, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, China
- Hainan Institute, Zhejiang University, Yazhou District, Sanya 572025, China
| | - Nanqiao Liao
- Laboratory of Germplasm Innovation and Molecular Breeding, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, China
| | - Jian Ling
- Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, 12 Zhongguancun South Street, Beijing 100081, China
| | - Yongming Bo
- Key Laboratory of Vegetable Breeding, Ningbo Weimeng Seed Co., Ltd, Ningbo 315100, China
| | - Zhongyuan Hu
- Laboratory of Germplasm Innovation and Molecular Breeding, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, China
- Hainan Institute, Zhejiang University, Yazhou District, Sanya 572025, China
- Key Laboratory of Horticultural Plant Growth, Development and Quality Improvement, Ministry of Agriculture, Hangzhou 310058, China
| | - Jinghua Yang
- Laboratory of Germplasm Innovation and Molecular Breeding, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, China
- Hainan Institute, Zhejiang University, Yazhou District, Sanya 572025, China
- Key Laboratory of Horticultural Plant Growth, Development and Quality Improvement, Ministry of Agriculture, Hangzhou 310058, China
| | - Mingfang Zhang
- Laboratory of Germplasm Innovation and Molecular Breeding, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, China
- Hainan Institute, Zhejiang University, Yazhou District, Sanya 572025, China
- Key Laboratory of Horticultural Plant Growth, Development and Quality Improvement, Ministry of Agriculture, Hangzhou 310058, China
| |
Collapse
|
5
|
Wang J, Yang W, Zhang S, Hu H, Yuan Y, Dong J, Chen L, Ma Y, Yang T, Zhou L, Chen J, Liu B, Li C, Edwards D, Zhao J. A pangenome analysis pipeline provides insights into functional gene identification in rice. Genome Biol 2023; 24:19. [PMID: 36703158 PMCID: PMC9878884 DOI: 10.1186/s13059-023-02861-9] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Accepted: 01/18/2023] [Indexed: 01/27/2023] Open
Abstract
BACKGROUND A pangenome aims to capture the complete genetic diversity within a species and reduce bias in genetic analysis inherent in using a single reference genome. However, the current linear format of most plant pangenomes limits the presentation of position information for novel sequences. Graph pangenomes have been developed to overcome this limitation. However, bioinformatics analysis tools for graph format genomes are lacking. RESULTS To overcome this problem, we develop a novel strategy for pangenome construction and a downstream pangenome analysis pipeline (PSVCP) that captures genetic variants' position information while maintaining a linearized layout. Using PSVCP, we construct a high-quality rice pangenome using 12 representative rice genomes and analyze an international rice panel with 413 diverse accessions using the pangenome as the reference. We show that PSVCP successfully identifies causal structural variations for rice grain weight and plant height. Our results provide insights into rice population structure and genomic diversity. We characterize a new locus (qPH8-1) associated with plant height on chromosome 8 undetected by the SNP-based genome-wide association study (GWAS). CONCLUSIONS Our results demonstrate that the pangenome constructed by our pipeline combined with a presence and absence variation-based GWAS can provide additional power for genomic and genetic analysis. The pangenome constructed in this study and the associated genome sequence and genetic variants data provide valuable genomic resources for rice genomics research and improvement in future.
Collapse
Affiliation(s)
- Jian Wang
- grid.135769.f0000 0001 0561 6611Rice Research Institute & Guangdong Key Laboratory of New Technology in Rice Breeding & Guangdong Rice Engineering Laboratory, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640 China
| | - Wu Yang
- grid.135769.f0000 0001 0561 6611Rice Research Institute & Guangdong Key Laboratory of New Technology in Rice Breeding & Guangdong Rice Engineering Laboratory, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640 China
| | - Shaohong Zhang
- grid.135769.f0000 0001 0561 6611Rice Research Institute & Guangdong Key Laboratory of New Technology in Rice Breeding & Guangdong Rice Engineering Laboratory, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640 China
| | - Haifei Hu
- grid.135769.f0000 0001 0561 6611Rice Research Institute & Guangdong Key Laboratory of New Technology in Rice Breeding & Guangdong Rice Engineering Laboratory, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640 China ,grid.1025.60000 0004 0436 6763Western Crop Genetics Alliance, Murdoch University, Murdoch, Western Australia 6150 Australia
| | - Yuxuan Yuan
- grid.10784.3a0000 0004 1937 0482School of Life Sciences and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong, SAR China
| | - Jingfang Dong
- grid.135769.f0000 0001 0561 6611Rice Research Institute & Guangdong Key Laboratory of New Technology in Rice Breeding & Guangdong Rice Engineering Laboratory, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640 China
| | - Luo Chen
- grid.135769.f0000 0001 0561 6611Rice Research Institute & Guangdong Key Laboratory of New Technology in Rice Breeding & Guangdong Rice Engineering Laboratory, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640 China
| | - Yamei Ma
- grid.135769.f0000 0001 0561 6611Rice Research Institute & Guangdong Key Laboratory of New Technology in Rice Breeding & Guangdong Rice Engineering Laboratory, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640 China
| | - Tifeng Yang
- grid.135769.f0000 0001 0561 6611Rice Research Institute & Guangdong Key Laboratory of New Technology in Rice Breeding & Guangdong Rice Engineering Laboratory, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640 China
| | - Lian Zhou
- grid.135769.f0000 0001 0561 6611Rice Research Institute & Guangdong Key Laboratory of New Technology in Rice Breeding & Guangdong Rice Engineering Laboratory, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640 China
| | - Jiansong Chen
- grid.135769.f0000 0001 0561 6611Rice Research Institute & Guangdong Key Laboratory of New Technology in Rice Breeding & Guangdong Rice Engineering Laboratory, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640 China
| | - Bin Liu
- grid.135769.f0000 0001 0561 6611Rice Research Institute & Guangdong Key Laboratory of New Technology in Rice Breeding & Guangdong Rice Engineering Laboratory, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640 China
| | - Chengdao Li
- grid.1025.60000 0004 0436 6763Western Crop Genetics Alliance, Murdoch University, Murdoch, Western Australia 6150 Australia
| | - David Edwards
- grid.1012.20000 0004 1936 7910School of Biological Sciences and Centre for Applied Bioinformatics, University of Western Australia, Perth, WA Australia
| | - Junliang Zhao
- grid.135769.f0000 0001 0561 6611Rice Research Institute & Guangdong Key Laboratory of New Technology in Rice Breeding & Guangdong Rice Engineering Laboratory, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640 China
| |
Collapse
|
6
|
Graph Pangenomes Track Genetic Variants for Crop Improvement. Int J Mol Sci 2022; 23:ijms232113420. [DOI: 10.3390/ijms232113420] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Revised: 10/28/2022] [Accepted: 10/29/2022] [Indexed: 11/06/2022] Open
Abstract
Global climate change and the urgency to transform crops require an exhaustive genetic evaluation. The large polyploid genomes of food crops, such as cereals, make it difficult to identify candidate genes with confirmed hereditary. Although genome-wide association studies (GWAS) have been proficient in identifying genetic variants that are associated with complex traits, the resolution of acquired heritability faces several significant bottlenecks such as incomplete detection of structural variants (SV), genetic heterogeneity, and/or locus heterogeneity. Consequently, a biased estimate is generated with respect to agronomically complex traits. The graph pangenomes have resolved this missing heritability and provide significant details in terms of specific loci segregating among individuals and evolving to variations. The graph pangenome approach facilitates crop improvements through genome-linked fast breeding.
Collapse
|