Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Download

Total Articles

45
(from Reference Citation Analysis)

Article PDFs (23)

Cited by > 0 (36)

Searched Name

Julien Y Dutheil

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Type

Show more Refine

Article Statistics

Refine

MESH Headings

Show more Refine

First Author

Show more Refine

First Author Affiliations

Show more Refine

Authors

Show more Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Countries/Regions

Show more Refine

Affiliations

Show more Refine

Corresponding Author Affiliations

Show more Refine

Category

Show more Refine

Number

Citation Analysis

Dutheil JY. On the estimation of genome-average recombination rates. Genetics 2024:iyae051. [PMID: 38565705 DOI: 10.1093/genetics/iyae051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2024] [Revised: 03/13/2024] [Accepted: 03/20/2024] [Indexed: 04/04/2024] Open

Langebrake C, Manthey G, Frederiksen A, Lugo Ramos JS, Dutheil JY, Chetverikova R, Solov'yov IA, Mouritsen H, Liedvogel M. Adaptive evolution and loss of a putative magnetoreceptor in passerines. Proc Biol Sci 2024;291:20232308. [PMID: 38320616 PMCID: PMC10846946 DOI: 10.1098/rspb.2023.2308] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Accepted: 01/08/2024] [Indexed: 02/08/2024] Open

Dutheil JY, Hamidi D, Pajot B. The Site/Group Extended Data Format and Tools. Genome Biol Evol 2024;16:evae011. [PMID: 38252924 PMCID: PMC10849175 DOI: 10.1093/gbe/evae011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Accepted: 01/12/2024] [Indexed: 01/24/2024] Open

Bascón-Cardozo K, Bours A, Manthey G, Durieux G, Dutheil JY, Pruisscher P, Odenthal-Hesse L, Liedvogel M. Fine-Scale Map Reveals Highly Variable Recombination Rates Associated with Genomic Features in the Eurasian Blackcap. Genome Biol Evol 2024;16:evad233. [PMID: 38198800 PMCID: PMC10781513 DOI: 10.1093/gbe/evad233] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/12/2023] [Indexed: 01/12/2024] Open

Abstract

Recombination is responsible for breaking up haplotypes, influencing genetic variability, and the efficacy of selection. Bird genomes lack the protein PR domain-containing protein 9, a key determinant of recombination dynamics in most metazoans. Historical recombination maps in birds show an apparent stasis in positioning recombination events. This highly conserved recombination pattern over long timescales may constrain the evolution of recombination in birds. At the same time, extensive variation in recombination rate is observed across the genome and between different species of birds. Here, we characterize the fine-scale historical recombination map of an iconic migratory songbird, the Eurasian blackcap (Sylvia atricapilla), using a linkage disequilibrium-based approach that accounts for population demography. Our results reveal variable recombination rates among and within chromosomes, which associate positively with nucleotide diversity and GC content and negatively with chromosome size. Recombination rates increased significantly at regulatory regions but not necessarily at gene bodies. CpG islands are associated strongly with recombination rates, though their specific position and local DNA methylation patterns likely influence this relationship. The association with retrotransposons varied according to specific family and location. Our results also provide evidence of heterogeneous intrachromosomal conservation of recombination maps between the blackcap and its closest sister taxon, the garden warbler. These findings highlight the considerable variability of recombination rates at different scales and the role of specific genomic features in shaping this variation. This study opens the possibility of further investigating the impact of recombination on specific population-genomic features.

Collapse

Rivas-González I, Rousselle M, Li F, Zhou L, Dutheil JY, Munch K, Shao Y, Wu D, Schierup MH, Zhang G. Pervasive incomplete lineage sorting illuminates speciation and selection in primates. Science 2023;380:eabn4409. [PMID: 37262154 DOI: 10.1126/science.abn4409] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2021] [Accepted: 01/19/2023] [Indexed: 06/03/2023]

Affiliation(s)

Iker Rivas-González Bioinformatics Research Centre, Aarhus University, DK-8000 Aarhus C, Denmark
Marjolaine Rousselle Bioinformatics Research Centre, Aarhus University, DK-8000 Aarhus C, Denmark
Fang Li BGI-Research, BGI-Wuhan, Wuhan 430074, China Institute of Animal Sex and Development, ZhejiangWanli University, Ningbo 315104, China BGI-Research, BGI-Shenzhen, Shenzhen 518083, China
Long Zhou Evolutionary & Organismal Biology Research Center, Zhejiang University School of Medicine, Hangzhou 310058, China Women's Hospital, School of Medicine, Zhejiang University, Shangcheng District, Hangzhou 310006, China
Julien Y Dutheil Max Planck Institute for Evolutionary Biology, Plön, Germany Institute of Evolution Sciences of Montpellier (ISEM), CNRS, University of Montpellier, IRD, EPHE, 34095 Montpellier, France
Kasper Munch Bioinformatics Research Centre, Aarhus University, DK-8000 Aarhus C, Denmark
Yong Shao State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China
Dongdong Wu State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, Yunnan 650223, China National Resource Center for Non-Human Primates, Kunming Primate Research Center, and National Research Facility for Phenotypic and Genetic Analysis of Model Animals (Primate Facility), Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650107, China Kunming Natural History Museum of Zoology, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China
Mikkel H Schierup Bioinformatics Research Centre, Aarhus University, DK-8000 Aarhus C, Denmark
Guojie Zhang Evolutionary & Organismal Biology Research Center, Zhejiang University School of Medicine, Hangzhou 310058, China Women's Hospital, School of Medicine, Zhejiang University, Shangcheng District, Hangzhou 310006, China State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China Liangzhu Laboratory, Zhejiang University Medical Center, Hangzhou 311121, China Villum Centre for Biodiversity Genomics, Section for Ecology and Evolution, Department of Biology, University of Copenhagen, DK-2100 Copenhagen, Denmark

Collapse

Raas MWD, Dutheil JY. The rate of adaptive molecular evolution in wild and domesticated Saccharomyces cerevisiae populations. Mol Ecol 2023. [PMID: 37157166 DOI: 10.1111/mec.16980] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Revised: 04/22/2023] [Accepted: 04/26/2023] [Indexed: 05/10/2023]

Abstract

Through its fermentative capacities, Saccharomyces cerevisiae was central in the development of civilisation during the Neolithic period, and the yeast remains of importance in industry and biotechnology, giving rise to bona fide domesticated populations. Here, we conduct a population genomic study of domesticated and wild populations of S. cerevisiae. Using coalescent analyses, we report that the effective population size of yeast populations decreased since the divergence with S. paradoxus. We fitted models of distributions of fitness effects to infer the rate of adaptive ( ω a $$ {\omega}_a $$ ) and non-adaptive ( ω na $$ {\omega}_{na} $$ ) non-synonymous substitutions in protein-coding genes. We report an overall limited contribution of positive selection to S. cerevisiae protein evolution, albeit with higher rates of adaptive evolution in wild compared to domesticated populations. Our analyses revealed the signature of background selection and possibly Hill-Robertson interference, as recombination was found to be negatively correlated with ω na $$ {\omega}_{na} $$ and positively correlated with ω a $$ {\omega}_a $$ . However, the effect of recombination on ω a $$ {\omega}_a $$ was found to be labile, as it is only apparent after removing the impact of codon usage bias on the synonymous site frequency spectrum and disappears if we control for the correlation with ω na $$ {\omega}_{na} $$ , suggesting that it could be an artefact of the decreasing population size. Furthermore, the rate of adaptive non-synonymous substitutions is significantly correlated with the residue solvent exposure, a relation that cannot be explained by the population's demography. Together, our results provide a detailed characterisation of adaptive mutations in protein-coding genes across S. cerevisiae populations.

Collapse

Puzović N, Madaan T, Dutheil JY. Being noisy in a crowd: Differential selective pressure on gene expression noise in model gene regulatory networks. PLoS Comput Biol 2023;19:e1010982. [PMID: 37079488 PMCID: PMC10118199 DOI: 10.1371/journal.pcbi.1010982] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2022] [Accepted: 02/27/2023] [Indexed: 04/21/2023] Open

Moutinho AF, Eyre-Walker A, Dutheil JY. Strong evidence for the adaptive walk model of gene evolution in Drosophila and Arabidopsis. PLoS Biol 2022;20:e3001775. [PMID: 36099311 PMCID: PMC9470001 DOI: 10.1371/journal.pbio.3001775] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Accepted: 08/01/2022] [Indexed: 11/19/2022] Open

Meteyer CU, Dutheil JY, Keel MK, Boyles JG, Stukenbrock EH. Plant pathogens provide clues to the potential origin of bat white-nose syndrome Pseudogymnoascus destructans. Virulence 2022;13:1020-1031. [PMID: 35635339 PMCID: PMC9176227 DOI: 10.1080/21505594.2022.2082139] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open

Chaurasia S, Dutheil JY. The Structural Determinants of Intra-Protein Compensatory Substitutions. Mol Biol Evol 2022;39:6555661. [PMID: 35349721 PMCID: PMC9004419 DOI: 10.1093/molbev/msac063] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Locke DP, Hillier LW, Warren WC, Worley KC, Nazareth LV, Muzny DM, Yang SP, Wang Z, Chinwalla AT, Minx P, Mitreva M, Cook L, Delehaunty KD, Fronick C, Schmidt H, Fulton LA, Fulton RS, Nelson JO, Magrini V, Pohl C, Graves TA, Markovic C, Cree A, Dinh HH, Hume J, Kovar CL, Fowler GR, Lunter G, Meader S, Heger A, Ponting CP, Marques-Bonet T, Alkan C, Chen L, Cheng Z, Kidd JM, Eichler EE, White S, Searle S, Vilella AJ, Chen Y, Flicek P, Ma J, Raney B, Suh B, Burhans R, Herrero J, Haussler D, Faria R, Fernando O, Darré F, Farré D, Gazave E, Oliva M, Navarro A, Roberto R, Capozzi O, Archidiacono N, Della Valle G, Purgato S, Rocchi M, Konkel MK, Walker JA, Ullmer B, Batzer MA, Smit AFA, Hubley R, Casola C, Schrider DR, Hahn MW, Quesada V, Puente XS, Ordoñez GR, López-Otín C, Vinar T, Brejova B, Ratan A, Harris RS, Miller W, Kosiol C, Lawson HA, Taliwal V, Martins AL, Siepel A, RoyChoudhury A, Ma X, Degenhardt J, Bustamante CD, Gutenkunst RN, Mailund T, Dutheil JY, Hobolth A, Schierup MH, Ryder OA, Yoshinaga Y, de Jong PJ, Weinstock GM, Rogers J, Mardis ER, Gibbs RA, Wilson RK. Author Correction: Comparative and demographic analysis of orang-utan genomes. Nature 2022;608:E36. [PMID: 35962045 PMCID: PMC9402433 DOI: 10.1038/s41586-022-04799-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Affiliation(s)

Devin P. Locke grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
LaDeana W. Hillier grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
Wesley C. Warren grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
Kim C. Worley grid.39382.330000 0001 2160 926XHuman Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas USA
Lynne V. Nazareth grid.39382.330000 0001 2160 926XHuman Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas USA
Donna M. Muzny grid.39382.330000 0001 2160 926XHuman Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas USA
Shiaw-Pyng Yang grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
Zhengyuan Wang grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
Asif T. Chinwalla grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
Pat Minx grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
Makedonka Mitreva grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
Lisa Cook grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
Kim D. Delehaunty grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
Catrina Fronick grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
Heather Schmidt grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
Lucinda A. Fulton grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
Robert S. Fulton grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
Joanne O. Nelson grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
Vincent Magrini grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
Craig Pohl grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
Tina A. Graves grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
Chris Markovic grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
Andy Cree grid.39382.330000 0001 2160 926XHuman Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas USA
Huyen H. Dinh grid.39382.330000 0001 2160 926XHuman Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas USA
Jennifer Hume grid.39382.330000 0001 2160 926XHuman Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas USA
Christie L. Kovar grid.39382.330000 0001 2160 926XHuman Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas USA
Gerald R. Fowler grid.39382.330000 0001 2160 926XHuman Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas USA
Gerton Lunter grid.4991.50000 0004 1936 8948MRC Functional Genomics Unit and Department of Physiology, Anatomy and Genetics, University of Oxford, Le Gros Clark Building, Oxford, UK ,4grid.270683.80000 0004 0641 4511Wellcome Trust Centre for Human Genetics, Oxford, UK
Stephen Meader grid.4991.50000 0004 1936 8948MRC Functional Genomics Unit and Department of Physiology, Anatomy and Genetics, University of Oxford, Le Gros Clark Building, Oxford, UK
Andreas Heger grid.4991.50000 0004 1936 8948MRC Functional Genomics Unit and Department of Physiology, Anatomy and Genetics, University of Oxford, Le Gros Clark Building, Oxford, UK
Chris P. Ponting grid.4991.50000 0004 1936 8948MRC Functional Genomics Unit and Department of Physiology, Anatomy and Genetics, University of Oxford, Le Gros Clark Building, Oxford, UK
Tomas Marques-Bonet grid.34477.330000000122986657Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington USA ,6grid.5612.00000 0001 2172 2676IBE, Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, PRBB, Doctor Aiguader, 88, Barcelona, Spain
Can Alkan grid.34477.330000000122986657Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington USA
Lin Chen grid.34477.330000000122986657Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington USA
Ze Cheng grid.34477.330000000122986657Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington USA
Jeffrey M. Kidd grid.34477.330000000122986657Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington USA
Evan E. Eichler grid.34477.330000000122986657Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington USA ,7grid.413575.10000 0001 2167 1581Howard Hughes Medical Institute, Seattle, Washington USA
Simon White grid.10306.340000 0004 0606 5382Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Cambridge, UK
Stephen Searle grid.10306.340000 0004 0606 5382Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Cambridge, UK
Albert J. Vilella grid.52788.300000 0004 0427 7672European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge UK
Yuan Chen grid.52788.300000 0004 0427 7672European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge UK
Paul Flicek grid.52788.300000 0004 0427 7672European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge UK
Jian Ma grid.205975.c0000 0001 0740 6917Center for Biomolecular Science and Engineering, University of California, Santa Cruz, California USA ,32grid.35403.310000 0004 1936 9991Present Address: Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, Illinois USA
Brian Raney grid.205975.c0000 0001 0740 6917Center for Biomolecular Science and Engineering, University of California, Santa Cruz, California USA
Bernard Suh grid.205975.c0000 0001 0740 6917Center for Biomolecular Science and Engineering, University of California, Santa Cruz, California USA
Richard Burhans grid.29857.310000 0001 2097 4281Center for Comparative Genomics and Bioinformatics, Penn State University, University Park, Pennsylvania, USA
Javier Herrero grid.52788.300000 0004 0427 7672European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge UK
David Haussler grid.205975.c0000 0001 0740 6917Center for Biomolecular Science and Engineering, University of California, Santa Cruz, California USA
Rui Faria grid.5612.00000 0001 2172 2676IBE, Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, PRBB, Doctor Aiguader, 88, Barcelona, Spain ,12grid.5808.50000 0001 1503 7226CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, Universidade do Porto, Campus Agrário de Vairão, Vairão, Portugal
Olga Fernando grid.5612.00000 0001 2172 2676IBE, Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, PRBB, Doctor Aiguader, 88, Barcelona, Spain ,13grid.10772.330000000121511713Instituto de Tecnologia Química e Biológica, Universidade Nova de Lisboa, Oeiras, Portugal
Fleur Darré grid.5612.00000 0001 2172 2676IBE, Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, PRBB, Doctor Aiguader, 88, Barcelona, Spain
Domènec Farré grid.5612.00000 0001 2172 2676IBE, Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, PRBB, Doctor Aiguader, 88, Barcelona, Spain
Elodie Gazave grid.5612.00000 0001 2172 2676IBE, Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, PRBB, Doctor Aiguader, 88, Barcelona, Spain
Meritxell Oliva grid.5612.00000 0001 2172 2676IBE, Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, PRBB, Doctor Aiguader, 88, Barcelona, Spain
Arcadi Navarro grid.5612.00000 0001 2172 2676IBE, Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, PRBB, Doctor Aiguader, 88, Barcelona, Spain ,14grid.425902.80000 0000 9601 989XICREA (Institució Catalana de Recerca i Estudis Avançats) and INB (Instituto Nacional de Bioinformática) PRBB, Doctor Aiguader, 88, Barcelona, Spain
Roberta Roberto grid.7644.10000 0001 0120 3326Department of Biology, University of Bari, Bari, Italy
Oronzo Capozzi grid.7644.10000 0001 0120 3326Department of Biology, University of Bari, Bari, Italy
Nicoletta Archidiacono grid.7644.10000 0001 0120 3326Department of Biology, University of Bari, Bari, Italy
Giuliano Della Valle grid.6292.f0000 0004 1757 1758Department of Biology, University of Bologna, Bologna, Italy
Stefania Purgato grid.6292.f0000 0004 1757 1758Department of Biology, University of Bologna, Bologna, Italy
Mariano Rocchi grid.7644.10000 0001 0120 3326Department of Biology, University of Bari, Bari, Italy
Miriam K. Konkel grid.64337.350000 0001 0662 7451Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana USA
Jerilyn A. Walker grid.64337.350000 0001 0662 7451Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana USA
Brygg Ullmer grid.64337.350000 0001 0662 7451Center for Computation and Technology, Department of Computer Sciences, Louisiana State University, Baton Rouge, Louisiana USA
Mark A. Batzer grid.64337.350000 0001 0662 7451Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana USA
Arian F. A. Smit grid.64212.330000 0004 0463 2320Institute for Systems Biology, Seattle, Washington USA
Robert Hubley grid.64212.330000 0004 0463 2320Institute for Systems Biology, Seattle, Washington USA
Claudio Casola grid.411377.70000 0001 0790 959XDepartment of Biology and School of Informatics and Computing, Indiana University, Bloomington, Indiana USA
Daniel R. Schrider grid.411377.70000 0001 0790 959XDepartment of Biology and School of Informatics and Computing, Indiana University, Bloomington, Indiana USA
Matthew W. Hahn grid.411377.70000 0001 0790 959XDepartment of Biology and School of Informatics and Computing, Indiana University, Bloomington, Indiana USA
Victor Quesada grid.10863.3c0000 0001 2164 6351Instituto Universitario de Oncologia, Departamento de Bioquimica y Biologia Molecular, Universidad de Oviedo, Oviedo, Spain
Xose S. Puente grid.10863.3c0000 0001 2164 6351Instituto Universitario de Oncologia, Departamento de Bioquimica y Biologia Molecular, Universidad de Oviedo, Oviedo, Spain
Gonzalo R. Ordoñez grid.10863.3c0000 0001 2164 6351Instituto Universitario de Oncologia, Departamento de Bioquimica y Biologia Molecular, Universidad de Oviedo, Oviedo, Spain
Carlos López-Otín grid.10863.3c0000 0001 2164 6351Instituto Universitario de Oncologia, Departamento de Bioquimica y Biologia Molecular, Universidad de Oviedo, Oviedo, Spain
Tomas Vinar grid.7634.60000000109409708Faculty of Mathematics, Physics and Informatics, Comenius University, Mlynska Dolina, Bratislava, Slovakia
Brona Brejova grid.7634.60000000109409708Faculty of Mathematics, Physics and Informatics, Comenius University, Mlynska Dolina, Bratislava, Slovakia
Aakrosh Ratan grid.29857.310000 0001 2097 4281Center for Comparative Genomics and Bioinformatics, Penn State University, University Park, Pennsylvania, USA
Robert S. Harris grid.29857.310000 0001 2097 4281Center for Comparative Genomics and Bioinformatics, Penn State University, University Park, Pennsylvania, USA
Webb Miller grid.29857.310000 0001 2097 4281Center for Comparative Genomics and Bioinformatics, Penn State University, University Park, Pennsylvania, USA
Carolin Kosiol Institut für Populations genetik, Vetmeduni Vienna, Wien, Austria
Heather A. Lawson grid.4367.60000 0001 2355 7002Department of Anatomy and Neurobiology, Washington University School of Medicine, Saint Louis, Missouri USA
Vikas Taliwal grid.5386.8000000041936877XDepartment of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York USA
André L. Martins grid.5386.8000000041936877XDepartment of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York USA
Adam Siepel grid.5386.8000000041936877XDepartment of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York USA
Arindam RoyChoudhury grid.21729.3f0000000419368729Department of Biostatistics, Columbia University, New York, New York USA
Xin Ma grid.5386.8000000041936877XDepartment of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York USA
Jeremiah Degenhardt grid.5386.8000000041936877XDepartment of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York USA
Carlos D. Bustamante grid.168010.e0000000419368956Department of Genetics, Stanford University, Stanford, California USA
Ryan N. Gutenkunst grid.134563.60000 0001 2168 186XDepartment of Molecular and Cellular Biology, University of Arizona, Tucson, Arizona USA
Thomas Mailund grid.7048.b0000 0001 1956 2722Bioinformatics Research Centre, Aarhus University, Aarhus C, Denmark
Julien Y. Dutheil grid.7048.b0000 0001 1956 2722Bioinformatics Research Centre, Aarhus University, Aarhus C, Denmark
Asger Hobolth grid.7048.b0000 0001 1956 2722Bioinformatics Research Centre, Aarhus University, Aarhus C, Denmark
Mikkel H. Schierup grid.7048.b0000 0001 1956 2722Bioinformatics Research Centre, Aarhus University, Aarhus C, Denmark
Oliver A. Ryder grid.452788.40000 0004 0458 5309San Diego Zoo’s Institute for Conservation Research, Escondido, California USA
Yuko Yoshinaga grid.414016.60000 0004 0433 7727Children’s Hospital Oakland Research Institute, Oakland, California USA
Pieter J. de Jong grid.414016.60000 0004 0433 7727Children’s Hospital Oakland Research Institute, Oakland, California USA
George M. Weinstock grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
Jeffrey Rogers grid.39382.330000 0001 2160 926XHuman Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas USA
Elaine R. Mardis grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
Richard A. Gibbs grid.39382.330000 0001 2160 926XHuman Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas USA
Richard K. Wilson grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA

Collapse

Schweizer G, Haider MB, Barroso GV, Rössel N, Münch K, Kahmann R, Dutheil JY. Population Genomics of the Maize Pathogen Ustilago maydis: Demographic History and Role of Virulence Clusters in Adaptation. Genome Biol Evol 2021;13:evab073. [PMID: 33837781 PMCID: PMC8120014 DOI: 10.1093/gbe/evab073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/06/2021] [Indexed: 11/14/2022] Open

Dutheil JY, Münch K, Schotanus K, Stukenbrock EH, Kahmann R. The insertion of a mitochondrial selfish element into the nuclear genome and its consequences. Ecol Evol 2020;10:11117-11132. [PMID: 33144953 PMCID: PMC7593156 DOI: 10.1002/ece3.6749] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2020] [Accepted: 08/12/2020] [Indexed: 12/15/2022] Open

Potgieter L, Feurtey A, Dutheil JY, Stukenbrock EH. On Variant Discovery in Genomes of Fungal Plant Pathogens. Front Microbiol 2020;11:626. [PMID: 32373089 PMCID: PMC7176817 DOI: 10.3389/fmicb.2020.00626] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2019] [Accepted: 03/19/2020] [Indexed: 11/13/2022] Open

Barroso GV, Moutinho AF, Dutheil JY. A Population Genomics Lexicon. Methods Mol Biol 2020;2090:3-17. [PMID: 31975161 DOI: 10.1007/978-1-0716-0199-0_1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

Dutheil JY. Correction to: Processing and Analyzing Multiple Genomes Alignments with MafFilter. Methods Mol Biol 2020;2090:C1. [PMID: 33635534 DOI: 10.1007/978-1-0716-0199-0_20] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Moutinho AF, Bataillon T, Dutheil JY. Variation of the adaptive substitution rate between species and within genomes. Evol Ecol 2019. [DOI: 10.1007/s10682-019-10026-z] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]

V. Barroso G, Puzović N, Dutheil JY. Inference of recombination maps from a single pair of genomes and its application to ancient samples. PLoS Genet 2019;15:e1008449. [PMID: 31725722 PMCID: PMC6879166 DOI: 10.1371/journal.pgen.1008449] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2019] [Revised: 11/26/2019] [Accepted: 09/30/2019] [Indexed: 12/11/2022] Open

Grandaubert J, Dutheil JY, Stukenbrock EH. The genomic determinants of adaptive evolution in a fungal pathogen. Evol Lett 2019;3:299-312. [PMID: 31171985 PMCID: PMC6546377 DOI: 10.1002/evl3.117] [Citation(s) in RCA: 40] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2018] [Revised: 04/02/2019] [Accepted: 04/05/2019] [Indexed: 12/16/2022] Open

Abstract

Unravelling the strength, frequency, and distribution of selective variants along the genome as well as the underlying factors shaping this distribution are fundamental goals of evolutionary biology. Antagonistic host-pathogen coevolution is thought to be a major driver of genome evolution between interacting species. While rapid evolution of pathogens has been documented in several model organisms, the genetic mechanisms of their adaptation are still poorly understood and debated, particularly the role of sexual reproduction. Here, we apply a population genomic approach to infer genome-wide patterns of selection among 13 isolates of Zymoseptoria tritici, a fungal pathogen characterized by extremely high genetic diversity, gene density, and recombination rates. We report that the genome of Z. tritici undergoes a high rate of adaptive substitutions, with 44% of nonsynonymous substitutions being adaptive on average. This fraction reaches 68% in so-called effector genes encoding determinants of pathogenicity, and the distribution of fitness effects differs in this class of genes as they undergo adaptive mutations with stronger positive fitness effects, but also more slightly deleterious mutations. Besides the globally high rate of adaptive substitutions, we report a negative relationship between pN/pS and the fine-scale recombination rate and a strong positive correlation between the rate of adaptive nonsynonymous substitutions (ωa) and recombination rate. This result suggests a pervasive role of both background selection and Hill-Robertson interference even in a species with an exceptionally high recombination rate (60 cM/Mb on average). While transposable elements (TEs) have been suggested to contribute to adaptation by creating compartments of fast-evolving genomic regions, we do not find a significant effect of TEs on the rate of adaptive mutations. Overall our study suggests that sexual recombination is a significant driver of genome evolution, even in rapidly evolving organisms subject to recurrent mutations with large positive effects.

Collapse

Dutheil JY, Hobolth A. Ancestral Population Genomics. Methods Mol Biol 2019;1910:555-589. [PMID: 31278677 DOI: 10.1007/978-1-4939-9074-0_18] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Stukenbrock EH, Dutheil JY. Fine-Scale Recombination Maps of Fungal Plant Pathogens Reveal Dynamic Recombination Landscapes and Intragenic Hotspots. Genetics 2018;208:1209-1229. [PMID: 29263029 PMCID: PMC5844332 DOI: 10.1534/genetics.117.300502] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2017] [Accepted: 12/15/2017] [Indexed: 11/18/2022] Open

Abstract

Meiotic recombination is an important driver of evolution. Variability in the intensity of recombination across chromosomes can affect sequence composition, nucleotide variation, and rates of adaptation. In many organisms, recombination events are concentrated within short segments termed recombination hotspots. The variation in recombination rate and positions of recombination hotspot can be studied using population genomics data and statistical methods. In this study, we conducted population genomics analyses to address the evolution of recombination in two closely related fungal plant pathogens: the prominent wheat pathogen Zymoseptoria tritici and a sister species infecting wild grasses Z. ardabiliae We specifically addressed whether recombination landscapes, including hotspot positions, are conserved in the two recently diverged species and if recombination contributes to rapid evolution of pathogenicity traits. We conducted a detailed simulation analysis to assess the performance of methods of recombination rate estimation based on patterns of linkage disequilibrium, in particular in the context of high nucleotide diversity. Our analyses reveal overall high recombination rates, a lack of suppressed recombination in centromeres, and significantly lower recombination rates on chromosomes that are known to be accessory. The comparison of the recombination landscapes of the two species reveals a strong correlation of recombination rate at the megabase scale, but little correlation at smaller scales. The recombination landscapes in both pathogen species are dominated by frequent recombination hotspots across the genome including coding regions, suggesting a strong impact of recombination on gene evolution. A significant but small fraction of these hotspots colocalize between the two species, suggesting that hotspot dynamics contribute to the overall pattern of fast evolving recombination in these species.

Collapse

Schweizer G, Münch K, Mannhaupt G, Schirawski J, Kahmann R, Dutheil JY. Positively Selected Effector Genes and Their Contribution to Virulence in the Smut Fungus Sporisorium reilianum. Genome Biol Evol 2018;10:629-645. [PMID: 29390140 PMCID: PMC5811872 DOI: 10.1093/gbe/evy023] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/29/2018] [Indexed: 12/13/2022] Open

Barroso GV, Puzovic N, Dutheil JY. The Evolution of Gene-Specific Transcriptional Noise Is Driven by Selection at the Pathway Level. Genetics 2018;208:173-189. [PMID: 29097405 PMCID: PMC5753856 DOI: 10.1534/genetics.117.300467] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2017] [Accepted: 10/13/2017] [Indexed: 11/18/2022] Open

Odenthal-Hesse L, Dutheil JY, Klötzl F, Haubold B. hotspot: software to support sperm-typing for investigating recombination hotspots. Bioinformatics 2016;32:2554-5. [PMID: 27153632 PMCID: PMC4978934 DOI: 10.1093/bioinformatics/btw195] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2016] [Revised: 03/24/2016] [Accepted: 04/07/2016] [Indexed: 11/14/2022] Open

Tollot M, Assmann D, Becker C, Altmüller J, Dutheil JY, Wegner CE, Kahmann R. The WOPR Protein Ros1 Is a Master Regulator of Sporogenesis and Late Effector Gene Expression in the Maize Pathogen Ustilago maydis. PLoS Pathog 2016;12:e1005697. [PMID: 27332891 PMCID: PMC4917244 DOI: 10.1371/journal.ppat.1005697] [Citation(s) in RCA: 53] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2015] [Accepted: 05/20/2016] [Indexed: 12/31/2022] Open

Abstract

The biotrophic basidiomycete fungus Ustilago maydis causes smut disease in maize. Hallmarks of the disease are large tumors that develop on all aerial parts of the host in which dark pigmented teliospores are formed. We have identified a member of the WOPR family of transcription factors, Ros1, as major regulator of spore formation in U. maydis. ros1 expression is induced only late during infection and hence Ros1 is neither involved in plant colonization of dikaryotic fungal hyphae nor in plant tumor formation. However, during late stages of infection Ros1 is essential for fungal karyogamy, massive proliferation of diploid fungal cells and spore formation. Premature expression of ros1 revealed that Ros1 counteracts the b-dependent filamentation program and induces morphological alterations resembling the early steps of sporogenesis. Transcriptional profiling and ChIP-seq analyses uncovered that Ros1 remodels expression of about 30% of all U. maydis genes with 40% of these being direct targets. In total the expression of 80 transcription factor genes is controlled by Ros1. Four of the upregulated transcription factor genes were deleted and two of the mutants were affected in spore development. A large number of b-dependent genes were differentially regulated by Ros1, suggesting substantial changes in this regulatory cascade that controls filamentation and pathogenic development. Interestingly, 128 genes encoding secreted effectors involved in the establishment of biotrophic development were downregulated by Ros1 while a set of 70 “late effectors” was upregulated. These results indicate that Ros1 is a master regulator of late development in U. maydis and show that the biotrophic interaction during sporogenesis involves a drastic shift in expression of the fungal effectome including the downregulation of effectors that are essential during early stages of infection.

The fungus Ustilago maydis is a pathogen of maize which induces tumor formation in the infected tissue. In these tumors huge amounts of fungal spores develop. As a biotrophic pathogen, U. maydis establishes itself in the plant with the help of a large number of secreted effector proteins. Many effector proteins are important for virulence because they counteract plant defense reactions. In this manuscript we have identified and characterized Ros1, a master regulator for the late stages of U. maydis development. This transcription factor is expressed late during infection and controls nuclear fusion, hyphal aggregation and late proliferation. ros1 mutants are still able to induce tumor formation but these are a dead end because they do not contain any spores. We show that Ros1 interferes with the early regulatory cascade controlled by a complex of two homeodomain proteins. In addition, Ros1 triggers a major switch in the effector repertoire, suggesting that different sets of effectors are needed for different stages of fungal development inside the plant.

Collapse

Dutheil JY, Mannhaupt G, Schweizer G, M K Sieber C, Münsterkötter M, Güldener U, Schirawski J, Kahmann R. A Tale of Genome Compartmentalization: The Evolution of Virulence Clusters in Smut Fungi. Genome Biol Evol 2016;8:681-704. [PMID: 26872771 PMCID: PMC4824034 DOI: 10.1093/gbe/evw026] [Citation(s) in RCA: 94] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open

Dutheil JY, Munch K, Nam K, Mailund T, Schierup MH. Strong Selective Sweeps on the X Chromosome in the Human-Chimpanzee Ancestor Explain Its Low Divergence. PLoS Genet 2015;11:e1005451. [PMID: 26274919 PMCID: PMC4537231 DOI: 10.1371/journal.pgen.1005451] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2015] [Accepted: 07/20/2015] [Indexed: 11/18/2022] Open

Dutheil JY, Figuet E. Optimization of sequence alignments according to the number of sequences vs. number of sites trade-off. BMC Bioinformatics 2015;16:190. [PMID: 26055961 PMCID: PMC4459672 DOI: 10.1186/s12859-015-0619-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2014] [Accepted: 05/18/2015] [Indexed: 12/02/2022] Open

Abstract

Background

Comparative analysis of homologous sequences enables the understanding of evolutionary patterns at the molecular level, unraveling the functional constraints that shaped the underlying genes. Bioinformatic pipelines for comparative sequence analysis typically include procedures for (i) alignment quality assessment and (ii) control of sequence redundancy. An additional, underassessed step is the control of the amount and distribution of missing data in sequence alignments. While the number of sequences available for a given gene typically increases with time, the site-specific coverage of each alignment position remains highly variable because of differences in sequencing and annotation quality, or simply because of biological variation. For any given alignment-based analysis, the selection of sequences thus defines a trade-off between the species representation and the quantity of sites with sufficient coverage to be included in the subsequent analyses.

Results

We introduce an algorithm for the optimization of sequence alignments according to the number of sequences vs. number of sites trade-off. The algorithm uses a guide tree to compute scores for each bipartition of the alignment, allowing the recursive selection of sequence subsets with optimal combinations of sequence and site numbers. By applying our methods to two large data sets of several thousands of gene families, we show that significant site-specific coverage increases can be achieved while controlling for the species representation.

Conclusions

The algorithm introduced in this work allows the control of the distribution of missing data in any sequence alignment by removing sequences to increase the number of sites with a defined minimum coverage. We advocate that our missing data optimization procedure in an important step which should be considered in comparative analysis pipelines, together with alignment quality assessment and control of sampled diversity. An open source C++ implementation is available at http://bioweb.me/physamp.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-015-0619-8) contains supplementary material, which is available to authorized users.

Collapse

Figuet E, Romiguier J, Dutheil JY, Galtier N. Mitochondrial DNA as a tool for reconstructing past life-history traits in mammals. J Evol Biol 2014;27:899-910. [PMID: 24720883 DOI: 10.1111/jeb.12361] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2014] [Revised: 02/27/2014] [Accepted: 02/28/2014] [Indexed: 12/23/2022]

Dutheil JY, Gaillard S, Stukenbrock EH. MafFilter: a highly flexible and extensible multiple genome alignment files processor. BMC Genomics 2014;15:53. [PMID: 24447531 PMCID: PMC3904536 DOI: 10.1186/1471-2164-15-53] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2013] [Accepted: 01/16/2014] [Indexed: 11/10/2022] Open

Munch K, Mailund T, Dutheil JY, Schierup MH. A fine-scale recombination map of the human-chimpanzee ancestor reveals faster change in humans than in chimpanzees and a strong impact of GC-biased gene conversion. Genome Res 2013;24:467-74. [PMID: 24190946 PMCID: PMC3941111 DOI: 10.1101/gr.158469.113] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Guéguen L, Gaillard S, Boussau B, Gouy M, Groussin M, Rochette NC, Bigot T, Fournier D, Pouyet F, Cahais V, Bernard A, Scornavacca C, Nabholz B, Haudry A, Dachary L, Galtier N, Belkhir K, Dutheil JY. Bio++: Efficient Extensible Libraries and Tools for Computational Molecular Evolution. Mol Biol Evol 2013;30:1745-50. [DOI: 10.1093/molbev/mst097] [Citation(s) in RCA: 132] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open

Mailund T, Halager AE, Westergaard M, Dutheil JY, Munch K, Andersen LN, Lunter G, Prüfer K, Scally A, Hobolth A, Schierup MH. A new isolation with migration model along complete genomes infers very different divergence processes among closely related great ape species. PLoS Genet 2012;8:e1003125. [PMID: 23284294 PMCID: PMC3527290 DOI: 10.1371/journal.pgen.1003125] [Citation(s) in RCA: 73] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2012] [Accepted: 10/14/2012] [Indexed: 11/18/2022] Open

Romiguier J, Figuet E, Galtier N, Douzery EJP, Boussau B, Dutheil JY, Ranwez V. Fast and robust characterization of time-heterogeneous sequence evolutionary processes using substitution mapping. PLoS One 2012;7:e33852. [PMID: 22479459 PMCID: PMC3313935 DOI: 10.1371/journal.pone.0033852] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2011] [Accepted: 02/22/2012] [Indexed: 12/22/2022] Open

Scally A, Dutheil JY, Hillier LW, Jordan GE, Goodhead I, Herrero J, Hobolth A, Lappalainen T, Mailund T, Marques-Bonet T, McCarthy S, Montgomery SH, Schwalie PC, Tang YA, Ward MC, Xue Y, Yngvadottir B, Alkan C, Andersen LN, Ayub Q, Ball EV, Beal K, Bradley BJ, Chen Y, Clee CM, Fitzgerald S, Graves TA, Gu Y, Heath P, Heger A, Karakoc E, Kolb-Kokocinski A, Laird GK, Lunter G, Meader S, Mort M, Mullikin JC, Munch K, O'Connor TD, Phillips AD, Prado-Martinez J, Rogers AS, Sajjadian S, Schmidt D, Shaw K, Simpson JT, Stenson PD, Turner DJ, Vigilant L, Vilella AJ, Whitener W, Zhu B, Cooper DN, de Jong P, Dermitzakis ET, Eichler EE, Flicek P, Goldman N, Mundy NI, Ning Z, Odom DT, Ponting CP, Quail MA, Ryder OA, Searle SM, Warren WC, Wilson RK, Schierup MH, Rogers J, Tyler-Smith C, Durbin R. Insights into hominid evolution from the gorilla genome sequence. Nature 2012;483:169-75. [PMID: 22398555 PMCID: PMC3303130 DOI: 10.1038/nature10842] [Citation(s) in RCA: 457] [Impact Index Per Article: 38.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2011] [Accepted: 01/10/2012] [Indexed: 12/13/2022]

Corbi J, Dutheil JY, Damerval C, Tenaillon MI, Manicacci D. Accelerated evolution and coevolution drove the evolutionary history of AGPase sub-units during angiosperm radiation. Ann Bot 2012;109:693-708. [PMID: 22307567 PMCID: PMC3286274 DOI: 10.1093/aob/mcr303] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/15/2011] [Accepted: 11/07/2011] [Indexed: 05/10/2023]

Abstract

BACKGROUND AND AIMS

ADP-glucose pyrophosphorylase (AGPase) is a key enzyme of starch biosynthesis. In the green plant lineage, it is composed of two large (LSU) and two small (SSU) sub-units encoded by paralogous genes, as a consequence of several rounds of duplication. First, our aim was to detect specific patterns of molecular evolution following duplication events and the divergence between monocotyledons and dicotyledons. Secondly, we investigated coevolution between amino acids both within and between sub-units.

METHODS

A phylogeny of each AGPase sub-unit was built using all gymnosperm and angiosperm sequences available in databases. Accelerated evolution along specific branches was tested using the ratio of the non-synonymous to the synonymous substitution rate. Coevolution between amino acids was investigated taking into account compensatory changes between co-substitutions.

KEY RESULTS

We showed that SSU paralogues evolved under high functional constraints during angiosperm radiation, with a significant level of coevolution between amino acids that participate in SSU major functions. In contrast, in the LSU paralogues, we identified residues under positive selection (1) following the first LSU duplication that gave rise to two paralogues mainly expressed in angiosperm source and sink tissues, respectively; and (2) following the emergence of grass-specific paralogues expressed in the endosperm. Finally, we found coevolution between residues that belong to the interaction domains of both sub-units.

CONCLUSIONS

Our results support the view that coevolution among amino acid residues, especially those lying in the interaction domain of each sub-unit, played an important role in AGPase evolution. First, within SSU, coevolution allowed compensating mutations in a highly constrained context. Secondly, the LSU paralogues probably acquired tissue-specific expression and regulatory properties via the coevolution between sub-unit interacting domains. Finally, the pattern we observed during LSU evolution is consistent with repeated sub-functionalization under 'Escape from Adaptive Conflict', a model rarely illustrated in the literature.

Collapse

Dutheil JY, Galtier N, Romiguier J, Douzery EJ, Ranwez V, Boussau B. Efficient Selection of Branch-Specific Models of Sequence Evolution. Mol Biol Evol 2012;29:1861-74. [DOI: 10.1093/molbev/mss059] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Stukenbrock EH, Dutheil JY. Comparing fungal genomes: insight into functional and evolutionary processes. Methods Mol Biol 2012;835:531-548. [PMID: 22183676 DOI: 10.1007/978-1-61779-501-5_33] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]

Dutheil JY, Hobolth A. Ancestral population genomics. Methods Mol Biol 2012;856:293-313. [PMID: 22399464 DOI: 10.1007/978-1-61779-585-5_12] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]

Stukenbrock EH, Bataillon T, Dutheil JY, Hansen TT, Li R, Zala M, McDonald BA, Wang J, Schierup MH. The making of a new pathogen: insights from comparative population genomics of the domesticated wheat pathogen Mycosphaerella graminicola and its wild sister species. Genome Res 2011;21:2157-66. [PMID: 21994252 DOI: 10.1101/gr.118851.110] [Citation(s) in RCA: 159] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]

Dutheil JY. Detecting coevolving positions in a molecule: why and how to account for phylogeny. Brief Bioinform 2011;13:228-43. [PMID: 21949241 DOI: 10.1093/bib/bbr048] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Maldonado E, Dutheil JY, da Fonseca RR, Vasconcelos V, Antunes A. IMPACT: integrated multiprogram platform for analyses in ConTest. J Hered 2011;102:366-9. [PMID: 21414966 DOI: 10.1093/jhered/esr003] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Mailund T, Dutheil JY, Hobolth A, Lunter G, Schierup MH. Estimating divergence time and ancestral effective population size of Bornean and Sumatran orangutan subspecies using a coalescent hidden Markov model. PLoS Genet 2011;7:e1001319. [PMID: 21408205 PMCID: PMC3048369 DOI: 10.1371/journal.pgen.1001319] [Citation(s) in RCA: 70] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2009] [Accepted: 01/25/2011] [Indexed: 12/01/2022] Open

Abstract

Due to genetic variation in the ancestor of two populations or two species, the divergence time for DNA sequences from two populations is variable along the genome. Within genomic segments all bases will share the same divergence—because they share a most recent common ancestor—when no recombination event has occurred to split them apart. The size of these segments of constant divergence depends on the recombination rate, but also on the speciation time, the effective population size of the ancestral population, as well as demographic effects and selection. Thus, inference of these parameters may be possible if we can decode the divergence times along a genomic alignment. Here, we present a new hidden Markov model that infers the changing divergence (coalescence) times along the genome alignment using a coalescent framework, in order to estimate the speciation time, the recombination rate, and the ancestral effective population size. The model is efficient enough to allow inference on whole-genome data sets. We first investigate the power and consistency of the model with coalescent simulations and then apply it to the whole-genome sequences of the two orangutan sub-species, Bornean (P. p. pygmaeus) and Sumatran (P. p. abelii) orangutans from the Orangutan Genome Project. We estimate the speciation time between the two sub-species to be thousand years ago and the effective population size of the ancestral orangutan species to be , consistent with recent results based on smaller data sets. We also report a negative correlation between chromosome size and ancestral effective population size, which we interpret as a signature of recombination increasing the efficacy of selection.

We present a hidden Markov model that uses variation in coalescence times between two distantly related populations, or closely related species, to infer population genetics parameters in ancestral population or species. The model infers the divergence times in segments along the alignment. Using coalescent simulations, we show that the model accurately estimates the divergence time between the two populations and the effective population size of the ancestral population. We apply the model to the recently sequenced orangutan sub-species and estimate their divergence time and the effective population size of their ancestor population.

Collapse

Hobolth A, Dutheil JY, Hawks J, Schierup MH, Mailund T. Incomplete lineage sorting patterns among human, chimpanzee, and orangutan suggest recent orangutan speciation and widespread selection. Genome Res 2011;21:349-56. [PMID: 21270173 PMCID: PMC3044849 DOI: 10.1101/gr.114751.110] [Citation(s) in RCA: 168] [Impact Index Per Article: 12.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Dutheil JY, Jossinet F, Westhof E. Base pairing constraints drive structural epistasis in ribosomal RNA sequences. Mol Biol Evol 2010;27:1868-76. [PMID: 20211929 DOI: 10.1093/molbev/msq069] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open

Abstract

It has long been accepted that the structural constraints stemming from the 3D structure of ribosomal RNA (rRNA) lead to coevolution through compensating mutations between interacting sites. State-of-the-art methods for detecting coevolving sites, however, while reaching high levels of specificity and sensitivity for Watson-Crick (WC) pairs of the helices defining the secondary structure, only scarcely reveal tertiary interactions occurring at the level of the 3D structure. In order to understand the relative failure of coevolutionary methods to detect such interactions, we analyze 2,682 interacting sites derived from high-resolution structures, which include a comprehensive data set of rRNA sequences from Archaea and Bacteria. We report a striking difference in the amount of coevolution between WC and non-WC pairs. In order to understand this pattern, we derive fitness landscapes from the geometry of base pairing interactions and construct neutral networks of substitutions for each type of interaction. These networks show that coevolution is a property of WC pairs because, unlike non-WC pairs, their landscapes exhibit fitness valleys, a single mutation in a WC pair resulting in a fitness drop. Second, we used the inferred neutral networks to estimate the level of constraint acting on each type of base pair and show that it correlates negatively with the observed rate of substitutions for all non-WC pairs. WC pairs appear as outliers, fixing more substitutions than expected according to their level of constraint. We here propose that the rate of substitution in WC pairs is due to coevolution resulting from constraints acting at intermediate levels of organization, namely the one of the helical stem with its forming WC pairs. In agreement with this hypothesis, we report a significant excess of intrahelical, inter-WC pairs coevolution compared with interhelices pairs. Altogether, these results show that detailed biochemical knowledge is required and has to be incorporated into evolutionary reasoning in order to understand the fine patterns of variation at the molecular level. They also demonstrate that coevolutionary analysis provides almost exclusively 2D information and only little 3D signal.

Collapse