1
|
Phillips A, Schultz CJ, Burton RA. New crops on the block: effective strategies to broaden our food, fibre, and fuel repertoire in the face of increasingly volatile agricultural systems. JOURNAL OF EXPERIMENTAL BOTANY 2025; 76:2043-2063. [PMID: 40036544 DOI: 10.1093/jxb/eraf023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/18/2024] [Accepted: 02/26/2025] [Indexed: 03/06/2025]
Abstract
Climate change poses significant challenges to our ability to keep a growing global population fed, clothed, and fuelled. This review sets the scene by summarizing the impacts of climate change on production of the major grain crop species rice, wheat, and maize, with a focus on yield reductions due to abiotic stresses and altered disease pressures. We discuss efforts to improve resilience, emphasizing traits such as water use efficiency, heat tolerance, and disease resistance. We move on to exploring production trends of established, re-emerging, and new crops, highlighting the challenges of developing and maintaining new arrivals in the global market. We analyse the potential of wild relatives for improving domesticated crops, or as candidates for de novo domestication. The importance of pangenomes for uncovering genetic variation for crop improvement is also discussed. We examine the impact of climate change on non-cereals, including fruit, nut, and fibre crops, and the potential of alternative multiuse crops to increase global sustainability and address climate change-related challenges. Agave is used as an exemplar to demonstrate the strategic pathway for developing a robust new crop option. There is a need for sustained investment in research and development across the entire value chain to facilitate the exploration of diverse species and genetic resources to enhance crop resilience and adaptability to future environmental conditions.
Collapse
Affiliation(s)
- Aaron Phillips
- School of Agriculture, Food and Wine, Plant Genomics Centre, Hartley Grove, Urrbrae SA 5064, Australia
| | - Carolyn J Schultz
- School of Agriculture, Food and Wine, Plant Genomics Centre, Hartley Grove, Urrbrae SA 5064, Australia
| | - Rachel A Burton
- School of Agriculture, Food and Wine, Plant Genomics Centre, Hartley Grove, Urrbrae SA 5064, Australia
| |
Collapse
|
2
|
Perkins DO, Jeffries CD, Clark SR, Upthegrove R, Wannan CMJ, Wray NR, Li QS, Do KQ, Walker E, Paul Amminger G, Anticevic A, Cotter D, Ellman LM, Mongan D, Phassouliotis C, Barbee J, Roth S, Billah T, Corcoran C, Calkins ME, Cerrato F, Khadimallah I, Klauser P, Winter-van Rossum I, Nunez AR, Bleggi RS, Martin AR, Bouix S, Pasternak O, Shah JL, Toben C, Wolf DH, Accelerating Medicines Partnership® Schizophrenia (AMP® SCZ), Kahn RS, Kane JM, McGorry PD, Bearden CE, Nelson B, Shenton ME, Woods SW. Body fluid biomarkers and psychosis risk in The Accelerating Medicines Partnership® Schizophrenia Program: design considerations. SCHIZOPHRENIA (HEIDELBERG, GERMANY) 2025; 11:78. [PMID: 40399418 PMCID: PMC12095529 DOI: 10.1038/s41537-025-00610-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 09/16/2024] [Accepted: 02/11/2025] [Indexed: 05/23/2025]
Abstract
Advances in proteomic assay methodologies and genomics have significantly improved our understanding of the blood proteome. Schizophrenia and psychosis risk are linked to polygenic scores for schizophrenia and other mental disorders, as well as to altered blood and saliva levels of biomarkers involved in hormonal signaling, redox balance, and chronic systemic inflammation. The Accelerating Medicines Partnership® Schizophrenia (AMP®SCZ) aims to ascertain biomarkers that both predict clinical outcomes and provide insights into the biological processes driving clinical outcomes in persons meeting CHR criteria. AMP®SCZ will follow almost 2000 CHR and 640 community study participants for two years, assessing biomarkers at baseline and two-month follow-up including the collection of blood and saliva samples. The following provides the rationale and methods for plans to utilize polygenic risk scores for schizophrenia and other disorders, salivary cortisol levels, and a discovery-based proteomic platform for plasma analyses. We also provide details about the standardized methods used to collect and store these biological samples, as well as the study participant metadata and quality control measures related to preanalytical factors that could influence the values of the biomarkers. Finally, we discuss our plans for analyzing the results of blood- and saliva-based biomarkers. Watch Dr. Perkins discuss their work and this article: https://vimeo.com/1062879582?share=copy#t=0 .
Collapse
Affiliation(s)
- Diana O Perkins
- Department of Psychiatry, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.
| | - Clark D Jeffries
- Rennaisance Computing Institute, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Scott R Clark
- Discipline of Psychiatry, University of Adelaide, Adelaide, SA, Australia
- Basil Hetzel Institute, Woodville, SA, Australia
| | - Rachel Upthegrove
- Institute for Mental Health, University of Birmingham, Birmingham, UK
- Birmingham Womens and Childrens, NHS Foundation Trust, Birmingham, UK
| | - Cassandra M J Wannan
- Centre for Youth Mental Health, The University of Melbourne, Parkville, VIC, Australia
- Orygen, Parkville, VIC, Australia
| | - Naomi R Wray
- Department of Psychiatry, University of Oxford, Oxford, UK
- Institute for Molecular Biosciences, University of Queensland, Queensland, Australia
| | - Qingqin S Li
- JRD Data Science, Janssen Research & Development, LLC, Titusville, NJ, USA
| | - Kim Q Do
- Department of Psychiatry, Center for Psychiatric Neuroscience, Lausanne University Hospital and University of Lausanne, Lausanne, Switzerland
- Department of Psychological Medicine, Institute of Psychiatry, Psychology and Neuroscience King's College London, London, UK
| | - Elaine Walker
- Departments of Psychology and Psychiatry, Emory University, Atlanta, GA, United States of America
| | - G Paul Amminger
- Centre for Youth Mental Health, The University of Melbourne, Parkville, VIC, Australia
- Orygen, Parkville, VIC, Australia
| | - Alan Anticevic
- Department of Psychiatry, Yale University School of Medicine, New Haven, CT, USA
| | - David Cotter
- Department of Psychiatry, Royal College of Surgeons in Ireland, Dublin, Ireland
- School of Computing and Information Systems, The University of Melbourne, Parkville, VIC, Australia
| | - Lauren M Ellman
- Department of Psychology and Neuroscience, Temple University, Philadelphia, PA, USA
| | - David Mongan
- Department of Psychiatry, Royal College of Surgeons in Ireland, Dublin, Ireland
- Centre for Public Health, Queen's University Belfast, Belfast, United Kingdom
| | - Christina Phassouliotis
- Centre for Youth Mental Health, The University of Melbourne, Parkville, VIC, Australia
- Orygen, Parkville, VIC, Australia
| | - Jenna Barbee
- Department of Psychiatry, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Sharin Roth
- Genomics and Biomarker Research, Otsuka Pharmaceutical Development & Commercialization, Inc, Rockville, MD, USA
| | - Tashrif Billah
- Department of Psychiatry, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - Cheryl Corcoran
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Monica E Calkins
- Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Felecia Cerrato
- Stanley Center for Psychiatric Research and Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Ines Khadimallah
- Department of Psychiatry, Center for Psychiatric Neuroscience, Lausanne University Hospital and University of Lausanne, Lausanne, Switzerland
- Department of Psychological Medicine, Institute of Psychiatry, Psychology and Neuroscience King's College London, London, UK
| | - Paul Klauser
- Department of Psychiatry, Center for Psychiatric Neuroscience, Lausanne University Hospital and University of Lausanne, Lausanne, Switzerland
- Department of Psychiatry, Service of Child and Adolescent Psychiatry, Lausanne University Hospital and University of Lausanne, Lausanne, Switzerland
| | | | - Angela R Nunez
- Department of Psychiatry, Yale University School of Medicine, New Haven, CT, USA
- Connecticut Mental Health Center, New Haven, CT, USA
| | - Rachel S Bleggi
- Department of Psychiatry, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Alicia R Martin
- Stanley Center for Psychiatric Research and Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Sylvain Bouix
- Department of Psychiatry, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
- Department of Radiology, Brigham and Women's Hospital, Boston, MA, USA
- Department of Psychiatry, MGB, Massachusetts General Hospital, Boston, MA, USA
| | - Ofer Pasternak
- Department of Psychiatry, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
- Department of Radiology, Brigham and Women's Hospital, Boston, MA, USA
- Department of Psychiatry, MGB, Massachusetts General Hospital, Boston, MA, USA
| | - Jai L Shah
- Douglas Research Centre, Montreal, Quebec, Canada
- Department of Psychiatry, McGill University, Montreal, Quebec, Canada
| | - Catherine Toben
- Discipline of Psychiatry, University of Adelaide, Adelaide, SA, Australia
| | - Daniel H Wolf
- Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | | | - Rene S Kahn
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - John M Kane
- Department of Psychiatry, Donald and Barbara Zucker School of Medicine, Hempstead, N.Y, USA
- Institute for Behavioral Science, Feinstein Institute for Medical Research, Manhasset, NY, USA
| | - Patrick D McGorry
- Centre for Youth Mental Health, The University of Melbourne, Parkville, VIC, Australia
- Orygen, Parkville, VIC, Australia
| | - Carrie E Bearden
- Department of Psychology, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Psychiatry and Biobehavioral Sciences, Semel Institute for Neuroscience and Human Behavior, University of California, Los Angeles, Los Angeles, CA, USA
| | - Barnaby Nelson
- Centre for Youth Mental Health, The University of Melbourne, Parkville, VIC, Australia
- Orygen, Parkville, VIC, Australia
| | - Martha E Shenton
- Department of Psychiatry, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
- Department of Radiology, Brigham and Women's Hospital, Boston, MA, USA
- Department of Psychiatry, MGB, Massachusetts General Hospital, Boston, MA, USA
| | - Scott W Woods
- Department of Psychiatry, Yale University School of Medicine, New Haven, CT, USA
- Connecticut Mental Health Center, New Haven, CT, USA
| |
Collapse
|
3
|
Zheng W, Ma W, Chen Z, Wang C, Sun T, Dong W, Zhang W, Zhang S, Tang Z, Li K, Zhao Y, Liu Y. DPImpute: A Genotype Imputation Framework for Ultra-Low Coverage Whole-Genome Sequencing and its Application in Genomic Selection. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2025; 12:e2412482. [PMID: 40013759 PMCID: PMC12021046 DOI: 10.1002/advs.202412482] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/07/2024] [Revised: 02/05/2025] [Indexed: 02/28/2025]
Abstract
Whole-genome sequencing is pivotal for elucidating the complex relationships between genotype and phenotype. However, its widespread application is hindered by the high sequencing depth and large sample sizes required, especially for genomic selection (GS) reliant on precise phenotype prediction from high-density genotype data. To address this, DPImpute (Dual-Phase Impute) is developed, an two-step imputation pipeline enabling accurate whole-genome SNP genotyping under ultra-low coverage whole-genome sequencing (ulcWGS) depths, small testing sample sizes, and limited reference populations. DPImpute achieved 98.06% SNP imputation accuracy with minimal testing samples (≤10), reference samples (≤100), and an ultra-low sequencing depth of 0.3X, surpassing the accuracy of existing imputation methods. Moreover, this high accuracy is maintained across multi-ancestry human populations. Remarkably, DPImpute demonstrated accurate SNP imputation from low-coverage sequencing data from single blood cells and single blastocyst cells, highlighting its potential in embryo GS. To enhance the accessibility of DPImpute, a user-friendly web server (https://agdb.ecenr.com/DPImpute/home) is developed and a Docker container for seamless implementation. In summary, DPImpute can significantly expedite breeding programs through precise and cost-effective genotyping and serve as a valuable tool for diverse population genotyping, encompassing both human and animal studies.
Collapse
Affiliation(s)
- Weigang Zheng
- Key Laboratory of Agricultural Animal GeneticsBreeding and Reproduction of Ministry of Education & Key Lab of Swine Genetics and Breeding of Ministry of Agriculture and Rural AffairsCollege of Animal Science and TechnologyHuazhong Agricultural UniversityWuhan430070China
- Shenzhen BranchGuangdong Laboratory for Lingnan Modern AgricultureKey Laboratory of Livestock and Poultry Multi‐Omics of MARAAgricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhen518124China
- Innovation Group of Pig Genome Design and BreedingResearch Centre for Animal GenomeAgricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhen518124China
| | - Wenlong Ma
- Shenzhen BranchGuangdong Laboratory for Lingnan Modern AgricultureKey Laboratory of Livestock and Poultry Multi‐Omics of MARAAgricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhen518124China
- Innovation Group of Pig Genome Design and BreedingResearch Centre for Animal GenomeAgricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhen518124China
| | - Zhilong Chen
- Shenzhen BranchGuangdong Laboratory for Lingnan Modern AgricultureKey Laboratory of Livestock and Poultry Multi‐Omics of MARAAgricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhen518124China
- Innovation Group of Pig Genome Design and BreedingResearch Centre for Animal GenomeAgricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhen518124China
| | - Chao Wang
- Key Laboratory of Agricultural Animal GeneticsBreeding and Reproduction of Ministry of Education & Key Lab of Swine Genetics and Breeding of Ministry of Agriculture and Rural AffairsCollege of Animal Science and TechnologyHuazhong Agricultural UniversityWuhan430070China
- Shenzhen BranchGuangdong Laboratory for Lingnan Modern AgricultureKey Laboratory of Livestock and Poultry Multi‐Omics of MARAAgricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhen518124China
- Innovation Group of Pig Genome Design and BreedingResearch Centre for Animal GenomeAgricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhen518124China
| | - Tao Sun
- Shenzhen BranchGuangdong Laboratory for Lingnan Modern AgricultureKey Laboratory of Livestock and Poultry Multi‐Omics of MARAAgricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhen518124China
- Innovation Group of Pig Genome Design and BreedingResearch Centre for Animal GenomeAgricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhen518124China
| | - Wenjun Dong
- Key Laboratory of Agricultural Animal GeneticsBreeding and Reproduction of Ministry of Education & Key Lab of Swine Genetics and Breeding of Ministry of Agriculture and Rural AffairsCollege of Animal Science and TechnologyHuazhong Agricultural UniversityWuhan430070China
- Shenzhen BranchGuangdong Laboratory for Lingnan Modern AgricultureKey Laboratory of Livestock and Poultry Multi‐Omics of MARAAgricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhen518124China
- Innovation Group of Pig Genome Design and BreedingResearch Centre for Animal GenomeAgricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhen518124China
| | - Wenjing Zhang
- State Key Laboratory of Swine and Poultry Breeding IndustryNational Engineering Research Center for Breeding Swine IndustryGuangdong Provincial Key Lab of Agro‐Animal Genomics and Molecular BreedingCollege of Animal ScienceSouth China Agricultural UniversityGuangzhou510642China
| | - Song Zhang
- Shenzhen BranchGuangdong Laboratory for Lingnan Modern AgricultureKey Laboratory of Livestock and Poultry Multi‐Omics of MARAAgricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhen518124China
- Innovation Group of Pig Genome Design and BreedingResearch Centre for Animal GenomeAgricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhen518124China
| | - Zhonglin Tang
- Shenzhen BranchGuangdong Laboratory for Lingnan Modern AgricultureKey Laboratory of Livestock and Poultry Multi‐Omics of MARAAgricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhen518124China
- Innovation Group of Pig Genome Design and BreedingResearch Centre for Animal GenomeAgricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhen518124China
- Kunpeng Institute of Modern Agriculture at FoshanChinese Academy of Agricultural SciencesFoshan528226China
| | - Kui Li
- Shenzhen BranchGuangdong Laboratory for Lingnan Modern AgricultureKey Laboratory of Livestock and Poultry Multi‐Omics of MARAAgricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhen518124China
- Innovation Group of Pig Genome Design and BreedingResearch Centre for Animal GenomeAgricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhen518124China
| | - Yunxiang Zhao
- Guangxi Key Laboratory of Animal Breeding, Disease Control and Prevention, College of Animal Science and TechnologyGuangxi UniversityNanning530004China
| | - Yuwen Liu
- Key Laboratory of Agricultural Animal GeneticsBreeding and Reproduction of Ministry of Education & Key Lab of Swine Genetics and Breeding of Ministry of Agriculture and Rural AffairsCollege of Animal Science and TechnologyHuazhong Agricultural UniversityWuhan430070China
- Shenzhen BranchGuangdong Laboratory for Lingnan Modern AgricultureKey Laboratory of Livestock and Poultry Multi‐Omics of MARAAgricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhen518124China
- Innovation Group of Pig Genome Design and BreedingResearch Centre for Animal GenomeAgricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhen518124China
- Kunpeng Institute of Modern Agriculture at FoshanChinese Academy of Agricultural SciencesFoshan528226China
| |
Collapse
|
4
|
Lin YS, Tan T, Wang Y, Pasaniuc B, Martin AR, Atkinson EG. Differential performance of polygenic prediction across traits and populations depending on genotype discovery approach. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.03.18.644029. [PMID: 40166153 PMCID: PMC11957064 DOI: 10.1101/2025.03.18.644029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/02/2025]
Abstract
Polygenic scores (PGS) are widely used for estimating genetic predisposition to complex traits by aggregating the effects of common variants into a single measure. They hold promise in identifying individuals at increased risk for diseases, allowing earlier screening and interventions. Genotyping arrays, commonly used for PGS computation, are affordable and computationally efficient, while whole-genome sequencing (WGS) offers a comprehensive view of genetic variation. Using the same set of individuals, we compared PGS derived from arrays and WGS across multiple traits to evaluate differences in predictive performance, portability across populations, and computational efficiency. We computed PGS for 10 traits across the spectrum of heritability and polygenicity in the three largest genetic ancestry groups in All of Us (European, African American, Admixed American), trained on the multi-ancestry meta-analyses from the Pan-UK Biobank. Using the clumping and thresholding (C+T) method, we found that WGS-based PGS outperformed array-based PRS for highly polygenic traits but showed differentially reduced accuracy for sparse traits in certain populations. This may be attributable to the lower allele frequency observed in clumped variants from WGS compared to arrays. Using the LD-informed PRS-CS method, we observed overall improved prediction performance compared to C+T, with WGS outperforming arrays across most non-cancer traits. In conclusion, while PGS computed using WGS generally provide superior predictive power with PRS-CS, the advantage over arrays is context-dependent, varying by trait, population, and the PGS method. This study provides insights into the complexities and potential advantages of using different genotype discovery approach for polygenic predictions in diverse populations. Graphical abstract
Collapse
|
5
|
Corey V, Chavez M, Qasim L, Dincer TU, Henry A, Bagayan S, Treadup S, Mehan M, de Feo E, Kim S. Calculating maternal polygenic risk scores from prenatal screening by cell-free DNA data. Front Genet 2025; 16:1495604. [PMID: 40051705 PMCID: PMC11882851 DOI: 10.3389/fgene.2025.1495604] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2024] [Accepted: 01/27/2025] [Indexed: 03/09/2025] Open
Abstract
Polygenic Risk Scores (PRS) have enabled quantification of genetic risk for many common and complex traits. Here we developed a novel method to estimate maternal PRS using low-coverage whole genome sequencing data from prenatal screening by cell-free DNA data intended to screen for fetal chromosomal aneuploidies. A prospective study was conducted where 455 consented patients that performed prenatal screening by cell-free DNA as part of their standard of care were randomly selected. Cell-free DNA and genomic DNA were isolated from the plasma and buffy coat of the blood drawn from pregnant women, respectively. Cell-free DNA was sequenced at ∼0.25x coverage while genomic DNA was sequenced at ∼15x coverage. The sequence data was used to impute genotypes which were then used to calculate PRS for paired comparisons. There was a high correlation (average = ∼0.9 across different PRS panels and panel sizes) between PRS from prenatal screening by cfDNA data and PRS from genome sequence data of the buffy coat. This proof-of-concept study illustrates that maternal PRS can be calculated using low-coverage prenatal screening by cfDNA sequence data with high accuracy.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | - Sung Kim
- Illumina, Inc., San Diego, CA, United States
| |
Collapse
|
6
|
Zavala EI, Rohlfs RV, Moorjani P. Benchmarking for genotyping and imputation using degraded DNA for forensic applications across diverse populations. Forensic Sci Int Genet 2025; 75:103177. [PMID: 39586186 PMCID: PMC12056732 DOI: 10.1016/j.fsigen.2024.103177] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2024] [Revised: 11/14/2024] [Accepted: 11/14/2024] [Indexed: 11/27/2024]
Abstract
Advancements in sequencing and laboratory technologies have enabled forensic genetic analysis on increasingly low quality and degraded DNA samples. However, existing computational methods applied to genotyping and imputation for generating DNA profiles from degraded DNA have not been tested for forensic applications. Here we simulated sequencing data of varying qualities-coverage, fragment lengths, and deamination patterns-from forty individuals of diverse genetic ancestries. We used this dataset to test the performance of commonly used genotype and imputation methods (SAMtools, GATK, ATLAS, Beagle, and GLIMPSE) on five different SNP panels (MPS-plex, FORCE, two extended kinship panels, and the Human Origins array) that are used for forensic and population genetics applications. For genome mapping and variant calling with degraded DNA, we find use of parameters and methods (such as ATLAS) developed for ancient DNA analysis provides a marked improvement over conventional standards used for next generation sequencing analysis. We find that ATLAS outperforms GATK and SAMtools, achieving over 90 % genotyping accuracy for the four largest SNP panels with coverages greater than 10X. For lower coverages, decreased concordance rates are correlated with increased rates of heterozygosity. Genotype refinement and imputation improve the accuracy at lower coverages by leveraging population reference data. For all five SNP panels, we find that using a population reference panel representative of worldwide populations (e.g., the 1000 Genomes Project) results in increased genotype accuracies across genetic ancestries, compared to ancestry-matched population reference panels. Importantly, we find that the low SNP density of commonly used forensics SNP panels can impact the reliability and performance of genotype refinement and imputation. This highlights a critical trade-off between enhancing privacy by using panels with fewer SNPs and maintaining the effectiveness of genomic tools. We provide benchmarks and recommendations for analyzing degraded DNA from diverse populations with widely used genomic methods in forensic casework.
Collapse
Affiliation(s)
- Elena I Zavala
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, United States.
| | - Rori V Rohlfs
- Department of Data Science, Institute of Ecology and Evolution, University of Oregon, Eugene, OR, United States.
| | - Priya Moorjani
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, United States; Center for Computational Biology, University of California, Berkeley, Berkeley, CA, United States.
| |
Collapse
|
7
|
Yao X, Li J, Fu J, Wang X, Ma L, Nanaei HA, Shah AM, Zhang Z, Bian P, Zhou S, Wang A, Wang X, Jiang Y. Genomic Landscape and Prediction of Udder Traits in Saanen Dairy Goats. Animals (Basel) 2025; 15:261. [PMID: 39858261 PMCID: PMC11759135 DOI: 10.3390/ani15020261] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2024] [Revised: 01/05/2025] [Accepted: 01/07/2025] [Indexed: 01/27/2025] Open
Abstract
Goats are essential to the dairy industry in Shaanxi, China, with udder traits playing a critical role in determining milk production and economic value for breeding programs. However, the direct measurement of these traits in dairy goats is challenging and resource-intensive. This study leveraged genotyping imputation to explore the genetic parameters and architecture of udder traits and assess the efficiency of genomic prediction methods. Using data from 635 Saanen dairy goats, genotyped for over 14,717,075 SNP markers and phenotyped for three udder traits, heritability was 0.16 for udder width, 0.32 for udder depth, and 0.13 for teat spacing, with genetic correlations of 0.79, 0.70, and 0.45 observed among the traits. Genome-wide association studies (GWAS) revealed four candidate genes with selection signatures linked to udder traits. Predictive models, including GBLUP, kernel ridge regression (KRR), and Adaboost.RT, were evaluated for genomic estimated breeding value (GEBV) prediction. Machine learning models (KRR and Adaboost.RT) outperformed GBLUP by 20% and 11% in predictive accuracy, showing superior stability and reliability. These results underscore the potential of machine learning approaches to enhance genomic prediction accuracy in dairy goats, providing valuable insights that could contribute to improvements in animal health, productivity, and economic outcomes within the dairy goat industry.
Collapse
Affiliation(s)
- Xiaoting Yao
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, China; (X.Y.); (J.L.); (J.F.); (X.W.); (L.M.); (A.M.S.); (Z.Z.); (P.B.); (S.Z.); (A.W.)
| | - Jiaxin Li
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, China; (X.Y.); (J.L.); (J.F.); (X.W.); (L.M.); (A.M.S.); (Z.Z.); (P.B.); (S.Z.); (A.W.)
| | - Jiaqi Fu
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, China; (X.Y.); (J.L.); (J.F.); (X.W.); (L.M.); (A.M.S.); (Z.Z.); (P.B.); (S.Z.); (A.W.)
| | - Xingquan Wang
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, China; (X.Y.); (J.L.); (J.F.); (X.W.); (L.M.); (A.M.S.); (Z.Z.); (P.B.); (S.Z.); (A.W.)
| | - Longgang Ma
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, China; (X.Y.); (J.L.); (J.F.); (X.W.); (L.M.); (A.M.S.); (Z.Z.); (P.B.); (S.Z.); (A.W.)
| | - Hojjat Asadollahpour Nanaei
- Animal Science Research Department, Fars Agricultural and Natural Resources Research and Education Center, Agricultural Research, Education and Extension Organization (AREEO), Shiraz 7155863511, Iran;
| | - Ali Mujtaba Shah
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, China; (X.Y.); (J.L.); (J.F.); (X.W.); (L.M.); (A.M.S.); (Z.Z.); (P.B.); (S.Z.); (A.W.)
| | - Zhuangbiao Zhang
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, China; (X.Y.); (J.L.); (J.F.); (X.W.); (L.M.); (A.M.S.); (Z.Z.); (P.B.); (S.Z.); (A.W.)
| | - Peipei Bian
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, China; (X.Y.); (J.L.); (J.F.); (X.W.); (L.M.); (A.M.S.); (Z.Z.); (P.B.); (S.Z.); (A.W.)
| | - Shishuo Zhou
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, China; (X.Y.); (J.L.); (J.F.); (X.W.); (L.M.); (A.M.S.); (Z.Z.); (P.B.); (S.Z.); (A.W.)
| | - Ao Wang
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, China; (X.Y.); (J.L.); (J.F.); (X.W.); (L.M.); (A.M.S.); (Z.Z.); (P.B.); (S.Z.); (A.W.)
| | - Xihong Wang
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, China; (X.Y.); (J.L.); (J.F.); (X.W.); (L.M.); (A.M.S.); (Z.Z.); (P.B.); (S.Z.); (A.W.)
| | - Yu Jiang
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, China; (X.Y.); (J.L.); (J.F.); (X.W.); (L.M.); (A.M.S.); (Z.Z.); (P.B.); (S.Z.); (A.W.)
| |
Collapse
|
8
|
Fonseca EM, Tran LN, Mendoza H, Gutenkunst RN. Modeling Biases from Low-Pass Genome Sequencing to Enable Accurate Population Genetic Inferences. Mol Biol Evol 2025; 42:msaf002. [PMID: 39847470 PMCID: PMC11756381 DOI: 10.1093/molbev/msaf002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2024] [Revised: 01/06/2025] [Accepted: 01/07/2025] [Indexed: 01/25/2025] Open
Abstract
Low-pass genome sequencing is cost-effective and enables analysis of large cohorts. However, it introduces biases by reducing heterozygous genotypes and low-frequency alleles, impacting subsequent analyses such as model-based demographic history inference. Several approaches exist for inferring an unbiased allele frequency spectrum (AFS) from low-pass data, but they can introduce spurious noise into the AFS. Rather than correcting the AFS, here, we developed an approach that incorporates low-pass biases into the demographic modeling and directly analyzes the AFS from low-pass data. Our probabilistic model captures biases from the Genome Analysis Toolkit multisample calling pipeline, and we implemented it in the population genomic inference software dadi. We evaluated the model using simulated low-pass datasets and found that it alleviated low-pass biases in inferred demographic parameters. We further validated the model by downsampling 1000 Genomes Project data, demonstrating its effectiveness on real data. Our model is widely applicable and substantially improves model-based inferences from low-pass population genomic data.
Collapse
Affiliation(s)
- Emanuel M Fonseca
- Department of Molecular and Cellular Biology, University of Arizona, Tucson, AZ 85721, USA
| | - Linh N Tran
- Department of Molecular and Cellular Biology, University of Arizona, Tucson, AZ 85721, USA
| | - Hannah Mendoza
- Department of Molecular and Cellular Biology, University of Arizona, Tucson, AZ 85721, USA
| | - Ryan N Gutenkunst
- Department of Molecular and Cellular Biology, University of Arizona, Tucson, AZ 85721, USA
| |
Collapse
|
9
|
Han Y, Mwesigwa S, Wu Q, Laska MN, Jilcott Pitts SB, Moran NE, Hanchard NA. Common and rare genetic variation intersects with ancestry to influence human skin and plasma carotenoid concentrations. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.12.20.24319465. [PMID: 39763521 PMCID: PMC11703293 DOI: 10.1101/2024.12.20.24319465] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/11/2025]
Abstract
Carotenoids are dietary bioactive compounds with health effects that are biomarkers of fruit and vegetable intake. Here, we examine genetic associations with plasma and skin carotenoid concentrations in two rigorously phenotyped human cohorts (n=317). Analysis of genome-wide SNPs revealed heritability to vary by genetic ancestry (h2=0.08-0.44) with ten SNPs at four loci reaching genome-wide significance (P<5E-08) in multivariate models, including at RAPGEF1 (rs3765544, P=8.86E-10, beta=0.75) with α-carotene, and near IGSF11 (rs80316816, P=6.25E-10, beta=0.74), with cryptoxanthin; these were replicated in the second cohort (n=110). Multiple SNPs near IGSF11 demonstrated genotype-dependent dietary effects on plasma cryptoxanthin. Deep sequencing of 35 candidate genes revealed associations between the PKD1L2-BCO1 locus and plasma β-carotene (Padj=0.04, beta=-1.3 to -0.3), and rare, ancestry-restricted, damaging variants in CETP (rs2303790) and APOA1 (rs756535387) in individuals with high skin carotenoids. Our findings implicate novel loci in carotenoid disposition and indicate the importance of including cohorts of diverse genetic ancestry.
Collapse
Affiliation(s)
- Yixing Han
- Center for Precision Health Research, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD
| | - Savannah Mwesigwa
- Department of Medical Microbiology, College of Health Sciences, Makerere University, Kampala, Uganda
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX
| | - Qiang Wu
- Department of Public Health, East Carolina University, Greenville, NC
| | - Melissa N Laska
- Division of Epidemiology & Community Health, School of Public Health, University of Minnesota, Minneapolis, MN
| | | | - Nancy E Moran
- USDA/ARS Children's Nutrition Research Center, Department of Pediatrics, Baylor College of Medicine, Houston, TX
| | - Neil A Hanchard
- Center for Precision Health Research, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD
- USDA/ARS Children's Nutrition Research Center, Department of Pediatrics, Baylor College of Medicine, Houston, TX
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX
| |
Collapse
|
10
|
Chandra G, Hossen MH, Scholz S, Dilthey AT, Gibney D, Jain C. Integer programming framework for pangenome-based genome inference. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.10.27.620212. [PMID: 39554168 PMCID: PMC11565907 DOI: 10.1101/2024.10.27.620212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/19/2024]
Abstract
Affordable genotyping methods are essential in genomics. Commonly used genotyping methods primarily support single nucleotide variants and short indels but neglect structural variants. Additionally, accuracy of read alignments to a reference genome is unreliable in highly polymorphic and repetitive regions, further impacting genotyping performance. Recent works highlight the advantage of haplotype-resolved pangenome graphs in addressing these challenges. Building on these developments, we propose a rigorous alignment-free genotyping framework. Our formulation seeks a path through the pangenome graph that maximizes the matches between the path and substrings of sequencing reads (e.g., k-mers) while minimizing recombination events (haplotype switches) along the path. We prove that this problem is NP-Hard and develop efficient integer-programming solutions. We benchmarked the algorithm using downsampled short-read datasets from homozygous human cell lines with coverage ranging from 0.1× to 10×. Our algorithm accurately estimates complete major histocompatibility complex (MHC) haplotype sequences with small edit distances from the ground-truth sequences, providing a significant advantage over existing methods on low-coverage inputs. Although our algorithm is designed for haploid samples, we discuss future extensions to diploid samples.
Collapse
Affiliation(s)
- Ghanshyam Chandra
- Department of Computational and Data Sciences, Indian Institute of Science, Bangalore KA 560012, India
| | - Md Helal Hossen
- Department of Computer Science, The University of Texas at Dallas, TX 75080, USA
| | - Stephan Scholz
- Institute of Medical Microbiology and Hospital Hygiene, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | - Alexander T Dilthey
- Institute of Medical Microbiology and Hospital Hygiene, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | - Daniel Gibney
- Department of Computer Science, The University of Texas at Dallas, TX 75080, USA
| | - Chirag Jain
- Department of Computational and Data Sciences, Indian Institute of Science, Bangalore KA 560012, India
| |
Collapse
|
11
|
Hamid I, Raveloson SNS, Spiral GJ, Ravelonjanahary S, Raharivololona BM, Randria JM, Zafimaro M, Randriambola TA, Andriantsoa RM, Andriamahefa TJ, Rafidison BFL, Mughal M, Emde AK, Hendershott M, LeBaron von Baeyer S, Wasik KA, Ranaivoarisoa JF, Yerges-Armstrong L, Castel SE, Rakotoarivony R. Mid-pass whole-genome sequencing in a Malagasy cohort uncovers body composition associations. HGG ADVANCES 2024; 5:100343. [PMID: 39169618 PMCID: PMC11415767 DOI: 10.1016/j.xhgg.2024.100343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 08/17/2024] [Accepted: 08/19/2024] [Indexed: 08/23/2024] Open
Abstract
The majority of human genomic research studies have been conducted in European-ancestry cohorts, reducing the likelihood of detecting potentially novel and globally impactful findings. Here, we present mid-pass whole-genome sequencing data and a genome-wide association study in a cohort of 264 self-reported Malagasy individuals from three locations on the island of Madagascar. We describe genetic variation in this Malagasy cohort, providing insight into the shared and unique patterns of genetic variation across the island. We observe phenotypic variation by location and find high rates of hypertension, particularly in the Southern Highlands sampling site, as well as elevated self-reported malaria prevalence in the West Coast site relative to other sites. After filtering to a subset of 214 minimally related individuals, we find a number of genetic associations with body composition traits, including many variants that are only observed in African populations or populations with admixed African ancestry from the 1000 Genomes Project. This study highlights the importance of including diverse populations in genomic research for the potential to gain novel insights, even with small cohort sizes. This project was conducted in partnership and consultation with local stakeholders in Madagascar and serves as an example of genomic research that prioritizes community engagement and potentially impacts our understanding of human health and disease.
Collapse
Affiliation(s)
- Iman Hamid
- Variant Bio, Inc., Seattle, WA 98109, USA
| | | | - Germain Jules Spiral
- University of Antananarivo, Faculty of Sciences, Mention Anthropobiologie et Développement Durable, Antananarivo 101, Madagascar
| | - Soanorolalao Ravelonjanahary
- University of Antananarivo, Faculty of Sciences, Mention Anthropobiologie et Développement Durable, Antananarivo 101, Madagascar
| | - Brigitte Marie Raharivololona
- University of Antananarivo, Faculty of Sciences, Mention Anthropobiologie et Développement Durable, Antananarivo 101, Madagascar
| | - José Mahenina Randria
- University of Antananarivo, Faculty of Medicine, Ministry of Public Health, Antananarivo 101, Madagascar
| | - Mosa Zafimaro
- University of Antananarivo, Faculty of Medicine, Ministry of Public Health, Antananarivo 101, Madagascar
| | - Tsiorimanitra Aimée Randriambola
- University of Antananarivo, Faculty of Sciences, Mention Anthropobiologie et Développement Durable, Antananarivo 101, Madagascar
| | - Rota Mamimbahiny Andriantsoa
- University of Antananarivo, Faculty of Sciences, Mention Anthropobiologie et Développement Durable, Antananarivo 101, Madagascar
| | - Tojo Julio Andriamahefa
- University of Antananarivo, Faculty of Sciences, Mention Anthropobiologie et Développement Durable, Antananarivo 101, Madagascar
| | | | | | | | | | | | | | - Jean Freddy Ranaivoarisoa
- University of Antananarivo, Faculty of Sciences, Mention Anthropobiologie et Développement Durable, Antananarivo 101, Madagascar
| | | | | | - Rindra Rakotoarivony
- University of Antananarivo, Faculty of Sciences, Mention Anthropobiologie et Développement Durable, Antananarivo 101, Madagascar.
| |
Collapse
|
12
|
Berdnikova AA, Zorkoltseva IV, Tsepilov YA, Elgaeva EE. Genotype imputation in human genomic studies. Vavilovskii Zhurnal Genet Selektsii 2024; 28:628-639. [PMID: 39440308 PMCID: PMC11491486 DOI: 10.18699/vjgb-24-70] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Revised: 04/23/2024] [Accepted: 07/04/2024] [Indexed: 10/25/2024] Open
Abstract
Imputation is a method that supplies missing information about genetic variants that could not be directly genotyped with DNA microarrays or low-coverage sequencing. Imputation plays a critical role in genome-wide association studies (GWAS). It leads to a significant increase in the number of studied variants, which improves the resolution of the method and enhances the comparability of data obtained in different cohorts and/or by using different technologies, which is important for conducting meta-analyses. When performing imputation, genotype information from the study sample, in which only part of the genetic variants are known, is complemented using the standard (reference) sample, which has more complete genotype data (most often the results of whole-genome sequencing). Imputation has become an integral part of human genomic research due to the benefits it provides and the increasing availability of imputation tools and reference sample data. This review focuses on imputation in human genomic research. The first section of the review provides a description of technologies for obtaining information about human genotypes and characteristics of these types of data. The second section describes the imputation methodology, lists the stages of its implementation and the corresponding programs, provides a description of the most popular reference panels and methods for assessing the quality of imputation. The review concludes with examples of the use of imputation in genomic studies of samples from Russia. This review shows the importance of imputation, provides information on how to carry it out, and systematizes the results of its application using Russian samples.
Collapse
Affiliation(s)
- A A Berdnikova
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia Novosibirsk State University, Novosibirsk, Russia
| | - I V Zorkoltseva
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - Y A Tsepilov
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - E E Elgaeva
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia Novosibirsk State University, Novosibirsk, Russia
| |
Collapse
|
13
|
Boltz TA, Chu BB, Liao C, Sealock JM, Ye R, Majara L, Fu JM, Service S, Zhan L, Medland SE, Chapman SB, Rubinacci S, DeFelice M, Grimsby JL, Abebe T, Alemayehu M, Ashaba FK, Atkinson EG, Bigdeli T, Bradway AB, Brand H, Chibnik LB, Fekadu A, Gatzen M, Gelaye B, Gichuru S, Gildea ML, Hill TC, Huang H, Hubbard KM, Injera WE, James R, Joloba M, Kachulis C, Kalmbach PR, Kamulegeya R, Kigen G, Kim S, Koen N, Kwobah EK, Kyebuzibwa J, Lee S, Lennon NJ, Lind PA, Lopera-Maya EA, Makale J, Mangul S, McMahon J, Mowlem P, Musinguzi H, Mwema RM, Nakasujja N, Newman CP, Nkambule LL, O'Neil CR, Olivares AM, Olsen CM, Ongeri L, Parsa SJ, Pretorius A, Ramesar R, Reagan FL, Sabatti C, Schneider JA, Shiferaw W, Stevenson A, Stricker E, Stroud RE, Tang J, Whiteman D, Yohannes MT, Yu M, Yuan K, Akena D, Atwoli L, Kariuki SM, Koenen KC, Newton CRJC, Stein DJ, Teferra S, Zingela Z, Pato CN, Pato MT, Lopez-Jaramillo C, Freimer N, Ophoff RA, Olde Loohuis LM, Talkowski ME, Neale BM, Howrigan DP, Martin AR. A blended genome and exome sequencing method captures genetic variation in an unbiased, high-quality, and cost-effective manner. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.09.06.611689. [PMID: 39282356 PMCID: PMC11398523 DOI: 10.1101/2024.09.06.611689] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 09/21/2024]
Abstract
We deployed the Blended Genome Exome (BGE), a DNA library blending approach that generates low pass whole genome (1-4× mean depth) and deep whole exome (30-40× mean depth) data in a single sequencing run. This technology is cost-effective, empowers most genomic discoveries possible with deep whole genome sequencing, and provides an unbiased method to capture the diversity of common SNP variation across the globe. To evaluate this new technology at scale, we applied BGE to sequence >53,000 samples from the Populations Underrepresented in Mental Illness Associations Studies (PUMAS) Project, which included participants across African, African American, and Latin American populations. We evaluated the accuracy of BGE imputed genotypes against raw genotype calls from the Illumina Global Screening Array. All PUMAS cohorts hadR 2 concordance ≥95% among SNPs with MAF≥1%, and never fell below ≥90%R 2 for SNPs with MAF<1%. Furthermore, concordance rates among local ancestries within two recently admixed cohorts were consistent among SNPs with MAF≥1%, with only minor deviations in SNPs with MAF<1%. We also benchmarked the discovery capacity of BGE to access protein-coding copy number variants (CNVs) against deep whole genome data, finding that deletions and duplications spanning at least 3 exons had a positive predicted value of ~90%. Our results demonstrate BGE scalability and efficacy in capturing SNPs, indels, and CNVs in the human genome at 28% of the cost of deep whole-genome sequencing. BGE is poised to enhance access to genomic testing and empower genomic discoveries, particularly in underrepresented populations.
Collapse
Affiliation(s)
- Toni A Boltz
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Benjamin B Chu
- Department of Psychiatry, School of Medicine, College of Health Sciences, Addis Ababa University, Addis Ababa, Ethiopia
| | - Calwing Liao
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Julia M Sealock
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Robert Ye
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Lerato Majara
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
- Department of Psychiatry and Mental Health and South African Medical Council Research Unit on Risk and Resilience in Mental Disorders, Neuroscience Institute, University of Cape Town and Groote Schuur Hospital, Cape Town, South Africa
| | - Jack M Fu
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
| | - Susan Service
- Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, CA, USA
| | - Lingyu Zhan
- Department of Psychiatry, Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, CA, USA
- The Collaboratory, Institute for Quantitative and Computational Biosciences, University of California Los Angeles, Los Angeles, CA, USA
| | - Sarah E Medland
- QIMR Berghofer Medical Research Institute, Brisbane, QLD, Australia
| | - Sinéad B Chapman
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- BD2: Breakthrough Discoveries for thriving with Bipolar Disorder, Santa Monica, CA, USA
| | - Simone Rubinacci
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School,, Boston, MA, USA
- Center for Data Sciences, Brigham and Women's Hospital and Harvard Medical School,, Boston, MA, USA
| | - Matthew DeFelice
- Broad Clinical Labs (BCL), Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Jonna L Grimsby
- Broad Clinical Labs (BCL), Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Tamrat Abebe
- Department of Microbiology, Immunology, and Parasitology, School of Medicine, College of Health Sciences, Addis Ababa University, Addis Ababa, Ethiopia
| | - Melkam Alemayehu
- Department of Psychiatry, School of Medicine, College of Health Sciences, Addis Ababa University, Addis Ababa, Ethiopia
| | - Fred K Ashaba
- Department of Immunology & Molecular Biology, College of Health Sciences, Makerere University, Kampala, Uganda
| | - Elizabeth G Atkinson
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
- Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX, USA
| | - Tim Bigdeli
- Institute for Genomics in Health, The State University of New York, Brooklyn, NY, USA
| | - Amanda B Bradway
- Broad Clinical Labs (BCL), Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Harrison Brand
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Lori B Chibnik
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Neurology, Massachusetts General Hospital, Boston, MA, USA
- Department of Epidemiology, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Abebaw Fekadu
- Department of Psychiatry, School of Medicine, College of Health Sciences, Addis Ababa University, Addis Ababa, Ethiopia
- Centre for Innovative Drug Development & Therapeutic Trials for Africa, Addis Ababa University, Addis Ababa, Ethiopia
| | - Michael Gatzen
- Broad Clinical Labs (BCL), Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Bizu Gelaye
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Epidemiology, Harvard T. H. Chan School of Public Health, Boston, MA, USA
- Department of Psychiatry, Harvard Medical School and The Chester M. Pierce MD, Division of Global Psychiatry, Massachusetts General Hospital, Boston, MA, USA
| | - Stella Gichuru
- Department of Mental Health, Moi Teaching and Referral Hospital, Eldoret, Kenya
| | - Marissa L Gildea
- Broad Clinical Labs (BCL), Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Toni C Hill
- Broad Clinical Labs (BCL), Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Hailiang Huang
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- BD2: Breakthrough Discoveries for thriving with Bipolar Disorder, Santa Monica, CA, USA
| | - Kalyn M Hubbard
- Broad Clinical Labs (BCL), Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Wilfred E Injera
- Department of Medical Laboratory Sciences, School of Health Sciences, Alupe University, Busia, Kenya
| | - Roxanne James
- Department of Psychiatry and Mental Health, University of Cape Town, Cape Town, South Africa
| | - Moses Joloba
- School of Biomedical Sciences, School of Medicine, College of Health Sciences, Makerere University, Kampala, Uganda
| | - Christopher Kachulis
- Broad Clinical Labs (BCL), Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Phillip R Kalmbach
- Broad Clinical Labs (BCL), Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Rogers Kamulegeya
- School of Biomedical Sciences, School of Medicine, College of Health Sciences, Makerere University, Kampala, Uganda
| | - Gabriel Kigen
- Department of Pharmacology and Toxicology, Moi University School of Medicine, Eldoret, Kenya
| | - Soyeon Kim
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Nastassja Koen
- Department of Psychiatry and Mental Health, University of Cape Town, Cape Town, South Africa
- SA MRC Unit on Risk & Resilience in Mental Disorders, University of Cape Town and Neuroscience Institute, Cape Town, South Africa
| | - Edith K Kwobah
- Department of Mental Health, Moi Teaching and Referral Hospital, Eldoret, Kenya
| | - Joseph Kyebuzibwa
- Department of Psychiatry, School of Medicine, College of Health Sciences, Makerere University, Kampala, Uganda
| | - Seungmo Lee
- Department of Computer Science, University of California Los Angeles, Los Angeles, CA, USA
| | - Niall J Lennon
- Broad Clinical Labs (BCL), Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Penelope A Lind
- QIMR Berghofer Medical Research Institute, Brisbane, QLD, Australia
| | - Esteban A Lopera-Maya
- Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, CA, USA
- Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA, USA
| | - Johnstone Makale
- Epidemiology and Demography Department, KEMRI-Wellcome Trust Research Programme-Coast, Kilifi, Kenya
| | - Serghei Mangul
- Department of Quantitative and Computational Biology, Dana and David Dornsife College of Letters, Arts and Sciences, University of Southern California, Los Angeles, CA, USA
- Department of Clinical Pharmacy, Alfred E. Mann School of Pharmacy, University of Southern California, Los Angeles, CA, USA
| | - Justin McMahon
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Epidemiology, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Pierre Mowlem
- Ampath Laboratories, Moi University School of Medicine, Eldoret, Kenya
| | - Henry Musinguzi
- Department of Immunology & Molecular Biology, College of Health Sciences, Makerere University, Kampala, Uganda
| | - Rehema M Mwema
- Neurosciences Unit, Clinical Department, KEMRI-Wellcome Trust Research Programme-Coast,, Kilifi, Kenya
| | - Noeline Nakasujja
- Department of Psychiatry, School of Medicine, College of Health Sciences, Makerere University, Kampala, Uganda
| | - Carter P Newman
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Epidemiology, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Lethukuthula L Nkambule
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Conor R O'Neil
- Broad Clinical Labs (BCL), Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Ana Maria Olivares
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | | | - Linnet Ongeri
- Centre for Clinical Research, Kenya Medical Research Institute, Nairobi, Kenya
| | - Sophie J Parsa
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Adele Pretorius
- Department of Psychiatry and Mental Health, University of Cape Town, Cape Town, South Africa
| | - Raj Ramesar
- SA MRC Unit on Risk & Resilience in Mental Disorders, University of Cape Town and Neuroscience Institute, Cape Town, South Africa
| | - Faye L Reagan
- Broad Clinical Labs (BCL), Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Chiara Sabatti
- Department of Biomedical Data Sciences, Stanford University, Stanford, CA, USA
| | | | - Welelta Shiferaw
- Department of Psychiatry, School of Medicine, College of Health Sciences, Addis Ababa University, Addis Ababa, Ethiopia
| | - Anne Stevenson
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Epidemiology, Harvard T. H. Chan School of Public Health, Boston, MA, USA
- Department of Psychiatry, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Erik Stricker
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Rocky E Stroud
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Epidemiology, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Jessie Tang
- Broad Clinical Labs (BCL), Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - David Whiteman
- QIMR Berghofer Medical Research Institute, Brisbane, QLD, Australia
| | - Mary T Yohannes
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Mingrui Yu
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Kai Yuan
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Dickens Akena
- Department of Psychiatry, School of Medicine, College of Health Sciences, Makerere University, Kampala, Uganda
| | - Lukoye Atwoli
- Department of Mental Health and Behavioural Sciences, School of Medicine, Moi University College of Health Sciences, Eldoret, Kenya
- Brain and Mind Institute, The Aga Khan University, Nairobi, Kenya
- Department of Medicine, Medical College East Africa, The Aga Khan University, Nairobi, Kenya
| | - Symon M Kariuki
- Neurosciences Unit, Clinical Department, KEMRI-Wellcome Trust Research Programme-Coast, Kilifi, Kenya
- Department of Psychiatry, University of Oxford, Oxford, UK
| | - Karestan C Koenen
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Epidemiology, Harvard T. H. Chan School of Public Health, Boston, MA, USA
- Department of Social and Behavioral Sciences, Harvard T. H. Chan School of Public Health, Boston, MA, USA
- Psychiatric & Neurodevelopmental Genetics Unit, Department of Psychiatry, Massachusetts General Hospital, Boston, MA, USA
| | - Charles R J C Newton
- Neurosciences Unit, Clinical Department, KEMRI-Wellcome Trust Research Programme-Coast,, Kilifi, Kenya
- Department of Psychiatry, University of Oxford, Oxford, UK
| | - Dan J Stein
- SA MRC Unit on Risk & Resilience in Mental Disorders, University of Cape Town and Neuroscience Institute, Cape Town, South Africa
| | - Solomon Teferra
- Department of Psychiatry, School of Medicine, College of Health Sciences, Addis Ababa University, Addis Ababa, Ethiopia
| | - Zukiswa Zingela
- Executive Dean's Office, Faculty of Health Sciences, Nelson Mandela University, Gqebera, South Africa
| | - Carlos N Pato
- Rutgers University, New Brunswick, NJ, USA
- BD2: Breakthrough Discoveries for thriving with Bipolar Disorder, Santa Monica, CA, USA
| | - Michele T Pato
- Rutgers University, New Brunswick, NJ, USA
- BD2: Breakthrough Discoveries for thriving with Bipolar Disorder, Santa Monica, CA, USA
| | - Carlos Lopez-Jaramillo
- Department of Psychiatry, University of Antioquia, University of Antioquia, Medellín, Antioquia, Colombia
| | - Nelson Freimer
- BD2: Breakthrough Discoveries for thriving with Bipolar Disorder, Santa Monica, CA, USA
- Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, CA, USA
- Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA, USA
| | - Roel A Ophoff
- Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, CA, USA
- Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA, USA
| | - Loes M Olde Loohuis
- Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, CA, USA
- Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA, USA
- Department of Computational Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA, USA
| | - Michael E Talkowski
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Benjamin M Neale
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- BD2: Breakthrough Discoveries for thriving with Bipolar Disorder, Santa Monica, CA, USA
| | - Daniel P Howrigan
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Alicia R Martin
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- BD2: Breakthrough Discoveries for thriving with Bipolar Disorder, Santa Monica, CA, USA
| |
Collapse
|
14
|
Liu S, Martin KE, Snelling WM, Long R, Leeds TD, Vallejo RL, Wiens GD, Palti Y. Accurate genotype imputation from low-coverage whole-genome sequencing data of rainbow trout. G3 (BETHESDA, MD.) 2024; 14:jkae168. [PMID: 39041837 PMCID: PMC11373650 DOI: 10.1093/g3journal/jkae168] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/19/2024] [Revised: 04/19/2024] [Accepted: 07/05/2024] [Indexed: 07/24/2024]
Abstract
With the rapid and significant cost reduction of next-generation sequencing, low-coverage whole-genome sequencing (lcWGS), followed by genotype imputation, is becoming a cost-effective alternative to single-nucleotide polymorphism (SNP)-array genotyping. The objectives of this study were 2-fold: (1) construct a haplotype reference panel for genotype imputation from lcWGS data in rainbow trout (Oncorhynchus mykiss); and (2) evaluate the concordance between imputed genotypes and SNP-array genotypes in 2 breeding populations. Medium-coverage (12×) whole-genome sequences were obtained from a total of 410 fish representing 5 breeding populations with various spawning dates. The short-read sequences were mapped to the rainbow trout reference genome, and genetic variants were identified using GATK. After data filtering, 20,434,612 biallelic SNPs were retained. The reference panel was phased with SHAPEIT5 and was used as a reference to impute genotypes from lcWGS data employing GLIMPSE2. A total of 90 fish from the Troutlodge November breeding population were sequenced with an average coverage of 1.3×, and these fish were also genotyped with the Axiom 57K rainbow trout SNP array. The concordance between array-based genotypes and imputed genotypes was 99.1%. After downsampling the coverage to 0.5×, 0.2×, and 0.1×, the concordance between array-based genotypes and imputed genotypes was 98.7, 97.8, and 96.7%, respectively. In the USDA odd-year breeding population, the concordance between array-based genotypes and imputed genotypes was 97.8% for 109 fish downsampled to 0.5× coverage. Therefore, the reference haplotype panel reported in this study can be used to accurately impute genotypes from lcWGS data in rainbow trout breeding populations.
Collapse
Affiliation(s)
- Sixin Liu
- United States Department of Agriculture, National Center for Cool and Cold Water Aquaculture, Agricultural Research Service, Kearneysville, WV 25430, USA
| | | | - Warren M Snelling
- United States Department of Agriculture, US Meat Animal Research Center, Agricultural Research Service, Clay Center, NE 68933, USA
| | - Roseanna Long
- United States Department of Agriculture, National Center for Cool and Cold Water Aquaculture, Agricultural Research Service, Kearneysville, WV 25430, USA
| | - Timothy D Leeds
- United States Department of Agriculture, National Center for Cool and Cold Water Aquaculture, Agricultural Research Service, Kearneysville, WV 25430, USA
| | - Roger L Vallejo
- United States Department of Agriculture, National Center for Cool and Cold Water Aquaculture, Agricultural Research Service, Kearneysville, WV 25430, USA
| | - Gregory D Wiens
- United States Department of Agriculture, National Center for Cool and Cold Water Aquaculture, Agricultural Research Service, Kearneysville, WV 25430, USA
| | - Yniv Palti
- United States Department of Agriculture, National Center for Cool and Cold Water Aquaculture, Agricultural Research Service, Kearneysville, WV 25430, USA
| |
Collapse
|
15
|
Nystrom SE, Soldano KL, Rockett M, Datta S, Li G, Silas D, Garrett ME, Ashley-Koch AE, Olabisi OA. APOL1 High-Risk Genotype is Not Associated With New or Worsening of Proteinuria or Kidney Function Decline Following COVID-19 Vaccination. Kidney Int Rep 2024; 9:2657-2666. [PMID: 39291186 PMCID: PMC11403097 DOI: 10.1016/j.ekir.2024.06.023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 06/05/2024] [Accepted: 06/10/2024] [Indexed: 09/19/2024] Open
Abstract
Introduction SARS-CoV-2 infection increases systemic inflammatory cytokines which act as a second-hit driver of Apolipoprotein L1 (APOL1)-mediated collapsing glomerulopathy. SARS-CoV-2 vaccination also increases cytokines. Recent reports of new glomerular disease in individuals with APOL1 high-risk genotype (HRG) following SARS-CoV-2 vaccination raised the concern SARS-CoV-2 vaccination may also act as a second-hit driver of APOL1-mediated glomerulopathy. Methods We screened 1507 adults in the Duke's Measurement to Understand Reclassification of Disease of Cabarrus and Kannapolis (MURDOCK) registry and enrolled 105 eligible participants with available SARS-CoV-2 vaccination data, prevaccination and postvaccination serum creatinine, and urine protein measurements. Paired data were stratified by number of APOL1 risk alleles (RAs) and compared within groups using Wilcoxon signed rank test and across groups by analysis of variance. Results Among 105 participants, 30 (28.6%) had 2, 39 (37.1%) had 1, and 36 (34.3%) had 0 APOL1 RA. Most of the participants (94%) received at least 2 doses of vaccine. Most (98%) received the BNT162B2 (Pfizer) or mRNA-1273 (Moderna) vaccine. On average, the prevaccine and postvaccine laboratory samples were drawn 648 days apart. There were no detectable differences between pre- and post-serum creatinine or pre- and post-urine albumin creatinine ratio irrespective of the participants' APOL1 genotype. Finally, most participants with APOL1 RA had the most common haplotype (E150, I228, and K255) and lacked the recently described protective N264K haplotype. Conclusion In this observational study, APOL1 HRG is not associated with new or worsening of proteinuria or decline in kidney function following SARS-CoV-2 vaccination. Validation of this result in larger cohorts would further support the renal safety of SARS-CoV-2 vaccine in individuals with APOL1 HRG.
Collapse
Affiliation(s)
- Sarah E Nystrom
- Division of Nephrology, Duke Molecular Physiology Institute, Duke University School of Medicine, Durham, North Carolina, USA
| | - Karen L Soldano
- Division of Nephrology, Duke Molecular Physiology Institute, Duke University School of Medicine, Durham, North Carolina, USA
| | - Micki Rockett
- Duke Clinical and Translational Science Institute, Duke University School of Medicine, Durham, North Carolina, USA
| | - Somenath Datta
- Division of Nephrology, Duke Molecular Physiology Institute, Duke University School of Medicine, Durham, North Carolina, USA
| | - Guojie Li
- Division of Nephrology, Duke Molecular Physiology Institute, Duke University School of Medicine, Durham, North Carolina, USA
| | - Daniel Silas
- Division of Nephrology, Duke Molecular Physiology Institute, Duke University School of Medicine, Durham, North Carolina, USA
| | - Melanie E Garrett
- Division of Nephrology, Duke Molecular Physiology Institute, Duke University School of Medicine, Durham, North Carolina, USA
| | - Allison E Ashley-Koch
- Division of Nephrology, Duke Molecular Physiology Institute, Duke University School of Medicine, Durham, North Carolina, USA
| | - Opeyemi A Olabisi
- Division of Nephrology, Duke Molecular Physiology Institute, Duke University School of Medicine, Durham, North Carolina, USA
| |
Collapse
|
16
|
DeFelice M, Grimsby JL, Howrigan D, Yuan K, Chapman SB, Stevens C, DeLuca S, Townsend M, Buxbaum J, Pericak-Vance M, Qin S, Stein DJ, Teferra S, Xavier RJ, Huang H, Martin AR, Neale BM. Blended Genome Exome (BGE) as a Cost Efficient Alternative to Deep Whole Genomes or Arrays. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.03.587209. [PMID: 38645052 PMCID: PMC11030253 DOI: 10.1101/2024.04.03.587209] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/23/2024]
Abstract
Genomic scientists have long been promised cheaper DNA sequencing, but deep whole genomes are still costly, especially when considered for large cohorts in population-level studies. More affordable options include microarrays + imputation, whole exome sequencing (WES), or low-pass whole genome sequencing (WGS) + imputation. WES + array + imputation has recently been shown to yield 99% of association signals detected by WGS. However, a method free from ascertainment biases of arrays or the need for merging different data types that still benefits from deeper exome coverage to enhance novel coding variant detection does not exist. We developed a new, combined, "Blended Genome Exome" (BGE) in which a whole genome library is generated, an aliquot of that genome is amplified by PCR, the exome regions are selected and enriched, and the genome and exome libraries are combined back into a single tube for sequencing (33% exome, 67% genome). This creates a single CRAM with a low-coverage whole genome (2-3x) combined with a higher coverage exome (30-40x). This BGE can be used for imputing common variants throughout the genome as well as for calling rare coding variants. We tested this new method and observed >99% r 2 concordance between imputed BGE data and existing 30x WGS data for exome and genome variants. BGE can serve as a useful and cost-efficient alternative sequencing product for genomic researchers, requiring ten-fold less sequencing compared to 30x WGS without the need for complicated harmonization of array and sequencing data.
Collapse
|
17
|
Fonseca EM, Tran LN, Mendoza H, Gutenkunst RN. Modeling biases from low-pass genome sequencing to enable accurate population genetic inferences. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.19.604366. [PMID: 39091836 PMCID: PMC11291017 DOI: 10.1101/2024.07.19.604366] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 08/04/2024]
Abstract
Low-pass genome sequencing is cost-effective and enables analysis of large cohorts. However, it introduces biases by reducing heterozygous genotypes and low-frequency alleles, impacting subsequent analyses such as demographic history inference. We developed a probabilistic model of low-pass biases from the Genome Analysis Toolkit (GATK) multi-sample calling pipeline, and we implemented it in the population genomic inference software dadi. We evaluated the model using simulated low-pass datasets and found that it alleviated low-pass biases in inferred demographic parameters. We further validated the model by downsampling 1000 Genomes Project data, demonstrating its effectiveness on real data. Our model is widely applicable and substantially improves model-based inferences from low-pass population genomic data.
Collapse
Affiliation(s)
- Emanuel M. Fonseca
- Department of Molecular and Cellular Biology, University of Arizona, Tucson, AZ 85721, USA
| | - Linh N. Tran
- Department of Molecular and Cellular Biology, University of Arizona, Tucson, AZ 85721, USA
| | - Hannah Mendoza
- Department of Molecular and Cellular Biology, University of Arizona, Tucson, AZ 85721, USA
| | - Ryan N. Gutenkunst
- Department of Molecular and Cellular Biology, University of Arizona, Tucson, AZ 85721, USA
| |
Collapse
|
18
|
Lukacova E, Hanzlikova Z, Podlesnyi P, Sedlackova T, Szemes T, Grendar M, Samec M, Hurtova T, Malicherova B, Leskova K, Budis J, Burjanivova T. Novel liquid biopsy CNV biomarkers in malignant melanoma. Sci Rep 2024; 14:15786. [PMID: 38982214 PMCID: PMC11233564 DOI: 10.1038/s41598-024-65928-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Accepted: 06/25/2024] [Indexed: 07/11/2024] Open
Abstract
Malignant melanoma (MM) is known for its abundance of genetic alterations and a tendency for rapid metastasizing. Identification of novel plasma biomarkers may enhance non-invasive diagnostics and disease monitoring. Initially, we examined copy number variations (CNV) in CDK genes (CDKN2A, CDKN2B, CDK4) using MLPA (gDNA) and ddPCR (ctDNA) analysis. Subsequently, low-coverage whole genome sequencing (lcWGS) was used to identify the most common CNV in plasma samples, followed by ddPCR verification of chosen biomarkers. CNV alterations in CDK genes were identified in 33.3% of FFPE samples (Clark IV, V only). Detection of the same genes in MM plasma showed no significance, neither compared to healthy plasmas nor between pre- versus post-surgery plasma. Sequencing data showed the most common CNV occurring in 6q27, 4p16.1, 10p15.3, 10q22.3, 13q34, 18q23, 20q11.21-q13.12 and 22q13.33. CNV in four chosen genes (KIF25, E2F1, DIP2C and TFG) were verified by ddPCR using 2 models of interpretation. Model 1 was concordant with lcWGS results in 54% of samples, for model 2 it was 46%. Although CDK genes have not been proven to be suitable CNV liquid biopsy biomarkers, lcWGS defined the most frequently affected chromosomal regions by CNV. Among chosen genes, DIP2C demonstrated a potential for further analysis.
Collapse
Affiliation(s)
- E Lukacova
- Department of Molecular Biology and Genomics, Comenius University in Bratislava, Jessenius Faculty of Medicine in Martin (JFM CU), Martin, Slovakia
| | | | - P Podlesnyi
- Instituto de Investigaciones Biomedicas de Barcelona (IIBB), CSIC /Centro Investigacion Biomedica en Red Enfermedades Neurodegenerativas (CiberNed), Barcelona, Spain
| | - T Sedlackova
- Geneton Ltd., Bratislava, Slovakia
- Science Park, Comenius University in Bratislava, Bratislava, Slovakia
| | - T Szemes
- Geneton Ltd., Bratislava, Slovakia
- Science Park, Comenius University in Bratislava, Bratislava, Slovakia
| | - M Grendar
- Laboratory of Bioinformatics and Biostatistics, Biomedical Center Martin, Comenius University in Bratislava, Jessenius Faculty of Medicine in Martin (JFM CU), Martin, Slovakia
| | - M Samec
- Department of Medical Biology, Jessenius Faculty of Medicine in Martin, Comenius University in Bratislava, Martin, Slovakia
| | - T Hurtova
- Department of Dermatovenereology, Jessenius Faculty of Medicine in Martin, Comenius University in Bratislava, Martin, Slovakia
| | - B Malicherova
- Department of Clinical Biochemistry, University Hospital in Martin and Jessenius Faculty of Medicine, Comenius University, Martin, Slovakia
| | - K Leskova
- Department of Pathological Anatomy, Jessenius Faculty of Medicine and University Hospital in Martin, Comenius University, Martin, Slovakia
| | - J Budis
- Geneton Ltd., Bratislava, Slovakia
- Science Park, Comenius University in Bratislava, Bratislava, Slovakia
| | - T Burjanivova
- Department of Molecular Biology and Genomics, Comenius University in Bratislava, Jessenius Faculty of Medicine in Martin (JFM CU), Martin, Slovakia.
| |
Collapse
|
19
|
Kendall C, Robinson J, Debortoli G, Nooranikhojasteh A, Christian D, Newman D, Sayers K, Cole S, Parra E, Schillaci M, Viola B. Global and local ancestry estimation in a captive baboon colony. PLoS One 2024; 19:e0305157. [PMID: 38959276 PMCID: PMC11221750 DOI: 10.1371/journal.pone.0305157] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Accepted: 05/24/2024] [Indexed: 07/05/2024] Open
Abstract
The last couple of decades have highlighted the importance of studying hybridization, particularly among primate species, as it allows us to better understand our own evolutionary trajectory. Here, we report on genetic ancestry estimates using dense, full genome data from 881 olive (Papio anubus), yellow (Papio cynocephalus), or olive-yellow crossed captive baboons from the Southwest National Primate Research Center. We calculated global and local ancestry information, imputed low coverage genomes (n = 830) to improve marker quality, and updated the genetic resources of baboons available to assist future studies. We found evidence of historical admixture in some putatively purebred animals and identified errors within the Southwest National Primate Research Center pedigree. We also compared the outputs between two different phasing and imputation pipelines along with two different global ancestry estimation software. There was good agreement between the global ancestry estimation software, with R2 > 0.88, while evidence of phase switch errors increased depending on what phasing and imputation pipeline was used. We also generated updated genetic maps and created a concise set of ancestry informative markers (n = 1,747) to accurately obtain global ancestry estimates.
Collapse
Affiliation(s)
| | - Jacqueline Robinson
- Institute for Human Genetics, University of California, San Francisco, San Francisco, California, United States of America
| | - Guilherme Debortoli
- Department of Anthropology, University of Toronto Mississauga, Mississauga, Ontario, Canada
| | - Amin Nooranikhojasteh
- Epigenomics Lab, Princess Margaret Cancer Center, University Health Network, Toronto, Ontario, Canada
| | - Debbie Christian
- Southwest National Primate Research Center, Texas Biomedical Research Institute, San Antonio, Texas, United States of America
| | - Deborah Newman
- Southwest National Primate Research Center, Texas Biomedical Research Institute, San Antonio, Texas, United States of America
| | - Kenneth Sayers
- Southwest National Primate Research Center, Texas Biomedical Research Institute, San Antonio, Texas, United States of America
| | - Shelley Cole
- Southwest National Primate Research Center, Texas Biomedical Research Institute, San Antonio, Texas, United States of America
| | - Esteban Parra
- Department of Anthropology, University of Toronto Mississauga, Mississauga, Ontario, Canada
| | - Michael Schillaci
- Department of Anthropology, University of Toronto Scarborough, Scarborough, Ontario, Canada
| | - Bence Viola
- Department of Anthropology, University of Toronto, Toronto, Ontario, Canada
| |
Collapse
|
20
|
Koenig Z, Yohannes MT, Nkambule LL, Zhao X, Goodrich JK, Kim HA, Wilson MW, Tiao G, Hao SP, Sahakian N, Chao KR, Walker MA, Lyu Y, Rehm HL, Neale BM, Talkowski ME, Daly MJ, Brand H, Karczewski KJ, Atkinson EG, Martin AR. A harmonized public resource of deeply sequenced diverse human genomes. Genome Res 2024; 34:796-809. [PMID: 38749656 PMCID: PMC11216312 DOI: 10.1101/gr.278378.123] [Citation(s) in RCA: 28] [Impact Index Per Article: 28.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Accepted: 05/07/2024] [Indexed: 05/18/2024]
Abstract
Underrepresented populations are often excluded from genomic studies owing in part to a lack of resources supporting their analyses. The 1000 Genomes Project (1kGP) and Human Genome Diversity Project (HGDP), which have recently been sequenced to high coverage, are valuable genomic resources because of the global diversity they capture and their open data sharing policies. Here, we harmonized a high-quality set of 4094 whole genomes from 80 populations in the HGDP and 1kGP with data from the Genome Aggregation Database (gnomAD) and identified over 153 million high-quality SNVs, indels, and SVs. We performed a detailed ancestry analysis of this cohort, characterizing population structure and patterns of admixture across populations, analyzing site frequency spectra, and measuring variant counts at global and subcontinental levels. We also show substantial added value from this data set compared with the prior versions of the component resources, typically combined via liftOver and variant intersection; for example, we catalog millions of new genetic variants, mostly rare, compared with previous releases. In addition to unrestricted individual-level public release, we provide detailed tutorials for conducting many of the most common quality-control steps and analyses with these data in a scalable cloud-computing environment and publicly release this new phased joint callset for use as a haplotype resource in phasing and imputation pipelines. This jointly called reference panel will serve as a key resource to support research of diverse ancestry populations.
Collapse
Affiliation(s)
- Zan Koenig
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
| | - Mary T Yohannes
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
| | - Lethukuthula L Nkambule
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
| | - Xuefang Zhao
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts 02114, USA
| | - Julia K Goodrich
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Heesu Ally Kim
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Michael W Wilson
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Grace Tiao
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Stephanie P Hao
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts 02114, USA
| | - Nareh Sahakian
- Broad Genomics, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts, 02141, USA
| | - Katherine R Chao
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Mark A Walker
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
- Data Sciences Platform, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Yunfei Lyu
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts 02115, USA
| | - Heidi L Rehm
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
| | - Benjamin M Neale
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Novo Nordisk Foundation Center for Genomic Mechanisms of Disease, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Michael E Talkowski
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts 02114, USA
| | - Mark J Daly
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Department of Medicine, Harvard Medical School, Boston, Massachusetts 02115, USA
- Institute for Molecular Medicine Finland, 00290 Helsinki, Finland
| | - Harrison Brand
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts 02114, USA
| | - Konrad J Karczewski
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
- Novo Nordisk Foundation Center for Genomic Mechanisms of Disease, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Elizabeth G Atkinson
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Alicia R Martin
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA;
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
| |
Collapse
|
21
|
Attur M, Petrilli C, Adhikari S, Iturrate E, Li X, Tuminello S, Hu N, Chakravarti A, Beck D, Abramson SB. Interleukin-1 Receptor Antagonist Gene (IL1RN) Variants Modulate the Cytokine Release Syndrome and Mortality of COVID-19. J Infect Dis 2024; 229:1740-1749. [PMID: 38871359 PMCID: PMC11175666 DOI: 10.1093/infdis/jiae031] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Accepted: 01/26/2024] [Indexed: 06/15/2024] Open
Abstract
BACKGROUND We examined effects of single-nucleotide variants (SNVs) of IL1RN, the gene encoding the anti-inflammatory interleukin 1 receptor antagonist (IL-1Ra), on the cytokine release syndrome (CRS) and mortality in patients with acute severe respiratory syndrome coronavirus 2 (SARS-CoV-2) infection. METHODS IL1RN CTA haplotypes formed from 3 SNVs (rs419598, rs315952, rs9005) and the individual SNVs were assessed for association with laboratory markers of inflammation and mortality. We studied 2589 patients hospitalized with SARS-CoV-2 between March 2020 and March 2021. RESULTS Mortality was 15.3% and lower in women than men (13.1% vs 17.3%, P = .0003). Carriers of the CTA-1/2 IL1RN haplotypes exhibited decreased inflammatory markers and increased plasma IL-1Ra. Evaluation of the individual SNVs of the IL1RN, carriers of the rs419598 C/C SNV exhibited significantly reduced inflammatory biomarker levels and numerically lower mortality compared to the C/T-T/T genotype (10.0% vs 17.8%, P = .052) in men, with the most pronounced association observed in male patients ≤74 years old, whose mortality was reduced by 80% (3.1% vs 14.0%, P = .030). CONCLUSIONS The IL1RN haplotype CTA and C/C variant of rs419598 are associated with attenuation of the CRS and decreased mortality in men with acute SARS-CoV-2 infection. The data suggest that the IL1RN pathway modulates the severity of coronavirus disease 2019 (COVID-19) via endogenous anti-inflammatory mechanisms.
Collapse
Affiliation(s)
- Mukundan Attur
- Division of Rheumatology, Department of Medicine, New York University Langone Orthopedic Hospital, New York University Langone Health, New York, New York, USA
| | - Christopher Petrilli
- Department of Medicine, New York University Grossman School of Medicine, New York University Langone Health, New York, New York, USA
| | - Samrachana Adhikari
- Division of Biostatistics, Department of Population Health, New York University Grossman School of Medicine, New York, New York, USA
| | - Eduardo Iturrate
- Department of Medicine, New York University Grossman School of Medicine, New York University Langone Health, New York, New York, USA
| | - Xiyue Li
- Division of Biostatistics, Department of Population Health, New York University Grossman School of Medicine, New York, New York, USA
| | - Stephanie Tuminello
- Division of Biostatistics, Department of Population Health, New York University Grossman School of Medicine, New York, New York, USA
| | - Nan Hu
- Center for Human Genetics and Genomics, New York University Grossman School of Medicine, New York, New York, USA
| | - Aravinda Chakravarti
- Department of Medicine, New York University Grossman School of Medicine, New York University Langone Health, New York, New York, USA
- Center for Human Genetics and Genomics, New York University Grossman School of Medicine, New York, New York, USA
| | - David Beck
- Department of Medicine, New York University Grossman School of Medicine, New York University Langone Health, New York, New York, USA
- Center for Human Genetics and Genomics, New York University Grossman School of Medicine, New York, New York, USA
| | - Steven B Abramson
- Department of Medicine, New York University Grossman School of Medicine, New York University Langone Health, New York, New York, USA
| |
Collapse
|
22
|
Vaddadi K, Mun T, Langmead B. Minimizing Reference Bias with an Impute-First Approach. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.11.30.568362. [PMID: 38076784 PMCID: PMC10705441 DOI: 10.1101/2023.11.30.568362] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/23/2023]
Abstract
Pangenome indexes reduce reference bias in sequencing data analysis. However, bias can be reduced further by using a personalized reference, e.g. a diploid human reference constructed to match a donor individual's alleles. We present a novel impute-first alignment framework that combines elements of genotype imputation and pangenome alignment. It begins by genotyping the individual using only a subsample of the input reads. It next uses a reference panel and efficient imputation algorithm to impute a personalized diploid reference. Finally, it indexes the personalized reference and applies a read aligner, which could be a linear or graph aligner, to align the full read set to the personalized reference. This framework achieves higher variant-calling recall (99.54% vs. 99.37%), precision (99.36% vs. 99.18%), and F1 (99.45% vs. 99.28%) compared to a graph pangenome aligner. The personalized reference is also smaller and faster to query compared to a pangenome index, making it an overall advantageous choice for whole-genome DNA sequencing experiments.
Collapse
Affiliation(s)
- Kavya Vaddadi
- Department of Computer Science, Johns Hopkins University
| | - Taher Mun
- Department of Computer Science, Johns Hopkins University
| | - Ben Langmead
- Department of Computer Science, Johns Hopkins University
| |
Collapse
|
23
|
Koenig Z, Yohannes MT, Nkambule LL, Zhao X, Goodrich JK, Kim HA, Wilson MW, Tiao G, Hao SP, Sahakian N, Chao KR, Walker MA, Lyu Y, gnomAD Project Consortium, Rehm HL, Neale BM, Talkowski ME, Daly MJ, Brand H, Karczewski KJ, Atkinson EG, Martin AR. A harmonized public resource of deeply sequenced diverse human genomes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.01.23.525248. [PMID: 36747613 PMCID: PMC9900804 DOI: 10.1101/2023.01.23.525248] [Citation(s) in RCA: 24] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
Underrepresented populations are often excluded from genomic studies due in part to a lack of resources supporting their analyses. The 1000 Genomes Project (1kGP) and Human Genome Diversity Project (HGDP), which have recently been sequenced to high coverage, are valuable genomic resources because of the global diversity they capture and their open data sharing policies. Here, we harmonized a high quality set of 4,094 whole genomes from HGDP and 1kGP with data from the Genome Aggregation Database (gnomAD) and identified over 153 million high-quality SNVs, indels, and SVs. We performed a detailed ancestry analysis of this cohort, characterizing population structure and patterns of admixture across populations, analyzing site frequency spectra, and measuring variant counts at global and subcontinental levels. We also demonstrate substantial added value from this dataset compared to the prior versions of the component resources, typically combined via liftover and variant intersection; for example, we catalog millions of new genetic variants, mostly rare, compared to previous releases. In addition to unrestricted individual-level public release, we provide detailed tutorials for conducting many of the most common quality control steps and analyses with these data in a scalable cloud-computing environment and publicly release this new phased joint callset for use as a haplotype resource in phasing and imputation pipelines. This jointly called reference panel will serve as a key resource to support research of diverse ancestry populations.
Collapse
Affiliation(s)
- Zan Koenig
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Mary T. Yohannes
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Lethukuthula L. Nkambule
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Xuefang Zhao
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA
| | - Julia K. Goodrich
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Heesu Ally Kim
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Michael W. Wilson
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Grace Tiao
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Stephanie P. Hao
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA
| | - Nareh Sahakian
- Broad Genomics, The Broad Institute of MIT and Harvard, 320 Charles Street, Cambridge, MA, 02141, USA
| | - Katherine R. Chao
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Mark A. Walker
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA
- Data Sciences Platform, The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Yunfei Lyu
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | | | - Heidi L. Rehm
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Benjamin M. Neale
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Michael E. Talkowski
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA
| | - Mark J. Daly
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Institute for Molecular Medicine Finland, Helsinki, Finland
| | - Harrison Brand
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA
| | - Konrad J. Karczewski
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Elizabeth G. Atkinson
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Alicia R. Martin
- Stanley Center for Psychiatric Research, The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA
| |
Collapse
|
24
|
Li JH, Findley K, Pickrell JK, Blease K, Zhao J, Kruglyak S. Low-pass sequencing plus imputation using avidity sequencing displays comparable imputation accuracy to sequencing by synthesis while reducing duplicates. G3 (BETHESDA, MD.) 2024; 14:jkad276. [PMID: 38038370 PMCID: PMC10849336 DOI: 10.1093/g3journal/jkad276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Revised: 09/21/2023] [Accepted: 11/09/2023] [Indexed: 12/02/2023]
Abstract
Low-pass sequencing with genotype imputation has been adopted as a cost-effective method for genotyping. The most widely used method of short-read sequencing uses sequencing by synthesis (SBS). Here we perform a study of a novel sequencing technology-avidity sequencing. In this short note, we compare the performance of imputation from low-pass libraries sequenced on an Element AVITI system (which utilizes avidity sequencing) to those sequenced on an Illumina NovaSeq 6000 (which utilizes SBS) with an SP flow cell for the same set of biological samples across a range of genetic ancestries. We observed dramatically lower optical duplication rates in the data deriving from the AVITI system compared to the NovaSeq 6000, resulting in higher effective coverage given a fixed number of sequenced bases, and comparable imputation accuracy performance between sequencing chemistries across ancestries. This study demonstrates that avidity sequencing is a viable alternative to the standard SBS chemistries for applications involving low-pass sequencing plus imputation.
Collapse
|
25
|
Bhérer C, Eveleigh R, Trajanoska K, St-Cyr J, Paccard A, Nadukkalam Ravindran P, Caron E, Bader Asbah N, McClelland P, Wei C, Baumgartner I, Schindewolf M, Döring Y, Perley D, Lefebvre F, Lepage P, Bourgey M, Bourque G, Ragoussis J, Mooser V, Taliun D. A cost-effective sequencing method for genetic studies combining high-depth whole exome and low-depth whole genome. NPJ Genom Med 2024; 9:8. [PMID: 38326393 PMCID: PMC10850497 DOI: 10.1038/s41525-024-00390-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Accepted: 12/07/2023] [Indexed: 02/09/2024] Open
Abstract
Whole genome sequencing (WGS) at high-depth (30X) allows the accurate discovery of variants in the coding and non-coding DNA regions and helps elucidate the genetic underpinnings of human health and diseases. Yet, due to the prohibitive cost of high-depth WGS, most large-scale genetic association studies use genotyping arrays or high-depth whole exome sequencing (WES). Here we propose a cost-effective method which we call "Whole Exome Genome Sequencing" (WEGS), that combines low-depth WGS and high-depth WES with up to 8 samples pooled and sequenced simultaneously (multiplexed). We experimentally assess the performance of WEGS with four different depth of coverage and sample multiplexing configurations. We show that the optimal WEGS configurations are 1.7-2.0 times cheaper than standard WES (no-plexing), 1.8-2.1 times cheaper than high-depth WGS, reach similar recall and precision rates in detecting coding variants as WES, and capture more population-specific variants in the rest of the genome that are difficult to recover when using genotype imputation methods. We apply WEGS to 862 patients with peripheral artery disease and show that it directly assesses more known disease-associated variants than a typical genotyping array and thousands of non-imputable variants per disease-associated locus.
Collapse
Affiliation(s)
- Claude Bhérer
- Department of Human Genetics, Faculty of Medicine and Health Sciences, McGill University, Montréal, Québec, Canada
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
- Canada Excellence Research Chair in Genomic Medicine, McGill University, Montréal, Québec, Canada
| | - Robert Eveleigh
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
- Canadian Centre for Computational Genomics, McGill University, Montréal, Québec, Canada
| | - Katerina Trajanoska
- Department of Human Genetics, Faculty of Medicine and Health Sciences, McGill University, Montréal, Québec, Canada
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
- Canada Excellence Research Chair in Genomic Medicine, McGill University, Montréal, Québec, Canada
| | - Janick St-Cyr
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
| | - Antoine Paccard
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
| | - Praveen Nadukkalam Ravindran
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
- Canada Excellence Research Chair in Genomic Medicine, McGill University, Montréal, Québec, Canada
| | - Elizabeth Caron
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
| | - Nimara Bader Asbah
- Department of Human Genetics, Faculty of Medicine and Health Sciences, McGill University, Montréal, Québec, Canada
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
| | - Peyton McClelland
- Department of Human Genetics, Faculty of Medicine and Health Sciences, McGill University, Montréal, Québec, Canada
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
- Canada Excellence Research Chair in Genomic Medicine, McGill University, Montréal, Québec, Canada
| | - Clare Wei
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
- Canada Excellence Research Chair in Genomic Medicine, McGill University, Montréal, Québec, Canada
| | - Iris Baumgartner
- Division of Angiology, Swiss Cardiovascular Center, Inselspital, Bern University Hospital, University of Bern, Bern, Switzerland
- Department for BioMedical Research (DBMR), University of Bern, Bern, Switzerland
| | - Marc Schindewolf
- Division of Angiology, Swiss Cardiovascular Center, Inselspital, Bern University Hospital, University of Bern, Bern, Switzerland
- Department for BioMedical Research (DBMR), University of Bern, Bern, Switzerland
| | - Yvonne Döring
- Division of Angiology, Swiss Cardiovascular Center, Inselspital, Bern University Hospital, University of Bern, Bern, Switzerland
- Department for BioMedical Research (DBMR), University of Bern, Bern, Switzerland
- Institute for Cardiovascular Prevention (IPEK), Ludwig-Maximilians University Munich, Pettenkoferstr 9, 80336, Munich, Germany
| | - Danielle Perley
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
- Canadian Centre for Computational Genomics, McGill University, Montréal, Québec, Canada
| | - François Lefebvre
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
- Canadian Centre for Computational Genomics, McGill University, Montréal, Québec, Canada
| | - Pierre Lepage
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
| | | | - Guillaume Bourque
- Department of Human Genetics, Faculty of Medicine and Health Sciences, McGill University, Montréal, Québec, Canada
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
- Canadian Centre for Computational Genomics, McGill University, Montréal, Québec, Canada
| | - Jiannis Ragoussis
- Department of Human Genetics, Faculty of Medicine and Health Sciences, McGill University, Montréal, Québec, Canada
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
| | - Vincent Mooser
- Department of Human Genetics, Faculty of Medicine and Health Sciences, McGill University, Montréal, Québec, Canada
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
- Canada Excellence Research Chair in Genomic Medicine, McGill University, Montréal, Québec, Canada
| | - Daniel Taliun
- Department of Human Genetics, Faculty of Medicine and Health Sciences, McGill University, Montréal, Québec, Canada.
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada.
- Canada Excellence Research Chair in Genomic Medicine, McGill University, Montréal, Québec, Canada.
| |
Collapse
|
26
|
Kachuri L, Chatterjee N, Hirbo J, Schaid DJ, Martin I, Kullo IJ, Kenny EE, Pasaniuc B, Witte JS, Ge T. Principles and methods for transferring polygenic risk scores across global populations. Nat Rev Genet 2024; 25:8-25. [PMID: 37620596 PMCID: PMC10961971 DOI: 10.1038/s41576-023-00637-2] [Citation(s) in RCA: 103] [Impact Index Per Article: 103.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/11/2023] [Indexed: 08/26/2023]
Abstract
Polygenic risk scores (PRSs) summarize the genetic predisposition of a complex human trait or disease and may become a valuable tool for advancing precision medicine. However, PRSs that are developed in populations of predominantly European genetic ancestries can increase health disparities due to poor predictive performance in individuals of diverse and complex genetic ancestries. We describe genetic and modifiable risk factors that limit the transferability of PRSs across populations and review the strengths and weaknesses of existing PRS construction methods for diverse ancestries. Developing PRSs that benefit global populations in research and clinical settings provides an opportunity for innovation and is essential for health equity.
Collapse
Affiliation(s)
- Linda Kachuri
- Department of Epidemiology and Population Health, Stanford University School of Medicine, Stanford, CA, USA
- Stanford Cancer Institute, Stanford University School of Medicine, Stanford, CA, USA
| | - Nilanjan Chatterjee
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Jibril Hirbo
- Department of Medicine Division of Genetic Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Daniel J Schaid
- Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN, USA
| | - Iman Martin
- Division of Genomic Medicine, National Human Genome Research Institute, Bethesda, MD, USA
| | - Iftikhar J Kullo
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN, USA
| | - Eimear E Kenny
- Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Bogdan Pasaniuc
- Department of Computational Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
| | - John S Witte
- Department of Epidemiology and Population Health, Stanford University School of Medicine, Stanford, CA, USA.
- Stanford Cancer Institute, Stanford University School of Medicine, Stanford, CA, USA.
- Department of Biomedical Data Science, Stanford University, Stanford, CA, USA.
- Department of Genetics, Stanford University, Stanford, CA, USA.
| | - Tian Ge
- Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA.
- Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| |
Collapse
|
27
|
Childebayeva A, Zavala EI. Review: Computational analysis of human skeletal remains in ancient DNA and forensic genetics. iScience 2023; 26:108066. [PMID: 37927550 PMCID: PMC10622734 DOI: 10.1016/j.isci.2023.108066] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2023] Open
Abstract
Degraded DNA is used to answer questions in the fields of ancient DNA (aDNA) and forensic genetics. While aDNA studies typically center around human evolution and past history, and forensic genetics is often more concerned with identifying a specific individual, scientists in both fields face similar challenges. The overlap in source material has prompted periodic discussions and studies on the advantages of collaboration between fields toward mutually beneficial methodological advancements. However, most have been centered around wet laboratory methods (sampling, DNA extraction, library preparation, etc.). In this review, we focus on the computational side of the analytical workflow. We discuss limitations and considerations to consider when working with degraded DNA. We hope this review provides a framework to researchers new to computational workflows for how to think about analyzing highly degraded DNA and prompts an increase of collaboration between the forensic genetics and aDNA fields.
Collapse
Affiliation(s)
- Ainash Childebayeva
- Department of Archaeogenetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
- Department of Anthropology, University of Kansas, Lawrence, KS, USA
| | - Elena I. Zavala
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
- Department of Biology, University of Oregon, Eugene, OR, USA
| |
Collapse
|
28
|
Smolyak D, Humphries EM, Parikh A, Gopalakrishnan M, Aycan F, Bjarnadóttir M, Ament SA, El-Metwally D, Beitelshees A, Agarwal R. Predicting Heterogeneity in Patient Response to Morphine Treatment for Neonatal Opioid Withdrawal Syndrome. Clin Pharmacol Ther 2023; 114:1015-1022. [PMID: 37470135 DOI: 10.1002/cpt.3007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2023] [Accepted: 07/06/2023] [Indexed: 07/21/2023]
Abstract
Infants with neonatal opioid withdrawal syndrome commonly receive morphine treatment to manage their withdrawal signs. However, the effectiveness of this pharmacotherapy in managing the infants' withdrawal signs vary widely. We sought to understand how information available early in infant monitoring can anticipate this treatment response, focusing on early modified Finnegan Neonatal Abstinence Scoring System (FNASS) scores, polygenic risk for opioid dependence (polygenic risk score (PRS)), and drug exposure. Using k-means clustering, we divided the 213 infants in our cohort into 3 groups based on their FNASS scores in the 12 hours before and after the initiation of pharmacotherapy. We found that these groups were pairwise significantly different for risk factors, including methadone exposure, and for in-hospital outcomes, including total morphine received, length of stay, and highest FNASS score. Whereas PRS was not predictive of receipt of treatment, PRS was pairwise significantly different between a subset of the groups. Using tree-based machine learning methods, we then constructed network graphs of the relationships among these groups, FNASS scores, PRS, drug exposures, and in-hospital outcomes. The resulting networks also showed meaningful connection between early FNASS scores and PRS, as well as between both of those and later in-hospital outcomes. These analyses present clinicians with the opportunity to better anticipate infant withdrawal progression and prepare accordingly, whether with expedited morphine treatment or non-pharmacotherapeutic alternative treatments.
Collapse
Affiliation(s)
- Daniel Smolyak
- Department of Computer Science, University of Maryland, College Park, Maryland, USA
| | - Elizabeth M Humphries
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland, USA
- Program in Molecular Epidemiology, University of Maryland School of Medicine, Baltimore, Maryland, USA
| | - Abhinav Parikh
- Department of Pediatrics, New York Presbyterian Brooklyn Methodist Hospital, Brooklyn, New York, USA
| | - Mathangi Gopalakrishnan
- Department of Practice, Science, and Health Outcomes Research, University of Maryland School of Pharmacy, Baltimore, Maryland, USA
| | - Fulden Aycan
- Department of Pediatrics, Division of Neonatology, University of Maryland School of Medicine, Baltimore, Maryland, USA
| | - Margrét Bjarnadóttir
- Robert H. Smith School of Business, University of Maryland, College Park, Maryland, USA
| | - Seth A Ament
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland, USA
- Department of Psychiatry, University of Maryland School of Medicine, Baltimore, Maryland, USA
- Department of Medicine, Division of Endocrinology, Diabetes and Nutrition and Program for Personalized and Genomic Medicine, University of Maryland School of Medicine, Baltimore, Maryland, USA
| | - Dina El-Metwally
- Department of Pediatrics, Division of Neonatology, University of Maryland School of Medicine, Baltimore, Maryland, USA
| | - Amber Beitelshees
- University of Maryland - Medicine Institute for Neuroscience Discovery (UM-MIND), Baltimore, Maryland, USA
| | - Ritu Agarwal
- Carey Business School, Center for Digital Health and Artificial Intelligence, Johns Hopkins University, Baltimore, Maryland, USA
| |
Collapse
|
29
|
Pan L, Wu J, Liang D, Yuan J, Wang J, Shen Y, Lu J, Xia A, Li J, Wu L. Association analysis between chromosomal abnormalities and fetal ultrasonographic soft markers based on 15,263 fetuses. Am J Obstet Gynecol MFM 2023; 5:101072. [PMID: 37393030 DOI: 10.1016/j.ajogmf.2023.101072] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2023] [Revised: 06/24/2023] [Accepted: 06/27/2023] [Indexed: 07/03/2023]
Abstract
BACKGROUND Soft markers are common prenatal ultrasonographic findings that indicate an increased risk for fetal aneuploidy. However, the association between soft markers and pathogenic or likely pathogenic copy number variations is still unclear, and clinicians lack clarity on which soft markers warrant a recommendation for invasive prenatal genetic testing of the fetus. OBJECTIVE This study aimed to provide guidance on ordering prenatal genetic testing for fetuses with different soft markers and to elucidate the association between specific types of chromosomal abnormalities and specific ultrasonographic soft markers. STUDY DESIGN Low-pass genome sequencing was performed for 15,263 fetuses, including 9123 with ultrasonographic soft markers and 6140 with normal ultrasonographic findings. The detection rate of pathogenic or likely pathogenic copy number variants among fetuses with various ultrasonographic soft markers were compared with that of fetuses with normal ultrasonography. The association of soft markers with aneuploidy and pathogenic or likely pathogenic copy number variants were investigated using Fisher exact tests with Bonferroni correction. RESULTS The detection rate of aneuploidy and pathogenic or likely pathogenic copy number variants was 3.04% (277/9123) and 3.40% (310/9123), respectively, in fetuses with ultrasonographic soft markers. An absent or a hypoplastic nasal bone was the soft marker in the second trimester with the highest diagnostic rate for aneuploidy of 5.22% (83/1591) among all isolated groups. Four types of isolated ultrasonographic soft markers, namely a thickened nuchal fold, single umbilical artery, mild ventriculomegaly, and absent or hypoplastic nasal bone, had higher diagnostic rates for pathogenic or likely pathogenic copy number variants (P<.05; odds ratio, 1.69-3.31). Furthermore, this study found that the 22q11.2 deletion was associated with an aberrant right subclavian artery, whereas the 16p13.11 deletion, 10q26.13-q26.3 deletion, and 8p23.3-p23.1 deletion were associated with a thickened nuchal fold, and the 16p11.2 deletion and 17p11.2 deletion were associated with mild ventriculomegaly (P<.05). CONCLUSION Ultrasonographic phenotype-based genetic testing should be considered in clinical consultations. Copy number variant analysis is recommended for fetuses with an isolated thickened nuchal fold, a single umbilical artery, mild ventriculomegaly, and an absent or a hypoplastic nasal bone. A comprehensive definition of genotype-phenotype correlations in aneuploidy and pathogenic or likely pathogenic copy number variants could provide better information for genetic counseling.
Collapse
Affiliation(s)
- Lijuan Pan
- Center for Medical Genetics & Hunan Key Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha, Hunan, China (Drs Pan, J Wu, Liang, and L Wu); Department of Obstetrics, Xiangya Hospital, Central South University, Changsha, Hunan, China (Dr Pan)
| | - Jiayu Wu
- Center for Medical Genetics & Hunan Key Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha, Hunan, China (Drs Pan, J Wu, Liang, and L Wu)
| | - Desheng Liang
- Center for Medical Genetics & Hunan Key Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha, Hunan, China (Drs Pan, J Wu, Liang, and L Wu)
| | - Jing Yuan
- Prenatal Diagnosis Center, Department of Obstetrics and Gynecology, The First Affiliated Hospital of Anhui Medical University, Hefei, Anhui, China (Dr Yuan)
| | - Jue Wang
- Department of Obstetrics, The First Affiliated Hospital, Nanjing Medical University, Nanjing, Jiangsu, China (Dr Wang)
| | - Yinchen Shen
- Department of Maternity Care, Nanning Maternity and Child Health Hospital, Nanning, Guangxi, China (Dr Shen)
| | - Junjie Lu
- Department of Obstetrics and Gynecology, The Second Affiliated Hospital, Army Military Medical University, Chongqing, China (Dr Lu)
| | - Aihua Xia
- Department of Obstetrics, Beihai People's Hospital, Beihai, Guangxi, China (Dr Xia)
| | - Jinchen Li
- Bioinformatics Center and National Clinical Research Center for Geriatric Disorders, Department of Geriatrics, Xiangya Hospital, Central South University, Changsha, Hunan, China (Dr Li).
| | - Lingqian Wu
- Center for Medical Genetics & Hunan Key Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha, Hunan, China (Drs Pan, J Wu, Liang, and L Wu).
| |
Collapse
|
30
|
Al-Jumaan M, Chu H, Alsulaiman A, Camp SY, Han S, Gillani R, Al Marzooq Y, Almulhim F, Vatte C, Al Nemer A, Almuhanna A, Van Allen EM, Al-Ali A, AlDubayan SH. Interplay of Mendelian and polygenic risk factors in Arab breast cancer patients. Genome Med 2023; 15:65. [PMID: 37658461 PMCID: PMC10474689 DOI: 10.1186/s13073-023-01220-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Accepted: 08/09/2023] [Indexed: 09/03/2023] Open
Abstract
BACKGROUND Breast cancer patients from the indigenous Arab population present much earlier than patients from Western countries and have traditionally been underrepresented in cancer genomics studies. The contribution of polygenic and Mendelian risk toward the earlier onset of breast cancer in the population remains elusive. METHODS We performed low-pass whole genome sequencing (lpWGS) and whole-exome sequencing (WES) from 220 female breast cancer patients unselected for positive family history from the indigenous Arab population. Using publicly available resources, we imputed population-specific variants and calculated breast cancer burden-sensitive polygenic risk scores (PRS). Variant pathogenicity was also evaluated on exome variants with high coverage. RESULTS Variants imputed from lpWGS showed high concordance with paired exome (median dosage correlation: 0.9459, Interquartile range: 0.9410-0.9490). After adjusting the PRS to the Arab population, we found significant associations between PRS performance in risk prediction and first-degree relative breast cancer history prediction (Spearman rho=0.43, p = 0.03), where breast cancer patients in the top PRS decile are 5.53 (95% CI 1.76-17.97, p = 0.003) times more likely also to have a first-degree relative diagnosed with breast cancer compared to those in the middle deciles. In addition, we found evidence for the genetic liability threshold model of breast cancer where among patients with a family history of breast cancer, pathogenic rare variant carriers had significantly lower PRS than non-carriers (p = 0.0205, Mann-Whitney U test) while for non-carriers every standard deviation increase in PRS corresponded to 4.52 years (95% CI 8.88-0.17, p = 0.042) earlier age of presentation. CONCLUSIONS Overall, our study provides a framework to assess polygenic risk in an understudied population using lpWGS and identifies common variant risk as a factor independent of pathogenic variant carrier status for earlier age of onset of breast cancer among indigenous Arab breast cancer patients.
Collapse
Affiliation(s)
- Mohammed Al-Jumaan
- College of Medicine, Imam Abdulrahman bin Faisal University, Dammam, Saudi Arabia
| | - Hoyin Chu
- Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Cancer Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Abdullah Alsulaiman
- College of Medicine, Imam Abdulrahman bin Faisal University, Dammam, Saudi Arabia
| | - Sabrina Y Camp
- Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Cancer Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Seunghun Han
- Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Riaz Gillani
- Cancer Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Pediatric Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Pediatrics, Harvard Medical School, Boston, MA, USA
- Boston Children's Hospital, Boston, MA, USA
| | - Yousef Al Marzooq
- College of Medicine, Imam Abdulrahman bin Faisal University, Dammam, Saudi Arabia
| | - Fatmah Almulhim
- College of Medicine, Imam Abdulrahman bin Faisal University, Dammam, Saudi Arabia
| | - Chittibabu Vatte
- College of Medicine, Imam Abdulrahman bin Faisal University, Dammam, Saudi Arabia
| | - Areej Al Nemer
- College of Medicine, Imam Abdulrahman bin Faisal University, Dammam, Saudi Arabia
| | - Afnan Almuhanna
- College of Medicine, Imam Abdulrahman bin Faisal University, Dammam, Saudi Arabia
| | - Eliezer M Van Allen
- Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Cancer Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Cancer Genomics, Dana-Farber Cancer Institute, Boston, MA, 02115, USA
| | - Amein Al-Ali
- College of Medicine, Imam Abdulrahman bin Faisal University, Dammam, Saudi Arabia
| | - Saud H AlDubayan
- Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA.
- Cancer Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Division of Genetics, Brigham and Women's Hospital, Boston, MA, USA.
- College of Medicine, King Saud bin Abdulaziz University for Health Sciences, Riyadh, Saudi Arabia.
| |
Collapse
|
31
|
Bell SM, Evans JM, Greif EA, Tsai KL, Friedenberg SG, Clark LA. GWAS using low-pass whole genome sequence reveals a novel locus in canine congenital idiopathic megaesophagus. Mamm Genome 2023; 34:464-472. [PMID: 37041421 PMCID: PMC10600401 DOI: 10.1007/s00335-023-09991-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Accepted: 03/29/2023] [Indexed: 04/13/2023]
Abstract
Congenital idiopathic megaesophagus (CIM) is a gastrointestinal disorder of dogs wherein the esophagus is dilated and swallowing activity is reduced, causing regurgitation of ingesta. Affected individuals experience weight loss and malnourishment and are at risk for aspiration pneumonia, intussusception, and euthanasia. Great Danes have among the highest incidences of CIM across dog breeds, suggesting a genetic predisposition. We generated low-pass sequencing data for 83 Great Danes and used variant calls to impute missing whole genome single-nucleotide variants (SNVs) for each individual based on haplotypes phased from 624 high-coverage dog genomes, including 21 Great Danes. We validated the utility of our imputed data set for genome-wide association studies (GWASs) by mapping loci known to underlie coat phenotypes with simple and complex inheritance patterns. We conducted a GWAS for CIM with 2,010,300 SNVs, identifying a novel locus on canine chromosome 1 (P-val = 2.76 × 10-10). Associated SNVs are intergenic or intronic and are found in two clusters across a 1.7-Mb region. Inspection of coding regions in high-coverage genomes from affected Great Danes did not reveal candidate causal variants, suggesting that regulatory variants underlie CIM. Further studies are necessary to assess the role of these non-coding variants.
Collapse
Affiliation(s)
- Sarah M Bell
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, USA
| | - Jacquelyn M Evans
- College of Veterinary Medicine, Baker Institute for Animal Health, Cornell University, Ithaca, NY, USA
- Department of Biomedical Sciences, College of Veterinary Medicine, Cornell University, Ithaca, NY, USA
| | - Elizabeth A Greif
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, USA
| | - Kate L Tsai
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, USA
| | - Steven G Friedenberg
- Department of Veterinary Clinical Sciences, College of Veterinary Medicine, University of Minnesota, Saint Paul, MN, USA.
| | - Leigh Anne Clark
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, USA.
| |
Collapse
|
32
|
Sakaue S, Gurajala S, Curtis M, Luo Y, Choi W, Ishigaki K, Kang JB, Rumker L, Deutsch AJ, Schönherr S, Forer L, LeFaive J, Fuchsberger C, Han B, Lenz TL, de Bakker PIW, Okada Y, Smith AV, Raychaudhuri S. Tutorial: a statistical genetics guide to identifying HLA alleles driving complex disease. Nat Protoc 2023; 18:2625-2641. [PMID: 37495751 PMCID: PMC10786448 DOI: 10.1038/s41596-023-00853-4] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2022] [Accepted: 04/27/2023] [Indexed: 07/28/2023]
Abstract
The human leukocyte antigen (HLA) locus is associated with more complex diseases than any other locus in the human genome. In many diseases, HLA explains more heritability than all other known loci combined. In silico HLA imputation methods enable rapid and accurate estimation of HLA alleles in the millions of individuals that are already genotyped on microarrays. HLA imputation has been used to define causal variation in autoimmune diseases, such as type I diabetes, and in human immunodeficiency virus infection control. However, there are few guidelines on performing HLA imputation, association testing, and fine mapping. Here, we present a comprehensive tutorial to impute HLA alleles from genotype data. We provide detailed guidance on performing standard quality control measures for input genotyping data and describe options to impute HLA alleles and amino acids either locally or using the web-based Michigan Imputation Server, which hosts a multi-ancestry HLA imputation reference panel. We also offer best practice recommendations to conduct association tests to define the alleles, amino acids, and haplotypes that affect human traits. Along with the pipeline, we provide a step-by-step online guide with scripts and available software ( https://github.com/immunogenomics/HLA_analyses_tutorial ). This tutorial will be broadly applicable to large-scale genotyping data and will contribute to defining the role of HLA in human diseases across global populations.
Collapse
Affiliation(s)
- Saori Sakaue
- Center for Data Sciences, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Divisions of Genetics and Rheumatology, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Saisriram Gurajala
- Center for Data Sciences, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Divisions of Genetics and Rheumatology, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Michelle Curtis
- Center for Data Sciences, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Divisions of Genetics and Rheumatology, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Yang Luo
- Center for Data Sciences, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Divisions of Genetics and Rheumatology, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Kennedy Institute of Rheumatology, University of Oxford, Oxford, UK
| | - Wanson Choi
- Department of Biomedical Sciences, Seoul National University College of Medicine, Seoul, South Korea
| | - Kazuyoshi Ishigaki
- Center for Data Sciences, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Divisions of Genetics and Rheumatology, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Laboratory for Human Immunogenetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Joyce B Kang
- Center for Data Sciences, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Divisions of Genetics and Rheumatology, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Laurie Rumker
- Center for Data Sciences, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Divisions of Genetics and Rheumatology, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Aaron J Deutsch
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Diabetes Unit, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
- Program in Metabolism, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Sebastian Schönherr
- Institute of Genetic Epidemiology, Department of Genetics, Medical University of Innsbruck, Innsbruck, Austria
| | - Lukas Forer
- Institute of Genetic Epidemiology, Department of Genetics, Medical University of Innsbruck, Innsbruck, Austria
| | - Jonathon LeFaive
- Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA
- Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, USA
| | - Christian Fuchsberger
- Institute of Genetic Epidemiology, Department of Genetics, Medical University of Innsbruck, Innsbruck, Austria
- Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA
- Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, USA
- Institute for Biomedicine, Eurac Research, Bolzano, Italy
| | - Buhm Han
- Department of Biomedical Sciences, Seoul National University College of Medicine, Seoul, South Korea
- Interdisciplinary Program in Bioengineering, Seoul National University, Seoul, South Korea
| | - Tobias L Lenz
- Research Unit for Evolutionary Immunogenomics, Department of Biology, University of Hamburg, Hamburg, Germany
| | - Paul I W de Bakker
- Data and Computational Sciences, Vertex Pharmaceuticals, Boston, MA, USA
| | - Yukinori Okada
- Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan
- Integrated Frontier Research for Medical Science Division, Institute for Open and Transdisciplinary Research Initiatives, Osaka University, Suita, Japan
- Laboratory of Statistical Immunology, Immunology Frontier Research Center (WPI-IFReC), Osaka University, Suita, Japan
- Laboratory for Systems Genetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
- Center for Infectious Disease Education and Research (CiDER), Osaka University, Suita, Japan
- Department of Genome Informatics, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
| | - Albert V Smith
- Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA
- Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, USA
| | - Soumya Raychaudhuri
- Center for Data Sciences, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.
- Divisions of Genetics and Rheumatology, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
- Centre for Genetics and Genomics Versus Arthritis, University of Manchester, Manchester, UK.
| |
Collapse
|
33
|
Babadi M, Fu JM, Lee SK, Smirnov AN, Gauthier LD, Walker M, Benjamin DI, Zhao X, Karczewski KJ, Wong I, Collins RL, Sanchis-Juan A, Brand H, Banks E, Talkowski ME. GATK-gCNV enables the discovery of rare copy number variants from exome sequencing data. Nat Genet 2023; 55:1589-1597. [PMID: 37604963 PMCID: PMC10904014 DOI: 10.1038/s41588-023-01449-0] [Citation(s) in RCA: 42] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Accepted: 06/16/2023] [Indexed: 08/23/2023]
Abstract
Copy number variants (CNVs) are major contributors to genetic diversity and disease. While standardized methods, such as the genome analysis toolkit (GATK), exist for detecting short variants, technical challenges have confounded uniform large-scale CNV analyses from whole-exome sequencing (WES) data. Given the profound impact of rare and de novo coding CNVs on genome organization and human disease, we developed GATK-gCNV, a flexible algorithm to discover rare CNVs from sequencing read-depth information, complete with open-source distribution via GATK. We benchmarked GATK-gCNV in 7,962 exomes from individuals in quartet families with matched genome sequencing and microarray data, finding up to 95% recall of rare coding CNVs at a resolution of more than two exons. We used GATK-gCNV to generate a reference catalog of rare coding CNVs in WES data from 197,306 individuals in the UK Biobank, and observed strong correlations between per-gene CNV rates and measures of mutational constraint, as well as rare CNV associations with multiple traits. In summary, GATK-gCNV is a tunable approach for sensitive and specific CNV discovery in WES data, with broad applications.
Collapse
Affiliation(s)
- Mehrtash Babadi
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| | - Jack M Fu
- Program in Medical and Population Genetics and Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
| | - Samuel K Lee
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Andrey N Smirnov
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Laura D Gauthier
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Mark Walker
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - David I Benjamin
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Xuefang Zhao
- Program in Medical and Population Genetics and Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
| | - Konrad J Karczewski
- Program in Medical and Population Genetics and Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Isaac Wong
- Program in Medical and Population Genetics and Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Ryan L Collins
- Program in Medical and Population Genetics and Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Alba Sanchis-Juan
- Program in Medical and Population Genetics and Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
| | - Harrison Brand
- Program in Medical and Population Genetics and Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
| | - Eric Banks
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Michael E Talkowski
- Program in Medical and Population Genetics and Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA.
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA.
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA.
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| |
Collapse
|
34
|
Watowich MM, Chiou KL, Graves B, tague MJM, Brent LJ, Higham JP, Horvath JE, Lu A, Martinez MI, Platt ML, Schneider-Crease IA, Lea AJ, Snyder-Mackler N. Best practices for genotype imputation from low-coverage sequencing data in natural populations. Mol Ecol Resour 2023:10.1111/1755-0998.13854. [PMID: 37602981 PMCID: PMC10879460 DOI: 10.1111/1755-0998.13854] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2023] [Revised: 07/01/2023] [Accepted: 07/31/2023] [Indexed: 08/22/2023]
Abstract
Monitoring genetic diversity in wild populations is a central goal of ecological and evolutionary genetics and is critical for conservation biology. However, genetic studies of nonmodel organisms generally lack access to species-specific genotyping methods (e.g. array-based genotyping) and must instead use sequencing-based approaches. Although costs are decreasing, high-coverage whole-genome sequencing (WGS), which produces the highest confidence genotypes, remains expensive. More economical reduced representation sequencing approaches fail to capture much of the genome, which can hinder downstream inference. Low-coverage WGS combined with imputation using a high-confidence reference panel is a cost-effective alternative, but the accuracy of genotyping using low-coverage WGS and imputation in nonmodel populations is still largely uncharacterized. Here, we empirically tested the accuracy of low-coverage sequencing (0.1-10×) and imputation in two natural populations, one with a large (n = 741) reference panel, rhesus macaques (Macaca mulatta), and one with a smaller (n = 68) reference panel, gelada monkeys (Theropithecus gelada). Using samples sequenced to coverage as low as 0.5×, we could impute genotypes at >95% of the sites in the reference panel with high accuracy (median r2 ≥ 0.92). We show that low-coverage imputed genotypes can reliably calculate genetic relatedness and population structure. Based on these data, we also provide best practices and recommendations for researchers who wish to deploy this approach in other populations, with all code available on GitHub (https://github.com/mwatowich/LoCSI-for-non-model-species). Our results endorse accurate and effective genotype imputation from low-coverage sequencing, enabling the cost-effective generation of population-scale genetic datasets necessary for tackling many pressing challenges of wildlife conservation.
Collapse
Affiliation(s)
- Marina M. Watowich
- Department of Biology, University of Washington; Seattle, WA, 98195 USA
- Department of Biological Sciences, Vanderbilt University; Nashville, TN, 37235
| | - Kenneth L. Chiou
- Center for Evolution and Medicine, Arizona State University; Tempe, AZ, 85281 USA
- School of Life Sciences, Arizona State University; Tempe, AZ, 85281 USA
| | - Brian Graves
- Program in Ecology, Evolution, and Conservation Biology, University of Illinois at Urbana-Champaign; Urbana, IL 61801
| | - Michael J. Mon tague
- Department of Neuroscience, Perelman School of Medicine; University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Lauren J.N. Brent
- Centre for Research in Animal Behaviour, University of Exeter; Exeter EX4 4QG, UK
| | - James P. Higham
- Department of Anthropology, New York University; New York, NY 10003, USA
- New York Consortium in Evolutionary Primatology; New York, NY, 10016 USA
| | - Julie E. Horvath
- Department of Biological and Biomedical Sciences, North Carolina Central University; Durham, NC 27707, USA
- Research and Collections Section, North Carolina Museum of Natural Sciences; Raleigh, NC 27601, USA
- Department of Biological Sciences, North Carolina State University; Raleigh, NC 27695, USA
- Department of Evolutionary Anthropology, Duke University; Durham, NC 27708, USA
| | - Amy Lu
- Department of Anthropology, Stony Brook University, Stony Brook, NY, 11794, USA
| | - Melween I. Martinez
- Caribbean Primate Research Center, Unit of Comparative Medicine, University of Puerto Rico; San Juan, PR 00936, USA
| | - Michael L. Platt
- Department of Neuroscience, Perelman School of Medicine; University of Pennsylvania, Philadelphia, PA 19104, USA
- Department of Psychology, School of Arts and Sciences; University of Pennsylvania, Philadelphia, PA 19104, USA
- Marketing Department, Wharton School of Business; University of Pennsylvania, Philadelphia, PA 19104, USA
| | - India A. Schneider-Crease
- Center for Evolution and Medicine, Arizona State University; Tempe, AZ, 85281 USA
- School of Life Sciences, Arizona State University; Tempe, AZ, 85281 USA
- School of Human Evolution and Social Change, Arizona State University; Tempe, AZ, 85281 USA
| | - Amanda J. Lea
- Department of Biological Sciences, Vanderbilt University, Nashville, Tennessee, 37235, USA
- Child and Brain Development, Canadian Institute for Advanced Research, Toronto, Canada
| | - Noah Snyder-Mackler
- Center for Evolution and Medicine, Arizona State University; Tempe, AZ, 85281 USA
- School of Life Sciences, Arizona State University; Tempe, AZ, 85281 USA
- School of Human Evolution and Social Change, Arizona State University; Tempe, AZ, 85281 USA
- Neurodegenerative Disease Research Center, Arizona State University; Tempe, AZ, 85281 USA
| |
Collapse
|
35
|
Meadows JRS, Kidd JM, Wang GD, Parker HG, Schall PZ, Bianchi M, Christmas MJ, Bougiouri K, Buckley RM, Hitte C, Nguyen AK, Wang C, Jagannathan V, Niskanen JE, Frantz LAF, Arumilli M, Hundi S, Lindblad-Toh K, Ginja C, Agustina KK, André C, Boyko AR, Davis BW, Drögemüller M, Feng XY, Gkagkavouzis K, Iliopoulos G, Harris AC, Hytönen MK, Kalthoff DC, Liu YH, Lymberakis P, Poulakakis N, Pires AE, Racimo F, Ramos-Almodovar F, Savolainen P, Venetsani S, Tammen I, Triantafyllidis A, vonHoldt B, Wayne RK, Larson G, Nicholas FW, Lohi H, Leeb T, Zhang YP, Ostrander EA. Genome sequencing of 2000 canids by the Dog10K consortium advances the understanding of demography, genome function and architecture. Genome Biol 2023; 24:187. [PMID: 37582787 PMCID: PMC10426128 DOI: 10.1186/s13059-023-03023-7] [Citation(s) in RCA: 45] [Impact Index Per Article: 22.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2022] [Accepted: 07/25/2023] [Indexed: 08/17/2023] Open
Abstract
BACKGROUND The international Dog10K project aims to sequence and analyze several thousand canine genomes. Incorporating 20 × data from 1987 individuals, including 1611 dogs (321 breeds), 309 village dogs, 63 wolves, and four coyotes, we identify genomic variation across the canid family, setting the stage for detailed studies of domestication, behavior, morphology, disease susceptibility, and genome architecture and function. RESULTS We report the analysis of > 48 M single-nucleotide, indel, and structural variants spanning the autosomes, X chromosome, and mitochondria. We discover more than 75% of variation for 239 sampled breeds. Allele sharing analysis indicates that 94.9% of breeds form monophyletic clusters and 25 major clades. German Shepherd Dogs and related breeds show the highest allele sharing with independent breeds from multiple clades. On average, each breed dog differs from the UU_Cfam_GSD_1.0 reference at 26,960 deletions and 14,034 insertions greater than 50 bp, with wolves having 14% more variants. Discovered variants include retrogene insertions from 926 parent genes. To aid functional prioritization, single-nucleotide variants were annotated with SnpEff and Zoonomia phyloP constraint scores. Constrained positions were negatively correlated with allele frequency. Finally, the utility of the Dog10K data as an imputation reference panel is assessed, generating high-confidence calls across varied genotyping platform densities including for breeds not included in the Dog10K collection. CONCLUSIONS We have developed a dense dataset of 1987 sequenced canids that reveals patterns of allele sharing, identifies likely functional variants, informs breed structure, and enables accurate imputation. Dog10K data are publicly available.
Collapse
Affiliation(s)
- Jennifer R S Meadows
- Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University, 75132, Uppsala, Sweden.
| | - Jeffrey M Kidd
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, 48107, USA.
| | - Guo-Dong Wang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China
| | - Heidi G Parker
- National Human Genome Research Institute, National Institutes of Health, 50 South Drive, Building 50 Room 5351, Bethesda, MD, 20892, USA
| | - Peter Z Schall
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, 48107, USA
| | - Matteo Bianchi
- Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University, 75132, Uppsala, Sweden
| | - Matthew J Christmas
- Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University, 75132, Uppsala, Sweden
| | - Katia Bougiouri
- Section for Molecular Ecology and Evolution, Globe Institute, University of Copenhagen, Øster Voldgade 5-7, 1350, Copenhagen, Denmark
| | - Reuben M Buckley
- National Human Genome Research Institute, National Institutes of Health, 50 South Drive, Building 50 Room 5351, Bethesda, MD, 20892, USA
| | - Christophe Hitte
- University of Rennes, CNRS, Institute Genetics and Development Rennes - UMR6290, 35000, Rennes, France
| | - Anthony K Nguyen
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, 48107, USA
| | - Chao Wang
- Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University, 75132, Uppsala, Sweden
| | - Vidhya Jagannathan
- Institute of Genetics, Vetsuisse Faculty, University of Bern, 3001, Bern, Switzerland
| | - Julia E Niskanen
- Department of Medical and Clinical Genetics, Department of Veterinary Biosciences, University of Helsinki and Folkhälsan Research Center, 02900, Helsinki, Finland
| | - Laurent A F Frantz
- School of Biological and Behavioural Sciences, Queen Mary University of London, London E14NS, UK and Palaeogenomics Group, Department of Veterinary Sciences, Ludwig Maximilian University, D-80539, Munich, Germany
| | - Meharji Arumilli
- Department of Medical and Clinical Genetics, Department of Veterinary Biosciences, University of Helsinki and Folkhälsan Research Center, 02900, Helsinki, Finland
| | - Sruthi Hundi
- Department of Medical and Clinical Genetics, Department of Veterinary Biosciences, University of Helsinki and Folkhälsan Research Center, 02900, Helsinki, Finland
| | - Kerstin Lindblad-Toh
- Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University, 75132, Uppsala, Sweden
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Catarina Ginja
- BIOPOLIS-CIBIO-InBIO-Centro de Investigação Em Biodiversidade E Recursos Genéticos - ArchGen Group, Universidade Do Porto, 4485-661, Vairão, Portugal
| | | | - Catherine André
- University of Rennes, CNRS, Institute Genetics and Development Rennes - UMR6290, 35000, Rennes, France
| | - Adam R Boyko
- Department of Biomedical Sciences, Cornell University, 930 Campus Road, Ithaca, NY, 14853, USA
| | - Brian W Davis
- Department of Veterinary Integrative Biosciences, School of Veterinary Medicine and Biomedical Sciences, Texas A&M University, College Station, TX, 77843, USA
| | - Michaela Drögemüller
- Institute of Genetics, Vetsuisse Faculty, University of Bern, 3001, Bern, Switzerland
| | - Xin-Yao Feng
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China
| | - Konstantinos Gkagkavouzis
- Department of Genetics, School of Biology, ), Aristotle University of Thessaloniki, Thessaloniki, Macedonia 54124, Greece and Genomics and Epigenomics Translational Research (GENeTres), Center for Interdisciplinary Research and Innovation (CIRI-AUTH, Balkan Center, Thessaloniki, Greece
| | - Giorgos Iliopoulos
- NGO "Callisto", Wildlife and Nature Conservation Society, 54621, Thessaloniki, Greece
| | - Alexander C Harris
- National Human Genome Research Institute, National Institutes of Health, 50 South Drive, Building 50 Room 5351, Bethesda, MD, 20892, USA
| | - Marjo K Hytönen
- Department of Medical and Clinical Genetics, Department of Veterinary Biosciences, University of Helsinki and Folkhälsan Research Center, 02900, Helsinki, Finland
| | - Daniela C Kalthoff
- NGO "Callisto", Wildlife and Nature Conservation Society, 54621, Thessaloniki, Greece
| | - Yan-Hu Liu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China
| | - Petros Lymberakis
- Natural History Museum of Crete & Department of Biology, University of Crete, 71202, Irakleio, Greece
- Biology Department, School of Sciences and Engineering, University of Crete, Heraklion, Greece
- Palaeogenomics and Evolutionary Genetics Lab, Institute of Molecular Biology and Biotechnology (IMBB), Foundation for Research and Technology - Hellas (FORTH), Heraklion, Greece
| | - Nikolaos Poulakakis
- Natural History Museum of Crete & Department of Biology, University of Crete, 71202, Irakleio, Greece
- Biology Department, School of Sciences and Engineering, University of Crete, Heraklion, Greece
- Palaeogenomics and Evolutionary Genetics Lab, Institute of Molecular Biology and Biotechnology (IMBB), Foundation for Research and Technology - Hellas (FORTH), Heraklion, Greece
| | - Ana Elisabete Pires
- BIOPOLIS-CIBIO-InBIO-Centro de Investigação Em Biodiversidade E Recursos Genéticos - ArchGen Group, Universidade Do Porto, 4485-661, Vairão, Portugal
| | - Fernando Racimo
- Section for Molecular Ecology and Evolution, Globe Institute, University of Copenhagen, Øster Voldgade 5-7, 1350, Copenhagen, Denmark
| | | | - Peter Savolainen
- Department of Gene Technology, Science for Life Laboratory, KTH - Royal Institute of Technology, 17121, Solna, Sweden
| | - Semina Venetsani
- Department of Genetics, School of Biology, Aristotle University of Thessaloniki, 54124, Thessaloniki, Macedonia, Greece
| | - Imke Tammen
- Sydney School of Veterinary Science, The University of Sydney, Sydney, NSW, 2570, Australia
| | - Alexandros Triantafyllidis
- Department of Genetics, School of Biology, ), Aristotle University of Thessaloniki, Thessaloniki, Macedonia 54124, Greece and Genomics and Epigenomics Translational Research (GENeTres), Center for Interdisciplinary Research and Innovation (CIRI-AUTH, Balkan Center, Thessaloniki, Greece
| | - Bridgett vonHoldt
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ, 08544, USA
| | - Robert K Wayne
- Department of Ecology and Evolutionary Biology, Ecology and Evolutionary Biology, University of California, Los Angeles, CA, 90095-7246, USA
| | - Greger Larson
- Palaeogenomics and Bio-Archaeology Research Network, School of Archaeology, University of Oxford, Oxford, OX1 3TG, UK
| | - Frank W Nicholas
- Sydney School of Veterinary Science, The University of Sydney, Sydney, NSW, 2570, Australia
| | - Hannes Lohi
- Department of Medical and Clinical Genetics, Department of Veterinary Biosciences, University of Helsinki and Folkhälsan Research Center, 02900, Helsinki, Finland
| | - Tosso Leeb
- Institute of Genetics, Vetsuisse Faculty, University of Bern, 3001, Bern, Switzerland
| | - Ya-Ping Zhang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China
| | - Elaine A Ostrander
- National Human Genome Research Institute, National Institutes of Health, 50 South Drive, Building 50 Room 5351, Bethesda, MD, 20892, USA.
| |
Collapse
|
36
|
Rubinacci S, Hofmeister RJ, Sousa da Mota B, Delaneau O. Imputation of low-coverage sequencing data from 150,119 UK Biobank genomes. Nat Genet 2023:10.1038/s41588-023-01438-3. [PMID: 37386250 DOI: 10.1038/s41588-023-01438-3] [Citation(s) in RCA: 34] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Accepted: 05/31/2023] [Indexed: 07/01/2023]
Abstract
The release of 150,119 UK Biobank sequences represents an unprecedented opportunity as a reference panel to impute low-coverage whole-genome sequencing data with high accuracy but current methods cannot cope with the size of the data. Here we introduce GLIMPSE2, a low-coverage whole-genome sequencing imputation method that scales sublinearly in both the number of samples and markers, achieving efficient whole-genome imputation from the UK Biobank reference panel while retaining high accuracy for ancient and modern genomes, particularly at rare variants and for very low-coverage samples.
Collapse
Affiliation(s)
- Simone Rubinacci
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Robin J Hofmeister
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Bárbara Sousa da Mota
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Olivier Delaneau
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland.
- Swiss Institute of Bioinformatics, Lausanne, Switzerland.
| |
Collapse
|
37
|
Bebo A, Jarmul JA, Pletcher MJ, Hasbani NR, Couper D, Nambi V, Ballantyne CM, Fornage M, Morrison AC, Avery CL, de Vries PS. Coronary heart disease and ischemic stroke polygenic risk scores and atherosclerotic cardiovascular disease in a diverse, population-based cohort study. PLoS One 2023; 18:e0285259. [PMID: 37327218 PMCID: PMC10275447 DOI: 10.1371/journal.pone.0285259] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Accepted: 04/18/2023] [Indexed: 06/18/2023] Open
Abstract
The predictive ability of coronary heart disease (CHD) and ischemic stroke (IS) polygenic risk scores (PRS) have been evaluated individually, but whether they predict the combined outcome of atherosclerotic cardiovascular disease (ASCVD) remains insufficiently researched. It is also unclear whether associations of the CHD and IS PRS with ASCVD are independent of subclinical atherosclerosis measures. 7,286 White and 2,016 Black participants from the population-based Atherosclerosis Risk in Communities study who were free of cardiovascular disease and type 2 diabetes at baseline were included. We computed previously validated CHD and IS PRS consisting of 1,745,179 and 3,225,583 genetic variants, respectively. Cox proportional hazards models were used to test the association between each PRS and ASCVD, adjusting for traditional risk factors, ankle-brachial index, carotid intima media thickness, and carotid plaque. The hazard ratios (HR) for the CHD and IS PRS were significant with HR of 1.50 (95% CI: 1.36-1.66) and 1.31 (95% CI: 1.18-1.45) respectively for the risk of incident ASCVD per standard deviation increase in CHD and IS PRS among White participants after adjusting for traditional risk factors. The HR for the CHD PRS was not significant with an HR of 0.95 (95% CI: 0.79-1.13) for the risk of incident ASCVD in Black participants. The HR for the IS PRS was significant with an HR of 1.26 (95%CI: 1.05-1.51) for the risk of incident ASCVD in Black participants. The association of the CHD and IS PRS with ASCVD was not attenuated in White participants after adjustment for ankle-brachial index, carotid intima media thickness, and carotid plaque. The CHD and IS PRS do not cross-predict well, and predict better the outcome for which they were created than the composite ASCVD outcome. Thus, the use of the composite outcome of ASCVD may not be ideal for genetic risk prediction.
Collapse
Affiliation(s)
- Allison Bebo
- Human Genetics Center, Department of Epidemiology, Human Genetics and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, United States of America
| | - Jamie A. Jarmul
- Gillings School of Public Health, Department of Health Policy and Management, University of North Carolina, Chapel Hill, Chapel Hill, NC, United States of America
- School of Medicine, University of North Carolina, Chapel Hill, Chapel Hill, NC, United States of America
| | - Mark J. Pletcher
- Departments of Epidemiology and Biostatistics and Medicine, University of California, San Francisco, San Francisco, CA, United States of America
| | - Natalie R. Hasbani
- Human Genetics Center, Department of Epidemiology, Human Genetics and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, United States of America
| | - David Couper
- Gillings School of Public Health, Department of Biostatistics, University of North Carolina, Chapel Hill, Chapel Hill, NC, United States of America
- Collaborative Studies Coordinating Center, University of North Carolina, Chapel Hill, Chapel Hill, NC, United States of America
| | - Vijay Nambi
- Baylor College of Medicine, Houston, TX, United States of America
- Michael E DeBakey Veterans Affairs Medical Center, Houston, TX, United States of America
| | | | - Myriam Fornage
- Human Genetics Center, Department of Epidemiology, Human Genetics and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, United States of America
- McGovern Medical School Institute of Molecular Medicine Research Center for Human Genetics, Houston, TX, United States of America
| | - Alanna C. Morrison
- Human Genetics Center, Department of Epidemiology, Human Genetics and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, United States of America
| | - Christy L. Avery
- Gillings School of Public Health, Department of Epidemiology, University of North Carolina, Chapel Hill, Chapel Hill, NC, United States of America
- Carolina Population Center, University of North Carolina–Chapel Hill, Chapel Hill, NC, United States of America
| | - Paul S. de Vries
- Human Genetics Center, Department of Epidemiology, Human Genetics and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, United States of America
| |
Collapse
|
38
|
Bocher O, Willer CJ, Zeggini E. Unravelling the genetic architecture of human complex traits through whole genome sequencing. Nat Commun 2023; 14:3520. [PMID: 37316478 DOI: 10.1038/s41467-023-39259-x] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2022] [Accepted: 06/06/2023] [Indexed: 06/16/2023] Open
Affiliation(s)
- Ozvan Bocher
- Institute of Translational Genomics, Helmholtz Zentrum München, German Research Center for Environmental Health, 85764, Neuherberg, Germany
| | - Cristen J Willer
- Department of Internal Medicine, Division of Cardiology, University of Michigan, Ann Arbor, MI, 48109, USA
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, 48109, USA
- Department of Computational Medicine and Biostatistics, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Eleftheria Zeggini
- Institute of Translational Genomics, Helmholtz Zentrum München, German Research Center for Environmental Health, 85764, Neuherberg, Germany.
- Technical University of Munich (TUM) and Klinikum Rechts der Isar, TUM School of Medicine, Ismaninger Str. 22, 81675, Munich, Germany.
| |
Collapse
|
39
|
Jesudoss Chelladurai JRJ, Abraham A, Quintana TA, Ritchie D, Smith V. Comparative Genomic Analysis and Species Delimitation: A Case for Two Species in the Zoonotic Cestode Dipylidium caninum. Pathogens 2023; 12:pathogens12050675. [PMID: 37242345 DOI: 10.3390/pathogens12050675] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Revised: 04/06/2023] [Accepted: 04/20/2023] [Indexed: 05/28/2023] Open
Abstract
Dipylidium caninum (Linnaeus, 1758) is a common zoonotic cestode of dogs and cats worldwide. Previous studies have demonstrated the existence of largely host-associated canine and feline genotypes based on infection studies, differences at the 28S rDNA gene, and complete mitochondrial genomes. There have been no comparative genome-wide studies. Here, we sequenced the genomes of a dog and cat isolate of Dipylidium caninum from the United States using the Illumina platform at mean coverage depths of 45× and 26× and conducted comparative analyses with the reference draft genome. Complete mitochondrial genomes were used to confirm the genotypes of the isolates. Genomes of D. caninum canine and feline genotypes generated in this study, had an average identity of 98% and 89%, respectively, when compared to the reference genome. SNPs were 20 times higher in the feline isolate. Comparison and species delimitation using universally conserved orthologs and protein-coding mitochondrial genes revealed that the canine and feline isolates are different species. Data from this study build a base for future integrative taxonomy. Further genomic studies from geographically diverse populations are necessary to understand implications for taxonomy, epidemiology, veterinary clinical medicine, and anthelmintic resistance.
Collapse
Affiliation(s)
- Jeba R J Jesudoss Chelladurai
- Department of Diagnostic Medicine/Pathobiology, College of Veterinary Medicine, Kansas State University, Manhattan, KS 66506, USA
| | - Aloysius Abraham
- Department of Biotechnology, Alagappa University, Karaikudi 630003, India
| | - Theresa A Quintana
- Department of Diagnostic Medicine/Pathobiology, College of Veterinary Medicine, Kansas State University, Manhattan, KS 66506, USA
| | - Deb Ritchie
- Department of Diagnostic Medicine/Pathobiology, College of Veterinary Medicine, Kansas State University, Manhattan, KS 66506, USA
| | - Vicki Smith
- Department of Diagnostic Medicine/Pathobiology, College of Veterinary Medicine, Kansas State University, Manhattan, KS 66506, USA
| |
Collapse
|
40
|
Jesudoss Chelladurai JRJ, Abraham A, Quintana T, Smith V, Ritchie D. Genomic differences and species delimitation: a case for two species in the zoonotic cestode Dipylidium caninum. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.23.529708. [PMID: 36865108 PMCID: PMC9980070 DOI: 10.1101/2023.02.23.529708] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/25/2023]
Abstract
Dipylidium caninum (Linnaeus, 1758) is a common zoonotic cestode of dogs and cats worldwide. Previous studies have demonstrated the existence of largely host associated canine and feline genotypes based on infection studies, genetic differences at the nuclear 28S rDNA gene and complete mitochondrial genomes. There have been no comparative studies at a genome-wide scale. Here, we sequenced the genomes of a dog and cat isolate of Dipylidium caninum from the United States using the Illumina platform and conducted comparative analyses with the reference draft genome. Complete mitochondrial genomes were used to confirm the genotypes of the isolates. D. caninum canine and feline genomes generated in this study had mean coverage depths of 45x and 26x and an average identity of 98% and 89% respectively when compared to the reference genome. SNPs were 20 times higher in the feline isolate. Comparison and species delimitation using universally conserved orthologs and protein coding mitochondrial genes revealed that the canine and feline isolates are different species. Data from this study builds a base for future integrative taxonomy. Further genomic studies from geographically diverse populations are necessary to understand implications for taxonomy, epidemiology, veterinary clinical medicine, and anthelmintic resistance.
Collapse
|
41
|
Attur M, Petrilli C, Adhikari S, Iturrate E, Li X, Tuminello S, Hu N, Chakravarti A, Beck D, Abramson SB. Interleukin-1 receptor antagonist gene ( IL1RN ) variants modulate the cytokine release syndrome and mortality of SARS-CoV-2. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.01.09.23284348. [PMID: 36711766 PMCID: PMC9882468 DOI: 10.1101/2023.01.09.23284348] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
Objective To explore the regulation of the inflammatory response in acute SARS-CoV-2 infection, we examined effects of single nucleotide variants (SNVs) of IL1RN , the gene encoding the anti-inflammatory IL-1 receptor antagonist (IL-1Ra), on the cytokine release syndrome and mortality. Methods We studied 2589 patients hospitalized with SARS-CoV-2 between March 2020 and March 2021 at NYU Langone's Tisch Hospital. CTA and TTG haplotypes formed from three SNVs (rs419598, rs315952, rs9005) and the individual SNVs of the IL1RN gene were assessed for association with laboratory markers of the cytokine release syndrome (CRS) and mortality. Results Mortality in the population was 15.3%, and was lower in women than men (13.1% vs.17.3%, p<0.0003). Carriers of the CTA-1/2 IL1RN haplotypes exhibited decreased inflammatory markers and increased plasma IL-1Ra relative to TTG carriers. Decreased mortality among CTA-1/2 carriers was observed in male patients between the ages of 55-74 [9.2% vs. 17.9%, p=0.001]. Evaluation of individual SNVs of the IL1RN gene (rs419598, rs315952, rs9005) indicated that carriers of the IL1RN rs419598 CC SNV exhibited lower inflammatory biomarker levels, and was associated with reduced mortality compared to the CT/TT genotype in men (OR 0.49 (0.23 - 1.00); 0.052), with the most pronounced effect observed between the ages of 55-74 [5.5% vs. 18.4%, p<0.001]. Conclusion The IL1RN haplotype CTA, and sequence variant of rs419598 are associated with attenuation of the cytokine release syndrome and decreased mortality in males with acute SARS-CoV2 infection. The data suggest that IL1RN modulates the COVID-19 cytokine release syndrome via endogenous " anti-inflammatory" mechanisms. Significance statement We provide evidence that variants of IL1RN modulate the severity of SARS-CoV-2 infection. The IL1RN CTA haplotype and rs419598 CC single nucleotide variant are associated with decreased plasma levels of inflammatory markers, interleukin-1 beta (IL-1β), interleukin-6 (IL-6), interleukin-2 (IL-2), C-reactive protein (CRP), D-dimer, ferritin, and procalcitonin, in association with higher levels of IL-1Ra and IL-10, anti-inflammatory proteins. Both haplotype CTA and rs419598 CC genotype are associated with a significant reduction in the mortality of men. These data provide genetic evidence that inflammasome activation and the IL-1 pathway plays an important role in the mortality and morbidity associated with severe SARS-CoV-2 infection, and that genetic regulation of inflammatory pathways by variants of IL1RN merits further evaluation in severe SARS-CoV-2 infection.
Collapse
|
42
|
Wang D, Xie K, Wang Y, Hu J, Li W, Yang A, Zhang Q, Ning C, Fan X. Cost-effectively dissecting the genetic architecture of complex wool traits in rabbits by low-coverage sequencing. Genet Sel Evol 2022; 54:75. [PMCID: PMC9673297 DOI: 10.1186/s12711-022-00766-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Accepted: 10/31/2022] [Indexed: 11/19/2022] Open
Abstract
Background Rabbit wool traits are important in fiber production and for model organism research on hair growth, but their genetic architecture remains obscure. In this study, we focused on wool characteristics in Angora rabbits, a breed well-known for the quality of its wool. Considering the cost to generate population-scale sequence data and the biased detection of variants using chip data, developing an effective genotyping strategy using low-coverage whole-genome sequencing (LCS) data is necessary to conduct genetic analyses. Results Different genotype imputation strategies (BaseVar + STITCH, Bcftools + Beagle4, and GATK + Beagle5), sequencing coverages (0.1X, 0.5X, 1.0X, 1.5X, and 2.0X), and sample sizes (100, 200, 300, 400, 500, and 600) were compared. Our results showed that using BaseVar + STITCH at a sequencing depth of 1.0X with a sample size larger than 300 resulted in the highest genotyping accuracy, with a genotype concordance higher than 98.8% and genotype accuracy higher than 0.97. We performed multivariate genome-wide association studies (GWAS), followed by conditional GWAS and estimation of the confidence intervals of quantitative trait loci (QTL) to investigate the genetic architecture of wool traits. Six QTL were detected, which explained 0.4 to 7.5% of the phenotypic variation. Gene-level mapping identified the fibroblast growth factor 10 (FGF10) gene as associated with fiber growth and diameter, which agrees with previous results from functional data analyses on the FGF gene family in other species, and is relevant for wool rabbit breeding. Conclusions We suggest that LCS followed by imputation can be a cost-effective alternative to array and high-depth sequencing for assessing common variants. GWAS combined with LCS can identify new QTL and candidate genes that are associated with quantitative traits. This study provides a cost-effective and powerful method for investigating the genetic architecture of complex traits, which will be useful for genomic breeding applications. Supplementary Information The online version contains supplementary material available at 10.1186/s12711-022-00766-y.
Collapse
Affiliation(s)
- Dan Wang
- grid.440622.60000 0000 9482 4676College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai’an, China
| | - Kerui Xie
- grid.440622.60000 0000 9482 4676College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai’an, China
| | - Yanyan Wang
- grid.440622.60000 0000 9482 4676College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai’an, China
| | - Jiaqing Hu
- grid.440622.60000 0000 9482 4676College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai’an, China
| | - Wenqiang Li
- grid.440622.60000 0000 9482 4676College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai’an, China
| | - Aiguo Yang
- grid.440622.60000 0000 9482 4676College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai’an, China
| | - Qin Zhang
- grid.440622.60000 0000 9482 4676College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai’an, China
| | - Chao Ning
- grid.440622.60000 0000 9482 4676College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai’an, China
| | - Xinzhong Fan
- grid.440622.60000 0000 9482 4676College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai’an, China
| |
Collapse
|
43
|
Atkinson EG, Dalvie S, Pichkar Y, Kalungi A, Majara L, Stevenson A, Abebe T, Akena D, Alemayehu M, Ashaba FK, Atwoli L, Baker M, Chibnik LB, Creanza N, Daly MJ, Fekadu A, Gelaye B, Gichuru S, Injera WE, James R, Kariuki SM, Kigen G, Koen N, Koenen KC, Koenig Z, Kwobah E, Kyebuzibwa J, Musinguzi H, Mwema RM, Neale BM, Newman CP, Newton CRJC, Ongeri L, Ramachandran S, Ramesar R, Shiferaw W, Stein DJ, Stroud RE, Teferra S, Yohannes MT, Zingela Z, Martin AR. Genetic structure correlates with ethnolinguistic diversity in eastern and southern Africa. Am J Hum Genet 2022; 109:1667-1679. [PMID: 36055213 PMCID: PMC9502052 DOI: 10.1016/j.ajhg.2022.07.013] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Accepted: 07/28/2022] [Indexed: 12/22/2022] Open
Abstract
African populations are the most diverse in the world yet are sorely underrepresented in medical genetics research. Here, we examine the structure of African populations using genetic and comprehensive multi-generational ethnolinguistic data from the Neuropsychiatric Genetics of African Populations-Psychosis study (NeuroGAP-Psychosis) consisting of 900 individuals from Ethiopia, Kenya, South Africa, and Uganda. We find that self-reported language classifications meaningfully tag underlying genetic variation that would be missed with consideration of geography alone, highlighting the importance of culture in shaping genetic diversity. Leveraging our uniquely rich multi-generational ethnolinguistic metadata, we track language transmission through the pedigree, observing the disappearance of several languages in our cohort as well as notable shifts in frequency over three generations. We find suggestive evidence for the rate of language transmission in matrilineal groups having been higher than that for patrilineal ones. We highlight both the diversity of variation within Africa as well as how within-Africa variation can be informative for broader variant interpretation; many variants that are rare elsewhere are common in parts of Africa. The work presented here improves the understanding of the spectrum of genetic variation in African populations and highlights the enormous and complex genetic and ethnolinguistic diversity across Africa.
Collapse
Affiliation(s)
- Elizabeth G Atkinson
- Analytic and Translational Genetics Unit and Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA; Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| | - Shareefa Dalvie
- Department of Psychiatry and Mental Health, University of Cape Town, Cape Town, South Africa; South African Medical Research Council (SAMRC) Unit on Risk and Resilience in Mental Disorders, Neuroscience Institute, University of Cape Town, Cape Town, South Africa
| | - Yakov Pichkar
- Department of Biological Sciences and Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN, USA
| | - Allan Kalungi
- Department of Psychiatry, School of Medicine, College of Health Sciences, Makerere University, Kampala, Uganda; Mental Health Section of MRC/UVRI & LSHTM Uganda Research Unit, Entebbe, Uganda
| | - Lerato Majara
- Department of Psychiatry and Mental Health, University of Cape Town, Cape Town, South Africa; South African Medical Research Council (SAMRC) Human Genetics Research Unit, Division of Human Genetics, Institute of Infectious Disease and Molecular Medicine, University of Cape Town, Cape Town, South Africa
| | - Anne Stevenson
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA; Department of Epidemiology, Harvard T. H. Chan School of Public Health, Boston, MA, USA; Department of Psychiatry, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Tamrat Abebe
- Department of Microbiology, Immunology, and Parasitology, School of Medicine, College of Health Sciences, Addis Ababa University, Addis Ababa, Ethiopia
| | - Dickens Akena
- Department of Psychiatry, School of Medicine, College of Health Sciences, Makerere University, Kampala, Uganda
| | - Melkam Alemayehu
- Department of Psychiatry, School of Medicine, College of Health Sciences, Addis Ababa University, Addis Ababa, Ethiopia
| | - Fred K Ashaba
- Department of Immunology & Molecular Biology, College of Health Sciences, Makerere University, Kampala, Uganda
| | - Lukoye Atwoli
- Department of Mental Health, School of Medicine, Moi University College of Health Sciences, Eldoret, Kenya; Brain and Mind Institute and Department of Internal Medicine, Medical College East Africa, the Aga Khan University, Nairobi, Kenya
| | - Mark Baker
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Lori B Chibnik
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA; Department of Neurology, Massachusetts General Hospital, Boston, MA, USA
| | - Nicole Creanza
- Department of Biological Sciences and Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN, USA
| | - Mark J Daly
- Analytic and Translational Genetics Unit and Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA; Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Abebaw Fekadu
- Department of Psychiatry, School of Medicine, College of Health Sciences, Addis Ababa University, Addis Ababa, Ethiopia; Centre for Innovative Drug Development & Therapeutic Trials for Africa, Addis Ababa University, Addis Ababa, Ethiopia
| | - Bizu Gelaye
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA; Department of Epidemiology, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Stella Gichuru
- Department of Mental Health, School of Medicine, Moi University College of Health Sciences, Eldoret, Kenya
| | - Wilfred E Injera
- Department of Immunology, School of Medicine, Moi University College of Health Sciences, Eldoret, Kenya
| | - Roxanne James
- Department of Psychiatry and Mental Health, University of Cape Town, Cape Town, South Africa
| | - Symon M Kariuki
- Neurosciences Unit, Clinical Department, KEMRI-Wellcome Trust Research Programme-Coast, Kilifi, Kenya; Department of Psychiatry, University of Oxford, Oxford, UK
| | - Gabriel Kigen
- Department of Pharmacology and Toxicology, School of Medicine, Moi University College of Health Sciences, Eldoret, Kenya
| | - Nastassja Koen
- Department of Psychiatry and Mental Health, University of Cape Town, Cape Town, South Africa; South African Medical Research Council (SAMRC) Unit on Risk and Resilience in Mental Disorders, Neuroscience Institute, University of Cape Town, Cape Town, South Africa
| | - Karestan C Koenen
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA; Department of Epidemiology, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Zan Koenig
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Edith Kwobah
- Department of Mental Health, School of Medicine, Moi University College of Health Sciences, Eldoret, Kenya
| | - Joseph Kyebuzibwa
- Department of Psychiatry, School of Medicine, College of Health Sciences, Makerere University, Kampala, Uganda
| | - Henry Musinguzi
- Department of Immunology & Molecular Biology, College of Health Sciences, Makerere University, Kampala, Uganda
| | - Rehema M Mwema
- Neurosciences Unit, Clinical Department, KEMRI-Wellcome Trust Research Programme-Coast, Kilifi, Kenya
| | - Benjamin M Neale
- Analytic and Translational Genetics Unit and Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA; Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Carter P Newman
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA; Department of Epidemiology, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Charles R J C Newton
- Neurosciences Unit, Clinical Department, KEMRI-Wellcome Trust Research Programme-Coast, Kilifi, Kenya; Department of Psychiatry, University of Oxford, Oxford, UK
| | - Linnet Ongeri
- Neurosciences Unit, Clinical Department, KEMRI-Wellcome Trust Research Programme-Coast, Kilifi, Kenya
| | - Sohini Ramachandran
- Department of Ecology and Evolutionary Biology and Center for Computational Molecular Biology, Brown University, Providence, RI, USA
| | - Raj Ramesar
- South African Medical Research Council (SAMRC) Unit on Risk and Resilience in Mental Disorders, Neuroscience Institute, University of Cape Town, Cape Town, South Africa
| | - Welelta Shiferaw
- Department of Psychiatry, School of Medicine, College of Health Sciences, Addis Ababa University, Addis Ababa, Ethiopia
| | - Dan J Stein
- Department of Psychiatry and Mental Health, University of Cape Town, Cape Town, South Africa; South African Medical Research Council (SAMRC) Unit on Risk and Resilience in Mental Disorders, Neuroscience Institute, University of Cape Town, Cape Town, South Africa
| | - Rocky E Stroud
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA; Department of Epidemiology, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Solomon Teferra
- Department of Psychiatry, School of Medicine, College of Health Sciences, Addis Ababa University, Addis Ababa, Ethiopia
| | - Mary T Yohannes
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Zukiswa Zingela
- Executive Dean's Office, Faculty of Health Sciences, Nelson Mandela University, Port Elizabeth, South Africa
| | - Alicia R Martin
- Analytic and Translational Genetics Unit and Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA; Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | | |
Collapse
|
44
|
Cheruiyot EK, Haile-Mariam M, Cocks BG, Pryce JE. Improving Genomic Selection for Heat Tolerance in Dairy Cattle: Current Opportunities and Future Directions. Front Genet 2022; 13:894067. [PMID: 35769985 PMCID: PMC9234448 DOI: 10.3389/fgene.2022.894067] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2022] [Accepted: 05/02/2022] [Indexed: 11/13/2022] Open
Abstract
Heat tolerance is the ability of an animal to maintain production and reproduction levels under hot and humid conditions and is now a trait of economic relevance in dairy systems worldwide because of an escalating warming climate. The Australian dairy population is one of the excellent study models for enhancing our understanding of the biology of heat tolerance because they are predominantly kept outdoors on pastures where they experience direct effects of weather elements (e.g., solar radiation). In this article, we focus on evidence from recent studies in Australia that leveraged large a dataset [∼40,000 animals with phenotypes and 15 million whole-genome sequence variants] to elucidate the genetic basis of thermal stress as a critical part of the strategy to breed cattle adapted to warmer environments. Genotype-by-environment interaction (i.e., G × E) due to temperature and humidity variation is increasing, meaning animals are becoming less adapted (i.e., more sensitive) to changing environments. There are opportunities to reverse this trend and accelerate adaptation to warming climate by 1) selecting robust or heat-resilient animals and 2) including resilience indicators in breeding goals. Candidate causal variants related to the nervous system and metabolic functions are relevant for heat tolerance and, therefore, key for improving this trait. This could include adding these variants in the custom SNP panels used for routine genomic evaluations or as the basis to design specific agonist or antagonist compounds for lowering core body temperature under heat stress conditions. Indeed, it was encouraging to see that adding prioritized functionally relevant variants into the 50k SNP panel (i.e., the industry panel used for genomic evaluation in Australia) increased the prediction accuracy of heat tolerance by up to 10% units. This gain in accuracy is critical because genetic improvement has a linear relationship with prediction accuracy. Overall, while this article used data mainly from Australia, this could benefit other countries that aim to develop breeding values for heat tolerance, considering that the warming climate is becoming a topical issue worldwide.
Collapse
Affiliation(s)
- Evans K. Cheruiyot
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC, Australia
- Centre for AgriBiosciences, Agriculture Victoria Research, AgriBio, Bundoora, VIC, Australia
| | - Mekonnen Haile-Mariam
- Centre for AgriBiosciences, Agriculture Victoria Research, AgriBio, Bundoora, VIC, Australia
- *Correspondence: Mekonnen Haile-Mariam,
| | - Benjamin G. Cocks
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC, Australia
- Centre for AgriBiosciences, Agriculture Victoria Research, AgriBio, Bundoora, VIC, Australia
| | - Jennie E. Pryce
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC, Australia
- Centre for AgriBiosciences, Agriculture Victoria Research, AgriBio, Bundoora, VIC, Australia
| |
Collapse
|
45
|
Shan Y, Cole SA, Haack K, Melton PE, Best LG, Bizon C, Kobes S, Köroğlu Ç, Baier LJ, Hanson RL, Sanna S, Li Y, Franceschini N. Association of protein function-altering variants with cardiometabolic traits: the strong heart study. Sci Rep 2022; 12:9317. [PMID: 35665752 PMCID: PMC9167281 DOI: 10.1038/s41598-022-12866-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2022] [Accepted: 05/05/2022] [Indexed: 11/08/2022] Open
Abstract
Clinical and biomarker phenotypic associations for carriers of protein function-altering variants may help to elucidate gene function and health effects in populations. We genotyped 1127 Strong Heart Family Study participants for protein function-altering single nucleotide variants (SNV) and indels selected from a low coverage whole exome sequencing of American Indians. We tested the association of each SNV/indel with 35 cardiometabolic traits. Among 1206 variants (average minor allele count = 20, range of 1 to 1064), ~ 43% were not present in publicly available repositories. We identified seven SNV-trait significant associations including a missense SNV at ABCA10 (rs779392624, p = 8 × 10-9) associated with fasting triglycerides, which gene product is involved in macrophage lipid homeostasis. Among non-diabetic individuals, missense SNVs at four genes were associated with fasting insulin adjusted for BMI (PHIL, chr6:79,650,711, p = 2.1 × 10-6; TRPM3, rs760461668, p = 5 × 10-8; SPTY2D1, rs756851199, p = 1.6 × 10-8; and TSPO, rs566547284, p = 2.4 × 10-6). PHIL encoded protein is involved in pancreatic β-cell proliferation and survival, and TRPM3 protein mediates calcium signaling in pancreatic β-cells in response to glucose. A genetic risk score combining increasing insulin risk alleles of these four genes was associated with 53% (95% confidence interval 1.09, 2.15) increased odds of incident diabetes and 83% (95% confidence interval 1.35, 2.48) increased odds of impaired fasting glucose at follow-up. Our study uncovered novel gene-trait associations through the study of protein-coding variants and demonstrates the advantages of association screenings targeting diverse and high-risk populations to study variants absent in publicly available repositories.
Collapse
Affiliation(s)
- Yue Shan
- Department of Biostatistics, University of North Carolina, Chapel Hill, NC, USA
| | - Shelley A Cole
- Texas Biomedical Research Institute, San Antonio, TX, USA
| | - Karin Haack
- Texas Biomedical Research Institute, San Antonio, TX, USA
| | - Phillip E Melton
- The Curtin UWA Centre for Genetic Origins of Health and Disease, Faculty of Health Sciences, Curtin University and Faculty of Health and Medical Sciences, The University of Western Australia, Crawley, WA, Australia
- School of Pharmacy and Biomedical Sciences, Faculty of Health Sciences, Curtin University, Bentley, WA, Australia
- Menzies Medical Research Institute, University of Tasmania, Hobart, TAS, Australia
| | - Lyle G Best
- Missouri Breaks Industries Research Inc, Eagle Butte, SD, USA
| | - Christopher Bizon
- Renaissance Computing Institute, University of North Carolina, Chapel Hill, NC, USA
| | - Sayuko Kobes
- Phoenix Epidemiology and Clinical Research Branch, NIDDK, NIH, Bethesda, USA
| | - Çiğdem Köroğlu
- Phoenix Epidemiology and Clinical Research Branch, NIDDK, NIH, Bethesda, USA
| | - Leslie J Baier
- Phoenix Epidemiology and Clinical Research Branch, NIDDK, NIH, Bethesda, USA
| | - Robert L Hanson
- Phoenix Epidemiology and Clinical Research Branch, NIDDK, NIH, Bethesda, USA
| | - Serena Sanna
- Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
- Istituto Di Ricerca Genetica E Biomedica (IRGB), Consiglio Nazionale Delle Ricerche (CNR), Monserrato, Italy
| | - Yun Li
- Department of Biostatistics, University of North Carolina, Chapel Hill, NC, USA
- Departments of Genetics and Computer Science, University of North Carolina, Chapel Hill, NC, USA
| | - Nora Franceschini
- Department of Epidemiology, University of North Carolina, Chapel Hill, NC, USA.
- Gillings School of Global Public Health, University of North Carolina, Chapel Hill, NC, USA.
| |
Collapse
|
46
|
Rare and population-specific functional variation across pig lines. Genet Sel Evol 2022; 54:39. [PMID: 35659233 PMCID: PMC9164375 DOI: 10.1186/s12711-022-00732-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Accepted: 05/17/2022] [Indexed: 01/09/2023] Open
Abstract
BACKGROUND It is expected that functional, mainly missense and loss-of-function (LOF), and regulatory variants are responsible for most phenotypic differences between breeds and genetic lines of livestock species that have undergone diverse selection histories. However, there is still limited knowledge about the existing missense and LOF variation in commercial livestock populations, in particular regarding population-specific variation and how it can affect applications such as across-breed genomic prediction. METHODS We re-sequenced the whole genome of 7848 individuals from nine commercial pig lines (average sequencing coverage: 4.1×) and imputed whole-genome genotypes for 440,610 pedigree-related individuals. The called variants were categorized according to predicted functional annotation (from LOF to intergenic) and prevalence level (number of lines in which the variant segregated; from private to widespread). Variants in each category were examined in terms of their distribution along the genome, alternative allele frequency, per-site Wright's fixation index (FST), individual load, and association to production traits. RESULTS Of the 46 million called variants, 28% were private (called in only one line) and 21% were widespread (called in all nine lines). Genomic regions with a low recombination rate were enriched with private variants. Low-prevalence variants (called in one or a few lines only) were enriched for lower allele frequencies, lower FST, and putatively functional and regulatory roles (including LOF and deleterious missense variants). On average, individuals carried fewer private deleterious missense alleles than expected compared to alleles with other predicted consequences. Only a small subset of the low-prevalence variants had intermediate allele frequencies and explained small fractions of phenotypic variance (up to 3.2%) of production traits. The significant low-prevalence variants had higher per-site FST than the non-significant ones. These associated low-prevalence variants were tagged by other more widespread variants in high linkage disequilibrium, including intergenic variants. CONCLUSIONS Most low-prevalence variants have low minor allele frequencies and only a small subset of low-prevalence variants contributed detectable fractions of phenotypic variance of production traits. Accounting for low-prevalence variants is therefore unlikely to noticeably benefit across-breed analyses, such as the prediction of genomic breeding values in a population using reference populations of a different genetic background.
Collapse
|
47
|
Nln I, Fernandez-Ruiz R, Muskardin TLW, Paredes JL, Blazer AD, Tuminello S, Attur M, Iturrate E, Petrilli CM, Abramson SB, Chakravarti A, Niewold TB. Interferon pathway lupus risk alleles modulate risk of death from acute COVID-19. Transl Res 2022; 244:47-55. [PMID: 35114420 PMCID: PMC8802623 DOI: 10.1016/j.trsl.2022.01.007] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/02/2021] [Revised: 01/27/2022] [Accepted: 01/28/2022] [Indexed: 12/15/2022]
Abstract
Type I interferon (IFN) is critical in our defense against viral infections. Increased type I IFN pathway activation is a genetic risk factor for systemic lupus erythematosus (SLE), and a number of common risk alleles contribute to the high IFN trait. We hypothesized that these common gain-of-function IFN pathway alleles may be associated with protection from mortality in acute COVID-19. We studied patients admitted with acute COVID-19 (756 European-American and 398 African-American ancestry). Ancestral backgrounds were analyzed separately, and mortality after acute COVID-19 was the primary outcome. In European-American ancestry, we found that a haplotype of interferon regulatory factor 5 (IRF5) and alleles of protein kinase cGMP-dependent 1 (PRKG1) were associated with mortality from COVID-19. Interestingly, these were much stronger risk factors in younger patients (OR = 29.2 for PRKG1 in ages 45-54). Variants in the IRF7 and IRF8 genes were associated with mortality from COVID-19 in African-American subjects, and these genetic effects were more pronounced in older subjects. Combining genetic information with blood biomarker data such as C-reactive protein, troponin, and D-dimer resulted in significantly improved predictive capacity, and in both ancestral backgrounds the risk genotypes were most relevant in those with positive biomarkers (OR for death between 14 and 111 in high risk genetic/biomarker groups). This study confirms the critical role of the IFN pathway in defense against COVID-19 and viral infections, and supports the idea that some common SLE risk alleles exert protective effects in antiviral immunity.
Collapse
Affiliation(s)
- Ilona Nln
- Colton Center for Autoimmunity, NYU Grossman School of Medicine, New York, New York
| | - Ruth Fernandez-Ruiz
- Colton Center for Autoimmunity, NYU Grossman School of Medicine, New York, New York
| | | | - Jacqueline L Paredes
- Colton Center for Autoimmunity, NYU Grossman School of Medicine, New York, New York
| | - Ashira D Blazer
- Colton Center for Autoimmunity, NYU Grossman School of Medicine, New York, New York
| | - Stephanie Tuminello
- Center for Human Genetics and Genomics, NYU Grossman School of Medicine, New York, New York
| | - Mukundan Attur
- Divison of Rheumatology, Department of Medicine, NYU Grossman School of Medicine, New York, New York
| | - Eduardo Iturrate
- Department of Medicine, NYU Grossman School of Medicine, New York, New York
| | | | - Steven B Abramson
- Department of Medicine, NYU Grossman School of Medicine, New York, New York
| | - Aravinda Chakravarti
- Center for Human Genetics and Genomics, NYU Grossman School of Medicine, New York, New York
| | - Timothy B Niewold
- Colton Center for Autoimmunity, NYU Grossman School of Medicine, New York, New York.
| |
Collapse
|
48
|
Martin AR, Stroud RE, Abebe T, Akena D, Alemayehu M, Atwoli L, Chapman SB, Flowers K, Gelaye B, Gichuru S, Kariuki SM, Kinyanjui S, Korte KJ, Koen N, Koenen KC, Newton CRJC, Olivares AM, Pollock S, Post K, Singh I, Stein DJ, Teferra S, Zingela Z, Chibnik LB. Increasing diversity in genomics requires investment in equitable partnerships and capacity building. Nat Genet 2022; 54:740-745. [PMID: 35668301 PMCID: PMC7613571 DOI: 10.1038/s41588-022-01095-y] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Calls for diversity in genomics have motivated new global research collaborations across institutions with highly imbalanced resources. We describe practical lessons we have learned so far from designing multidisciplinary international research and capacity-building programs that prioritize equity in two intertwined programs — the NeuroGAP-Psychosis research study and GINGER training program — spanning institutions in Ethiopia, Kenya, South Africa, Uganda and the united States.
Collapse
Affiliation(s)
- Alicia R Martin
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA.
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA.
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, USA.
| | - Rocky E Stroud
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Department of Epidemiology, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Tamrat Abebe
- Department of Microbiology, Immunology, and Parasitology, School of Medicine, College of Health Sciences, Addis Ababa University, Addis Ababa, Ethiopia
| | - Dickens Akena
- Department of Psychiatry, School of Medicine, College of Health Sciences, Makerere University, Kampala, Uganda
| | - Melkam Alemayehu
- Department of Psychiatry, School of Medicine, College of Health Sciences, Addis Ababa University, Addis Ababa, Ethiopia
| | - Lukoye Atwoli
- Department of Mental Health, School of Medicine, Moi University College of Health Sciences, Eldoret, Kenya
- Brain and Mind Institute, Medical College East Africa, The Aga Khan University, Nairobi, Kenya
- Department of Internal Medicine, Medical College East Africa, The Aga Khan University, Nairobi, Kenya
| | - Sinéad B Chapman
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Katelyn Flowers
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Broad Genomics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Bizu Gelaye
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Department of Epidemiology, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Stella Gichuru
- Department of Mental Health, Moi Teaching and Referral Hospital, Eldoret, Kenya
| | - Symon M Kariuki
- Neurosciences Unit, Clinical Department, KEMRI-Wellcome Trust Research Programme-Coast, Kilifi, Kenya
- Department of Psychiatry, University of Oxford, Oxford, UK
| | - Sam Kinyanjui
- Centre for Geographic Medicine Research Coast, KEMRI-Wellcome Trust Research Programme-Coast, Kilifi, Kenya
- Nuffield Department of Medicine, Centre for Tropical Medicine and Global Health, University of Oxford, Oxford, UK
| | - Kristina J Korte
- Department of Epidemiology, Harvard T. H. Chan School of Public Health, Boston, MA, USA
- Department of Psychiatry, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| | - Nastassja Koen
- Department of Psychiatry and Mental Health, University of Cape Town, Cape Town, South Africa
- SA MRC Unit on Risk & Resilience in Mental Disorders, University of Cape Town and Neuroscience Institute, Cape Town, South Africa
| | - Karestan C Koenen
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Department of Epidemiology, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Charles R J C Newton
- Neurosciences Unit, Clinical Department, KEMRI-Wellcome Trust Research Programme-Coast, Kilifi, Kenya
- Department of Psychiatry, University of Oxford, Oxford, UK
| | - Ana Maria Olivares
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Sam Pollock
- Broad Genomics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Kristianna Post
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Ilina Singh
- Department of Psychiatry and Wellcome Centre for Ethics and Humanities, University of Oxford, Oxford, UK
| | - Dan J Stein
- Department of Psychiatry and Mental Health, University of Cape Town, Cape Town, South Africa
- SA MRC Unit on Risk & Resilience in Mental Disorders, University of Cape Town and Neuroscience Institute, Cape Town, South Africa
| | - Solomon Teferra
- Department of Psychiatry, School of Medicine, College of Health Sciences, Addis Ababa University, Addis Ababa, Ethiopia
| | - Zukiswa Zingela
- Executive Dean's Office, Faculty of Health Sciences, Nelson Mandela University, Gqebera, South Africa
| | - Lori B Chibnik
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA.
- Department of Epidemiology, Harvard T. H. Chan School of Public Health, Boston, MA, USA.
- Department of Neurology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
49
|
Fan C, Mancuso N, Chiang CWK. A genealogical estimate of genetic relationships. Am J Hum Genet 2022; 109:812-824. [PMID: 35417677 PMCID: PMC9118131 DOI: 10.1016/j.ajhg.2022.03.016] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2021] [Accepted: 03/25/2022] [Indexed: 12/23/2022] Open
Abstract
The application of genetic relationships among individuals, characterized by a genetic relationship matrix (GRM), has far-reaching effects in human genetics. However, the current standard to calculate the GRM treats linked markers as independent and does not explicitly model the underlying genealogical history of the study sample. Here, we propose a coalescent-informed framework, namely the expected GRM (eGRM), to infer the expected relatedness between pairs of individuals given an ancestral recombination graph (ARG) of the sample. Through extensive simulations, we show that the eGRM is an unbiased estimate of latent pairwise genome-wide relatedness and is robust when computed with ARG inferred from incomplete genetic data. As a result, the eGRM better captures the structure of a population than the canonical GRM, even when using the same genetic information. More importantly, our framework allows a principled approach to estimate the eGRM at different time depths of the ARG, thereby revealing the time-varying nature of population structure in a sample. When applied to SNP array genotypes from a population sample from Northern and Eastern Finland, we find that clustering analysis with the eGRM reveals population structure driven by subpopulations that would not be apparent via the canonical GRM and that temporally the population model is consistent with recent divergence and expansion. Taken together, our proposed eGRM provides a robust tree-centric estimate of relatedness with wide application to genetic studies.
Collapse
Affiliation(s)
- Caoqi Fan
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA; Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA.
| | - Nicholas Mancuso
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA; Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA; Norris Comprehensive Cancer Center, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Charleston W K Chiang
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA; Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA.
| |
Collapse
|
50
|
Morrill K, Hekman J, Li X, McClure J, Logan B, Goodman L, Gao M, Dong Y, Alonso M, Carmichael E, Snyder-Mackler N, Alonso J, Noh HJ, Johnson J, Koltookian M, Lieu C, Megquier K, Swofford R, Turner-Maier J, White ME, Weng Z, Colubri A, Genereux DP, Lord KA, Karlsson EK. Ancestry-inclusive dog genomics challenges popular breed stereotypes. Science 2022; 376:eabk0639. [PMID: 35482869 DOI: 10.1126/science.abk0639] [Citation(s) in RCA: 96] [Impact Index Per Article: 32.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Behavioral genetics in dogs has focused on modern breeds, which are isolated subgroups with distinctive physical and, purportedly, behavioral characteristics. We interrogated breed stereotypes by surveying owners of 18,385 purebred and mixed-breed dogs and genotyping 2155 dogs. Most behavioral traits are heritable [heritability (h2) > 25%], and admixture patterns in mixed-breed dogs reveal breed propensities. Breed explains just 9% of behavioral variation in individuals. Genome-wide association analyses identify 11 loci that are significantly associated with behavior, and characteristic breed behaviors exhibit genetic complexity. Behavioral loci are not unusually differentiated in breeds, but breed propensities align, albeit weakly, with ancestral function. We propose that behaviors perceived as characteristic of modern breeds derive from thousands of years of polygenic adaptation that predates breed formation, with modern breeds distinguished primarily by aesthetic traits.
Collapse
Affiliation(s)
- Kathleen Morrill
- Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA 01655, USA.,Morningside Graduate School of Biomedical Sciences, University of Massachusetts Chan Medical School, Worcester, MA 01655, USA.,Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Jessica Hekman
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Xue Li
- Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA 01655, USA.,Morningside Graduate School of Biomedical Sciences, University of Massachusetts Chan Medical School, Worcester, MA 01655, USA.,Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Jesse McClure
- Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA 01655, USA
| | - Brittney Logan
- Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA 01655, USA.,Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Linda Goodman
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.,Fauna Bio Inc., Emeryville, CA 94608, USA
| | - Mingshi Gao
- Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA 01655, USA.,Morningside Graduate School of Biomedical Sciences, University of Massachusetts Chan Medical School, Worcester, MA 01655, USA
| | - Yinan Dong
- Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA 01655, USA.,Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Marjie Alonso
- The International Association of Animal Behavior Consultants, Cranberry Township, PA 16066, USA.,IAABC Foundation, Cranberry Township, PA 16066, USA
| | - Elena Carmichael
- Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA 01655, USA.,Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.,Rice University, Houston, TX 77005, USA
| | - Noah Snyder-Mackler
- Center for Evolution and Medicine, Arizona State University, Tempe, AZ 85251, USA.,School for Human Evolution and Social Change, Arizona State University, Tempe, AZ 85251, USA.,School of Life Sciences, Arizona State University, Tempe, AZ 85251, USA
| | - Jacob Alonso
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Hyun Ji Noh
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Jeremy Johnson
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | | | - Charlie Lieu
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.,Darwin's Ark Foundation, Seattle, WA 98026, USA
| | - Kate Megquier
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Ross Swofford
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | | | - Michelle E White
- Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA 01655, USA.,Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Zhiping Weng
- Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA 01655, USA
| | - Andrés Colubri
- Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA 01655, USA.,Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | | | - Kathryn A Lord
- Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA 01655, USA.,Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Elinor K Karlsson
- Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA 01655, USA.,Morningside Graduate School of Biomedical Sciences, University of Massachusetts Chan Medical School, Worcester, MA 01655, USA.,Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.,Darwin's Ark Foundation, Seattle, WA 98026, USA.,Program in Molecular Medicine, University of Massachusetts Chan Medical School, Worcester, MA 01655, USA
| |
Collapse
|