Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Brzyski D, Peterson CB, Sobczyk P, Candès EJ, Bogdan M, Sabatti C. Controlling the Rate of GWAS False Discoveries. Genetics 2017;205:61-75. [PMID: 27784720 PMCID: PMC5223524 DOI: 10.1534/genetics.116.193987] [Citation(s) in RCA: 72] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2016] [Accepted: 10/11/2016] [Indexed: 01/13/2023] Open

For:	Brzyski D, Peterson CB, Sobczyk P, Candès EJ, Bogdan M, Sabatti C. Controlling the Rate of GWAS False Discoveries. Genetics 2017;205:61-75. [PMID: 27784720 PMCID: PMC5223524 DOI: 10.1534/genetics.116.193987] [Citation(s) in RCA: 72] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2016] [Accepted: 10/11/2016] [Indexed: 01/13/2023] Open

Number

Cited by Other Article(s)

John M, Korte A, Grimm DG. The benefits of permutation-based genome-wide association studies. JOURNAL OF EXPERIMENTAL BOTANY 2024;75:5377-5389. [PMID: 38954539 PMCID: PMC11389838 DOI: 10.1093/jxb/erae280] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Accepted: 07/01/2024] [Indexed: 07/04/2024]

Tajerian A. Longitudinal study investigating the influence of COMT gene polymorphism on cortical thickness changes in Parkinson's disease over four years. Sci Rep 2024;14:9920. [PMID: 38689006 PMCID: PMC11061119 DOI: 10.1038/s41598-024-60828-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Accepted: 04/27/2024] [Indexed: 05/02/2024] Open

Abstract

Parkinson's disease (PD) is a progressive neurodegenerative disorder affecting over 3% of those over 65. It's caused by reduced dopaminergic neurons and Lewy bodies, leading to motor and non-motor symptoms. The relationship between COMT gene polymorphisms and PD is complex and not fully elucidated. Some studies have reported associations between certain COMT gene variants and PD risk, while others have not found significant associations. This study investigates how COMT gene variations impact cortical thickness changes in PD patients over time, aiming to link genetic factors, especially COMT gene variations, with PD progression. This study analyzed data from 44 PD patients with complete 4-year imaging follow-up from the Parkinson Progression Marker Initiative (PPMI) database. Magnetic resonance imaging (MRI) scans were acquired using consistent methods across 9 different MRI scanners. COMT single-nucleotide polymorphisms (SNPs) were assessed based on whole genome sequencing data. Longitudinal image analysis was conducted using FreeSurfer's processing pipeline. Linear mixed-effect models were employed to examine the interaction effect of genetic variations and time on cortical thickness, while controlling for covariates and subject-specific variations. The rs165599 SNP stands out as a potential contributor to alterations in cortical thickness, showing a significant reduction in overall mean cortical thickness in both hemispheres in homozygotes (Left: P = 0.023, Right: P = 0.028). The supramarginal, precentral, and superior frontal regions demonstrated significant bilateral alterations linked to rs165599. Our findings suggest that the rs165599 variant leads to earlier manifestation of cortical thinning during the course of the disease. However, it does not result in more severe cortical thinning outcomes over time. There is a need for larger cohorts and control groups to validate these findings and consider genetic variant interactions and clinical features to elucidate the specific mechanisms underlying COMT-related neurodegenerative processes in PD.

Collapse

Ghosal S, Schatz MC, Venkataraman A. BEATRICE: Bayesian Fine-mapping from Summary Data using Deep Variational Inference. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.24.534116. [PMID: 36993396 PMCID: PMC10055416 DOI: 10.1101/2023.03.24.534116] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]

Castro-Pearson S, Samorodnitsky S, Yang K, Lotfi-Emran S, Ingraham NE, Bramante C, Jones EK, Greising S, Yu M, Steffen BT, Svensson J, Åhlberg E, Österberg B, Wacker D, Guan W, Puskarich M, Smed-Sörensen A, Lusczek E, Safo SE, Tignanelli CJ. Development of a proteomic signature associated with severe disease for patients with COVID-19 using data from 5 multicenter, randomized, controlled, and prospective studies. Sci Rep 2023;13:20315. [PMID: 37985892 PMCID: PMC10661735 DOI: 10.1038/s41598-023-46343-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Accepted: 10/31/2023] [Indexed: 11/22/2023] Open

Abstract

Significant progress has been made in preventing severe COVID-19 disease through the development of vaccines. However, we still lack a validated baseline predictive biologic signature for the development of more severe disease in both outpatients and inpatients infected with SARS-CoV-2. The objective of this study was to develop and externally validate, via 5 international outpatient and inpatient trials and/or prospective cohort studies, a novel baseline proteomic signature, which predicts the development of moderate or severe (vs mild) disease in patients with COVID-19 from a proteomic analysis of 7000 + proteins. The secondary objective was exploratory, to identify (1) individual baseline protein levels and/or (2) protein level changes within the first 2 weeks of acute infection that are associated with the development of moderate/severe (vs mild) disease. For model development, samples collected from 2 randomized controlled trials were used. Plasma was isolated and the SomaLogic SomaScan platform was used to characterize protein levels for 7301 proteins of interest for all studies. We dichotomized 113 patients as having mild or moderate/severe COVID-19 disease. An elastic net approach was used to develop a predictive proteomic signature. For validation, we applied our signature to data from three independent prospective biomarker studies. We found 4110 proteins measured at baseline that significantly differed between patients with mild COVID-19 and those with moderate/severe COVID-19 after adjusting for multiple hypothesis testing. Baseline protein expression was associated with predicted disease severity with an error rate of 4.7% (AUC = 0.964). We also found that five proteins (Afamin, I-309, NKG2A, PRS57, LIPK) and patient age serve as a signature that separates patients with mild COVID-19 and patients with moderate/severe COVID-19 with an error rate of 1.77% (AUC = 0.9804). This panel was validated using data from 3 external studies with AUCs of 0.764 (Harvard University), 0.696 (University of Colorado), and 0.893 (Karolinska Institutet). In this study we developed and externally validated a baseline COVID-19 proteomic signature associated with disease severity for potential use in both outpatients and inpatients with COVID-19.

Collapse

Affiliation(s)

Sandra Castro-Pearson Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN, USA
Sarah Samorodnitsky Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN, USA
Kaifeng Yang Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN, USA
Sahar Lotfi-Emran Department of Medicine, University of Minnesota, Minneapolis, MN, USA
Nicholas E Ingraham Department of Medicine, University of Minnesota, Minneapolis, MN, USA
Carolyn Bramante Department of Medicine, University of Minnesota, Minneapolis, MN, USA
Emma K Jones Department of Surgery, University of Minnesota, 420 Delaware St SE, Minneapolis, MN, 55455, USA
Sarah Greising School of Kinesiology, University of Minnesota, Minneapolis, MN, USA
Meng Yu Division of Immunology and Allergy, Department of Medicine Solna, Center for Molecular Medicine, Karolinska Institutet and Karolinska University Hospital, Stockholm, Sweden
Brian T Steffen Department of Surgery, University of Minnesota, 420 Delaware St SE, Minneapolis, MN, 55455, USA
Julia Svensson Division of Immunology and Allergy, Department of Medicine Solna, Center for Molecular Medicine, Karolinska Institutet and Karolinska University Hospital, Stockholm, Sweden
Eric Åhlberg Division of Immunology and Allergy, Department of Medicine Solna, Center for Molecular Medicine, Karolinska Institutet and Karolinska University Hospital, Stockholm, Sweden
Björn Österberg Division of Immunology and Allergy, Department of Medicine Solna, Center for Molecular Medicine, Karolinska Institutet and Karolinska University Hospital, Stockholm, Sweden
David Wacker Department of Medicine, University of Minnesota, Minneapolis, MN, USA
Weihua Guan Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN, USA
Michael Puskarich Department of Emergency Medicine, University of Minnesota, Minneapolis, MN, USA Department of Emergency Medicine, Hennepin County Medical Center, Minneapolis, MN, USA
Anna Smed-Sörensen Division of Immunology and Allergy, Department of Medicine Solna, Center for Molecular Medicine, Karolinska Institutet and Karolinska University Hospital, Stockholm, Sweden
Elizabeth Lusczek Department of Surgery, University of Minnesota, 420 Delaware St SE, Minneapolis, MN, 55455, USA
Sandra E Safo Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN, USA
Christopher J Tignanelli Department of Surgery, University of Minnesota, 420 Delaware St SE, Minneapolis, MN, 55455, USA. Institute for Health Informatics, University of Minnesota, Minneapolis, MN, USA.

Collapse

John M, Lencz T. Potential application of elastic nets for shared polygenicity detection with adapted threshold selection. Int J Biostat 2023;19:417-438. [PMID: 36327464 PMCID: PMC10154439 DOI: 10.1515/ijb-2020-0108] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2020] [Accepted: 10/05/2022] [Indexed: 11/06/2022]

Yang K, Kang Z, Guan W, Lotfi-Emran S, Mayer ZJ, Guerrero CR, Steffen BT, Puskarich MA, Tignanelli CJ, Lusczek E, Safo SE. Developing A Baseline Metabolomic Signature Associated with COVID-19 Severity: Insights from Prospective Trials Encompassing 13 U.S. Centers. Metabolites 2023;13:1107. [PMID: 37999202 PMCID: PMC10672920 DOI: 10.3390/metabo13111107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Revised: 10/14/2023] [Accepted: 10/16/2023] [Indexed: 11/25/2023] Open

Urbut SM, Koyama S, Hornsby W, Bhukar R, Kheterpal S, Truong B, Selvaraj MS, Neale B, O’Donnell CJ, Peloso GM, Natarajan P. Bayesian multivariate genetic analysis improves translational insights. iScience 2023;26:107854. [PMID: 37766997 PMCID: PMC10520309 DOI: 10.1016/j.isci.2023.107854] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2023] [Revised: 05/15/2023] [Accepted: 09/05/2023] [Indexed: 09/29/2023] Open

Affiliation(s)

Sarah M. Urbut Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA 02114, USA Program in Medical and Population Genetics, Broad Institute, Cambridge, MA 02142, USA
Satoshi Koyama Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA 02114, USA Program in Medical and Population Genetics, Broad Institute, Cambridge, MA 02142, USA Department of Medicine Harvard Medical School, Boston, MA 02115, USA
Whitney Hornsby Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA 02114, USA Program in Medical and Population Genetics, Broad Institute, Cambridge, MA 02142, USA Department of Medicine Harvard Medical School, Boston, MA 02115, USA
Rohan Bhukar Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA 02114, USA Program in Medical and Population Genetics, Broad Institute, Cambridge, MA 02142, USA Department of Medicine Harvard Medical School, Boston, MA 02115, USA
Sumeet Kheterpal Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA 02114, USA Program in Medical and Population Genetics, Broad Institute, Cambridge, MA 02142, USA
Buu Truong Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA 02114, USA Program in Medical and Population Genetics, Broad Institute, Cambridge, MA 02142, USA Department of Medicine Harvard Medical School, Boston, MA 02115, USA
Margaret S. Selvaraj Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA 02114, USA Program in Medical and Population Genetics, Broad Institute, Cambridge, MA 02142, USA Department of Medicine Harvard Medical School, Boston, MA 02115, USA
Benjamin Neale Program in Medical and Population Genetics, Broad Institute, Cambridge, MA 02142, USA Department of Medicine Harvard Medical School, Boston, MA 02115, USA Analytic Translational and Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
Christopher J. O’Donnell Department of Medicine Harvard Medical School, Boston, MA 02115, USA VA Boston Department of Veterans Affairs, Boston, MA 02130, USA
Gina M. Peloso Department of Biostatistics, Boston University School of Public Health, Boston, MA 02218, USA
Pradeep Natarajan Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA 02114, USA Program in Medical and Population Genetics, Broad Institute, Cambridge, MA 02142, USA Department of Medicine Harvard Medical School, Boston, MA 02115, USA

Collapse

Rani R, Raza G, Ashfaq H, Rizwan M, Razzaq MK, Waheed MQ, Shimelis H, Babar AD, Arif M. Genome-wide association study of soybean (Glycine max [L.] Merr.) germplasm for dissecting the quantitative trait nucleotides and candidate genes underlying yield-related traits. FRONTIERS IN PLANT SCIENCE 2023;14:1229495. [PMID: 37636105 PMCID: PMC10450938 DOI: 10.3389/fpls.2023.1229495] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Accepted: 07/25/2023] [Indexed: 08/29/2023]

Abstract

Soybean (Glycine max [L.] Merr.) is one of the most significant crops in the world in terms of oil and protein. Owing to the rising demand for soybean products, there is an increasing need for improved varieties for more productive farming. However, complex correlation patterns among quantitative traits along with genetic interactions pose a challenge for soybean breeding. Association studies play an important role in the identification of accession with useful alleles by locating genomic sites associated with the phenotype in germplasm collections. In the present study, a genome-wide association study was carried out for seven agronomic and yield-related traits. A field experiment was conducted in 2015/2016 at two locations that include 155 diverse soybean germplasm. These germplasms were genotyped using SoySNP50K Illumina Infinium Bead-Chip. A total of 51 markers were identified for node number, plant height, pods per plant, seeds per plant, seed weight per plant, hundred-grain weight, and total yield using a multi-locus linear mixed model (MLMM) in FarmCPU. Among these significant SNPs, 18 were putative novel QTNs, while 33 co-localized with previously reported QTLs. A total of 2,356 genes were found in 250 kb upstream and downstream of significant SNPs, of which 17 genes were functional and the rest were hypothetical proteins. These 17 candidate genes were located in the region of 14 QTNs, of which ss715580365, ss715608427, ss715632502, and ss715620131 are novel QTNs for PH, PPP, SDPP, and TY respectively. Four candidate genes, Glyma.01g199200, Glyma.10g065700, Glyma.18g297900, and Glyma.14g009900, were identified in the vicinity of these novel QTNs, which encode lsd one like 1, Ergosterol biosynthesis ERG4/ERG24 family, HEAT repeat-containing protein, and RbcX2, respectively. Although further experimental validation of these candidate genes is required, several appear to be involved in growth and developmental processes related to the respective agronomic traits when compared with their homologs in Arabidopsis thaliana. This study supports the usefulness of association studies and provides valuable data for functional markers and investigating candidate genes within a diverse germplasm collection in future breeding programs.

Collapse

Wainberg M, Andrews SJ, Tripathy SJ. Shared genetic risk loci between Alzheimer's disease and related dementias, Parkinson's disease, and amyotrophic lateral sclerosis. Alzheimers Res Ther 2023;15:113. [PMID: 37328865 PMCID: PMC10273745 DOI: 10.1186/s13195-023-01244-3] [Citation(s) in RCA: 19] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2023] [Accepted: 05/16/2023] [Indexed: 06/18/2023]

Abstract

BACKGROUND

Genome-wide association studies (GWAS) have indicated moderate genetic overlap between Alzheimer's disease (AD) and related dementias (ADRD), Parkinson's disease (PD) and amyotrophic lateral sclerosis (ALS), neurodegenerative disorders traditionally considered etiologically distinct. However, the specific genetic variants and loci underlying this overlap remain almost entirely unknown.

METHODS

We leveraged state-of-the-art GWAS for ADRD, PD, and ALS. For each pair of disorders, we examined each of the GWAS hits for one disorder and tested whether they were also significant for the other disorder, applying Bonferroni correction for the number of variants tested. This approach rigorously controls the family-wise error rate for both disorders, analogously to genome-wide significance.

RESULTS

Eleven loci with GWAS hits for one disorder were also associated with one or both of the other disorders: one with all three disorders (the MAPT/KANSL1 locus), five with ADRD and PD (near LCORL, CLU, SETD1A/KAT8, WWOX, and GRN), three with ADRD and ALS (near GPX3, HS3ST5/HDAC2/MARCKS, and TSPOAP1), and two with PD and ALS (near GAK/TMEM175 and NEK1). Two of these loci (LCORL and NEK1) were associated with an increased risk of one disorder but decreased risk of another. Colocalization analysis supported a shared causal variant between ADRD and PD at the CLU, WWOX, and LCORL loci, between ADRD and ALS at the TSPOAP1 locus, and between PD and ALS at the NEK1 and GAK/TMEM175 loci. To address the concern that ADRD is an imperfect proxy for AD and that the ADRD and PD GWAS have overlapping participants (nearly all of which are from the UK Biobank), we confirmed that all our ADRD associations had nearly identical odds ratios in an AD GWAS that excluded the UK Biobank, and all but one remained nominally significant (p < 0.05) for AD.

CONCLUSIONS

In one of the most comprehensive investigations to date of pleiotropy between neurodegenerative disorders, we identify eleven genetic risk loci shared among ADRD, PD, and ALS. These loci support lysosomal/autophagic dysfunction (GAK/TMEM175, GRN, KANSL1), neuroinflammation/immunity (TSPOAP1), oxidative stress (GPX3, KANSL1), and the DNA damage response (NEK1) as transdiagnostic processes underlying multiple neurodegenerative disorders.

Collapse

Obry L, Dalmasso C. Weighted multiple testing procedures in genome-wide association studies. PeerJ 2023;11:e15369. [PMID: 37337586 PMCID: PMC10276986 DOI: 10.7717/peerj.15369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Accepted: 04/17/2023] [Indexed: 06/21/2023] Open

Lyman GH, Msaouel P, Kuderer NM. Risk Model Development and Validation in Clinical Oncology: Lessons Learned. Cancer Invest 2023;41:1-11. [PMID: 36254812 DOI: 10.1080/07357907.2022.2137914] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]

Bogomolov M. Testing partial conjunction hypotheses under dependency, with applications to meta-analysis. Electron J Stat 2023. [DOI: 10.1214/22-ejs2100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]

Prioritized candidate causal haplotype blocks in plant genome-wide association studies. PLoS Genet 2022;18:e1010437. [PMID: 36251695 PMCID: PMC9612827 DOI: 10.1371/journal.pgen.1010437] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2021] [Revised: 10/27/2022] [Accepted: 09/20/2022] [Indexed: 11/05/2022] Open

Abstract

Genome wide association studies (GWAS) can play an essential role in understanding genetic basis of complex traits in plants and animals. Conventional SNP-based linear mixed models (LMM) that marginally test single nucleotide polymorphisms (SNPs) have successfully identified many loci with major and minor effects in many GWAS. In plant, the relatively small population size in GWAS and the high genetic diversity found in many plant species can impede mapping efforts on complex traits. Here we present a novel haplotype-based trait fine-mapping framework, HapFM, to supplement current GWAS methods. HapFM uses genotype data to partition the genome into haplotype blocks, identifies haplotype clusters within each block, and then performs genome-wide haplotype fine-mapping to prioritize the candidate causal haplotype blocks of trait. We benchmarked HapFM, GEMMA, BSLMM, GMMAT, and BLINK in both simulated and real plant GWAS datasets. HapFM consistently resulted in higher mapping power than the other GWAS methods in high polygenicity simulation setting. Moreover, it resulted in smaller mapping intervals, especially in regions of high LD, achieved by prioritizing small candidate causal blocks in the larger haplotype blocks. In the Arabidopsis flowering time (FT10) datasets, HapFM identified four novel loci compared to GEMMA’s results, and the average mapping interval of HapFM was 9.6 times smaller than that of GEMMA. In conclusion, HapFM is tailored for plant GWAS to result in high mapping power on complex traits and improved on mapping resolution to facilitate crop improvement.

Genome-wide association studies (GWAS) are commonly used in human and plant studies to identify genetic variants responsible for the phenotype of interest and provide foundations for studying disease mechanisms and crop improvement. Most GWAS models are developed and optimized using human datasets. However, the difference between human and plant datasets essentially limits their applications in plant studies, especially when mapping complex traits such as drought resistance and yield. In this study, we present a novel GWAS method, HapFM, tailored for plant datasets to overcome the difficulties of many conventional GWAS methods. HapFM resulted in higher statistical power than conventional GWAS methods for mapping complex traits in our simulation and real dataset analyses. In addition, HapFM reduced the mapping interval by prioritizing candidate causal regions in the genome, which benefits the downstream experimental studies. Last but not least, HapFM can incorporate biological annotations to increase statistical power further. Overall, HapFM balances statistical power, result interpretability, and downstream experimental verifiability.

Collapse

Aboul-Naga AM, Alsamman AM, El Allali A, Elshafie MH, Abdelal ES, Abdelkhalek TM, Abdelsabour TH, Mohamed LG, Hamwieh A. Genome-wide analysis identified candidate variants and genes associated with heat stress adaptation in Egyptian sheep breeds. Front Genet 2022;13:898522. [PMID: 36263427 PMCID: PMC9574253 DOI: 10.3389/fgene.2022.898522] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2022] [Accepted: 09/05/2022] [Indexed: 11/24/2022] Open

Abstract

Heat stress caused by climatic changes is one of the most significant stresses on livestock in hot and dry areas. It has particularly adverse effects on the ability of the breed to maintain homeothermy. Developing countries are advised to protect and prepare their animal resources in the face of potential threats such as climate change. The current study was conducted in Egypt's three hot and dry agro-ecological zones. Three local sheep breeds (Saidi, Wahati, and Barki) were studied with a total of 206 ewes. The animals were exercised under natural heat stress. The heat tolerance index of the animals was calculated to identify animals with high and low heat tolerance based on their response to meteorological and physiological parameters. Genomic variation in these breeds was assessed using 64,756 single nucleotide polymorphic markers (SNPs). From the perspective of comparative adaptability to harsh conditions, our objective was to investigate the genomic structure that might control the adaptability of local sheep breeds to environmental stress under hot and dry conditions. In addition, indices of population structure and diversity of local breeds were examined. Measures of genetic diversity showed a significant influence of breed and location on populations. The standardized index of association (rbarD) ranged from 0.0012 (Dakhla) to 0.026 (Assuit), while for the breed, they ranged from 0.004 (Wahati) to 0.0103 (Saidi). The index of association analysis (Ia) ranged from 1.42 (Dakhla) to 35.88 (Assuit) by location and from 6.58 (Wahati) to 15.36 (Saidi) by breed. The most significant SNPs associated with heat tolerance were found in the MYO5A, PRKG1, GSTCD, and RTN1 genes (p ≤ 0.0001). MYO5A produces a protein widely distributed in the melanin-producing neural crest of the skin. Genetic association between genetic and phenotypic variations showed that OAR1_18300122.1, located in ST3GAL3, had the greatest positive effect on heat tolerance. Genome-wide association analysis identified SNPs associated with heat tolerance in the PLCB1, STEAP3, KSR2, UNC13C, PEBP4, and GPAT2 genes.

Collapse

Monti GS, Filzmoser P. A robust knockoff filter for sparse regression analysis of microbiome compositional data. Comput Stat 2022. [DOI: 10.1007/s00180-022-01268-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]

Detecting signatures of selection on gene expression. Nat Ecol Evol 2022;6:1035-1045. [PMID: 35551249 DOI: 10.1038/s41559-022-01761-8] [Citation(s) in RCA: 30] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2021] [Accepted: 04/01/2022] [Indexed: 12/15/2022]

Pudjihartono N, Fadason T, Kempa-Liehr AW, O'Sullivan JM. A Review of Feature Selection Methods for Machine Learning-Based Disease Risk Prediction. FRONTIERS IN BIOINFORMATICS 2022;2:927312. [PMID: 36304293 PMCID: PMC9580915 DOI: 10.3389/fbinf.2022.927312] [Citation(s) in RCA: 75] [Impact Index Per Article: 37.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2022] [Accepted: 06/03/2022] [Indexed: 01/14/2023] Open

Frommlet F, Szulc P, König F, Bogdan M. Selecting predictive biomarkers from genomic data. PLoS One 2022;17:e0269369. [PMID: 35709188 PMCID: PMC9202896 DOI: 10.1371/journal.pone.0269369] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Accepted: 05/13/2022] [Indexed: 11/18/2022] Open

Sutherland J, Bell T, Trexler RV, Carlson JE, Lasky JR. Host genomic influence on bacterial composition in the switchgrass rhizosphere. Mol Ecol 2022;31:3934-3950. [PMID: 35621390 PMCID: PMC10150372 DOI: 10.1111/mec.16549] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Revised: 05/20/2022] [Accepted: 05/24/2022] [Indexed: 11/28/2022]

Abstract

Host genetic variation can shape the diversity and composition of associated microbiomes, which may reciprocally influence host traits and performance. While the genetic basis of phenotypic diversity of plant populations in nature has been studied, comparatively little research has investigated the genetics of host effects on their associated microbiomes. Switchgrass (Panicum virgatum) is a highly outcrossing, perennial, grass species with substantial locally adaptive diversity across its native North American range. Here, we compared 383 switchgrass accessions in a common garden to determine the host genotypic influence on rhizosphere bacterial composition. We hypothesized that the composition and diversity of rhizosphere bacterial assemblages would differentiate due to genotypic differences between hosts (potentially due to root phenotypes and associated life history variation). We observed higher alpha diversity of bacteria associated with upland ecotypes and tetraploids, compared to lowland ecotypes and octoploids, respectively. Alpha diversity correlated negatively with flowering time and plant height, indicating that bacterial composition varies along switchgrass life history axes. Narrow-sense heritability (h² ) of the relative abundance of twenty-one core bacterial families was observed. Overall compositional differences among tetraploids, due to genetic variation, supports wide-spread genotypic influence on the rhizosphere microbiome. Tetraploids were only considered due to complexities associated with the octoploid genomes. Lastly, a genome-wide association study identified 1,861 single-nucleotide polymorphisms associated with 110 families and genes containing them related to potential regulatory functions. Our findings suggest that switchgrass genomic and life-history variation influences bacterial composition in the rhizosphere, potentially due to host adaptation to local environments.

Collapse

Fanter C, Madelaire C, Genereux DP, van Breukelen F, Levesque D, Hindle A. Epigenomics as a paradigm to understand the nuances of phenotypes. J Exp Biol 2022;225:274619. [PMID: 35258621 DOI: 10.1242/jeb.243411] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

Sandoval-Castillo J, Beheregaray LB, Wellenreuther M. Genomic prediction of growth in a commercially, recreationally, and culturally important marine resource, the Australian snapper (Chrysophrys auratus). G3 (BETHESDA, MD.) 2022;12:jkac015. [PMID: 35100370 PMCID: PMC8896003 DOI: 10.1093/g3journal/jkac015] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Accepted: 01/07/2022] [Indexed: 06/14/2023]

Abstract

Growth is one of the most important traits of an organism. For exploited species, this trait has ecological and evolutionary consequences as well as economical and conservation significance. Rapid changes in growth rate associated with anthropogenic stressors have been reported for several marine fishes, but little is known about the genetic basis of growth traits in teleosts. We used reduced genome representation data and genome-wide association approaches to identify growth-related genetic variation in the commercially, recreationally, and culturally important Australian snapper (Chrysophrys auratus, Sparidae). Based on 17,490 high-quality single-nucleotide polymorphisms and 363 individuals representing extreme growth phenotypes from 15,000 fish of the same age and reared under identical conditions in a sea pen, we identified 100 unique candidates that were annotated to 51 proteins. We documented a complex polygenic nature of growth in the species that included several loci with small effects and a few loci with larger effects. Overall heritability was high (75.7%), reflected in the high accuracy of the genomic prediction for the phenotype (small vs large). Although the single-nucleotide polymorphisms were distributed across the genome, most candidates (60%) clustered on chromosome 16, which also explains the largest proportion of heritability (16.4%). This study demonstrates that reduced genome representation single-nucleotide polymorphisms and the right bioinformatic tools provide a cost-efficient approach to identify growth-related loci and to describe genomic architectures of complex quantitative traits. Our results help to inform captive aquaculture breeding programs and are of relevance to monitor growth-related evolutionary shifts in wild populations in response to anthropogenic pressures.

Collapse

Navigating the pitfalls of applying machine learning in genomics. Nat Rev Genet 2022;23:169-181. [PMID: 34837041 DOI: 10.1038/s41576-021-00434-9] [Citation(s) in RCA: 83] [Impact Index Per Article: 41.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/28/2021] [Indexed: 11/08/2022]

Wang J, Patel A, Wason JM, Newcombe PJ. Two-stage penalized regression screening to detect biomarker-treatment interactions in randomized clinical trials. Biometrics 2022;78:141-150. [PMID: 33448327 PMCID: PMC7613856 DOI: 10.1111/biom.13424] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2020] [Revised: 12/16/2020] [Accepted: 12/31/2020] [Indexed: 12/30/2022]

SNP characteristics and validation success in genome wide association studies. Hum Genet 2022;141:229-238. [PMID: 34981173 PMCID: PMC8855685 DOI: 10.1007/s00439-021-02407-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Accepted: 11/27/2021] [Indexed: 02/03/2023]

Abstract

Genome wide association studies (GWASs) have identified tens of thousands of single nucleotide polymorphisms (SNPs) associated with human diseases and characteristics. A significant fraction of GWAS findings can be false positives. The gold standard for true positives is an independent validation. The goal of this study was to identify SNP features associated with validation success. Summary statistics from the Catalog of Published GWASs were used in the analysis. Since our goal was an analysis of reproducibility, we focused on the diseases/phenotypes targeted by at least 10 GWASs. GWASs were arranged in discovery-validation pairs based on the time of publication, with the discovery GWAS published before validation. We used four definitions of the validation success that differ by stringency. Associations of SNP features with validation success were consistent across the definitions. The strongest predictor of SNP validation was the level of statistical significance in the discovery GWAS. The magnitude of the effect size was associated with validation success in a non-linear manner. SNPs with risk allele frequencies in the range 30-70% showed a higher validation success rate compared to rarer or more common SNPs. Missense, 5'UTR, stop gained, and SNPs located in transcription factor binding sites had a higher validation success rate compared to intergenic, intronic and synonymous SNPs. There was a positive association between validation success and the level of evolutionary conservation of the sites. In addition, validation success was higher when discovery and validation GWASs targeted the same ethnicity. All predictors of validation success remained significant in a multivariate logistic regression model indicating their independent contribution. To conclude, we identified SNP features predicting validation success of GWAS hits. These features can be used to select SNPs for validation and downstream functional studies.

Collapse

Colombo M, Montazeaud G, Viader V, Ecarnot M, Prosperi J, David J, Fort F, Violle C, Freville H. A genome‐wide analysis suggests pleiotropic effects of Green Revolution genes on shade avoidance in wheat. Evol Appl 2022;15:1594-1604. [PMID: 36330302 PMCID: PMC9624089 DOI: 10.1111/eva.13349] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Revised: 01/19/2022] [Accepted: 01/20/2022] [Indexed: 11/26/2022] Open

Crosta M, Nazzicari N, Ferrari B, Pecetti L, Russi L, Romani M, Cabassi G, Cavalli D, Marocco A, Annicchiarico P. Pea Grain Protein Content Across Italian Environments: Genetic Relationship With Grain Yield, and Opportunities for Genome-Enabled Selection for Protein Yield. FRONTIERS IN PLANT SCIENCE 2022;12:718713. [PMID: 35046967 PMCID: PMC8761899 DOI: 10.3389/fpls.2021.718713] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/01/2021] [Accepted: 11/18/2021] [Indexed: 06/14/2023]

Abstract

Wider pea (Pisum sativum L.) cultivation has great interest for European agriculture, owing to its favorable environmental impact and provision of high-protein feedstuff. This work aimed to investigate the extent of genotype × environment interaction (GEI), genetically based trade-offs and polygenic control for crude protein content and grain yield of pea targeted to Italian environments, and to assess the efficiency of genomic selection (GS) as an alternative to phenotypic selection (PS) to increase protein yield per unit area. Some 306 genotypes belonging to three connected recombinant inbred line (RIL) populations derived from paired crosses between elite cultivars were genotyped through genotyping-by-sequencing and phenotyped for grain yield and protein content on a dry matter basis in three autumn-sown environments of northern or central Italy. Line variation for mean protein content ranged from 21.7 to 26.6%. Purely genetic effects, compared with GEI effects, were over two-fold larger for protein content, and over 2-fold smaller for grain and protein yield per unit area. Grain yield and protein content exhibited no inverse genetic correlation. A genome-wide association study revealed a definite polygenic control not only for grain yield but also for protein content, with small amounts of trait variation accounted for by individual loci. On average, the GS predictive ability for individual RIL populations based on the rrBLUP model (which was selected out of four tested models) using by turns two environments for selection and one for validation was moderately high for protein content (0.53) and moderate for grain yield (0.40) and protein yield (0.41). These values were about halved for inter-environment, inter-population predictions using one RIL population for model construction to predict data of the other populations. The comparison between GS and PS for protein yield based on predicted gains per unit time and similar evaluation costs indicated an advantage of GS for model construction including the target RIL population and, in case of multi-year PS, even for model training based on data of a non-target population. In conclusion, protein content is less challenging than grain yield for phenotypic or genome-enabled improvement, and GS is promising for the simultaneous improvement of both traits.

Collapse

Priyanatha C, Torkamaneh D, Rajcan I. Genome-Wide Association Study of Soybean Germplasm Derived From Canadian × Chinese Crosses to Mine for Novel Alleles to Improve Seed Yield and Seed Quality Traits. FRONTIERS IN PLANT SCIENCE 2022;13:866300. [PMID: 35419011 PMCID: PMC8996715 DOI: 10.3389/fpls.2022.866300] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Accepted: 03/04/2022] [Indexed: 05/16/2023]

Siekmann D, Jansen G, Zaar A, Kilian A, Fromme FJ, Hackauf B. A Genome-Wide Association Study Pinpoints Quantitative Trait Genes for Plant Height, Heading Date, Grain Quality, and Yield in Rye (Secale cereale L.). FRONTIERS IN PLANT SCIENCE 2021;12:718081. [PMID: 34777409 PMCID: PMC8586073 DOI: 10.3389/fpls.2021.718081] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Accepted: 09/22/2021] [Indexed: 06/03/2023]

Abstract

Rye is the only cross-pollinating Triticeae crop species. Knowledge of rye genes controlling complex-inherited traits is scarce, which, currently, largely disables the genomics assisted introgression of untapped genetic variation from self-incompatible germplasm collections in elite inbred lines for hybrid breeding. We report on the first genome-wide association study (GWAS) in rye based on the phenotypic evaluation of 526 experimental hybrids for plant height, heading date, grain quality, and yield in 2 years and up to 19 environments. We established a cross-validated NIRS calibration model as a fast, effective, and robust analytical method to determine grain quality parameters. We observed phenotypic plasticity in plant height and tiller number as a resource use strategy of rye under drought and identified increased grain arabinoxylan content as a striking phenotype in osmotically stressed rye. We used DArTseq™ as a genotyping-by-sequencing technology to reduce the complexity of the rye genome. We established a novel high-density genetic linkage map that describes the position of almost 19k markers and that allowed us to estimate a low genome-wide LD based on the assessed genetic diversity in elite germplasm. We analyzed the relationship between plant height, heading date, agronomic, as well as grain quality traits, and genotype based on 20k novel single-nucleotide polymorphism markers. In addition, we integrated the DArTseq™ markers in the recently established 'Lo7' reference genome assembly. We identified cross-validated SNPs in 'Lo7' protein-coding genes associated with all traits studied. These include associations of the WUSCHEL-related homeobox transcription factor DWT1 and grain yield, the DELLA protein gene SLR1 and heading date, the Ethylene overproducer 1-like protein gene ETOL1 and thousand-grain weight, protein and starch content, as well as the Lectin receptor kinase SIT2 and plant height. A Leucine-rich repeat receptor protein kinase and a Xyloglucan alpha-1,6-xylosyltransferase count among the cross-validated genes associated with water-extractable arabinoxylan content. This study demonstrates the power of GWAS, hybrid breeding, and the reference genome sequence in rye genetics research to dissect and identify the function of genes shaping genetic diversity in agronomic and grain quality traits of rye. The described links between genetic causes and phenotypic variation will accelerate genomics-enabled rye improvement.

Collapse

False discovery rate control in genome-wide association studies with population structure. Proc Natl Acad Sci U S A 2021;118:e2105841118. [PMID: 34580220 PMCID: PMC8501795 DOI: 10.1073/pnas.2105841118 10.1073/pnas.2105841118] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

Sesia M, Bates S, Candès E, Marchini J, Sabatti C. False discovery rate control in genome-wide association studies with population structure. Proc Natl Acad Sci U S A 2021;118:e2105841118. [PMID: 34580220 PMCID: PMC8501795 DOI: 10.1073/pnas.2105841118] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/18/2021] [Indexed: 12/25/2022] Open

Wallin J, Bogdan M, Szulc PA, Doerge RW, Siegmund DO. Ghost QTL and hotspots in experimental crosses: novel approach for modeling polygenic effects. Genetics 2021;217:6067404. [PMID: 33789342 DOI: 10.1093/genetics/iyaa041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Accepted: 12/10/2020] [Indexed: 11/14/2022] Open

BOGOMOLOV MARINA, PETERSON CHRISTINEB, BENJAMINI YOAV, SABATTI CHIARA. Hypotheses on a tree: new error rates and testing strategies. Biometrika 2021;108:575-590. [PMID: 36825068 PMCID: PMC9945647 DOI: 10.1093/biomet/asaa086] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Panahabadi R, Ahmadikhah A, McKee LS, Ingvarsson PK, Farrokhi N. Genome-Wide Association Mapping of Mixed Linkage (1,3;1,4)-β-Glucan and Starch Contents in Rice Whole Grain. FRONTIERS IN PLANT SCIENCE 2021;12:665745. [PMID: 34512678 PMCID: PMC8424012 DOI: 10.3389/fpls.2021.665745] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/10/2021] [Accepted: 07/28/2021] [Indexed: 05/27/2023]

Katsevich E, Sabatti C, Bogomolov M. Filtering the rejection set while preserving false discovery rate control. J Am Stat Assoc 2021;118:165-176. [PMID: 37346227 PMCID: PMC10281705 DOI: 10.1080/01621459.2021.1920958] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2020] [Revised: 04/14/2021] [Accepted: 04/18/2021] [Indexed: 12/28/2022]

Lightfoot JT, Roth SM, Hubal MJ. Systems Exercise Genetics Research Design Standards. Med Sci Sports Exerc 2021;53:883-887. [PMID: 33844668 DOI: 10.1249/mss.0000000000002563] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

Kafle OP, Cheng S, Ma M, Li P, Cheng B, Zhang L, Wen Y, Liang C, Qi X, Zhang F. Identifying insomnia-related chemicals through integrative analysis of genome-wide association studies and chemical-genes interaction information. Sleep 2021;43:5805199. [PMID: 32170308 DOI: 10.1093/sleep/zsaa042] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2019] [Revised: 03/02/2020] [Indexed: 12/30/2022] Open

Affiliation(s)

Om Prakash Kafle Key Laboratory of Trace Elements and Endemic Diseases of National Health and Family Planning Commission, School of Public Health, Health Science Center, Xi'an Jiaotong University, Xi'an, P. R. China
Shiqiang Cheng Key Laboratory of Trace Elements and Endemic Diseases of National Health and Family Planning Commission, School of Public Health, Health Science Center, Xi'an Jiaotong University, Xi'an, P. R. China
Mei Ma Key Laboratory of Trace Elements and Endemic Diseases of National Health and Family Planning Commission, School of Public Health, Health Science Center, Xi'an Jiaotong University, Xi'an, P. R. China
Ping Li Key Laboratory of Trace Elements and Endemic Diseases of National Health and Family Planning Commission, School of Public Health, Health Science Center, Xi'an Jiaotong University, Xi'an, P. R. China
Bolun Cheng Key Laboratory of Trace Elements and Endemic Diseases of National Health and Family Planning Commission, School of Public Health, Health Science Center, Xi'an Jiaotong University, Xi'an, P. R. China
Lu Zhang Key Laboratory of Trace Elements and Endemic Diseases of National Health and Family Planning Commission, School of Public Health, Health Science Center, Xi'an Jiaotong University, Xi'an, P. R. China
Yan Wen Key Laboratory of Trace Elements and Endemic Diseases of National Health and Family Planning Commission, School of Public Health, Health Science Center, Xi'an Jiaotong University, Xi'an, P. R. China
Chujun Liang Key Laboratory of Trace Elements and Endemic Diseases of National Health and Family Planning Commission, School of Public Health, Health Science Center, Xi'an Jiaotong University, Xi'an, P. R. China
Xin Qi Key Laboratory of Trace Elements and Endemic Diseases of National Health and Family Planning Commission, School of Public Health, Health Science Center, Xi'an Jiaotong University, Xi'an, P. R. China
Feng Zhang Key Laboratory of Trace Elements and Endemic Diseases of National Health and Family Planning Commission, School of Public Health, Health Science Center, Xi'an Jiaotong University, Xi'an, P. R. China

Collapse

Mai TT, Turner P, Corander J. Boosting heritability: estimating the genetic component of phenotypic variation with multiple sample splitting. BMC Bioinformatics 2021;22:164. [PMID: 33773584 PMCID: PMC8004405 DOI: 10.1186/s12859-021-04079-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2021] [Accepted: 03/15/2021] [Indexed: 11/29/2022] Open

Dang JT, Dang TT, Wine E, Dicken B, Madsen K, Laffin M. The Genetics of Postoperative Recurrence in Crohn Disease: A Systematic Review, Meta-analysis, and Framework for Future Work. CROHN'S & COLITIS 360 2021;3:otaa094. [PMID: 36778938 PMCID: PMC9802308 DOI: 10.1093/crocol/otaa094] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Indexed: 12/12/2022] Open

Tibbs Cortes L, Zhang Z, Yu J. Status and prospects of genome-wide association studies in plants. THE PLANT GENOME 2021;14:e20077. [PMID: 33442955 DOI: 10.1002/tpg2.20077] [Citation(s) in RCA: 138] [Impact Index Per Article: 46.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/05/2020] [Accepted: 11/18/2020] [Indexed: 05/22/2023]

Chen Z, Boehnke M, Wen X, Mukherjee B. Revisiting the genome-wide significance threshold for common variant GWAS. G3 (BETHESDA, MD.) 2021;11:jkaa056. [PMID: 33585870 PMCID: PMC8022962 DOI: 10.1093/g3journal/jkaa056] [Citation(s) in RCA: 39] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/18/2020] [Accepted: 11/05/2020] [Indexed: 11/23/2022]

Identification and Characterization of Serum microRNAs as Biomarkers for Human Disc Degeneration: An RNA Sequencing Analysis. Diagnostics (Basel) 2020;10:diagnostics10121063. [PMID: 33302347 PMCID: PMC7762572 DOI: 10.3390/diagnostics10121063] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Revised: 12/01/2020] [Accepted: 12/02/2020] [Indexed: 12/03/2022] Open

Nunes JRS, Pértille F, Andrade SCS, Perazza CA, Villela PMS, Almeida-Val VMF, Gao ZX, Coutinho LL, Hilsdorf AWS. Genome-wide association study reveals genes associated with the absence of intermuscular bones in tambaqui (Colossoma macropomum). Anim Genet 2020;51:899-909. [PMID: 33006182 DOI: 10.1111/age.13001] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/24/2020] [Indexed: 01/21/2023]

Powell Doherty RD, Liao H, Satsangi JJ, Ternette N. Extended Analysis Identifies Drug-Specific Association of 2 Distinct HLA Class II Haplotypes for Development of Immunogenicity to Adalimumab and Infliximab. Gastroenterology 2020;159:784-787. [PMID: 32275970 DOI: 10.1053/j.gastro.2020.03.073] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/26/2020] [Revised: 03/26/2020] [Accepted: 03/30/2020] [Indexed: 01/07/2023]

Lees JA, Mai TT, Galardini M, Wheeler NE, Horsfield ST, Parkhill J, Corander J. Improved Prediction of Bacterial Genotype-Phenotype Associations Using Interpretable Pangenome-Spanning Regressions. mBio 2020;11:e01344-20. [PMID: 32636251 PMCID: PMC7343994 DOI: 10.1128/mbio.01344-20] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2020] [Accepted: 06/05/2020] [Indexed: 12/19/2022] Open

Abstract

Discovery of genetic variants underlying bacterial phenotypes and the prediction of phenotypes such as antibiotic resistance are fundamental tasks in bacterial genomics. Genome-wide association study (GWAS) methods have been applied to study these relations, but the plastic nature of bacterial genomes and the clonal structure of bacterial populations creates challenges. We introduce an alignment-free method which finds sets of loci associated with bacterial phenotypes, quantifies the total effect of genetics on the phenotype, and allows accurate phenotype prediction, all within a single computationally scalable joint modeling framework. Genetic variants covering the entire pangenome are compactly represented by extended DNA sequence words known as unitigs, and model fitting is achieved using elastic net penalization, an extension of standard multiple regression. Using an extensive set of state-of-the-art bacterial population genomic data sets, we demonstrate that our approach performs accurate phenotype prediction, comparable to popular machine learning methods, while retaining both interpretability and computational efficiency. Compared to those of previous approaches, which test each genotype-phenotype association separately for each variant and apply a significance threshold, the variants selected by our joint modeling approach overlap substantially.IMPORTANCE Being able to identify the genetic variants responsible for specific bacterial phenotypes has been the goal of bacterial genetics since its inception and is fundamental to our current level of understanding of bacteria. This identification has been based primarily on painstaking experimentation, but the availability of large data sets of whole genomes with associated phenotype metadata promises to revolutionize this approach, not least for important clinical phenotypes that are not amenable to laboratory analysis. These models of phenotype-genotype association can in the future be used for rapid prediction of clinically important phenotypes such as antibiotic resistance and virulence by rapid-turnaround or point-of-care tests. However, despite much effort being put into adapting genome-wide association study (GWAS) approaches to cope with bacterium-specific problems, such as strong population structure and horizontal gene exchange, current approaches are not yet optimal. We describe a method that advances methodology for both association and generation of portable prediction models.

Collapse

Shi X, Jiao Y, Yang Y, Cheng CY, Yang C, Lin X, Liu J. VIMCO: variational inference for multiple correlated outcomes in genome-wide association studies. Bioinformatics 2020;35:3693-3700. [PMID: 30851102 DOI: 10.1093/bioinformatics/btz167] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2018] [Revised: 12/22/2018] [Accepted: 03/08/2019] [Indexed: 12/19/2022] Open

Beesley LJ, Salvatore M, Fritsche LG, Pandit A, Rao A, Brummett C, Willer CJ, Lisabeth LD, Mukherjee B. The emerging landscape of health research based on biobanks linked to electronic health records: Existing resources, statistical challenges, and potential opportunities. Stat Med 2020;39:773-800. [PMID: 31859414 PMCID: PMC7983809 DOI: 10.1002/sim.8445] [Citation(s) in RCA: 48] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2018] [Revised: 09/10/2019] [Accepted: 11/16/2019] [Indexed: 01/03/2023]

Sesia M, Katsevich E, Bates S, Candès E, Sabatti C. Multi-resolution localization of causal variants across the genome. Nat Commun 2020;11:1093. [PMID: 32107378 PMCID: PMC7046731 DOI: 10.1038/s41467-020-14791-2] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2019] [Accepted: 02/01/2020] [Indexed: 01/07/2023] Open

Potential of Genome-Wide Association Studies and Genomic Selection to Improve Productivity and Quality of Commercial Timber Species in Tropical Rainforest, a Case Study of Shorea platyclados. FORESTS 2020. [DOI: 10.3390/f11020239] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]

Renaux C, Buzdugan L, Kalisch M, Bühlmann P. Hierarchical inference for genome-wide association studies: a view on methodology with software. Comput Stat 2020. [DOI: 10.1007/s00180-019-00939-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]

Becker GM, Davenport KM, Burke JM, Lewis RM, Miller JE, Morgan JLM, Notter DR, Murdoch BM. Genome-wide association study to identify genetic loci associated with gastrointestinal nematode resistance in Katahdin sheep. Anim Genet 2020;51:330-335. [PMID: 31900974 PMCID: PMC7064973 DOI: 10.1111/age.12895] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/28/2019] [Indexed: 12/11/2022]