76
|
Sadler MC, Auwerx C, Deelen P, Kutalik Z. Multi-layered genetic approaches to identify approved drug targets. CELL GENOMICS 2023; 3:100341. [PMID: 37492104 PMCID: PMC10363916 DOI: 10.1016/j.xgen.2023.100341] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Revised: 04/25/2023] [Accepted: 05/16/2023] [Indexed: 07/27/2023]
Abstract
Drugs targeting genes linked to disease via evidence from human genetics have increased odds of approval. Approaches to prioritize such genes include genome-wide association studies (GWASs), rare variant burden tests in exome sequencing studies (Exome), or integration of a GWAS with expression/protein quantitative trait loci (eQTL/pQTL-GWAS). Here, we compare gene-prioritization approaches on 30 clinically relevant traits and benchmark their ability to recover drug targets. Across traits, prioritized genes were enriched for drug targets with odds ratios (ORs) of 2.17, 2.04, 1.81, and 1.31 for the GWAS, eQTL-GWAS, Exome, and pQTL-GWAS methods, respectively. Adjusting for differences in testable genes and sample sizes, GWAS outperforms e/pQTL-GWAS, but not the Exome approach. Furthermore, performance increased through gene network diffusion, although the node degree, being the best predictor (OR = 8.7), revealed strong bias in literature-curated networks. In conclusion, we systematically assessed strategies to prioritize drug target genes, highlighting the promises and pitfalls of current approaches.
Collapse
|
77
|
Drown MK, Oleksiak MF, Crawford DL. Trans-Acting Genotypes Associted with mRNA Expression Affect Metabolic and Thermal Tolerance Traits. Genome Biol Evol 2023:evad123. [PMID: 37392472 PMCID: PMC10370451 DOI: 10.1093/gbe/evad123] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2023] [Revised: 06/23/2023] [Accepted: 06/28/2023] [Indexed: 07/03/2023] Open
Abstract
Evolutionary processes driving physiological trait variation depend on the underlying genomic mechanisms. Evolution of these mechanisms depends on the genetic complexity (involving many genes) and how gene expression impacting the traits is converted to phenotype. Yet, genomic mechanisms that impact physiological traits are diverse and context dependent (e.g., vary by environment, tissues), making them difficult to discern. We examine the relationships between genotype, mRNA expression, and physiological traits to discern the genetic complexity and whether the gene expression affecting the physiological traits is primarily cis or trans-acting. We use low-coverage whole genome sequencing and heart- or brain-specific mRNA expression to identify polymorphisms directly associated with physiological traits and expressed quantitative trait loci (eQTL) indirectly associated with variation in six temperature specific physiological traits (standard metabolic rate, thermal tolerance, and four substrate specific cardiac metabolic rates). Focusing on a select set of mRNAs belonging to co-expression modules that explain up to 82% of temperature specific traits, we identified hundreds of significant eQTL for mRNA whose expression affects physiological traits. Surprisingly, most eQTL (97.4% for heart and 96.7% for brain) were trans-acting. This could be due to higher effect size of trans versus cis-acting eQTL for mRNAs that are central to co-expression modules. That is, we may have enhanced the identification of trans-acting factors by looking for SNPs associated with mRNAs in co-expression modules that broadly influence gene expression patterns. Overall, these data indicate that the genomic mechanism driving physiological variation across environments is driven by trans-acting heart- or brain-specific mRNA expression.
Collapse
|
78
|
Wang YH, Luo PP, Geng AY, Li X, Liu TH, He YJ, Huang L, Tang YQ. Identification of highly reliable risk genes for Alzheimer's disease through joint-tissue integrative analysis. Front Aging Neurosci 2023; 15:1183119. [PMID: 37416324 PMCID: PMC10320295 DOI: 10.3389/fnagi.2023.1183119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2023] [Accepted: 05/30/2023] [Indexed: 07/08/2023] Open
Abstract
Numerous genetic variants associated with Alzheimer's disease (AD) have been identified through genome-wide association studies (GWAS), but their interpretation is hindered by the strong linkage disequilibrium (LD) among the variants, making it difficult to identify the causal variants directly. To address this issue, the transcriptome-wide association study (TWAS) was employed to infer the association between gene expression and a trait at the genetic level using expression quantitative trait locus (eQTL) cohorts. In this study, we applied the TWAS theory and utilized the improved Joint-Tissue Imputation (JTI) approach and Mendelian Randomization (MR) framework (MR-JTI) to identify potential AD-associated genes. By integrating LD score, GTEx eQTL data, and GWAS summary statistic data from a large cohort using MR-JTI, a total of 415 AD-associated genes were identified. Then, 2873 differentially expressed genes from 11 AD-related datasets were used for the Fisher test of these AD-associated genes. We finally obtained 36 highly reliable AD-associated genes, including APOC1, CR1, ERBB2, and RIN3. Moreover, the GO and KEGG enrichment analysis revealed that these genes are primarily involved in antigen processing and presentation, amyloid-beta formation, tau protein binding, and response to oxidative stress. The identification of these potential AD-associated genes not only provides insights into the pathogenesis of AD but also offers biomarkers for early diagnosis of the disease.
Collapse
|
79
|
Gedik H, Nguyen TH, Peterson RE, Chatzinakos C, Vladimirov VI, Riley BP, Bacanu SA. Identifying potential risk genes and pathways for neuropsychiatric and substance use disorders using intermediate molecular mediator information. Front Genet 2023; 14:1191264. [PMID: 37415601 PMCID: PMC10320396 DOI: 10.3389/fgene.2023.1191264] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Accepted: 05/23/2023] [Indexed: 07/08/2023] Open
Abstract
Neuropsychiatric and substance use disorders (NPSUDs) have a complex etiology that includes environmental and polygenic risk factors with significant cross-trait genetic correlations. Genome-wide association studies (GWAS) of NPSUDs yield numerous association signals. However, for most of these regions, we do not yet have a firm understanding of either the specific risk variants or the effects of these variants. Post-GWAS methods allow researchers to use GWAS summary statistics and molecular mediators (transcript, protein, and methylation abundances) infer the effect of these mediators on risk for disorders. One group of post-GWAS approaches is commonly referred to as transcriptome/proteome/methylome-wide association studies, which are abbreviated as T/P/MWAS (or collectively as XWAS). Since these approaches use biological mediators, the multiple testing burden is reduced to the number of genes (∼20,000) instead of millions of GWAS SNPs, which leads to increased signal detection. In this work, our aim is to uncover likely risk genes for NPSUDs by performing XWAS analyses in two tissues-blood and brain. First, to identify putative causal risk genes, we performed an XWAS using the Summary-data-based Mendelian randomization, which uses GWAS summary statistics, reference xQTL data, and a reference LD panel. Second, given the large comorbidities among NPSUDs and the shared cis-xQTLs between blood and the brain, we improved XWAS signal detection for underpowered analyses by performing joint concordance analyses between XWAS results i) across the two tissues and ii) across NPSUDs. All XWAS signals i) were adjusted for heterogeneity in dependent instruments (HEIDI) (non-causality) p-values and ii) used to test for pathway enrichment. The results suggest that there were widely shared gene/protein signals within the major histocompatibility complex region on chromosome 6 (BTN3A2 and C4A) and elsewhere in the genome (FURIN, NEK4, RERE, and ZDHHC5). The identification of putative molecular genes and pathways underlying risk may offer new targets for therapeutic development. Our study revealed an enrichment of XWAS signals in vitamin D and omega-3 gene sets. So, including vitamin D and omega-3 in treatment plans may have a modest but beneficial effect on patients with bipolar disorder.
Collapse
|
80
|
Advani J, Corso-Diaz X, Kwicklis M, van Asten F, Ratnapriya R, Mehta P, Hamel A, Mahrotra S, Segrè A, Kiel C, Strunz T, Weber B, Chew E, Hernandez D, Montezuma S, Ferrington D, Swaroop A. QTL mapping of human retina DNA methylation identifies 87 gene-epigenome interactions in age-related macular degeneration. RESEARCH SQUARE 2023:rs.3.rs-3011096. [PMID: 37398472 PMCID: PMC10312909 DOI: 10.21203/rs.3.rs-3011096/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
DNA methylation (DNAm) provides a crucial epigenetic mark linking genetic variations to environmental influence. We analyzed array-based DNAm profiles of 160 human retinas with co-measured RNA-seq and > 8 million genetic variants, uncovering sites of genetic regulation in cis (37,453 mQTLs and 12,505 eQTLs) and 13,747 eQTMs (DNAm loci affecting gene expression), with over one-third specific to the retina. mQTLs and eQTMs show non-random distribution and enrichment of biological processes related to synapse, mitochondria, and catabolism. Summary data-based Mendelian randomization and colocalization analyses identify 87 target genes where methylation and gene-expression changes likely mediate the genotype effect on age-related macular degeneration (AMD). Integrated pathway analysis reveals epigenetic regulation of immune response and metabolism including the glutathione pathway and glycolysis. Our study thus defines key roles of genetic variations driving methylation changes, prioritizes epigenetic control of gene expression, and suggests frameworks for regulation of AMD pathology by genotype-environment interaction in retina.
Collapse
|
81
|
Gjorgjieva T, Chaloemtoem A, Shahin T, Bayaraa O, Dieng MM, Alshaikh M, Abdalbaqi M, Del Monte J, Begum G, Leonor C, Manikandan V, Drou N, Arshad M, Arnoux M, Kumar N, Jabari A, Abdulle A, ElGhazali G, Ali R, Shaheen SY, Abdalla J, Piano F, Gunsalus KC, Daggag H, Al Nahdi H, Abuzeid H, Idaghdour Y. Systems genetics identifies miRNA-mediated regulation of host response in COVID-19. Hum Genomics 2023; 17:49. [PMID: 37303042 DOI: 10.1186/s40246-023-00494-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Accepted: 05/10/2023] [Indexed: 06/13/2023] Open
Abstract
BACKGROUND Individuals infected with SARS-CoV-2 vary greatly in their disease severity, ranging from asymptomatic infection to severe disease. The regulation of gene expression is an important mechanism in the host immune response and can modulate the outcome of the disease. miRNAs play important roles in post-transcriptional regulation with consequences on downstream molecular and cellular host immune response processes. The nature and magnitude of miRNA perturbations associated with blood phenotypes and intensive care unit (ICU) admission in COVID-19 are poorly understood. RESULTS We combined multi-omics profiling-genotyping, miRNA and RNA expression, measured at the time of hospital admission soon after the onset of COVID-19 symptoms-with phenotypes from electronic health records to understand how miRNA expression contributes to variation in disease severity in a diverse cohort of 259 unvaccinated patients in Abu Dhabi, United Arab Emirates. We analyzed 62 clinical variables and expression levels of 632 miRNAs measured at admission and identified 97 miRNAs associated with 8 blood phenotypes significantly associated with later ICU admission. Integrative miRNA-mRNA cross-correlation analysis identified multiple miRNA-mRNA-blood endophenotype associations and revealed the effect of miR-143-3p on neutrophil count mediated by the expression of its target gene BCL2. We report 168 significant cis-miRNA expression quantitative trait loci, 57 of which implicate miRNAs associated with either ICU admission or a blood endophenotype. CONCLUSIONS This systems genetics study has given rise to a genomic picture of the architecture of whole blood miRNAs in unvaccinated COVID-19 patients and pinpoints post-transcriptional regulation as a potential mechanism that impacts blood traits underlying COVID-19 severity. The results also highlight the impact of host genetic regulatory control of miRNA expression in early stages of COVID-19 disease.
Collapse
|
82
|
Pudjihartono N, Ho D, Golovina E, Fadason T, Kempa-Liehr AW, O'Sullivan JM. Juvenile idiopathic arthritis-associated genetic loci exhibit spatially constrained gene regulatory effects across multiple tissues and immune cell types. J Autoimmun 2023; 138:103046. [PMID: 37229810 DOI: 10.1016/j.jaut.2023.103046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2023] [Revised: 04/04/2023] [Accepted: 04/15/2023] [Indexed: 05/27/2023]
Abstract
Juvenile idiopathic arthritis (JIA) is an autoimmune, inflammatory joint disease with complex genetic etiology. Previous GWAS have found many genetic loci associated with JIA. However, the biological mechanism behind JIA remains unknown mainly because most risk loci are located in non-coding genetic regions. Interestingly, increasing evidence has found that regulatory elements in the non-coding regions can regulate the expression of distant target genes through spatial (physical) interactions. Here, we used information on the 3D genome organization (Hi-C data) to identify target genes that physically interact with SNPs within JIA risk loci. Subsequent analysis of these SNP-gene pairs using data from tissue and immune cell type-specific expression quantitative trait loci (eQTL) databases allowed the identification of risk loci that regulate the expression of their target genes. In total, we identified 59 JIA-risk loci that regulate the expression of 210 target genes across diverse tissues and immune cell types. Functional annotation of spatial eQTLs within JIA risk loci identified significant overlap with gene regulatory elements (i.e., enhancers and transcription factor binding sites). We found target genes involved in immune-related pathways such as antigen processing and presentation (e.g., ERAP2, HLA class I and II), the release of pro-inflammatory cytokines (e.g., LTBR, TYK2), proliferation and differentiation of specific immune cell types (e.g., AURKA in Th17 cells), and genes involved in physiological mechanisms related to pathological joint inflammation (e.g., LRG1 in arteries). Notably, many of the tissues where JIA-risk loci act as spatial eQTLs are not classically considered central to JIA pathology. Overall, our findings highlight the potential tissue and immune cell type-specific regulatory changes contributing to JIA pathogenesis. Future integration of our data with clinical studies can contribute to the development of improved JIA therapy.
Collapse
|
83
|
Le NN, Tran TQB, du Toit C, Gill D, Padmanabhan S. Establishing plausibility of cardiovascular adverse effects of immunotherapies using Mendelian randomisation. Front Cardiovasc Med 2023; 10:1116799. [PMID: 37273876 PMCID: PMC10235787 DOI: 10.3389/fcvm.2023.1116799] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Accepted: 04/17/2023] [Indexed: 06/06/2023] Open
Abstract
Immune checkpoint inhibitors (ICIs) and Janus kinase inhibitors (JAKis) have raised concerns over serious unexpected cardiovascular adverse events. The widespread pleiotropy in genome-wide association studies offers an opportunity to identify cardiovascular risks from in-development drugs to help inform appropriate trial design and pharmacovigilance strategies. This study uses the Mendelian randomization (MR) approach to study the causal effects of 9 cardiovascular risk factors on ischemic stroke risk both independently and by mediation, followed by an interrogation of the implicated expression quantitative trait loci (eQTLs) to determine if the enriched pathways can explain the adverse stroke events observed with ICI or JAKi treatment. Genetic predisposition to higher systolic blood pressure (SBP), diastolic blood pressure (DBP), body mass index (BMI), waist-to-hip ratio (WHR), low-density lipoprotein cholesterol (LDL), triglycerides (TG), type 2 diabetes (T2DM), and smoking index were associated with higher ischemic stroke risk. The associations of genetically predicted BMI, WHR, and TG on the outcome were attenuated after adjusting for genetically predicted T2DM [BMI: 53.15% mediated, 95% CI 17.21%-89.10%; WHR: 42.92% (4.17%-81.67%); TG: 72.05% (10.63%-133.46%)]. JAKis, programmed cell death protein 1 and programmed death ligand 1 inhibitors were implicated in the pathways enriched by the genes related to the instruments for each of SBP, DBP, WHR, T2DM, and LDL. Overall, MR mediation analyses support the role of T2DM in mediating the effects of BMI, WHR, and TG on ischemic stroke risk and follow-up pathway enrichment analysis highlights the utility of this approach in the early identification of potential harm from drugs.
Collapse
|
84
|
Gibbs KD, Wang L, Yang Z, Anderson CE, Bourgeois JS, Cao Y, Gaggioli MR, Biel M, Puertollano R, Chen CC, Ko DC. Human variation impacting MCOLN2 restricts Salmonella Typhi replication by magnesium deprivation. CELL GENOMICS 2023; 3:100290. [PMID: 37228749 PMCID: PMC10203047 DOI: 10.1016/j.xgen.2023.100290] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Revised: 01/24/2023] [Accepted: 02/27/2023] [Indexed: 05/27/2023]
Abstract
Human genetic diversity can reveal critical factors in host-pathogen interactions. This is especially useful for human-restricted pathogens like Salmonella enterica serovar Typhi (S. Typhi), the cause of typhoid fever. One key defense during bacterial infection is nutritional immunity: host cells attempt to restrict bacterial replication by denying bacteria access to key nutrients or supplying toxic metabolites. Here, a cellular genome-wide association study of intracellular replication by S. Typhi in nearly a thousand cell lines from around the world-and extensive follow-up using intracellular S. Typhi transcriptomics and manipulation of magnesium availability-demonstrates that the divalent cation channel mucolipin-2 (MCOLN2 or TRPML2) restricts S. Typhi intracellular replication through magnesium deprivation. Mg2+ currents, conducted through MCOLN2 and out of endolysosomes, were measured directly using patch-clamping of the endolysosomal membrane. Our results reveal Mg2+ limitation as a key component of nutritional immunity against S. Typhi and as a source of variable host resistance.
Collapse
|
85
|
Lee YL, Bosse M, Takeda H, Moreira GCM, Karim L, Druet T, Oget-Ebrad C, Coppieters W, Veerkamp RF, Groenen MAM, Georges M, Bouwman AC, Charlier C. High-resolution structural variants catalogue in a large-scale whole genome sequenced bovine family cohort data. BMC Genomics 2023; 24:225. [PMID: 37127590 PMCID: PMC10152703 DOI: 10.1186/s12864-023-09259-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2022] [Accepted: 03/20/2023] [Indexed: 05/03/2023] Open
Abstract
BACKGROUND Structural variants (SVs) are chromosomal segments that differ between genomes, such as deletions, duplications, insertions, inversions and translocations. The genomics revolution enabled the discovery of sub-microscopic SVs via array and whole-genome sequencing (WGS) data, paving the way to unravel the functional impact of SVs. Recent human expression QTL mapping studies demonstrated that SVs play a disproportionally large role in altering gene expression, underlining the importance of including SVs in genetic analyses. Therefore, this study aimed to generate and explore a high-quality bovine SV catalogue exploiting a unique cattle family cohort data (total 266 samples, forming 127 trios). RESULTS We curated 13,731 SVs segregating in the population, consisting of 12,201 deletions, 1,509 duplications, and 21 multi-allelic CNVs (> 50-bp). Of these, we validated a subset of copy number variants (CNVs) utilising a direct genotyping approach in an independent cohort, indicating that at least 62% of the CNVs are true variants, segregating in the population. Among gene-disrupting SVs, we prioritised two likely high impact duplications, encompassing ORM1 and POPDC3 genes, respectively. Liver expression QTL mapping results revealed that these duplications are likely causing altered gene expression, confirming the functional importance of SVs. Although most of the accurately genotyped CNVs are tagged by single nucleotide polymorphisms (SNPs) ascertained in WGS data, most CNVs were not captured by individual SNPs obtained from a 50K genotyping array. CONCLUSION We generated a high-quality SV catalogue exploiting unique whole genome sequenced bovine family cohort data. Two high impact duplications upregulating the ORM1 and POPDC3 are putative candidates for postpartum feed intake and hoof health traits, thus warranting further investigation. Generally, CNVs were in low LD with SNPs on the 50K array. Hence, it remains crucial to incorporate CNVs via means other than tagging SNPs, such as investigation of tagging haplotypes, direct imputation of CNVs, or direct genotyping as done in the current study. The SV catalogue and the custom genotyping array generated in the current study will serve as valuable resources accelerating utilisation of full spectrum of genetic variants in bovine genomes.
Collapse
|
86
|
Wagner M, Sobczyński M, Jasek M, Pawełczyk K, Porębska I, Kuśnierczyk P, Wiśniewski A. Down-regulation of ERAP1 mRNA expression in non-small cell lung cancer. BMC Cancer 2023; 23:383. [PMID: 37101107 PMCID: PMC10134604 DOI: 10.1186/s12885-023-10785-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Accepted: 03/28/2023] [Indexed: 04/28/2023] Open
Abstract
BACKGROUND ERAP1 is a major aminopeptidase that serves as an editor of the peptide repertoire by trimming N-terminal residues of antigenic peptides, creating a pool of peptides with the optimal length for MHC-I binding. As an important component of the antigen processing and presenting machinery - APM, ERAP1 is frequently down-regulated in many cancers. Since ERAP1 expression has not yet been thoroughly investigated in non-small cell lung cancer (NSCLC), we decided to analyze ERAP1 mRNA levels in tissues collected from NSCLC patients. METHODS Using real-time qPCR, we evaluated ERAP1 mRNA expression in samples of tumor and adjacent non-tumor tissue (serving as control tissue) from 61 NSCLC patients. RESULTS We observed a significantly lower level of ERAP1 mRNA expression in tumor tissue (MedTumor = 0.75) in comparison to non-tumor tissue (MedNon-tumor = 1.1), p = 0.008. One of the five tested polymorphisms, namely rs26653, turned out to be significantly associated with ERAP1 expression in non-tumor tissue (difference [d] = 0.59 CI95% (0.14;1.05), p = 0.0086), but not in tumor tissue. The levels of ERAP1 mRNA expression did not affect the overall survival of NSCLC patients, either in the case of the tumor (p = 0.788) or in non-tumor (p = 0.298) tissue. We did not detect any association between mRNA ERAP1 expression level in normal tissue and: (i) age at diagnosis (p = 0.8386), (ii) patient's sex (p = 0.3616), (iii) histological type of cancer (p = 0.7580) and (iv) clinical stage of NSCLC (p = 0.7549). Furthermore, in the case of tumor tissue none of the abovementioned clinical parameters were associated with ERAP1 expression (p = 0.76). CONCLUSION Down-regulation of ERAP1 mRNA observed in NSCLC tissue may be related to tumor immune evasion strategy. The rs26653 polymorphism can be considered an expression quantitative trait locus (eQTL) associated with ERAP1 expression in normal lung tissue.
Collapse
|
87
|
Zhang C, Li X, Zhao L, Guo W, Deng W, Wang Q, Hu X, Du X, Sham PC, Luo X, Li T. Brain transcriptome-wide association study implicates novel risk genes underlying schizophrenia risk. Psychol Med 2023:1-11. [PMID: 37092861 DOI: 10.1017/s0033291723000417] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 04/25/2023]
Abstract
BACKGROUND To identify risk genes whose expression are regulated by the reported risk variants and to explore the potential regulatory mechanism in schizophrenia (SCZ). METHODS We systematically integrated three independent brain expression quantitative traits (eQTLs) (CommonMind, GTEx, and BrainSeq Phase 2, a total of 1039 individuals) and GWAS data (56 418 cases and 78 818 controls), with the use of transcriptome-wide association study (TWAS). Diffusion magnetic resonance imaging was utilized to quantify the integrity of white matter bundles and determine whether polygenic risk of novel genes linked to brain structure was present in patients with first-episode antipsychotic SCZ. RESULTS TWAS showed that eight risk genes (CORO7, DDAH2, DDHD2, ELAC2, GLT8D1, PCDHA8, THOC7, and TYW5) reached transcriptome-wide significance (TWS) level. These findings were confirmed by an independent integrative approach (i.e. Sherlock). We further conducted conditional analyses and identified the potential risk genes that driven the TWAS association signal in each locus. Gene expression analysis showed that several TWS genes (including CORO7, DDAH2, DDHD2, ELAC2, GLT8D1, THOC7 and TYW5) were dysregulated in the dorsolateral prefrontal cortex of SCZ cases compared with controls. TWS genes were mainly expressed on the surface of glutamatergic neurons, GABAergic neurons, and microglia. Finally, SCZ cases had a substantially greater TWS genes-based polygenic risk (PRS) compared to controls, and we showed that fractional anisotropy of the cingulum-hippocampus mediates the influence of TWS genes PRS on SCZ. CONCLUSIONS Our findings identified novel SCZ risk genes and highlighted the importance of the TWS genes in frontal-limbic dysfunctions in SCZ, indicating possible therapeutic targets.
Collapse
|
88
|
Li S, Schmid KT, de Vries DH, Korshevniuk M, Losert C, Oelen R, van Blokland IV, Groot HE, Swertz MA, van der Harst P, Westra HJ, van der Wijst MGP, Heinig M, Franke L. Identification of genetic variants that impact gene co-expression relationships using large-scale single-cell data. Genome Biol 2023; 24:80. [PMID: 37072791 PMCID: PMC10111756 DOI: 10.1186/s13059-023-02897-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Accepted: 03/16/2023] [Indexed: 04/20/2023] Open
Abstract
BACKGROUND Expression quantitative trait loci (eQTL) studies show how genetic variants affect downstream gene expression. Single-cell data allows reconstruction of personalized co-expression networks and therefore the identification of SNPs altering co-expression patterns (co-expression QTLs, co-eQTLs) and the affected upstream regulatory processes using a limited number of individuals. RESULTS We conduct a co-eQTL meta-analysis across four scRNA-seq peripheral blood mononuclear cell datasets using a novel filtering strategy followed by a permutation-based multiple testing approach. Before the analysis, we evaluate the co-expression patterns required for co-eQTL identification using different external resources. We identify a robust set of cell-type-specific co-eQTLs for 72 independent SNPs affecting 946 gene pairs. These co-eQTLs are replicated in a large bulk cohort and provide novel insights into how disease-associated variants alter regulatory networks. One co-eQTL SNP, rs1131017, that is associated with several autoimmune diseases, affects the co-expression of RPS26 with other ribosomal genes. Interestingly, specifically in T cells, the SNP additionally affects co-expression of RPS26 and a group of genes associated with T cell activation and autoimmune disease. Among these genes, we identify enrichment for targets of five T-cell-activation-related transcription factors whose binding sites harbor rs1131017. This reveals a previously overlooked process and pinpoints potential regulators that could explain the association of rs1131017 with autoimmune diseases. CONCLUSION Our co-eQTL results highlight the importance of studying context-specific gene regulation to understand the biological implications of genetic variation. With the expected growth of sc-eQTL datasets, our strategy and technical guidelines will facilitate future co-eQTL identification, further elucidating unknown disease mechanisms.
Collapse
|
89
|
Zeng L, White CC, Bennett DA, Klein HU, De Jager PL. Genetic insights into the association between inflammatory bowel disease and Alzheimer's disease. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.04.17.23286845. [PMID: 37131588 PMCID: PMC10153331 DOI: 10.1101/2023.04.17.23286845] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Background Myeloid cells, including monocytes, macrophages, microglia, dendritic cells and neutrophils are a part of innate immunity, playing a major role in orchestrating innate and adaptive immune responses. Microglia are the resident myeloid cells of the central nervous system, and many Alzheimer's disease (AD) risk loci are found in or near genes that are highly or sometimes uniquely expressed in myeloid cells. Similarly, inflammatory bowel disease (IBD) loci are also enriched for genes expressed by myeloid cells. However, the extent to which there is overlap between the effects of AD and IBD susceptibility loci in myeloid cells remains poorly described, and the substantial IBD genetic maps may help to accelerate AD research. Methods Here, we leveraged summary statistics from large-scale genome-wide association studies (GWAS) to investigate the causal effect of IBD (including ulcerative colitis and Crohn's disease) variants on AD and AD endophenotypes. Microglia and monocyte expression Quantitative Trait Locus (eQTLs) were used to examine the functional consequences of IBD and AD risk variants enrichment in two different myeloid cell subtypes. Results Our results showed that, while PTK2B is implicated in both diseases and both sets of risk loci are enriched for myeloid genes, AD and IBD susceptibility loci largely implicate distinct sets of genes and pathways. AD loci are significantly more enriched for microglial eQTLs than IBD. We also found that genetically determined IBD is associated with a lower risk of AD, which may driven by a negative effect on the accumulation of neurofibrillary tangles (beta=-1.04, p=0.013). In addition, IBD displayed a significant positive genetic correlation with psychiatric disorders and multiple sclerosis, while AD showed a significant positive genetic correlation with amyotrophic lateral sclerosis. Conclusion To our knowledge, this is the first study to systematically contrast the genetic association between IBD and AD, our findings highlight a possible genetically protective effect of IBD on AD even if the majority of effects on myeloid cell gene expression by the two sets of disease variants are distinct. Thus, IBD myeloid studies may not help to accelerate AD functional studies, but our observation reinforces the role of myeloid cells in the accumulation of tau proteinopathy and provides a new avenue for discovering a protective factor.
Collapse
|
90
|
Aydin S, Pham DT, Zhang T, Keele GR, Skelly DA, Paulo JA, Pankratz M, Choi T, Gygi SP, Reinholdt LG, Baker CL, Churchill GA, Munger SC. Genetic dissection of the pluripotent proteome through multi-omics data integration. CELL GENOMICS 2023; 3:100283. [PMID: 37082146 PMCID: PMC10112288 DOI: 10.1016/j.xgen.2023.100283] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Revised: 09/12/2022] [Accepted: 02/27/2023] [Indexed: 04/22/2023]
Abstract
Genetic background drives phenotypic variability in pluripotent stem cells (PSCs). Most studies to date have used transcript abundance as the primary molecular readout of cell state in PSCs. We performed a comprehensive proteogenomics analysis of 190 genetically diverse mouse embryonic stem cell (mESC) lines. The quantitative proteome is highly variable across lines, and we identified pluripotency-associated pathways that were differentially activated in the proteomics data that were not evident in transcriptome data from the same lines. Integration of protein abundance to transcript levels and chromatin accessibility revealed broad co-variation across molecular layers as well as shared and unique drivers of quantitative variation in pluripotency-associated pathways. Quantitative trait locus (QTL) mapping localized the drivers of these multi-omic signatures to genomic hotspots. This study reveals post-transcriptional mechanisms and genetic interactions that underlie quantitative variability in the pluripotent proteome and provides a regulatory map for mESCs that can provide a basis for future mechanistic studies.
Collapse
|
91
|
Li Z, Zhang B, Liu Q, Tao Z, Ding L, Guo B, Zhang E, Zhang H, Meng Z, Guo S, Chen Y, Peng J, Li J, Wang C, Huang Y, Xu H, Wu Y. Genetic association of lipids and lipid-lowering drug target genes with non-alcoholic fatty liver disease. EBioMedicine 2023; 90:104543. [PMID: 37002989 PMCID: PMC10070091 DOI: 10.1016/j.ebiom.2023.104543] [Citation(s) in RCA: 29] [Impact Index Per Article: 29.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2022] [Revised: 03/09/2023] [Accepted: 03/13/2023] [Indexed: 04/03/2023] Open
Abstract
BACKGROUND Some observational studies found that dyslipidaemia is a risk factor for non-alcoholic fatty liver disease (NAFLD), and lipid-lowering drugs may lower NAFLD risk. However, it remains unclear whether dyslipidaemia is causative for NAFLD. This Mendelian randomisation (MR) study aimed to explore the causal role of lipid traits in NAFLD and evaluate the potential effect of lipid-lowering drug targets on NAFLD. METHODS Genetic variants associated with lipid traits and variants of genes encoding lipid-lowering drug targets were extracted from the Global Lipids Genetics Consortium genome-wide association study (GWAS). Summary statistics for NAFLD were obtained from two independent GWAS datasets. Lipid-lowering drug targets that reached significance were further tested using expression quantitative trait loci data in relevant tissues. Colocalisation and mediation analyses were performed to validate the robustness of the results and explore potential mediators. FINDINGS No significant effect of lipid traits and eight lipid-lowering drug targets on NAFLD risk was found. Genetic mimicry of lipoprotein lipase (LPL) enhancement was associated with lower NAFLD risks in two independent datasets (OR1 = 0.60 [95% CI 0.50-0.72], p1 = 2.07 × 10-8; OR2 = 0.57 [95% CI 0.39-0.82], p2 = 3.00 × 10-3). A significant MR association (OR = 0.71 [95% CI, 0.58-0.87], p = 1.20 × 10-3) and strong colocalisation association (PP.H4 = 0.85) with NAFLD were observed for LPL expression in subcutaneous adipose tissue. Fasting insulin and type 2 diabetes mediated 7.40% and 9.15%, respectively, of the total effect of LPL on NAFLD risk. INTERPRETATION Our findings do not support dyslipidaemia as a causal factor for NAFLD. Among nine lipid-lowering drug targets, LPL is a promising candidate drug target in NAFLD. The mechanism of action of LPL in NAFLD may be independent of its lipid-lowering effects. FUNDING Capital's Funds for Health Improvement and Research (2022-4-4037). CAMS Innovation Fund for Medical Sciences (CIFMS, grant number: 2021-I2M-C&T-A-010).
Collapse
|
92
|
Murani E, Hadlich F. Exploration of genotype-by-environment interactions affecting gene expression responses in porcine immune cells. Front Genet 2023; 14:1157267. [PMID: 37007953 PMCID: PMC10061014 DOI: 10.3389/fgene.2023.1157267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Accepted: 03/06/2023] [Indexed: 03/18/2023] Open
Abstract
As one of the keys to healthy performance, robustness of farm animals is gaining importance, and with this comes increasing interest in genetic dissection of genotype-by-environment interactions (G×E). Changes in gene expression are among the most sensitive responses conveying adaptation to environmental stimuli. Environmentally responsive regulatory variation thus likely plays a central role in G×E. In the present study, we set out to detect action of environmentally responsive cis-regulatory variation by the analysis of condition-dependent allele specific expression (cd-ASE) in porcine immune cells. For this, we harnessed mRNA-sequencing data of peripheral blood mononuclear cells (PBMCs) stimulated in vitro with lipopolysaccharide, dexamethasone, or their combination. These treatments mimic common challenges such as bacterial infection or stress, and induce vast transcriptome changes. About two thirds of the examined loci showed significant ASE in at least one treatment, and out of those about ten percent exhibited cd-ASE. Most of the ASE variants were not yet reported in the PigGTEx Atlas. Genes showing cd-ASE were enriched in cytokine signaling in immune system and include several key candidates for animal health. In contrast, genes showing no ASE featured cell-cycle related functions. We confirmed LPS-dependent ASE for one of the top candidates, SOD2, which ranks among the major response genes in LPS-stimulated monocytes. The results of the present study demonstrate the potential of in vitro cell models coupled with cd-ASE analysis for the investigation of G×E in farm animals. The identified loci may benefit efforts to unravel the genetic basis of robustness and improvement of health and welfare in pigs.
Collapse
|
93
|
Huang M, Coral D, Ardalani H, Spegel P, Saadat A, Claussnitzer M, Mulder H, Franks PW, Kalamajski S. Identification of a weight loss-associated causal eQTL in MTIF3 and the effects of MTIF3 deficiency on human adipocyte function. eLife 2023; 12:84168. [PMID: 36876906 PMCID: PMC10023155 DOI: 10.7554/elife.84168] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Accepted: 03/05/2023] [Indexed: 03/07/2023] Open
Abstract
Genetic variation at the MTIF3 (Mitochondrial Translational Initiation Factor 3) locus has been robustly associated with obesity in humans, but the functional basis behind this association is not known. Here, we applied luciferase reporter assay to map potential functional variants in the haplotype block tagged by rs1885988 and used CRISPR-Cas9 to edit the potential functional variants to confirm the regulatory effects on MTIF3 expression. We further conducted functional studies on MTIF3-deficient differentiated human white adipocyte cell line (hWAs-iCas9), generated through inducible expression of CRISPR-Cas9 combined with delivery of synthetic MTIF3-targeting guide RNA. We demonstrate that rs67785913-centered DNA fragment (in LD with rs1885988, r2 > 0.8) enhances transcription in a luciferase reporter assay, and CRISPR-Cas9-edited rs67785913 CTCT cells show significantly higher MTIF3 expression than rs67785913 CT cells. Perturbed MTIF3 expression led to reduced mitochondrial respiration and endogenous fatty acid oxidation, as well as altered expression of mitochondrial DNA-encoded genes and proteins, and disturbed mitochondrial OXPHOS complex assembly. Furthermore, after glucose restriction, the MTIF3 knockout cells retained more triglycerides than control cells. This study demonstrates an adipocyte function-specific role of MTIF3, which originates in the maintenance of mitochondrial function, providing potential explanations for why MTIF3 genetic variation at rs67785913 is associated with body corpulence and response to weight loss interventions.
Collapse
|
94
|
van Wijk MH, Riksen JAG, Elvin M, Poulin GB, Maulana MI, Kammenga JE, Snoek BL, Sterken MG. Cryptic genetic variation of eQTL architecture revealed by genetic perturbation in C. elegans. G3 (BETHESDA, MD.) 2023; 13:7067301. [PMID: 36861370 PMCID: PMC10151397 DOI: 10.1093/g3journal/jkad050] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/27/2022] [Revised: 12/27/2022] [Accepted: 02/07/2023] [Indexed: 03/03/2023]
Abstract
Genetic perturbation in different genetic backgrounds can cause a range of phenotypes within a species. These phenotypic differences can be the result of the interaction between the genetic background and the perturbation. Previously we reported that perturbation of gld-1, an important player in developmental control of C. elegans, released cryptic genetic variation affecting fitness in different genetic backgrounds. Here we investigated the change in transcriptional architecture. We found 414 genes with a cis-eQTL and 991 genes with a trans-eQTL that were specifically found in the gld-1 RNAi treatment. In total, we detected 16 eQTL-hotspots, of which 7 were only found in the gld-1 RNAi treatment. Enrichment analysis of those 7 hotspots showed that the regulated genes were associated with neurons and the pharynx. Furthermore, we found evidence of accelerated transcriptional aging in the gld-1 RNAi treated nematodes. Overall, our results illustrate that studying CGV leads to the discovery of hidden polymorphic regulators.
Collapse
|
95
|
Yermakovich D, Pankratov V, Võsa U, Yunusbayev B, Dannemann M. Long-range regulatory effects of Neandertal DNA in modern humans. Genetics 2023; 223:6957427. [PMID: 36560850 PMCID: PMC9991505 DOI: 10.1093/genetics/iyac188] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Revised: 10/13/2022] [Accepted: 12/16/2022] [Indexed: 12/24/2022] Open
Abstract
The admixture between modern humans and Neandertals has resulted in ∼2% of the genomes of present-day non-Africans being composed of Neandertal DNA. Introgressed Neandertal DNA has been demonstrated to significantly affect the transcriptomic landscape in people today and via this molecular mechanism influence phenotype variation as well. However, little is known about how much of that regulatory impact is mediated through long-range regulatory effects that have been shown to explain ∼20% of expression variation. Here we identified 60 transcription factors (TFs) with their top cis-eQTL SNP in GTEx being of Neandertal ancestry and predicted long-range Neandertal DNA-induced regulatory effects by screening for the predicted target genes of those TFs. We show that the TFs form a significantly connected protein-protein interaction network. Among them are JUN and PRDM5, two brain-expressed TFs that have their predicted target genes enriched in regions devoid of Neandertal DNA. Archaic cis-eQTLs for the 60 TFs include multiple candidates for local adaptation, some of which show significant allele frequency increases over the last ∼10,000 years. A large proportion of the cis-eQTL-associated archaic SNPs have additional associations with various immune traits, schizophrenia, blood cell type composition and anthropometric measures. Finally, we demonstrate that our results are consistent with those of Neandertal DNA-associated empirical trans-eQTLs. Our results suggest that Neandertal DNA significantly influences regulatory networks, that its regulatory reach goes beyond the 40% of genomic sequence it still covers in present-day non-Africans and that via the investigated mechanism Neandertal DNA influences the phenotypic variation in people today.
Collapse
|
96
|
Hoellinger T, Mestre C, Aschard H, Le Goff W, Foissac S, Faraut T, Djebali S. Enhancer/gene relationships: Need for more reliable genome-wide reference sets. FRONTIERS IN BIOINFORMATICS 2023; 3:1092853. [PMID: 36909938 PMCID: PMC9999192 DOI: 10.3389/fbinf.2023.1092853] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Accepted: 02/07/2023] [Indexed: 02/26/2023] Open
Abstract
Differences in cells' functions arise from differential activity of regulatory elements, including enhancers. Enhancers are cis-regulatory elements that cooperate with promoters through transcription factors to activate the expression of one or several genes by getting physically close to them in the 3D space of the nucleus. There is increasing evidence that genetic variants associated with common diseases are enriched in enhancers active in cell types relevant to these diseases. Identifying the enhancers associated with genes and conversely, the sets of genes activated by each enhancer (the so-called enhancer/gene or E/G relationships) across cell types, can help understanding the genetic mechanisms underlying human diseases. There are three broad approaches for the genome-wide identification of E/G relationships in a cell type: 1) genetic link methods or eQTL, 2) functional link methods based on 1D functional data such as open chromatin, histone mark or gene expression and 3) spatial link methods based on 3D data such as HiC. Since 1) and 3) are costly, the current strategy is to develop functional link methods and to use data from 1) and 3) as reference to evaluate them. However, there is still no consensus on the best functional link method to date, and method comparison remain seldom. Here, we compared the relative performances of three recent methods for the identification of enhancer-gene links, TargetFinder, Average-Rank, and the ABC model, using the three latest benchmarks from the field: a reference that combines 3D and eQTL data, called BENGI, and two genetic screening references, called CRiFF and CRiSPRi. Overall, none of the three methods performed best on the three references. CRiFF and CRISPRi reference sets are likely more reliable, but CRiFF is not genome-wide and CRiFF and CRISPRi are mostly available on the K562 cancer cell line. The BENGI reference set is genome-wide but likely contains many false positives. This study therefore calls for new reliable and genome-wide E/G reference data rather than new functional link E/G identification methods.
Collapse
|
97
|
Kelly DE, Ramdas S, Ma R, Rawlings-Goss RA, Grant GR, Ranciaro A, Hirbo JB, Beggs W, Yeager M, Chanock S, Nyambo TB, Omar SA, Woldemeskel D, Belay G, Li H, Brown CD, Tishkoff SA. The genetic and evolutionary basis of gene expression variation in East Africans. Genome Biol 2023; 24:35. [PMID: 36829244 PMCID: PMC9951478 DOI: 10.1186/s13059-023-02874-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2022] [Accepted: 02/13/2023] [Indexed: 02/26/2023] Open
Abstract
BACKGROUND Mapping of quantitative trait loci (QTL) associated with molecular phenotypes is a powerful approach for identifying the genes and molecular mechanisms underlying human traits and diseases, though most studies have focused on individuals of European descent. While important progress has been made to study a greater diversity of human populations, many groups remain unstudied, particularly among indigenous populations within Africa. To better understand the genetics of gene regulation in East Africans, we perform expression and splicing QTL mapping in whole blood from a cohort of 162 diverse Africans from Ethiopia and Tanzania. We assess replication of these QTLs in cohorts of predominantly European ancestry and identify candidate genes under selection in human populations. RESULTS We find the gene regulatory architecture of African and non-African populations is broadly shared, though there is a considerable amount of variation at individual loci across populations. Comparing our analyses to an equivalently sized cohort of European Americans, we find that QTL mapping in Africans improves the detection of expression QTLs and fine-mapping of causal variation. Integrating our QTL scans with signatures of natural selection, we find several genes related to immunity and metabolism that are highly differentiated between Africans and non-Africans, as well as a gene associated with pigmentation. CONCLUSION Extending QTL mapping studies beyond European ancestry, particularly to diverse indigenous populations, is vital for a complete understanding of the genetic architecture of human traits and can reveal novel functional variation underlying human traits and disease.
Collapse
|
98
|
Zhong L, Zheng M, Huang Y, Jiang T, Yang B, Huang L, Ma J. An atlas of expression quantitative trait loci of microRNA in longissimus muscle of eight-way crossbred pigs. J Genet Genomics 2023:S1673-8527(23)00046-2. [PMID: 36822265 DOI: 10.1016/j.jgg.2023.02.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Revised: 02/03/2023] [Accepted: 02/05/2023] [Indexed: 02/24/2023]
Abstract
MicroRNAs (miRNAs) are key regulators of myocyte development and traits, yet insight into the genetic basis of variation in miRNA expression is still limited. Here, we present a systematic analysis of expression quantitative trait loci (eQTL) for miRNA profiling in longissimus muscle of pigs from an eight-breed crossed heterogeneous population. By integrating whole-genome sequencing and miRNAomics data, we map 54 cis- and 292 trans-eQTLs at high resolution that are associated with the expression of 54 and 92 miRNAs, respectively. Twenty-three trans-acting loci are identified to affect the expression of nine myomiRs (known muscle-specific miRNAs). MiRNAs in mammalian conserved miRNA clusters are found to be subjected to regulation by shared cis-eQTLs, while the expression of mature miRNA-5p/-3p counterparts is more likely to be regulated by different cis-eQTLs. Fine mapping and bioinformatics analyses pinpoint the peak cis-eSNP of miR-4331-5p, rs344650810, which is located in its seed region, as a causal variant for the changes in expression and function of this miRNA. Additionally, rs344650810 is significantly (P < 0.01) correlated with the density and percentage of type I muscle fibers. Altogether, this study provides a comprehensive atlas of miRNA-eQTLs in porcine skeletal muscle and new insights into regulatory mechanisms of miRNA expression.
Collapse
|
99
|
Behera S, LeFaive J, Orchard P, Mahmoud M, Paulin LF, Farek J, Soto DC, Parker SCJ, Smith AV, Dennis MY, Zook JM, Sedlazeck FJ. FixItFelix: improving genomic analysis by fixing reference errors. Genome Biol 2023; 24:31. [PMID: 36810122 PMCID: PMC9942314 DOI: 10.1186/s13059-023-02863-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Accepted: 01/20/2023] [Indexed: 02/23/2023] Open
Abstract
The current version of the human reference genome, GRCh38, contains a number of errors including 1.2 Mbp of falsely duplicated and 8.04 Mbp of collapsed regions. These errors impact the variant calling of 33 protein-coding genes, including 12 with medical relevance. Here, we present FixItFelix, an efficient remapping approach, together with a modified version of the GRCh38 reference genome that improves the subsequent analysis across these genes within minutes for an existing alignment file while maintaining the same coordinates. We showcase these improvements over multi-ethnic control samples, demonstrating improvements for population variant calling as well as eQTL studies.
Collapse
|
100
|
Wu Y, Zhang CY, Wang L, Li Y, Xiao X. Genetic Insights of Schizophrenia via Single Cell RNA-Sequencing Analyses. Schizophr Bull 2023:7048714. [PMID: 36805283 DOI: 10.1093/schbul/sbad002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/22/2023]
Abstract
BACKGROUND Schizophrenia is a complex and heterogeneous disorder involving multiple regions and types of cells in the brain. Despite rapid progress made by genome-wide association studies (GWAS) of schizophrenia, the mechanisms of the illness underlying the GWAS significant loci remain less clear. STUDY DESIGN We investigated schizophrenia risk genes using summary-data-based Mendelian randomization based on single-cell sequencing data, and explored the types of brain cells involved in schizophrenia through the expression weighted cell-type enrichment analysis. RESULTS We identified 54 schizophrenia risk genes (two-thirds of these genes were not identified using sequencing data of bulk tissues) using single-cell RNA-sequencing data. Further cell type enrichment analysis showed that schizophrenia risk genes were highly expressed in excitatory neurons and caudal ganglionic eminence interneurons, suggesting putative roles of these cells in the pathogenesis of schizophrenia. We also found that these risk genes identified using single-cell sequencing results could form a large protein-protein interaction network with genes affected by disease-causing rare variants. CONCLUSIONS Through integrative analyses using expression data at single-cell levels, we identified 54 risk genes associated with schizophrenia. Notably, many of these genes were only identified using single-cell RNA-sequencing data, and their altered expression levels in particular types of cells, rather than in the bulk tissues, were related to the increased risk of schizophrenia. Our results provide novel insight into the biological mechanisms of schizophrenia, and future single-cell studies are necessary to further facilitate the understanding of the disorder.
Collapse
|