1
|
Genome-wide association meta-analysis of spontaneous coronary artery dissection identifies risk variants and genes related to artery integrity and tissue-mediated coagulation. Nat Genet 2023; 55:964-972. [PMID: 37248441 PMCID: PMC10260398 DOI: 10.1038/s41588-023-01410-1] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2022] [Accepted: 04/26/2023] [Indexed: 05/31/2023]
Abstract
Spontaneous coronary artery dissection (SCAD) is an understudied cause of myocardial infarction primarily affecting women. It is not known to what extent SCAD is genetically distinct from other cardiovascular diseases, including atherosclerotic coronary artery disease (CAD). Here we present a genome-wide association meta-analysis (1,917 cases and 9,292 controls) identifying 16 risk loci for SCAD. Integrative functional annotations prioritized genes that are likely to be regulated in vascular smooth muscle cells and artery fibroblasts and implicated in extracellular matrix biology. One locus containing the tissue factor gene F3, which is involved in blood coagulation cascade initiation, appears to be specific for SCAD risk. Several associated variants have diametrically opposite associations with CAD, suggesting that shared biological processes contribute to both diseases, but through different mechanisms. We also infer a causal role for high blood pressure in SCAD. Our findings provide novel pathophysiological insights involving arterial integrity and tissue-mediated coagulation in SCAD and set the stage for future specific therapeutics and preventions.
Collapse
|
2
|
Exploring the Genetic Architecture of Spontaneous Coronary Artery Dissection Using Whole-Genome Sequencing. Circ Genom Precis Med 2022; 15:e003527. [PMID: 35583931 PMCID: PMC9388555 DOI: 10.1161/circgen.121.003527] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Background:
Spontaneous coronary artery dissection (SCAD) is a cause of acute coronary syndrome that predominantly affects women. Its pathophysiology remains unclear but connective tissue disorders (CTD) and other vasculopathies have been observed in many SCAD patients. A genetic component for SCAD is increasingly appreciated, although few genes have been robustly implicated. We sought to clarify the genetic cause of SCAD using targeted and genome-wide methods in a cohort of sporadic cases to identify both common and rare disease-associated variants.
Methods:
A cohort of 91 unrelated sporadic SCAD cases was investigated for rare, deleterious variants in genes associated with either SCAD or CTD, while new candidate genes were sought using rare variant collapsing analysis and identification of novel loss-of-function variants in genes intolerant to such variation. Finally, 2 SCAD polygenic risk scores were applied to assess the contribution of common variants.
Results:
We identified 10 cases with at least one rare, likely disease-causing variant in CTD-associated genes, although only one had a CTD phenotype. No genes were significantly associated with SCAD from genome-wide collapsing analysis, however, enrichment for TGF (transforming growth factor)-β signaling pathway genes was found with analysis of 24 genes harboring novel loss-of-function variants. Both polygenic risk scores demonstrated that sporadic SCAD cases have a significantly elevated genetic SCAD risk compared with controls.
Conclusions:
SCAD shares some genetic overlap with CTD, even in the absence of any major CTD phenotype. Consistent with a complex genetic architecture, SCAD patients also have a higher burden of common variants than controls.
Collapse
|
3
|
Contribution of NOTCH1 genetic variants to bicuspid aortic valve and other congenital lesions. Heart 2022; 108:1114-1120. [PMID: 35288444 PMCID: PMC9240330 DOI: 10.1136/heartjnl-2021-320428] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/10/2021] [Accepted: 01/18/2022] [Indexed: 12/05/2022] Open
Abstract
INTRODUCTION Bicuspid aortic valve (BAV) affects 1% of the general population. NOTCH1 was the first gene associated with BAV. The proportion of familial and sporadic BAV disease attributed to NOTCH1 mutations has not been estimated. AIM The aim of our study was to provide an estimate of familial and sporadic BAV disease attributable to NOTCH1 mutations. METHODS The population of our study consisted of participants of the University of Leicester Bicuspid aoRtic vAlVe gEnetic research-8 pedigrees with multiple affected family members and 381 sporadic patients. All subjects underwent NOTCH1 sequencing. A systematic literature search was performed in the NCBI PubMed database to identify publications reporting NOTCH1 sequencing in context of congenital heart disease. RESULTS NOTCH1 sequencing in 36 subjects from 8 pedigrees identified one variant c.873C>G/p.Tyr291* meeting the American College of Medical Genetics and Genomics criteria for pathogenicity. No pathogenic or likely pathogenic NOTCH1 variants were identified in 381 sporadic patients. Literature review identified 64 relevant publication reporting NOTCH1 sequencing in 528 pedigrees and 9449 sporadic subjects. After excluding families with syndromic disease pathogenic and likely pathogenic NOTCH1 variants were detected in 9/435 (2.1%; 95% CI: 0.7% to 3.4%) of pedigrees and between 0.05% (95% CI: 0.005% to 0.10%) and 0.08% (95% CI: 0.02% to 0.13%) of sporadic patients. Incomplete penetrance of definitely pathogenic NOTCH1 mutations was observed in almost half of reported pedigrees. CONCLUSIONS Pathogenic and likely pathogenic NOTCH1 genetic variants explain 2% of familial and <0.1% of sporadic BAV disease and are more likely to associate with tetralogy of Fallot and hypoplastic left heart.
Collapse
|
4
|
Polygenic basis and biomedical consequences of telomere length variation. Nat Genet 2021; 53:1425-1433. [PMID: 34611362 PMCID: PMC8492471 DOI: 10.1038/s41588-021-00944-6] [Citation(s) in RCA: 116] [Impact Index Per Article: 38.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2021] [Accepted: 08/18/2021] [Indexed: 02/08/2023]
Abstract
Telomeres, the end fragments of chromosomes, play key roles in cellular proliferation and senescence. Here we characterize the genetic architecture of naturally occurring variation in leukocyte telomere length (LTL) and identify causal links between LTL and biomedical phenotypes in 472,174 well-characterized UK Biobank participants. We identified 197 independent sentinel variants associated with LTL at 138 genomic loci (108 new). Genetically determined differences in LTL were associated with multiple biological traits, ranging from height to bone marrow function, as well as several diseases spanning neoplastic, vascular and inflammatory pathologies. Finally, we estimated that, at the age of 40 years, people with an LTL >1 s.d. shorter than the population mean had a 2.5-year-lower life expectancy compared with the group with ≥1 s.d. longer LDL. Overall, we furnish new insights into the genetic regulation of LTL, reveal wide-ranging influences of LTL on physiological traits, diseases and longevity, and provide a powerful resource available to the global research community.
Collapse
|
5
|
Spontaneous Coronary Artery Dissection: Insights on Rare Genetic Variation From Genome Sequencing. CIRCULATION-GENOMIC AND PRECISION MEDICINE 2020; 13:e003030. [PMID: 33125268 PMCID: PMC7748045 DOI: 10.1161/circgen.120.003030] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Supplemental Digital Content is available in the text. Spontaneous coronary artery dissection (SCAD) occurs when an epicardial coronary artery is narrowed or occluded by an intramural hematoma. SCAD mainly affects women and is associated with pregnancy and systemic arteriopathies, particularly fibromuscular dysplasia. Variants in several genes, such as those causing connective tissue disorders, have been implicated; however, the genetic architecture is poorly understood. Here, we aim to better understand the diagnostic yield of rare variant genetic testing among a cohort of SCAD survivors and to identify genes or gene sets that have a significant enrichment of rare variants.
Collapse
|
6
|
Novel loss of function mutation in NOTCH1 in a family with bicuspid aortic valve, ventricular septal defect, thoracic aortic aneurysm, and aortic valve stenosis. Mol Genet Genomic Med 2020; 8:e1437. [PMID: 32720365 PMCID: PMC7549557 DOI: 10.1002/mgg3.1437] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2020] [Revised: 07/01/2020] [Accepted: 07/09/2020] [Indexed: 12/18/2022] Open
Abstract
Background Bicuspid aortic valve is the most common congenital valvular heart defect in the general population. BAV is associated with significant morbidity due to valve failure, formation of thoracic aortic aneurysm, and increased risk of infective endocarditis and aortic dissection. Loss of function mutations in NOTCH1 (OMIM 190198) has previously been associated with congenital heart disease involving the aortic valve, left ventricle outflow tract, and mitral valve that segregates in affected pedigrees as an autosomal dominant trait with variable expressivity. Methods We performed whole‐exome sequencing in four members of a three‐generational family (three affected and one unaffected subject) with clinical phenotypes including aortic valve stenosis, thoracic aortic aneurysm, and ventricular septal defect. Results We identified 16 potentially damaging genetic variants (one stop variant, one splice variant, and 14 missense variants) cosegregating with the phenotype. Of these variants, the nonsense mutation (p.Tyr291*) in NOTCH1 was the most deleterious variant identified and the most likely variant causing the disease. Conclusion Inactivating NOTCH1 mutations are a rare cause of familial heart disease involving predominantly left ventricular outflow tract lesions and characterized by the heterogeneity of clinical phenotype.
Collapse
|
7
|
Rare loss-of-function mutations of PTGIR are enriched in fibromuscular dysplasia. Cardiovasc Res 2020; 117:1154-1165. [PMID: 32531060 PMCID: PMC7983006 DOI: 10.1093/cvr/cvaa161] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/22/2020] [Revised: 05/14/2020] [Accepted: 06/07/2020] [Indexed: 12/12/2022] Open
Abstract
Aims Fibromuscular dysplasia (FMD) and spontaneous coronary artery dissection (SCAD) are related, non-atherosclerotic arterial diseases mainly affecting middle-aged women. Little is known about their physiopathological mechanisms. We aimed to identify rare genetic causes to elucidate molecular mechanisms implicated in FMD and SCAD. Methods and results We analysed 29 exomes that included familial and sporadic FMD. We identified one rare loss-of-function variant (LoF) (frequencygnomAD = 0.000075) shared by two FMD sisters in the prostaglandin I2 receptor gene (PTGIR), a key player in vascular remodelling. Follow-up was conducted by targeted or Sanger sequencing (1071 FMD and 363 SCAD patients) or lookups in exome (264 FMD) or genome sequences (480 SCAD), all independent and unrelated. It revealed four additional LoF allele carriers, in addition to several rare missense variants, among FMD patients, and two LoF allele carriers among SCAD patients, including one carrying a rare splicing mutation (c.768 + 1C>G). We used burden test to test for enrichment in patients compared to gnomAD controls, which detected a putative enrichment in FMD (PTRAPD = 8 × 10−4), but not a significant enrichment (PTRAPD = 0.12) in SCAD. The biological effects of variants on human prostaclycin receptor (hIP) signalling and protein expression were characterized using transient overexpression in human cells. We confirmed the LoFs (Q163X and P17RfsX6) and one missense (L67P), identified in one FMD and one SCAD patient, to severely impair hIP function in vitro. Conclusions Our study shows that rare genetic mutations in PTGIR are enriched among FMD patients and found in SCAD patients, suggesting a role for prostacyclin signalling in non-atherosclerotic stenosis and dissection.
Collapse
|
8
|
Genome-wide Association Analysis in Humans Links Nucleotide Metabolism to Leukocyte Telomere Length. Am J Hum Genet 2020; 106:389-404. [PMID: 32109421 PMCID: PMC7058826 DOI: 10.1016/j.ajhg.2020.02.006] [Citation(s) in RCA: 98] [Impact Index Per Article: 24.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2019] [Accepted: 02/10/2020] [Indexed: 01/02/2023] Open
Abstract
Leukocyte telomere length (LTL) is a heritable biomarker of genomic aging. In this study, we perform a genome-wide meta-analysis of LTL by pooling densely genotyped and imputed association results across large-scale European-descent studies including up to 78,592 individuals. We identify 49 genomic regions at a false dicovery rate (FDR) < 0.05 threshold and prioritize genes at 31, with five highlighting nucleotide metabolism as an important regulator of LTL. We report six genome-wide significant loci in or near SENP7, MOB1B, CARMIL1, PRRC2A, TERF2, and RFWD3, and our results support recently identified PARP1, POT1, ATM, and MPHOSPH6 loci. Phenome-wide analyses in >350,000 UK Biobank participants suggest that genetically shorter telomere length increases the risk of hypothyroidism and decreases the risk of thyroid cancer, lymphoma, and a range of proliferative conditions. Our results replicate previously reported associations with increased risk of coronary artery disease and lower risk for multiple cancer types. Our findings substantially expand current knowledge on genes that regulate LTL and their impact on human health and disease.
Collapse
|
9
|
|
10
|
157 Myeloid expression of trib1 regulates the polarisation state of tissue resident macrophages that has consequences on plasma lipid and metabolic homeostasis. BRITISH HEART JOURNAL 2017. [DOI: 10.1136/heartjnl-2017-311726.156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
|
11
|
Meta-Analysis of Genome-Wide Association Studies for Abdominal Aortic Aneurysm Identifies Four New Disease-Specific Risk Loci. Circ Res 2016; 120:341-353. [PMID: 27899403 PMCID: PMC5253231 DOI: 10.1161/circresaha.116.308765] [Citation(s) in RCA: 130] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/23/2016] [Revised: 10/28/2016] [Accepted: 11/21/2016] [Indexed: 02/06/2023]
Abstract
Supplemental Digital Content is available in the text. Rationale: Abdominal aortic aneurysm (AAA) is a complex disease with both genetic and environmental risk factors. Together, 6 previously identified risk loci only explain a small proportion of the heritability of AAA. Objective: To identify additional AAA risk loci using data from all available genome-wide association studies. Methods and Results: Through a meta-analysis of 6 genome-wide association study data sets and a validation study totaling 10 204 cases and 107 766 controls, we identified 4 new AAA risk loci: 1q32.3 (SMYD2), 13q12.11 (LINC00540), 20q13.12 (near PCIF1/MMP9/ZNF335), and 21q22.2 (ERG). In various database searches, we observed no new associations between the lead AAA single nucleotide polymorphisms and coronary artery disease, blood pressure, lipids, or diabetes mellitus. Network analyses identified ERG, IL6R, and LDLR as modifiers of MMP9, with a direct interaction between ERG and MMP9. Conclusions: The 4 new risk loci for AAA seem to be specific for AAA compared with other cardiovascular diseases and related traits suggesting that traditional cardiovascular risk factor management may only have limited value in preventing the progression of aneurysmal disease.
Collapse
|
12
|
Prediction of Causal Candidate Genes in Coronary Artery Disease Loci. Arterioscler Thromb Vasc Biol 2015; 35:2207-17. [PMID: 26293461 DOI: 10.1161/atvbaha.115.306108] [Citation(s) in RCA: 78] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2015] [Accepted: 05/05/2015] [Indexed: 12/29/2022]
Abstract
OBJECTIVE Genome-wide association studies have to date identified 159 significant and suggestive loci for coronary artery disease (CAD). We now report comprehensive bioinformatics analyses of sequence variation in these loci to predict candidate causal genes. APPROACH AND RESULTS All annotated genes in the loci were evaluated with respect to protein-coding single-nucleotide polymorphism and gene expression parameters. The latter included expression quantitative trait loci, tissue specificity, and miRNA binding. High priority candidate genes were further identified based on literature searches and our experimental data. We conclude that the great majority of causal variations affecting CAD risk occur in noncoding regions, with 41% affecting gene expression robustly versus 6% leading to amino acid changes. Many of these genes differed from the traditionally annotated genes, which was usually based on proximity to the lead single-nucleotide polymorphism. Indeed, we obtained evidence that genetic variants at CAD loci affect 98 genes which had not been linked to CAD previously. CONCLUSIONS Our results substantially revise the list of likely candidates for CAD and suggest that genome-wide association studies efforts in other diseases may benefit from similar bioinformatics analyses.
Collapse
|
13
|
Abstract
BACKGROUND The nature and underlying mechanisms of an inverse association between adult height and the risk of coronary artery disease (CAD) are unclear. METHODS We used a genetic approach to investigate the association between height and CAD, using 180 height-associated genetic variants. We tested the association between a change in genetically determined height of 1 SD (6.5 cm) with the risk of CAD in 65,066 cases and 128,383 controls. Using individual-level genotype data from 18,249 persons, we also examined the risk of CAD associated with the presence of various numbers of height-associated alleles. To identify putative mechanisms, we analyzed whether genetically determined height was associated with known cardiovascular risk factors and performed a pathway analysis of the height-associated genes. RESULTS We observed a relative increase of 13.5% (95% confidence interval [CI], 5.4 to 22.1; P<0.001) in the risk of CAD per 1-SD decrease in genetically determined height. There was a graded relationship between the presence of an increased number of height-raising variants and a reduced risk of CAD (odds ratio for height quartile 4 versus quartile 1, 0.74; 95% CI, 0.68 to 0.84; P<0.001). Of the 12 risk factors that we studied, we observed significant associations only with levels of low-density lipoprotein cholesterol and triglycerides (accounting for approximately 30% of the association). We identified several overlapping pathways involving genes associated with both development and atherosclerosis. CONCLUSIONS There is a primary association between a genetically determined shorter height and an increased risk of CAD, a link that is partly explained by the association between shorter height and an adverse lipid profile. Shared biologic processes that determine achieved height and the development of atherosclerosis may explain some of the association. (Funded by the British Heart Foundation and others.).
Collapse
|
14
|
Screening in silico predicted remotely acting NF1 gene regulatory elements for mutations in patients with neurofibromatosis type 1. Hum Genomics 2013; 7:18. [PMID: 23947441 PMCID: PMC3750751 DOI: 10.1186/1479-7364-7-18] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2013] [Accepted: 08/11/2013] [Indexed: 11/10/2022] Open
Abstract
Neurofibromatosis type 1 (NF1), a neuroectodermal disorder, is caused by germline mutations in the NF1 gene. NF1 affects approximately 1/3,000 individuals worldwide, with about 50% of cases representing de novo mutations. Although the NF1 gene was identified in 1990, the underlying gene mutations still remain undetected in a small but obdurate minority of NF1 patients. We postulated that in these patients, hitherto undetected pathogenic mutations might occur in regulatory elements far upstream of the NF1 gene. In an attempt to identify such remotely acting regulatory elements, we reasoned that some of them might reside within DNA sequences that (1) have the potential to interact at distance with the NF1 gene and (2) lie within a histone H3K27ac-enriched region, a characteristic of active enhancers. Combining Hi-C data, obtained by means of the chromosome conformation capture technique, with data on the location and level of histone H3K27ac enrichment upstream of the NF1 gene, we predicted in silico the presence of two remotely acting regulatory regions, located, respectively, approximately 600 kb and approximately 42 kb upstream of the NF1 gene. These regions were then sequenced in 47 NF1 patients in whom no mutations had been found in either the NF1 or SPRED1 gene regions. Five patients were found to harbour DNA sequence variants in the distal H3K27ac-enriched region. Although these variants are of uncertain pathological significance and still remain to be functionally characterized, this approach promises to be of general utility for the detection of mutations underlying other inherited disorders that may be caused by mutations in remotely acting regulatory elements.
Collapse
|
15
|
Comparative analysis of genome sequences covering the seven cronobacter species. PLoS One 2012; 7:e49455. [PMID: 23166675 PMCID: PMC3500316 DOI: 10.1371/journal.pone.0049455] [Citation(s) in RCA: 93] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2012] [Accepted: 10/09/2012] [Indexed: 11/30/2022] Open
Abstract
BACKGROUND Species of Cronobacter are widespread in the environment and are occasional food-borne pathogens associated with serious neonatal diseases, including bacteraemia, meningitis, and necrotising enterocolitis. The genus is composed of seven species: C. sakazakii, C. malonaticus, C. turicensis, C. dublinensis, C. muytjensii, C. universalis, and C. condimenti. Clinical cases are associated with three species, C. malonaticus, C. turicensis and, in particular, with C. sakazakii multilocus sequence type 4. Thus, it is plausible that virulence determinants have evolved in certain lineages. METHODOLOGY/PRINCIPAL FINDINGS We generated high quality sequence drafts for eleven Cronobacter genomes representing the seven Cronobacter species, including an ST4 strain of C. sakazakii. Comparative analysis of these genomes together with the two publicly available genomes revealed Cronobacter has over 6,000 genes in one or more strains and over 2,000 genes shared by all Cronobacter. Considerable variation in the presence of traits such as type six secretion systems, metal resistance (tellurite, copper and silver), and adhesins were found. C. sakazakii is unique in the Cronobacter genus in encoding genes enabling the utilization of exogenous sialic acid which may have clinical significance. The C. sakazakii ST4 strain 701 contained additional genes as compared to other C. sakazakii but none of them were known specific virulence-related genes. CONCLUSIONS/SIGNIFICANCE Genome comparison revealed that pair-wise DNA sequence identity varies between 89 and 97% in the seven Cronobacter species, and also suggested various degrees of divergence. Sets of universal core genes and accessory genes unique to each strain were identified. These gene sequences can be used for designing genus/species specific detection assays. Genes encoding adhesins, T6SS, and metal resistance genes as well as prophages are found in only subsets of genomes and have contributed considerably to the variation of genomic content. Differences in gene content likely contribute to differences in the clinical and environmental distribution of species and sequence types.
Collapse
|
16
|
Exploring the somatic NF1 mutational spectrum associated with NF1 cutaneous neurofibromas. Eur J Hum Genet 2011; 20:411-9. [PMID: 22108604 DOI: 10.1038/ejhg.2011.207] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Neurofibromatosis type-1 (NF1), caused by heterozygous inactivation of the NF1 tumour suppressor gene, is associated with the development of benign and malignant peripheral nerve sheath tumours (MPNSTs). Although numerous germline NF1 mutations have been identified, relatively few somatic NF1 mutations have been described in neurofibromas. Here we have screened 109 cutaneous neurofibromas, excised from 46 unrelated NF1 patients, for somatic NF1 mutations. NF1 mutation screening (involving loss-of-heterozygosity (LOH) analysis, multiplex ligation-dependent probe amplification and DNA sequencing) identified 77 somatic NF1 point mutations, of which 53 were novel. LOH spanning the NF1 gene region was evident in 25 neurofibromas, but in contrast to previous data from MPNSTs, it was absent at the TP53, CDKN2A and RB1 gene loci. Analysis of DNA/RNA from neurofibroma-derived Schwann cell cultures revealed NF1 mutations in four tumours whose presence had been overlooked in the tumour DNA. Bioinformatics analysis suggested that four of seven novel somatic NF1 missense mutations (p.A330T, p.Q519P, p.A776T, p.S1463F) could be of functional/clinical significance. Functional analysis confirmed this prediction for p.S1463F, located within the GTPase-activating protein-related domain, as this mutation resulted in a 150-fold increase in activated GTP-bound Ras. Comparison of the relative frequencies of the different types of somatic NF1 mutation observed with those of their previously reported germline counterparts revealed significant (P=0.001) differences. Although non-identical somatic mutations involving either the same or adjacent nucleotides were identified in three pairs of tumours from the same patients (P<0.0002), no association was noted between the type of germline and somatic NF1 lesion within the same individual.
Collapse
|
17
|
A meta-analysis of single base-pair substitutions in translational termination codons ('nonstop' mutations) that cause human inherited disease. Hum Genomics 2011; 5:241-64. [PMID: 21712188 PMCID: PMC3525242 DOI: 10.1186/1479-7364-5-4-241] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
'Nonstop' mutations are single base-pair substitutions that occur within translational termination (stop) codons and which can lead to the continued and inappropriate translation of the mRNA into the 3'-untranslated region. We have performed a meta-analysis of the 119 nonstop mutations (in 87 different genes) known to cause human inherited disease, examining the sequence context of the mutated stop codons and the average distance to the next alternative in-frame stop codon downstream, in comparison with their counterparts from control (non-mutated) gene sequences. A paucity of alternative in-frame stop codons was noted in the immediate vicinity (0-49 nucleotides downstream) of the mutated stop codons as compared with their control counterparts (p = 7.81 × 10-4). This implies that at least some nonstop mutations with alternative stop codons in close proximity will not have come to clinical attention, possibly because they will have given rise to stable mRNAs (not subject to nonstop mRNA decay) that are translatable into proteins of near-normal length and biological function. A significant excess of downstream in-frame stop codons was, however, noted in the range 150-199 nucleotides from the mutated stop codon (p = 8.55 × 10-4). We speculate that recruitment of an alternative stop codon at greater distance from the mutated stop codon may trigger nonstop mRNA decay, thereby decreasing the amount of protein product and yielding a readily discernible clinical phenotype. Confirmation or otherwise of this postulate must await the emergence of a clearer understanding of the mechanism of nonstop mRNA decay in mammalian cells.
Collapse
|
18
|
In Silico identification of pathogenic strains of Cronobacter from Biochemical data reveals association of inositol fermentation with pathogenicity. BMC Microbiol 2011; 11:204. [PMID: 21933417 PMCID: PMC3188490 DOI: 10.1186/1471-2180-11-204] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2011] [Accepted: 09/20/2011] [Indexed: 11/11/2022] Open
Abstract
Background Cronobacter, formerly known as Enterobacter sakazakii, is a food-borne pathogen known to cause neonatal meningitis, septicaemia and death. Current diagnostic tests for identification of Cronobacter do not differentiate between species, necessitating time consuming 16S rDNA gene sequencing or multilocus sequence typing (MLST). The organism is ubiquitous, being found in the environment and in a wide range of foods, although there is variation in pathogenicity between Cronobacter isolates and between species. Therefore to be able to differentiate between the pathogenic and non-pathogenic strains is of interest to the food industry and regulators. Results Here we report the use of Expectation Maximization clustering to categorise 98 strains of Cronobacter as pathogenic or non-pathogenic based on biochemical test results from standard diagnostic test kits. Pathogenicity of a strain was postulated on the basis of either pathogenic symptoms associated with strain source or corresponding MLST sequence types, allowing the clusters to be labelled as containing either pathogenic or non-pathogenic strains. The resulting clusters gave good differentiation of strains into pathogenic and non-pathogenic groups, corresponding well to isolate source and MLST sequence type. The results also revealed a potential association between pathogenicity and inositol fermentation. An investigation of the genomes of Cronobacter sakazakii and C. turicensis revealed the gene for inositol monophosphatase is associated with putative virulence factors in pathogenic strains of Cronobacter. Conclusions We demonstrated a computational approach allowing existing diagnostic kits to be used to identify pathogenic strains of Cronobacter. The resulting clusters correlated well with MLST sequence types and revealed new information about the pathogenicity of Cronobacter species.
Collapse
|
19
|
Comparative analysis of germline and somatic microlesion mutational spectra in 17 human tumor suppressor genes. Hum Mutat 2011; 32:620-32. [PMID: 21432943 DOI: 10.1002/humu.21483] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2010] [Accepted: 02/07/2011] [Indexed: 12/17/2022]
Abstract
Mutations associated with tumorigenesis may either arise somatically or can be inherited through the germline. We performed a comparison of somatic, germline, shared (found in both soma and germline) and somatic recurrent mutational spectra for 17 human tumor suppressor genes, which focused upon missense single base-pair substitutions and microdeletions/microinsertions. Somatic and germline mutational spectra were similar in relation to C.G>T.A transitions but differed with respect to the frequency of A.T>G.C, A.T>T.A, and C.G>A.T substitutions. Shared missense mutations were characterized by higher mutability rates, greater physicochemical differences between wild-type and mutant residues, and a tendency to occur in evolutionarily conserved residues and within CpG/CpHpG oligonucleotides. Mononucleotide runs (≥4 bp) were identified as hotspots for shared microdeletions/microinsertions. Both germline and somatic microdeletions/microinsertions were found to be significantly overrepresented within the "indel-hotspot" motif, GTAAGT. Using a naïve Bayes' classifier trained to discriminate between five missense mutation groups, 63% of mutations in our dataset were on average correctly recognized. Applying this classifier to an independent dataset of probable driver mutations, we concluded that ∼50% of these somatic missense mutations possess features consistent with their being either shared or recurrent, suggesting that a disproportionate number of such lesions are likely to be drivers of tumorigenesis.
Collapse
|
20
|
Prediction of glycosylation sites using random forests. BMC Bioinformatics 2008; 9:500. [PMID: 19038042 PMCID: PMC2651179 DOI: 10.1186/1471-2105-9-500] [Citation(s) in RCA: 158] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2008] [Accepted: 11/27/2008] [Indexed: 12/22/2022] Open
Abstract
BACKGROUND Post translational modifications (PTMs) occur in the vast majority of proteins and are essential for function. Prediction of the sequence location of PTMs enhances the functional characterisation of proteins. Glycosylation is one type of PTM, and is implicated in protein folding, transport and function. RESULTS We use the random forest algorithm and pairwise patterns to predict glycosylation sites. We identify pairwise patterns surrounding glycosylation sites and use an odds ratio to weight their propensity of association with modified residues. Our prediction program, GPP (glycosylation prediction program), predicts glycosylation sites with an accuracy of 90.8% for Ser sites, 92.0% for Thr sites and 92.8% for Asn sites. This is significantly better than current glycosylation predictors. We use the trepan algorithm to extract a set of comprehensible rules from GPP, which provide biological insight into all three major glycosylation types. CONCLUSION We have created an accurate predictor of glycosylation sites and used this to extract comprehensible rules about the glycosylation process. GPP is available online at http://comp.chem.nottingham.ac.uk/glyco/.
Collapse
|