1
|
Postnikova O, Poliakov E, Golestaneh N, Rogozin IB, Redmond TM. Stable Intronic Sequences and Exon Skipping Events in the Human RPE65 Gene: Analysis of Expression in Retinal Pigment Epithelium Cells and Cell Culture Models. Front Genet 2019; 10:634. [PMID: 31379919 PMCID: PMC6658614 DOI: 10.3389/fgene.2019.00634] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2019] [Accepted: 06/17/2019] [Indexed: 12/04/2022] Open
Abstract
Currently, there is much interest in intronic sequence-containing long non-coding RNAs and the role of intronic transcription in regulation of cellular metabolism and fate. Several stable intronic sequence RNAs (sisRNAs) were recently implicated in regulation of parental genes. To investigate transcription from introns of the RPE65 gene, we analyzed RNA-seq and Nanopore sequencing data from different cell models of human retinal pigment epithelium (RPE) and native bovine RPE. We discovered putative stable poly-adenylated transcripts with sequences corresponding to intronic regions of the RPE65 gene in the cytoplasm of RPE cells. These stable intronic sequences could be important for RPE65 transcription, splicing or translation. We also analyzed alternative splicing events in RPE65. Frequent exon skipping events involving exons 2, 3, and 7 were detected. The rate of these events was much higher in human RPE cell cultures compared with native RPE , consistent with lack of translation of RPE65 mRNA in cell cultures.
Collapse
Affiliation(s)
- Olga Postnikova
- Laboratory of Retinal Cell & Molecular Biology, National Eye Institute, NIH, Bethesda, MD, United States
| | - Eugenia Poliakov
- Laboratory of Retinal Cell & Molecular Biology, National Eye Institute, NIH, Bethesda, MD, United States
| | - Nady Golestaneh
- Departments of Ophthalmology, Neurology, Biochemistry and Molecular & Cellular Biology, Georgetown University Medical Center, Washington, DC, United States
| | - Igor B Rogozin
- National Center for Biotechnology Information, National Library of Medicine, NIH, Bethesda, MD, United States
| | - T Michael Redmond
- Laboratory of Retinal Cell & Molecular Biology, National Eye Institute, NIH, Bethesda, MD, United States
| |
Collapse
|
2
|
Rogozin IB, Roche-Lima A, Lada AG, Belinky F, Sidorenko IA, Glazko GV, Babenko VN, Cooper DN, Pavlov YI. Nucleotide Weight Matrices Reveal Ubiquitous Mutational Footprints of AID/APOBEC Deaminases in Human Cancer Genomes. Cancers (Basel) 2019; 11:cancers11020211. [PMID: 30759888 PMCID: PMC6406962 DOI: 10.3390/cancers11020211] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2019] [Revised: 01/30/2019] [Accepted: 01/30/2019] [Indexed: 01/08/2023] Open
Abstract
Cancer genomes accumulate nucleotide sequence variations that number in the tens of thousands per genome. A prominent fraction of these mutations is thought to arise as a consequence of the off-target activity of DNA/RNA editing cytosine deaminases. These enzymes, collectively called activation induced deaminase (AID)/APOBECs, deaminate cytosines located within defined DNA sequence contexts. The resulting changes of the original C:G pair in these contexts (mutational signatures) provide indirect evidence for the participation of specific cytosine deaminases in a given cancer type. The conventional method used for the analysis of mutable motifs is the consensus approach. Here, for the first time, we have adopted the frequently used weight matrix (sequence profile) approach for the analysis of mutagenesis and provide evidence for this method being a more precise descriptor of mutations than the sequence consensus approach. We confirm that while mutational footprints of APOBEC1, APOBEC3A, APOBEC3B, and APOBEC3G are prominent in many cancers, mutable motifs characteristic of the action of the humoral immune response somatic hypermutation enzyme, AID, are the most widespread feature of somatic mutation spectra attributable to deaminases in cancer genomes. Overall, the weight matrix approach reveals that somatic mutations are significantly associated with at least one AID/APOBEC mutable motif in all studied cancers.
Collapse
Affiliation(s)
- Igor B Rogozin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894-6075, USA.
| | - Abiel Roche-Lima
- Center for Collaborative Research in Health Disparities⁻RCMI Program, Medical Sciences Campus, University of Puerto Rico, San Juan, Puerto Rico 00936-5067.
| | - Artem G Lada
- Department Microbiology and Molecular Genetics, University of California, Davis, CA 95616, USA.
| | - Frida Belinky
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894-6075, USA.
| | | | - Galina V Glazko
- Department of Biomedical Informatics, University of Arkansas for Medical Sciences, Little Rock, AR 72205, USA.
| | | | - David N Cooper
- Institute of Medical Genetics, Cardiff University, Cardiff CF14 4AY, UK.
| | - Youri I Pavlov
- Departments of Microbiology and Pathology; Biochemistry and Molecular Biology; Genetics, Cell Biology and Anatomy, University of Nebraska Medical Center, Omaha, NE 68198, USA.
- Eppley Institute for Research in Cancer and Allied Diseases, Omaha, NE 68198, USA.
| |
Collapse
|
3
|
Cheung PPH, Rogozin IB, Choy KT, Ng HY, Peiris JSM, Yen HL. Comparative mutational analyses of influenza A viruses. RNA (NEW YORK, N.Y.) 2015; 21:36-47. [PMID: 25404565 PMCID: PMC4274636 DOI: 10.1261/rna.045369.114] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/12/2023]
Abstract
The error-prone RNA-dependent RNA polymerase (RdRP) and external selective pressures are the driving forces for RNA viral diversity. When confounded by selective pressures, it is difficult to assess if influenza A viruses (IAV) that have a wide host range possess comparable or distinct spontaneous mutational frequency in their RdRPs. We used in-depth bioinformatics analyses to assess the spontaneous mutational frequencies of two RdRPs derived from human seasonal (A/Wuhan/359/95; Wuhan) and H5N1 (A/Vietnam/1203/04; VN1203) viruses using the mini-genome system with a common firefly luciferase reporter serving as the template. High-fidelity reverse transcriptase was applied to generate high-quality mutational spectra which allowed us to assess and compare the mutational frequencies and mutable motifs along a target sequence of the two RdRPs of two different subtypes. We observed correlated mutational spectra (τ correlation P < 0.0001), comparable mutational frequencies (H3N2:5.8 ± 0.9; H5N1:6.0 ± 0.5), and discovered a highly mutable motif "(A)AAG" for both Wuhan and VN1203 RdRPs. Results were then confirmed with two recombinant A/Puerto Rico/8/34 (PR8) viruses that possess RdRP derived from Wuhan or VN1203 (RG-PR8×Wuhan(PB2, PB1, PA, NP) and RG-PR8×VN1203(PB2, PB1, PA, NP)). Applying novel bioinformatics analysis on influenza mutational spectra, we provide a platform for a comprehensive analysis of the spontaneous mutation spectra for an RNA virus.
Collapse
Affiliation(s)
- Peter Pak-Hang Cheung
- School of Public Health, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Pokfulam, Hong Kong
| | - Igor B Rogozin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894-6075, USA
| | - Ka-Tim Choy
- School of Public Health, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Pokfulam, Hong Kong
| | - Hoi Yee Ng
- School of Public Health, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Pokfulam, Hong Kong
| | - Joseph Sriyal Malik Peiris
- School of Public Health, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Pokfulam, Hong Kong
| | - Hui-Ling Yen
- School of Public Health, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Pokfulam, Hong Kong
| |
Collapse
|
4
|
Hashimoto K, Rogozin IB, Panchenko AR. Oncogenic potential is related to activating effect of cancer single and double somatic mutations in receptor tyrosine kinases. Hum Mutat 2012; 33:1566-75. [PMID: 22753356 PMCID: PMC3465464 DOI: 10.1002/humu.22145] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2012] [Accepted: 05/29/2012] [Indexed: 01/16/2023]
Abstract
Aberrant activation of receptor tyrosine kinases (RTKs) is a common feature of many cancer cells. It was previously suggested that the mechanisms of kinase activation in cancer might be linked to transitions between active and inactive states. Here, we estimate the effects of single and double cancer mutations on the stability of active and inactive states of the kinase domains from different RTKs. We show that singleton cancer mutations destabilize active and inactive states; however, inactive states are destabilized more than the active ones, leading to kinase activation. We show that there exists a relationship between the estimate of oncogenic potential of cancer mutation and kinase activation. Namely, more frequent mutations have a higher activating effect, which might allow us to predict the activating effect of the mutations from the mutation spectra. Independent evolutionary analysis of mutation spectra complements this observation and finds the same frequency threshold defining mutation hotspots. We analyze double mutations and report a positive epistasis and additional advantage of doublets with respect to cancer cell fitness. The activation mechanisms of double mutations differ from those of single mutations and double mutation spectrum is found to be dissimilar to the mutation spectrum of singletons.
Collapse
Affiliation(s)
| | - Igor B. Rogozin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Anna R. Panchenko
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| |
Collapse
|
5
|
Duvvuri B, Duvvuri VRSK, Grigull J, Martin A, Pan-Hammarström Q, Wu GE, Larijani M. Altered spectrum of somatic hypermutation in common variable immunodeficiency disease characteristic of defective repair of mutations. Immunogenetics 2010; 63:1-11. [PMID: 20938659 DOI: 10.1007/s00251-010-0483-7] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2010] [Accepted: 09/25/2010] [Indexed: 01/08/2023]
Abstract
Pathogenic common variable immunodeficiency diseases (CVID) are genetic, usually inherited diseases for which a limited number of genetic defects have been implicated. As CVID presents with a wide range of clinical characteristics, there are likely diverse and for the most part unidentified genetic causes. In some individuals, defects in somatic hypermutation (SHM) have been suggested as the underlying cause of CVID. To address the mechanisms of SHM defects in CVID, we conducted a comprehensive mutational analysis of immunoglobulin heavy chain sequences from CVID patients. We identified several remarkably specific alterations in the spectra of SHM in comparison to healthy individuals. We provide evidence that some CVID cases are associated with defective repair of AID-induced mutations by the DNA mismatch repair (MMR) machinery. Our findings together with reports of increased chromosomal radiosensitivity and associated lymphoproliferative disorders amongst CVID patients, suggest that altered DNA damage repair may be a cause of CVID.
Collapse
Affiliation(s)
- Bhargavi Duvvuri
- The School of Kinesiology and Health Science, Faculty of Health, York University, Toronto, Ontario, Canada, M3J 1P3.
| | | | | | | | | | | | | |
Collapse
|
6
|
Asymptotic analysis of Bayesian generalization error with Newton diagram. Neural Netw 2010; 23:35-43. [DOI: 10.1016/j.neunet.2009.07.029] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2008] [Revised: 07/28/2009] [Accepted: 07/28/2009] [Indexed: 11/21/2022]
|
7
|
Rotskaya UN, Rogozin IB, Vasyunina EA, Kolosova NG, Malyarchuk BA, Nevinsky GA, Sinitsyna OI. Analysis of mitochondrial DNA somatic mutations in OXYS and Wistar strain rats. BIOCHEMISTRY (MOSCOW) 2009; 74:430-7. [DOI: 10.1134/s0006297909040117] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
|
8
|
Basu MK, Carmel L, Rogozin IB, Koonin EV. Evolution of protein domain promiscuity in eukaryotes. Genome Res 2008; 18:449-61. [PMID: 18230802 DOI: 10.1101/gr.6943508] [Citation(s) in RCA: 135] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Numerous eukaryotic proteins contain multiple domains. Certain domains show a tendency to occur in diverse domain architectures and can be considered "promiscuous." These promiscuous domains are, typically, involved in protein-protein interactions and play crucial roles in interaction networks, particularly those that contribute to signal transduction. A systematic comparative-genomic analysis of promiscuous domains in eukaryotes is described. Two quantitative measures of domain promiscuity are introduced and applied to the analysis of 28 genomes of diverse eukaryotes. Altogether, 215 domains are identified as strongly promiscuous. The fraction of promiscuous domains in animals is shown to be significantly greater than that in fungi or plants. Evolutionary reconstructions indicate that domain promiscuity is a volatile, relatively fast-changing feature of eukaryotic proteins, with few domains remaining promiscuous throughout the evolution of eukaryotes. Some domains appear to have attained promiscuity independently in different lineages, for example, animals and plants. It is proposed that promiscuous domains persist within a relatively small pool of evolutionarily stable domain combinations from which numerous rare architectures emerge during evolution. Domain promiscuity positively correlates with the number of experimentally detected domain interactions and with the strength of purifying selection affecting a domain. Thus, evolution of promiscuous domains seems to be constrained by the diversity of their interaction partners. The set of promiscuous domains is enriched for domains mediating protein-protein interactions that are involved in various forms of signal transduction, especially in the ubiquitin system and in chromatin. Thus, a limited repertoire of promiscuous domains makes a major contribution to the diversity and evolvability of eukaryotic proteomes and signaling networks.
Collapse
Affiliation(s)
- Malay Kumar Basu
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
| | | | | | | |
Collapse
|
9
|
Rogozin IB, Pavlov YI. The cytidine deaminase AID exhibits similar functional properties in yeast and mammals. Mol Immunol 2005; 43:1481-4. [PMID: 16219354 DOI: 10.1016/j.molimm.2005.09.002] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2005] [Indexed: 11/27/2022]
Abstract
A recent work published in Molecular Immunology examined the editing activity of activation-induced deaminase (AID) in yeast (Krause, K., Marcu, K.B., Greeve, J., 2006. The cytidine deaminases AID and APOBEC-1 exhibit distinct functional properties in a novel yeast selectable system. Mol. Immunol.). It was proposed that expression of AID in yeast is not sufficient for the generation of point mutations in a highly transcribed gene due to the lack of cofactors for AID-induced somatic hypermutation, which are unique to B cells. It was suggested that, on its own, AID does not have an intrinsic specificity for its target sequences. However, it has been shown previously that expression of the human AID gene in yeast was moderately mutagenic in a wild-type strain and highly mutagenic in an ung1 uracil-DNA glycosylase-deficient strain (Mayorov, V.I., Rogozin, I.B., Adkison, L.R., Frahm, C.R., Kunkel T.A., Pavlov Y. I., 2005. Expression of human AID in yeast induces mutations in context similar to the context of somatic hypermutation at G-C pairs in immunoglobulin genes. BMC Immunol. 6, 10; Poltoratsky, V.P., Wilson, S.H., Kunkel, T.A., Pavlov, Y.I., 2004. Recombinogenic phenotype of human activation-induced cytosine deaminase. J. Immunol. 172, 4308-4313). The vast majority of mutations were at G-C pairs. Mutations showed a clear DNA sequence context specificity which resembled the specificity of somatic hypermutation at G-C pairs in immunoglobulin genes and AID mutation specificity in vitro. The inability to detect mutator effects of AID by Krause et al. is likely to be caused by the use of the wild-type yeast strain and a small sample of clones examined for the presence of mutations. In addition, we show that non-uniformity of the mutation hotspot distribution is a factor potentially decreasing the chances of detecting mutations.
Collapse
Affiliation(s)
- Igor B Rogozin
- National Center for Biotechnology Information NLM, National Institutes of Health, 8600 Rockville Pike, Bldg. 38A/room 5N505A, Bethesda, MD 20894, USA.
| | | |
Collapse
|
10
|
Mayorov VI, Rogozin IB, Adkison LR, Gearhart PJ. DNA polymerase eta contributes to strand bias of mutations of A versus T in immunoglobulin genes. THE JOURNAL OF IMMUNOLOGY 2005; 174:7781-6. [PMID: 15944281 DOI: 10.4049/jimmunol.174.12.7781] [Citation(s) in RCA: 65] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
DNA polymerase (pol) eta participates in hypermutation of A:T bases in Ig genes because humans deficient for the polymerase have fewer substitutions of these bases. To determine whether polymerase eta is also responsible for the well-known preference for mutations of A vs T on the nontranscribed strand, we sequenced variable regions from three patients with xeroderma pigmentosum variant (XP-V) disease, who lack polymerase eta. The frequency of mutations in the intronic region downstream of rearranged J(H)4 gene segments was similar between XP-V and control clones; however, there were fewer mutations of A:T bases and correspondingly more substitutions of C:G bases in the XP-V clones (p < 10(-7)). There was significantly less of a bias for mutations of A compared with T nucleotides in the XP-V clones compared with control clones, whereas the frequencies for mutations of C and G were identical in both groups. An analysis of mutations in the WA sequence motif suggests that polymerase eta generates more mutations of A than T on the nontranscribed strand. This in vivo data from polymerase eta-deficient B cells correlates well with the in vitro specificity of the enzyme. Because polymerase eta inserts more mutations opposite template T than template A, it would generate more substitutions of A on the newly synthesized strand.
Collapse
Affiliation(s)
- Vladimir I Mayorov
- Department of Basic Medical Sciences, Mercer University School of Medicine, Macon, GA 31207, USA
| | | | | | | |
Collapse
|
11
|
Mayorov VI, Rogozin IB, Adkison LR, Frahm C, Kunkel TA, Pavlov YI. Expression of human AID in yeast induces mutations in context similar to the context of somatic hypermutation at G-C pairs in immunoglobulin genes. BMC Immunol 2005; 6:10. [PMID: 15949042 PMCID: PMC1180437 DOI: 10.1186/1471-2172-6-10] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2005] [Accepted: 06/10/2005] [Indexed: 12/02/2022] Open
Abstract
Background Antibody genes are diversified by somatic hypermutation (SHM), gene conversion and class-switch recombination. All three processes are initiated by the activation-induced deaminase (AID). According to a DNA deamination model of SHM, AID converts cytosine to uracil in DNA sequences. The initial deamination of cytosine leads to mutation and recombination in pathways involving replication, DNA mismatch repair and possibly base excision repair. The DNA sequence context of mutation hotspots at G-C pairs during SHM is DGYW/WRCH (G-C is a hotspot position, R = A/G, Y = T/C, W = A/T, D = A/G/T). Results To investigate the mechanisms of AID-induced mutagenesis in a model system, we studied the genetic consequences of AID expression in yeast. We constructed a yeast vector with an artificially synthesized human AID gene insert using codons common to highly expressed yeast genes. We found that expression of the artificial hAIDSc gene was moderately mutagenic in a wild-type strain and highly mutagenic in an ung1 uracil-DNA glycosylase-deficient strain. A majority of mutations were at G-C pairs. In the ung1 strain, C-G to T-A transitions were found almost exclusively, while a mixture of transitions with 12% transversions was characteristic in the wild-type strain. In the ung1 strain mutations that could have originated from deamination of the transcribed stand were found more frequently. In the wild-type strain, the strand bias was reversed. DGYW/WRCH motifs were preferential sites of mutations. Conclusion The results are consistent with the hypothesis that AID-mediated deamination of DNA is a major cause of mutations at G-C base pairs in immunoglobulin genes during SHM. The sequence contexts of mutations in yeast induced by AID and those of somatic mutations at G-C pairs in immunoglobulin genes are significantly similar. This indicates that the intrinsic substrate specificity of AID itself is a primary determinant of mutational hotspots at G-C base pairs during SHM.
Collapse
Affiliation(s)
| | - Igor B Rogozin
- National Center for Biotechnology Information NLM, National Institutes of Health, Bethesda MD 20894, USA
- Institute of Cytology and Genetics SD RAS, Novosibirsk 630090, Russia
| | | | - Christin Frahm
- Eppley Institute for Research in Cancer, University of Nebraska Medical Center, 986805 Nebraska Medical Center, Omaha, NE 68198, USA
| | - Thomas A Kunkel
- Laboratory of Structural Biology, National Institute of Environmental Health Sciences, National Institutes of Health, Research Triangle Park, NC 27709, USA
| | - Youri I Pavlov
- Eppley Institute for Research in Cancer, University of Nebraska Medical Center, 986805 Nebraska Medical Center, Omaha, NE 68198, USA
- Department of Biochemistry and Molecular Biology, University of Nebraska Medical Center, 986805 Nebraska Medical Center, Omaha, NE 68198, USA
- Department of Pathology and Microbiology, University of Nebraska Medical Center, 986805 Nebraska Medical Center, Omaha, NE 68198, USA
| |
Collapse
|
12
|
Malyarchuk BA, Rogozin IB. Mutagenesis by transient misalignment in the human mitochondrial DNA control region. Ann Hum Genet 2005; 68:324-39. [PMID: 15225158 DOI: 10.1046/j.1529-8817.2004.00099.x] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
To study spontaneous base substitutions in human mitochondrial DNA (mtDNA), we reconstructed the mutation spectra of the hypervariable segments I and II (HVS I and II) using published data on polymorphisms from various human populations. Classification analysis revealed numerous mutation hotspots in HVS I and II mutation spectra. Statistical analysis suggested that strand dislocation mutagenesis, operating in monotonous runs of nucleotides, plays an important role in generating base substitutions in the mtDNA control region. The frequency of mutations compatible with the primer strand dislocation in the HVS I region was almost twice as high as that for template strand dislocation. Frequencies of mutations compatible with the primer and template strand dislocation models are almost equal in the HVS II region. Further analysis of strand dislocation models suggested that an excess of pyrimidine transitions in mutation spectra, reconstructed on the basis of the L-strand sequence, is caused by an excess of both L-strand pyrimidine transitions and H-strand purine transitions. In general, no significant bias toward parent H-strand-specific dislocation mutagenesis was found in the HVS I and II regions.
Collapse
Affiliation(s)
- B A Malyarchuk
- Institute of Biological Problems of the North, Far-East Branch of the Russian Academy of Sciences, Portovaya str. 18, 68500 Magadan, Russia.
| | | |
Collapse
|
13
|
Hale ML, Borland AM, Gustafsson MHG, Wolff K. Causes of size homoplasy among chloroplast microsatellites in closely related Clusia species. J Mol Evol 2004; 58:182-90. [PMID: 15042338 DOI: 10.1007/s00239-003-2540-4] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2003] [Accepted: 08/07/2003] [Indexed: 11/28/2022]
Abstract
Chloroplast DNA sequences and microsatellites are useful tools for phylogenetic as well as population genetic analyses of plants. Chloroplast microsatellites tend to be less variable than nuclear microsatellites and therefore they may not be as powerful as nuclear microsatellites for within-species population analysis. However, chloroplast microsatellites may be useful for phylogenetic analysis between closely related taxa when more conventional loci, such as ITS or chloroplast sequence data, are not variable enough to resolve phylogenetic relationships in all clades. To determine the limits of chloroplast microsatellites as tools in phylogenetic analyses, we need to understand their evolution. Thus, we examined and compared phylogenetic relationships of species within the genus Clusia, using both chloroplast sequence data and variation at seven chloroplast microsatellite loci. Neither ITS nor chloroplast sequences were variable enough to resolve relationships within some sections of the genus, yet chloroplast microsatellite loci were too variable to provide any useful phylogenetic information. Size homoplasy was apparent, caused by base substitutions within the microsatellite, base substitutions in the flanking regions, indels in the flanking regions, multiple microsatellites within a fragment, and forward/reverse mutations of repeat length resulting in microsatellites of identical base composition that were not identical by descent.
Collapse
Affiliation(s)
- Marie L Hale
- School of Biology, University of Newcastle-upon-Tyne, NE1 7RU, UK,
| | | | | | | |
Collapse
|
14
|
Glazko GV, Koonin EV, Rogozin IB. Mutation hotspots in the p53 gene in tumors of different origin: correlation with evolutionary conservation and signs of positive selection. ACTA ACUST UNITED AC 2004; 1679:95-106. [PMID: 15297143 DOI: 10.1016/j.bbaexp.2004.05.004] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2004] [Revised: 04/28/2004] [Accepted: 05/14/2004] [Indexed: 01/13/2023]
Abstract
We present a classification analysis of the mutation spectra of the p53 gene and construct maps of hotspots for the germline (Li-Fraumein syndrome), different types of tumors and their derived cell lines. While spectra from solid tumors share common hotspots with the germline spectrum, they also contain unique sets of somatic hotspots that are not observed in the germline. All these hotspots correspond to amino acid replacements in the DNA-binding interface of p53. The mutation spectra of lymphomas and cell lines derived from lymphomas and lung cancers contained few hotspots compared to solid tumors. Thus, the distribution of hotspots in the p53 gene appears to depend on the tumor type and cell growth conditions; this specificity is missed by the bulk hotspot analysis. A negative correlation was detected between the amino acid replacement propensity in tumors and evolutionary variability: the hotspots are located in the positions that are highly conserved in p53 and its paralogs, p63 and p73. In all the mutation spectra, substitutions leading to amino acid replacements strongly dominate over silent substitutions, indicating that functional sites evolving under strong purifying selection are subject to intensive positive selection in p53-dependent tumors. These results are compatible with the gain-of-function concept of the role of p53 in tumorigenesis.
Collapse
Affiliation(s)
- Galina V Glazko
- Stowers Institute for Medical Research, Kansas City, MO 64110, USA.
| | | | | |
Collapse
|
15
|
Abstract
We studied the dependence of the rate of short deletions and insertions on their contexts using the data on mutations within coding exons at 19 human loci that cause mendelian diseases. We confirm that periodic sequences consisting of three to five or more nucleotides are mutagenic. Mutability of sequences with strongly biased nucleotide composition is also elevated, even when mutations within homonucleotide runs longer than three nucleotides are ignored. In contrast, no elevated mutation rates have been detected for imperfect direct or inverted repeats. Among known candidate contexts, the indel context GTAAGT and regions with purine-pyrimidine imbalance between the two DNA strands are mutagenic in our sample, and many others are not mutagenic. Data on mutation hot spots suggest two novel contexts that increase the deletion rate. Comprehensive analysis of mutability of all possible contexts of lengths four, six, and eight indicates a substantially elevated deletion rate within YYYTG and similar sequences, which is one of the two contexts revealed by the hot spots. Possible contexts that increase the insertion rate (AT(A/C)(A/C)GCC and TACCRC) and decrease deletion (TATCGC) or insertion (GCGG) rates have also been identified. Two-thirds of deletions remove a repeat, and over 80% of insertions create a repeat, i.e., they are duplications.
Collapse
Affiliation(s)
- Alexey S Kondrashov
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, Maryland
| | - Igor B Rogozin
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, Maryland
| |
Collapse
|
16
|
Rogozin IB, Diaz M. Cutting Edge: DGYW/WRCH Is a Better Predictor of Mutability at G:C Bases in Ig Hypermutation Than the Widely Accepted RGYW/WRCY Motif and Probably Reflects a Two-Step Activation-Induced Cytidine Deaminase-Triggered Process. THE JOURNAL OF IMMUNOLOGY 2004; 172:3382-4. [PMID: 15004135 DOI: 10.4049/jimmunol.172.6.3382] [Citation(s) in RCA: 160] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
A feature of Ig hypermutation is the presence of hypermutable DNA sequences that are preferentially found in the V regions of Ig genes. Among these, RGYW/WRCY is the most pronounced motif (G:C is a mutable position; R=A/G, Y=C/T, and W=A/T). However, a molecular basis for the high mutability of RGYW was not known until recently. The discovery that activation-induced cytidine deaminase targets the DNA encoding V regions, has enabled the analysis of its targeting properties when expressed outside of the context of hypermutation. We analyzed these data and found evidence that activation-induced cytidine deaminase is the major source of the RGYW mutable motif, but with a new twist: DGYW/WRCH (G:C is the mutable position; D=A/G/T, H=T/C/A) is a better descriptor of the Ig mutation hotspot than RGYW/WRCY. We also found evidence that a DNA repair enzyme may play a role in modifying the sequence of hypermutation hotspots.
Collapse
Affiliation(s)
- Igor B Rogozin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
| | | |
Collapse
|
17
|
|
18
|
Berikov V, Litvinenko A. The influence of prior knowledge on the expected performance of a classifier. Pattern Recognit Lett 2003. [DOI: 10.1016/s0167-8655(03)00099-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
19
|
Shcherbakova PV, Pavlov YI, Chilkova O, Rogozin IB, Johansson E, Kunkel TA. Unique error signature of the four-subunit yeast DNA polymerase epsilon. J Biol Chem 2003; 278:43770-80. [PMID: 12882968 DOI: 10.1074/jbc.m306893200] [Citation(s) in RCA: 99] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
We have purified wild type and exonuclease-deficient four-subunit DNA polymerase epsilon (Pol epsilon) complex from Saccharomyces cerevisiae and analyzed the fidelity of DNA synthesis by the two enzymes. Wild type Pol epsilon synthesizes DNA accurately, generating single-base substitutions and deletions at average error rates of </=2 x 10-5 and </=5 x 10-7, respectively. Pol epsilon lacking 3' --> 5' exonuclease activity is less accurate to a degree suggesting that wild type Pol epsilon proofreads at least 92% of base substitution errors and at least 99% of frameshift errors made by the polymerase. Surprisingly the base substitution fidelity of exonuclease-deficient Pol epsilon is severalfold lower than that of proofreading-deficient forms of other replicative polymerases. Moreover the spectrum of errors shows a feature not seen with other A, B, C, or X family polymerases: a high proportion of transversions resulting from T.dTTP, T.dCTP, and C.dTTP mispairs. This unique error specificity and amino acid sequence alignments suggest that the structure of the polymerase active site of Pol epsilon differs from those of other B family members. We observed both similarities and differences between the spectrum of substitutions generated by proofreading-deficient Pol epsilon in vitro and substitutions occurring in vivo in a yeast strain defective in Pol epsilon proofreading and DNA mismatch repair. We discuss the implications of these findings for the role of Pol epsilon polymerase activity in DNA replication.
Collapse
Affiliation(s)
- Polina V Shcherbakova
- Laboratory of Molecular Genetics, NIEHS, National Institutes of Health, North Carolina 27709, USA
| | | | | | | | | | | |
Collapse
|
20
|
Rogozin IB, Pavlov YI. Theoretical analysis of mutation hotspots and their DNA sequence context specificity. Mutat Res 2003; 544:65-85. [PMID: 12888108 DOI: 10.1016/s1383-5742(03)00032-2] [Citation(s) in RCA: 123] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Mutation frequencies vary significantly along nucleotide sequences such that mutations often concentrate at certain positions called hotspots. Mutation hotspots in DNA reflect intrinsic properties of the mutation process, such as sequence specificity, that manifests itself at the level of interaction between mutagens, DNA, and the action of the repair and replication machineries. The hotspots might also reflect structural and functional features of the respective DNA sequences. When mutations in a gene are identified using a particular experimental system, resulting hotspots could reflect the properties of the gene product and the mutant selection scheme. Analysis of the nucleotide sequence context of hotspots can provide information on the molecular mechanisms of mutagenesis. However, the determinants of mutation frequency and specificity are complex, and there are many analytical methods for their study. Here we review computational approaches for analyzing mutation spectra (distribution of mutations along the target genes) that include many mutable (detectable) positions. The following methods are reviewed: derivation of a consensus sequence, application of regression approaches to correlate nucleotide sequence features with mutation frequency, mutation hotspot prediction, analysis of oligonucleotide composition of regions containing mutations, pairwise comparison of mutation spectra, analysis of multiple spectra, and analysis of "context-free" characteristics. The advantages and pitfalls of these methods are discussed and illustrated by examples from the literature. The most reliable analyses were obtained when several methods were combined and information from theoretical analysis and experimental observations was considered simultaneously. Simple, robust approaches should be used with small samples of mutations, whereas combinations of simple and complex approaches may be required for large samples. We discuss several well-documented studies where analysis of mutation spectra has substantially contributed to the current understanding of molecular mechanisms of mutagenesis. The nucleotide sequence context of mutational hotspots is a fingerprint of interactions between DNA and DNA repair, replication, and modification enzymes, and the analysis of hotspot context provides evidence of such interactions.
Collapse
Affiliation(s)
- Igor B Rogozin
- Institute of Cytology and Genetics, Russian Academy of Sciences, Novosibirsk, Russia
| | | |
Collapse
|
21
|
Quievryn G, Peterson E, Messer J, Zhitkovich A. Genotoxicity and mutagenicity of chromium(VI)/ascorbate-generated DNA adducts in human and bacterial cells. Biochemistry 2003; 42:1062-70. [PMID: 12549927 DOI: 10.1021/bi0271547] [Citation(s) in RCA: 124] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Reduction of carcinogenic Cr(VI) by vitamin C generates ascorbate-Cr(III)-DNA cross-links, binary Cr(III)-DNA adducts, and can potentially cause oxidative DNA damage by intermediate reaction products. Here, we examined the mutational spectrum and the importance of different forms of DNA damage in genotoxicity and mutagenicity of Cr(VI) activated by physiological concentrations of ascorbate. Reduction of Cr(VI) led to a dose-dependent formation of both mutagenic and replication-blocking DNA lesions as detected by propagation of the pSP189 plasmids in human fibroblasts. Disruption of Cr-DNA binding abolished mutagenic responses and normalized the yield of replicated plasmids, indicating that Cr-DNA adducts were responsible for both mutagenicity and genotoxicity of Cr(VI). The absence of DNA breaks and abasic sites confirmed the lack of a significant production of hydroxyl radicals and Cr(V)-peroxo complexes in Cr(VI)-ascorbate reactions. Ascorbate-Cr(III)-DNA cross-links were much more mutagenic than smaller Cr(III)-DNA adducts and accounted for more than 90% of Cr(VI) mutagenicity. Ternary adducts were also several times more potent in the inhibition of replication than binary complexes. The Cr(VI)-induced mutational spectrum consisted of an approximately equal number of deletions and G/C-targeted point mutations (51% G/C --> T/A and 30% G/C --> A/T). In Escherichia coli cells, Cr(VI)-induced DNA adducts were only highly genotoxic but not mutagenic under either normal or SOS-induced conditions. Lower toxicity and high mutagenicity of ascorbate-Cr(III)-DNA adducts in human cells may result from the recruitment of an error-prone bypass DNA polymerase(s) to the stalled replication forks. Our results suggest that phosphotriester-type DNA adducts could play a more important role in human than bacterial mutagenesis.
Collapse
Affiliation(s)
- George Quievryn
- Brown University, Department of Pathology and Laboratory Medicine, Providence, Rhode Island 02912, USA
| | | | | | | |
Collapse
|
22
|
Fijal BA, Idury RM, Witte JS. Analysis of mutational spectra: locating hotspots and clusters of mutations using recursive segmentation. Stat Med 2002; 21:1867-85. [PMID: 12111894 DOI: 10.1002/sim.1145] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Mutations within different regions of disease-causing genes can vary in their impact on disease initiation and progression. Determining how individual mutations within such genes affect disease risk and progression can improve the accuracy of prognoses and help guide treatment selection. Estimates of mutation-specific risks can be poor, however, when genes have a large number of distinct mutations, and data for any given mutation is sparse. To address this problem, we present here a method of analysing the spectrum of mutations observed across a gene that pools together mutations that appear to have similar effects on disease. One of the assumptions underlying the analysis of mutational spectra created in this manner is that the frequency of the mutation in the sample reflects the degree of its effect on disease development. Additionally, mutations that disrupt the same functionally important region of the gene are expected to have a similar impact on disease development. These mutations tend to form a cluster within the spectrum. Therefore, we developed an algorithm that segments a spectrum into regions containing sites with similar mutational frequencies, and have derived by simulation equations that allow one to evaluate whether segmentation is needed. We used this approach to investigate the spectrum of mutations observed in the p53 tumour suppressor gene in colorectal cancer tumours. Here, recursive segmentation identified the boundaries of apparent clusters better than did other methods, and this approach could identify clusters of mutations which corresponded to biologically important regions of the p53 protein.
Collapse
Affiliation(s)
- Bonnie A Fijal
- Department of Epidemiology and Biostatistics, Case Western Reserve University, 2109 Adelbert Road, Cleveland, OH 44106-4945, USA
| | | | | |
Collapse
|
23
|
Berikov VB. An approach to the evaluation of the performance of a discrete classifier. Pattern Recognit Lett 2002. [DOI: 10.1016/s0167-8655(01)00119-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
24
|
Abstract
Several lines of research are now converging towards an integrated understanding of mutational mechanisms and their evolutionary implications. Experimentally, crystal structures reveal the effect of sequence context on polymerase fidelity; large-scale sequencing projects generate vast amounts of sequence polymorphism data; and locus-specific databases are being constructed. Computationally, software and analytical tools have been developed to analyze mutational data, to identify mutational hot spots, and to compare the signatures of mutagenic agents.
Collapse
Affiliation(s)
- M Zavolan
- Laboratory of Computational Genomics, The Rockefeller University, 1230 York Avenue, New York, New York 10021, USA.
| | | |
Collapse
|
25
|
Piegorsch WW, Richwine KA. Large-sample pairwise comparisons among multinomial proportions with an application to analysis of mutant spectra. JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS 2001. [DOI: 10.1198/108571101317096532] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
26
|
Rogozin IB, Pavlov YI, Bebenek K, Matsuda T, Kunkel TA. Somatic mutation hotspots correlate with DNA polymerase eta error spectrum. Nat Immunol 2001; 2:530-6. [PMID: 11376340 DOI: 10.1038/88732] [Citation(s) in RCA: 256] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Mutational spectra analysis of 15 immunoglobulin genes suggested that consensus motifs RGYW and WA were universal descriptors of somatic hypermutation. Highly mutable sites, "hotspots", that matched WA were preferentially found in one DNA strand and RGYW hotspots were found in both strands. Analysis of base-substitution hotspots in DNA polymerase error spectra showed that 33 of 36 hotspots in the human polymerase eta spectrum conformed to the WA consensus. This and four other characteristics of polymerase eta substitution specificity suggest that errors introduced by this enzyme during synthesis of the nontranscribed DNA strand in variable regions may contribute to strand-specific somatic hypermutagenesis of immunoglobulin genes at A-T base pairs.
Collapse
Affiliation(s)
- I B Rogozin
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, Novosibirsk 630090, Russia
| | | | | | | | | |
Collapse
|
27
|
Abstract
The study and comparison of mutation(al) spectra is an important problem in molecular biology, because these spectra often reflect on important features of mutations and their fixation. Such features include the interaction of DNA with various mutagens, the function of repair/replication enzymes, and properties of target proteins. It is known that mutability varies significantly along nucleotide sequences, such that mutations often concentrate at certain positions, called "hotspots," in a sequence. In this paper, we discuss in detail two approaches for mutation spectra analysis: the comparison of mutation spectra with a HG-PUBL program, (FTP: sunsite.unc.edu/pub/academic/biology/dna-mutations/hyperg) and hotspot prediction with the CLUSTERM program (www.itba.mi.cnr.it/webmutation; ftp.bionet.nsc.ru/pub/biology/dbms/clusterm.zip). Several other approaches for mutational spectra analysis, such as the analysis of a target protein structure, hotspot context revealing, multiple spectra comparisons, as well as a number of mutation databases are briefly described. Mutation spectra in the lacI gene of E. coli and the human p53 gene are used for illustration of various difficulties of such analysis.
Collapse
Affiliation(s)
- I Rogozin
- Institute of Cytology and Genetics, Russian Academy of Sciences, Novosibirsk, Russia.
| | | | | |
Collapse
|
28
|
Khromov-Borisov NN, Rogozin IB, Pêgas Henriques JA, de Serres FJ. Similarity pattern analysis in mutational distributions. Mutat Res 1999; 430:55-74. [PMID: 10592318 DOI: 10.1016/s0027-5107(99)00148-7] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The validity and applicability of the statistical procedure - similarity pattern analysis (SPAN) - to the study of mutational distributions (MDs) was demonstrated with two sets of data. The first was mutational spectra (MS) for 697 GC to AT transitions produced with eight alkylating agents (AAs) in the lacI gene of Escherichia coli. The second was a recently summarized data on the distributions of 11562 spontaneous, radiation- and chemical-induced forward mutations in the ad-3 region of heterokaryon 12 of Neurospora crassa. They were analyzed as large two-way contingency tables (CTs) where two kinds of profiles were compared: site (or genotypic class) profiles and origin (or mutagen) profiles. To measure similarity (homogeneity) between any pair of profiles, the relevant sufficient statistics, Kastenbaum-Hirotsu squared distance (KHi(2)), was used. Collapsing the similar profiles into distinct internally homogeneous clusters named 'collapsets' revealed their similarity pattern. To facilitate the procedure, the computer program, COLLAPSE, was elaborated. The results of SPAN for the lacI spectra were found comparable with the results of their previous analysis with two multivariate statistical methods, the factor and cluster analyses. In the ad-3 data set, five collapsets were revealed among origin profiles (OPs): (I) ENU = 4NQO = 4HAQO = FANFT = SQ18506; (II) AF-2 = EI = MMS = DEP; (III) ETO = UV; (IV) AHA = PROCARB; and (V) He ions = protons. Moreover, the previous observation that MDs are dose-dependent was confirmed for X-ray-induced MDs. Profiles induced with the low doses of X-rays are similar to that induced with 85Sr, and profiles induced with the medium X-ray doses to those induced with protons and He ions. Evaluated similarities appear to be rather reasonable: mutagens with similar mode of action induce similar MDs. Similarity pattern revealed among genotypic class profiles (GCPs) seems to be also interpretable. When supplemented with descriptive cluster analysis, SPAN appears to be a fruitful methodology in MS analysis.
Collapse
Affiliation(s)
- N N Khromov-Borisov
- Institute of Cytology and Genetics, Russian Academy of Sciences, Novosibirsk, Russia.
| | | | | | | |
Collapse
|