1
|
The Baikal subtype of tick-borne encephalitis virus is evident of recombination between Siberian and Far-Eastern subtypes. PLoS Negl Trop Dis 2023; 17:e0011141. [PMID: 36972237 PMCID: PMC10079218 DOI: 10.1371/journal.pntd.0011141] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Revised: 04/06/2023] [Accepted: 02/06/2023] [Indexed: 03/29/2023] Open
Abstract
Tick-borne encephalitis virus (TBEV) is a flavivirus which causes an acute or sometimes chronic infection that frequently has severe neurological consequences, and is a major public health threat in Eurasia. TBEV is genetically classified into three distinct subtypes; however, at least one group of isolates, the Baikal subtype, also referred to as “886-84-like”, challenges this classification. Baikal TBEV is a persistent group which has been repeatedly isolated from ticks and small mammals in the Buryat Republic, Irkutsk and Trans-Baikal regions of Russia for several decades. One case of meningoencephalitis with a lethal outcome caused by this subtype has been described in Mongolia in 2010. While recombination is frequent in Flaviviridae, its role in the evolution of TBEV has not been established. Here, we isolate and sequence four novel Baikal TBEV samples obtained in Eastern Siberia. Using a set of methods for inference of recombination events, including a newly developed phylogenetic method allowing for formal statistical testing for such events in the past, we find robust support for a difference in phylogenetic histories between genomic regions, indicating recombination at origin of the Baikal TBEV. This finding extends our understanding of the role of recombination in the evolution of this human pathogen.
Collapse
|
2
|
The Rise and Spread of the SARS-CoV-2 AY.122 Lineage in Russia. Virus Evol 2022; 8:veac017. [PMID: 35371558 PMCID: PMC8966696 DOI: 10.1093/ve/veac017] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2021] [Revised: 02/15/2022] [Accepted: 03/04/2022] [Indexed: 11/14/2022] Open
Abstract
Delta has outcompeted most preexisting variants of SARS-CoV-2, becoming the globally predominant lineage by mid-2021. Its subsequent evolution has led to the emergence of multiple sublineages, most of which are well-mixed between countries. By contrast, here we show that nearly the entire Delta epidemic in Russia has probably descended from a single import event, or from multiple closely timed imports from a single poorly sampled geographic location. Indeed, over 90 per cent of Delta samples in Russia are characterized by the nsp2:K81N + ORF7a:P45L pair of mutations which is rare outside Russia, putting them in the AY.122 sublineage. The AY.122 lineage was frequent in Russia among Delta samples from the start, and has not increased in frequency in other countries where it has been observed, suggesting that its high prevalence in Russia has probably resulted from a random founder effect rather than a transmission advantage. The apartness of the genetic composition of the Delta epidemic in Russia makes Russia somewhat unusual, although not exceptional, among other countries.
Collapse
|
3
|
The rise and spread of the SARS-CoV-2 AY.122 lineage in Russia. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2021:2021.12.02.21267168. [PMID: 34909799 PMCID: PMC8669866 DOI: 10.1101/2021.12.02.21267168] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
BACKGROUND Delta has outcompeted most preexisting variants of SARS-CoV-2, becoming the globally predominant lineage by mid-2021. Its subsequent evolution has led to emergence of multiple sublineages, many of which are well-mixed between countries. AIM Here, we aim to study the emergence and spread of the Delta lineage in Russia. METHODS We use a phylogeographic approach to infer imports of Delta sublineages into Russia, and phylodynamic models to assess the rate of their spread. RESULTS We show that nearly the entire Delta epidemic in Russia has probably descended from a single import event despite genetic evidence of multiple Delta imports. Indeed, over 90% of Delta samples in Russia are characterized by the nsp2:K81N+ORF7a:P45L pair of mutations which is rare outside Russia, putting them in the AY.122 sublineage. The AY.122 lineage was frequent in Russia among Delta samples from the start, and has not increased in frequency in other countries where it has been observed, suggesting that its high prevalence in Russia has probably resulted from a random founder effect. CONCLUSION The apartness of the genetic composition of the Delta epidemic in Russia makes Russia somewhat unusual, although not exceptional, among other countries.
Collapse
|
4
|
Episodic evolution of coadapted sets of amino acid sites in mitochondrial proteins. PLoS Genet 2021; 17:e1008711. [PMID: 33493156 PMCID: PMC7861529 DOI: 10.1371/journal.pgen.1008711] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Revised: 02/04/2021] [Accepted: 12/07/2020] [Indexed: 11/19/2022] Open
Abstract
The rate of evolution differs between protein sites and changes with time. However, the link between these two phenomena remains poorly understood. Here, we design a phylogenetic approach for distinguishing pairs of amino acid sites that evolve concordantly, i.e., such that substitutions at one site trigger subsequent substitutions at the other; and also pairs of sites that evolve discordantly, so that substitutions at one site impede subsequent substitutions at the other. We distinguish groups of amino acid sites that undergo coordinated evolution and evolve discordantly from other such groups. In mitochondrion-encoded proteins of metazoans and fungi, we show that concordantly evolving sites are clustered in protein structures. By analysing the phylogenetic patterns of substitutions at concordantly and discordantly evolving site pairs, we find that concordant evolution has two distinct causes: epistatic interactions between amino acid substitutions and episodes of selection independently affecting substitutions at different sites. The rate of substitutions at concordantly evolving groups of protein sites changes in the course of evolution, indicating episodes of selection limited to some of the lineages. The phylogenetic positions of these changes are consistent between proteins, suggesting common selective forces underlying them. The mode and rate of evolution of a protein site depends on the effect of its mutations on protein fitness. The fitness effect of a mutation itself can change in the course of evolution for at least two reasons. First, it can be modulated by substitutions occurring at other sites, a phenomenon called epistasis. Second, changes in selection can be non-epistatic, affecting sites independently of one another. Here, we analyse substitutions accumulated by the evolving lineages of the five proteins encoded by the mitochondrial genomes of thousands of species of metazoans and fungi. We show that substitutions at different amino acid sites occur in a coordinated fashion, and this coordination is caused both by epistasis and by episodes of selection affecting groups of sites. We partition each protein into several groups of concordantly evolving sites such that evolution of sites from different groups is discordant, and show that the proteins encoded by the mitochondrial genome consist of coevolving structural blocks. Some of these blocks have a clear functional specialization, e.g. are associated with interfaces between proteins composing respiratory complexes. Together, our results reveal a previously unrecognized complexity in the causes of variation in evolutionary rates between protein sites.
Collapse
|
5
|
Abstract
Amino acid propensities at a site change in the course of protein evolution. This may happen for two reasons. Changes may be triggered by substitutions at epistatically interacting sites elsewhere in the genome. Alternatively, they may arise due to environmental changes that are external to the genome. Here, we design a framework for distinguishing between these alternatives. Using analytical modelling and simulations, we show that they cause opposite dynamics of the fitness of the allele currently occupying the site: it tends to increase with the time since its origin due to epistasis ("entrenchment"), but to decrease due to random environmental fluctuations ("senescence"). By analysing the genomes of vertebrates and insects, we show that the amino acids originating at negatively selected sites experience strong entrenchment. By contrast, the amino acids originating at positively selected sites experience senescence. We propose that senescence of the current allele is a cause of adaptive evolution.
Collapse
|
6
|
Genetic diversity of Kemerovo virus and phylogenetic relationships within the Great Island virus genetic group. Ticks Tick Borne Dis 2019; 11:101333. [PMID: 31787560 DOI: 10.1016/j.ttbdis.2019.101333] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2019] [Revised: 10/29/2019] [Accepted: 11/15/2019] [Indexed: 11/27/2022]
Abstract
Kemerovo virus (KEMV) is a member of the Great Island virus genetic group, belonging to the tick-borne arboviruses of the genus Orbivirus within the family Reoviridae. Nine strains of KEMV, which were isolated from various locations in Russia, were sequenced by high-throughput sequencing to study their intraspecific diversity and the interspecific relationships of viruses within the Great Island genetic group. For the first time, multiple reassortment within KEMV was reliably demonstrated. Different types of independently emerged alternative reading frames in segment 9 and heterogeneity of the viral population in one of the KEMV strains were found. The hypothesis of the role of an alternative open reading frame (ORF) in segment 9 in KEMV cellular tropism was not confirmed in this study.
Collapse
|
7
|
Abstract
Characterization of the within-host genetic diversity of viral pathogens is required for selection of effective treatment of some important viral infections, e.g. HIV, HBV and HCV. Despite the technical ability of detection, there are conflicting data regarding the clinical significance of low-frequency variants, partially because of the difficulty of their distinguishing from experimental artifacts. The issue of cross-contamination is relevant for all highly sensitive techniques, including deep sequencing: even trace contamination leads to a significant increase of false positives in identified SNVs. Determination of infections by multiple genotypes of some viruses, the incidence of which can be considerable, especially in risk groups, is also clinically significant in some cases. We developed a new viral reference-guided assembler, VirGenA, that can separate mixtures of strains of different intraspecies genetic groups (genotypes, subtypes, clades, etc.) and assemble a separate consensus sequence for each group in a mixture. It produced long assemblies for mixture components of extremely low frequencies (<1%) allowing detection of cross-contamination of samples by divergent genotypes. We tested VirGenA on both clinical and simulated data. On both types of data, VirGenA shows better or similar results than the existing de novo assemblers. Cross-platform implementation (including source code) is freely available at https://github.com/gFedonin/VirGenA/releases.
Collapse
|
8
|
[Resistance to protease inhibitors and efficiency of antiviral therapy in patients with chronic hepatitis C]. TERAPEVT ARKH 2016; 88:62-67. [PMID: 27030332 DOI: 10.17116/terarkh201688362-67] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
AIM To clarify the role of virus resistance in the efficiency of antiviral therapy with protease inhibitors (PIs) chronic hepatitis C (CHC) patients, with moderate sensitivity to interferon-α. MATERIALS AND METHODS Eight Caucasian patients (4 men and 4 women) aged 21 to 65 years (median 52.5 years) with genotype 1b hepatitis C virus (HCV) infection were included in the study. Two patients were diagnosed with liver cirrhosis; 4 had been ineffectively treated with peginterferon in combination with ribavirin. None of the patients had obesity and/or insulin resistance. All the 8 patients received triple therapy with PIs (boceprevir (n=3), telaprevir (n=4), and simeprevir (n=1)) and as a result failed to achieve a sustained virologic response. All the participants were studied to identify mutations in HCV NS3/4A region. RESULTS Five of the 8 patients were found to have mutations in HCV NS3/4A region (substantially reducing drug susceptibility in 3 cases). CONCLUSION In CHC patients who are moderately sensitive to interferon-α and receive therapy with PIs, resistance to the latter is critically important for the efficiency of therapy and the timely identification of resistance mutations can contribute to the choice of an optimal treatment policy.
Collapse
|
9
|
Abstract
The surface proteins hemagglutinin (HA) and neuraminidase (NA) of human influenza A virus evolve under selection pressures to escape adaptive immune responses and antiviral drug treatments. In addition to these external selection pressures, some mutations in HA are known to affect the adaptive landscape of NA, and vice versa, because these two proteins are physiologically interlinked. However, the extent to which evolution of one protein affects the evolution of the other one is unknown. Here we develop a novel phylogenetic method for detecting the signatures of such genetic interactions between mutations in different genes – that is, inter-gene epistasis. Using this method, we show that influenza surface proteins evolve in a coordinated way, with mutations in HA affecting subsequent spread of mutations in NA and vice versa, at many sites. Of particular interest is our finding that the oseltamivir-resistance mutations in NA in subtype H1N1 were likely facilitated by prior mutations in HA. Our results illustrate that the adaptive landscape of a viral protein is remarkably sensitive to its genomic context and, more generally, that the evolution of any single protein must be understood within the context of the entire evolving genome. The fitness of an organism depends on the coordinated function of many genes. Thus, how a mutation in one gene affects fitness often depends on what mutations are present in other genes. This dependence is called “genetic interaction” or “epistasis”. The prevalence and type of such interactions are not well understood. Epistasis can be inferred from time-series sequencing data when a mutation in one gene is observed to facilitate the spread of a mutation in another gene. However, the situation is much more complicated when new combinations of genes are formed by processes such as recombination or reassortment. In such cases, deducing the time and order of genetic changes is difficult. Here, we devise a method to infer pairs of mutations in different genes which closely follow one another in the presence of reassortment. We apply it to evolution of two surface proteins of influenza A virus, hemagglutinin and neuraminidase, which are important targets for the human immune system and drugs. We show that mutations in one of these proteins are often facilitated by prior mutations, or compensated by subsequent mutations, in the other protein. In particular, drug-resistance mutations in neuraminidase were likely made possible by prior mutation in hemagglutinin. Knowledge of such interactions is necessary to fully understand and predict evolution.
Collapse
|
10
|
Development of Molecular Technique for Deep Sequencing of Almost Full HIV Genome. AIDS Res Hum Retroviruses 2014. [DOI: 10.1089/aid.2014.5388.abstract] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
|
11
|
Intrasubtype reassortments cause adaptive amino acid replacements in H3N2 influenza genes. PLoS Genet 2014; 10:e1004037. [PMID: 24415946 PMCID: PMC3886890 DOI: 10.1371/journal.pgen.1004037] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2013] [Accepted: 11/01/2013] [Indexed: 01/15/2023] Open
Abstract
Reassortments and point mutations are two major contributors to diversity of Influenza A virus; however, the link between these two processes is unclear. It has been suggested that reassortments provoke a temporary increase in the rate of amino acid changes as the viral proteins adapt to new genetic environment, but this phenomenon has not been studied systematically. Here, we use a phylogenetic approach to infer the reassortment events between the 8 segments of influenza A H3N2 virus since its emergence in humans in 1968. We then study the amino acid replacements that occurred in genes encoded in each segment subsequent to reassortments. In five out of eight genes (NA, M1, HA, PB1 and NS1), the reassortment events led to a transient increase in the rate of amino acid replacements on the descendant phylogenetic branches. In NA and HA, the replacements following reassortments were enriched with parallel and/or reversing replacements; in contrast, the replacements at sites responsible for differences between antigenic clusters (in HA) and at sites under positive selection (in NA) were underrepresented among them. Post-reassortment adaptive walks contribute to adaptive evolution in Influenza A: in NA, an average reassortment event causes at least 2.1 amino acid replacements in a reassorted gene, with, on average, 0.43 amino acid replacements per evolving post-reassortment lineage; and at least ~9% of all amino acid replacements are provoked by reassortments.
Collapse
|
12
|
Base-calling algorithm with vocabulary (BCV) method for analyzing population sequencing chromatograms. PLoS One 2013; 8:e54835. [PMID: 23382983 PMCID: PMC3557274 DOI: 10.1371/journal.pone.0054835] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2012] [Accepted: 12/19/2012] [Indexed: 02/01/2023] Open
Abstract
Sanger sequencing is a common method of reading DNA sequences. It is less expensive than high-throughput methods, and it is appropriate for numerous applications including molecular diagnostics. However, sequencing mixtures of similar DNA of pathogens with this method is challenging. This is important because most clinical samples contain such mixtures, rather than pure single strains. The traditional solution is to sequence selected clones of PCR products, a complicated, time-consuming, and expensive procedure. Here, we propose the base-calling with vocabulary (BCV) method that computationally deciphers Sanger chromatograms obtained from mixed DNA samples. The inputs to the BCV algorithm are a chromatogram and a dictionary of sequences that are similar to those we expect to obtain. We apply the base-calling function on a test dataset of chromatograms without ambiguous positions, as well as one with 3-14% sequence degeneracy. Furthermore, we use BCV to assemble a consensus sequence for an HIV genome fragment in a sample containing a mixture of viral DNA variants and to determine the positions of the indels. Finally, we detect drug-resistant Mycobacterium tuberculosis strains carrying frameshift mutations mixed with wild-type bacteria in the pncA gene, and roughly characterize bacterial communities in clinical samples by direct 16S rRNA sequencing.
Collapse
|
13
|
Conserved and species-specific alternative splicing in mammalian genomes. BMC Evol Biol 2007; 7:249. [PMID: 18154685 PMCID: PMC2231371 DOI: 10.1186/1471-2148-7-249] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2007] [Accepted: 12/22/2007] [Indexed: 11/30/2022] Open
Abstract
Background Alternative splicing has been shown to be one of the major evolutionary mechanisms for protein diversification and proteome expansion, since a considerable fraction of alternative splicing events appears to be species- or lineage-specific. However, most studies were restricted to the analysis of cassette exons in pairs of genomes and did not analyze functionality of the alternative variants. Results We analyzed conservation of human alternative splice sites and cassette exons in the mouse and dog genomes. Alternative exons, especially minor-isofom ones, were shown to be less conserved than constitutive exons. Frame-shifting alternatives in the protein-coding regions are less conserved than frame-preserving ones. Similarly, the conservation of alternative sites is highest for evenly used alternatives, and higher when the distance between the sites is divisible by three. The rate of alternative-exon and site loss in mouse is slightly higher than in dog, consistent with faster evolution of the former. The evolutionary dynamics of alternative sites was shown to be consistent with the model of random activation of cryptic sites. Conclusion Consistent with other studies, our results show that minor cassette exons are less conserved than major-alternative and constitutive exons. However, our study provides evidence that this is caused not only by exon birth, but also lineage-specific loss of alternative exons and sites, and it depends on exon functionality.
Collapse
|
14
|
[EDAS, databases of alternatively spliced human genes]. BIOFIZIKA 2006; 51:589-92. [PMID: 16909834] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
EDAS, a database of alternatively spliced human genes, contains data on the alignment of proteins, mRNAs, and EST. It contains information on all exons and introns observed, as well as elementary alternatives formed from them. The database makes it possible to filter the output data by changing the cut-off threshold by the significance level. The database is accessible at http://www.gene-bee.msu.ru/edas/.
Collapse
|
15
|
Abstract
Background Alternative splicing is a major mechanism of generating protein diversity in higher eukaryotes. Although at least half, and probably more, of mammalian genes are alternatively spliced, it was not clear, whether the frequency of alternative splicing is the same in different functional categories. The problem is obscured by uneven coverage of genes by ESTs and a large number of artifacts in the EST data. Results We have developed a method that generates possible mRNA isoforms for human genes contained in the EDAS database, taking into account the effects of nonsense-mediated decay and translation initiation rules, and a procedure for offsetting the effects of uneven EST coverage. Then we computed the number of mRNA isoforms for genes from different functional categories. Genes encoding ribosomal proteins and genes in the category "Small GTPase-mediated signal transduction" tend to have fewer isoforms than the average, whereas the genes in the category "DNA replication and chromosome cycle" have more isoforms than the average. Genes encoding proteins involved in protein-protein interactions tend to be alternatively spliced more often than genes encoding non-interacting proteins, although there is no significant difference in the number of isoforms of alternatively spliced genes. Conclusion Filtering for functional isoforms satisfying biological constraints and accountung for uneven EST coverage allowed us to describe differences in alternative splicing of genes from different functional categories. The observations seem to be consistent with expectations based on current biological knowledge: less isoforms for ribosomal and signal transduction proteins, and more alternative splicing of interacting and cell cycle proteins.
Collapse
|