76
|
Bailes E, Gao F, Bibollet-Ruche F, Courgnaud V, Peeters M, Marx PA, Hahn BH, Sharp PM. Hybrid origin of SIV in chimpanzees. Science 2003; 300:1713. [PMID: 12805540 DOI: 10.1126/science.1080657] [Citation(s) in RCA: 309] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
77
|
Markine-Goriaynoff N, Georgin JP, Goltz M, Zimmermann W, Broll H, Wamwayi HM, Pastoret PP, Sharp PM, Vanderplasschen A. The core 2 beta-1,6-N-acetylglucosaminyltransferase-mucin encoded by bovine herpesvirus 4 was acquired from an ancestor of the African buffalo. J Virol 2003; 77:1784-92. [PMID: 12525612 PMCID: PMC140990 DOI: 10.1128/jvi.77.3.1784-1792.2003] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
The Bo17 gene of bovine herpesvirus 4 (BoHV-4) is the only viral gene known to date that encodes a homologue of the cellular core 2 beta-1,6-N-acetylglucosaminyltransferase-mucin type (C2GnT-M). To investigate the origin and evolution of the Bo17 gene, we analyzed its distribution among BoHV-4 strains and determined the sequences of Bo17 from nine representative strains and of the C2GnT-M gene from six species of ruminants expected to encompass the group within which the gene acquisition occurred. Of 34 strains of BoHV-4, isolated from four different continents, all were found to contain the Bo17 gene. Phylogenetic analyses indicated that Bo17 was acquired from a recent ancestor of the African buffalo, implying that cattle subsequently acquired BoHV-4 by cross-species transmission. The rate of synonymous nucleotide substitution in Bo17 was estimated at 5 x 10(-8) to 6 x 10(-8) substitutions/site/year, consistent with previous estimates made under the assumption that herpesviruses have cospeciated with their hosts. The Bo17 gene acquisition was dated to around 1.5 million years ago. Bo17 sequences from BoHV-4 strains from African buffalo and from cattle formed two separate clades, estimated to have split about 700,000 years ago. Analysis of the ratio of nonsynonymous to synonymous nucleotide substitutions revealed a burst of amino acid replacements subsequent to the transfer of the cellular gene to the viral genome, followed by a return to a strong constraint on nonsynonymous changes during the divergence of contemporary BoHV-4 strains. The Bo17 gene represents the most recent of the known herpesvirus gene acquisitions and provides the best opportunity for learning more about this important process of viral evolution.
Collapse
|
78
|
Santiago ML, Bibollet-Ruche F, Bailes E, Kamenya S, Muller MN, Lukasik M, Pusey AE, Collins DA, Wrangham RW, Goodall J, Shaw GM, Sharp PM, Hahn BH. Amplification of a complete simian immunodeficiency virus genome from fecal RNA of a wild chimpanzee. J Virol 2003; 77:2233-42. [PMID: 12525658 PMCID: PMC140993 DOI: 10.1128/jvi.77.3.2233-2242.2003] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Current knowledge of the genetic diversity of simian immunodeficiency virus (SIVcpz) infection of wild chimpanzees (Pan troglodytes) is incomplete since few isolates, mostly from captive apes from Cameroon and Gabon, have been characterized; yet this information is critical for understanding the origins of human immunodeficiency virus type 1 (HIV-1) and the circumstances leading to the HIV-1 pandemic. Here, we report the first full-length SIVcpz sequence (TAN1) from a wild chimpanzee (Pan troglodytes schweinfurthii) from Gombe National Park (Tanzania), which was obtained noninvasively by amplification of virion RNA from fecal samples collected under field conditions. Using reverse transcription-PCR and a combination of generic and strain-specific primers, we amplified 13 subgenomic fragments which together spanned the entire TAN1 genome (9,326 bp). Distance and phylogenetic tree analyses identified TAN1 unambiguously as a member of the HIV-1/SIVcpz group of viruses but also revealed an extraordinary degree of divergence from all previously characterized SIVcpz and HIV-1 strains. In Gag, Pol, and Env proteins, TAN1 differed from west-central African SIVcpz and HIV-1 strains on average by 36, 30, and 51% of amino acid sequences, respectively, approaching distance values typically found for SIVs from different primate species. The closest relative was SIVcpzANT, also from a P. t. schweinfurthii ape, which differed by 30, 25, and 44%, respectively, in these same protein sequences but clustered with TAN1 in all major coding regions in a statistically highly significant manner. These data indicate that east African chimpanzees, like those from west-central Africa, are naturally infected by SIVcpz but that their viruses comprise a second, divergent SIVcpz lineage which appears to have evolved in relative isolation for an extended period of time. Our data also demonstrate that noninvasive molecular epidemiological studies of SIVcpz in wild chimpanzees are feasible and that such an approach may prove essential for unraveling the evolutionary history of SIVcpz/HIV-1 as well as that of other pathogens naturally infecting wild primate populations.
Collapse
|
79
|
Abstract
Pseudomonas aeruginosa PA01 has a large (6.7 Mbp) genome with a high (67%) G+C content. Codon usage in this species is dominated by this compositional bias, with the average G+C content at synonymously variable third positions of codons being 83%. Nevertheless, there is some variation of synonymous codon usage among genes. The nature and causes of this variation were investigated using multivariate statistical analyses. Three trends were identified. The major source of variation was attributable to genes with unusually low G+C content that are probably due to horizontal transfer. A lesser trend among genes was associated with the preferential use of putatively translationally optimal codons in genes expressed at high levels. In addition, genes on the leading strand of replication were on average more G+T-rich. Our findings contradict the results of two previous analyses, and the reasons for the discrepancies are discussed.
Collapse
|
80
|
Abstract
The diverse consequences of different viral infections on the human population reflect both the history of their origins and the nature of their ongoing evolution. Advances in the ease of finding and characterizing viral genome sequences are having a major impact on our understanding of the diversity of human viruses and of their relatives infecting other species.
Collapse
|
81
|
Santiago ML, Rodenburg CM, Kamenya S, Bibollet-Ruche F, Gao F, Bailes E, Meleth S, Soong SJ, Kilby JM, Moldoveanu Z, Fahey B, Muller MN, Ayouba A, Nerrienet E, McClure HM, Heeney JL, Pusey AE, Collins DA, Boesch C, Wrangham RW, Goodall J, Sharp PM, Shaw GM, Hahn BH. SIVcpz in wild chimpanzees. Science 2002; 295:465. [PMID: 11799233 DOI: 10.1126/science.295.5554.465] [Citation(s) in RCA: 193] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
82
|
Sharp PM, Bailes E, Chaudhuri RR, Rodenburg CM, Santiago MO, Hahn BH. The origins of acquired immune deficiency syndrome viruses: where and when? Philos Trans R Soc Lond B Biol Sci 2001; 356:867-76. [PMID: 11405934 PMCID: PMC1088480 DOI: 10.1098/rstb.2001.0863] [Citation(s) in RCA: 156] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
In the absence of direct epidemiological evidence, molecular evolutionary studies of primate lentiviruses provide the most definitive information about the origins of human immunodeficiency virus (HIV)-1 and HIV-2. Related lentiviruses have been found infecting numerous species of primates in sub-Saharan Africa. The only species naturally infected with viruses closely related to HIV-2 is the sooty mangabey (Cercocebus atys) from western Africa, the region where HIV-2 is known to be endemic. Similarly, the only viruses very closely related to HIV-1 have been isolated from chimpanzees (Pan troglodytes), and in particular those from western equatorial Africa, again coinciding with the region that appears to be the hearth of the HIV-1 pandemic. HIV-1 and HIV-2 have each arisen several times: in the case of HIV-1, the three groups (M, N and O) are the result of independent cross-species transmission events. Consistent with the phylogenetic position of a 'fossil' virus from 1959, molecular clock analyses using realistic models of HIV-1 sequence evolution place the last common ancestor of the M group prior to 1940, and several lines of evidence indicate that the jump from chimpanzees to humans occurred before then. Both the inferred geographical origin of HIV-1 and the timing of the cross-species transmission are inconsistent with the suggestion that oral polio vaccines, putatively contaminated with viruses from chimpanzees in eastern equatorial Africa in the late 1950s, could be responsible for the origin of acquired immune deficiency syndrome.
Collapse
|
83
|
Peeters M, Sharp PM. Genetic diversity of HIV-1: the moving target. AIDS 2001; 14 Suppl 3:S129-40. [PMID: 11086856] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/18/2023]
|
84
|
Grocock RJ, Sharp PM. Synonymous codon usage in Cryptosporidium parvum: identification of two distinct trends among genes. Int J Parasitol 2001; 31:402-12. [PMID: 11306119 DOI: 10.1016/s0020-7519(01)00129-1] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
The usage of alternative synonymous codons in the apicomplexan Cryptosporidium parvum has been investigated. A data set of 54 genes was analysed. Overall, A- and U-ending codons predominate, as expected in an A+T-rich genome. Two trends of codon usage variation among genes were identified using correspondence analysis. The primary trend is in the extent of usage of a subset of presumably translationally optimal codons, that are used at significantly higher frequencies in genes expected to be expressed at high levels. Fifteen of the 18 codons identified as optimal are more G+C-rich than the otherwise common codons, so that codon selection associated with translation opposes the general mutation bias. Among 40 genes with lower frequencies of these optimal codons, a secondary trend in G+C content was identified. In these genes, G+C content at synonymously variable third positions of codons is correlated with that in 5' and 3' flanking sequences, indicative of regional variation in G+C content, perhaps reflecting regional variation in mutational biases.
Collapse
|
85
|
Abstract
More than 20 near full-length genome sequences have been reported for human polyomavirus JC (JCV). These have previously been classified into seven genotypes, and additional subtypes, which exhibit geographical associations. One of these genotypes, Type 4, has been suggested to be a recombinant of Types 1 and 3. We have investigated the pattern of diversity, and evolutionary relationships, among these sequences. In direct contradiction of a recent report, we found that different phylogenetic methods gave consistent results for the phylogenetic relationships among strains. The single known strain representing Type 5 was shown to be a mosaic of sequences from Types 2 and 6, although whether this recombination occurred in vivo or in vitro is not clear. In contrast, there was no substantial evidence that Type 4 strains are recombinant; rather they seem to be simply divergent examples of Type 1. On the assumption that the major genotypes of JCV diverged with human populations, the rate of synonymous nucleotide substitution was estimated to be around 4x10(-7) per site per year, about 10 times higher than a previous estimate for primate polyomaviruses.
Collapse
|
86
|
Robertson DL, Anderson JP, Bradac JA, Carr JK, Foley B, Funkhouser RK, Gao F, Hahn BH, Kalish ML, Kuiken C, Learn GH, Leitner T, McCutchan F, Osmanov S, Peeters M, Pieniazek D, Salminen M, Sharp PM, Wolinsky S, Korber B. HIV-1 nomenclature proposal. Science 2000; 288:55-6. [PMID: 10766634 DOI: 10.1126/science.288.5463.55d] [Citation(s) in RCA: 614] [Impact Index Per Article: 25.6] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
87
|
Lafay B, Atherton JC, Sharp PM. Absence of translationally selected synonymous codon usage bias in Helicobacter pylori. MICROBIOLOGY (READING, ENGLAND) 2000; 146 ( Pt 4):851-860. [PMID: 10784043 DOI: 10.1099/00221287-146-4-851] [Citation(s) in RCA: 80] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Synonymous codon usage in the complete genome of Helicobacter pylori was investigated. The moderate A+T-richness of the genome (G+C=39 mol%) is reflected in the overall synonymous codon usage but the frequencies of some codons cannot be explained by simple mutational biases. A low level of heterogeneity among genes was observed, but this does not appear to be due to varying mutational bias or translational selection. Some of the heterogeneity was due to amino acid composition variation among the encoded proteins, and some may be attributable to recent acquisition of genes from other species. Since Hel. pylori codon usage is not dominated by biased mutation patterns, the absence of evidence for translationally mediated selection among synonymous codons is striking. This has implications with regard to the life history of this species, and in particular suggests that Hel. pylori strains are not subject to periods of competitive exponential growth. Despite the lack of selected codon usage, base composition immediately after the translation initiation site is skewed, consistent with selection against secondary structure formation in this region.
Collapse
|
88
|
Beer BE, Bailes E, Dapolito G, Campbell BJ, Goeken RM, Axthelm MK, Markham PD, Bernard J, Zagury D, Franchini G, Sharp PM, Hirsch VM. Patterns of genomic sequence diversity among their simian immunodeficiency viruses suggest that L'Hoest monkeys (Cercopithecus lhoesti) are a natural lentivirus reservoir. J Virol 2000; 74:3892-8. [PMID: 10729165 PMCID: PMC111899 DOI: 10.1128/jvi.74.8.3892-3898.2000] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Recently, we described a novel simian immunodeficiency virus (SIVlhoest) from a wild-caught L'Hoest monkey (Cercopithecus lhoesti) from a North American zoo. To investigate whether L'Hoest monkeys are the natural host for these viruses, we have screened blood samples from 14 wild animals from the Democratic Republic of Congo. Eight (57%) were found to be seropositive for SIV. Nearly full-length genome sequences were obtained for SIV isolates from three of these monkeys and compared to the original isolate and to other SIVs. The four samples of SIVlhoest formed a distinct cluster in phylogenetic trees. Two of these isolates differed on average at only about 5% of nucleotides, suggesting that they were epidemiologically linked; otherwise, the SIVlhoest isolates differed on average by 18%. Both the level of diversity and the pattern of its variation along the genome were very similar to those seen among isolates of SIVagm from vervet monkeys, pointing to similarities in the nature of, and constraints on, SIV evolution in these two species. Discordant phylogenetic relationships among the SIVlhoest isolates for different genomic regions indicated that mosaic viruses have been generated by recombination, implying that individual monkeys have been coinfected by more than one strain of SIV. Taken together, these observations provide strong evidence that L'Hoest monkeys constitute a natural reservoir for SIV.
Collapse
|
89
|
Averof M, Rokas A, Wolfe KH, Sharp PM. Evidence for a high frequency of simultaneous double-nucleotide substitutions. Science 2000; 287:1283-6. [PMID: 10678838 DOI: 10.1126/science.287.5456.1283] [Citation(s) in RCA: 94] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
Point mutations are generally assumed to involve changes of single nucleotides. Nevertheless, the nature and known mechanisms of mutation do not exclude the possibility that several adjacent nucleotides may change simultaneously in a single mutational event. Two independent approaches are used here to estimate the frequency of simultaneous double-nucleotide substitutions. The first examines switches between TCN and AGY (where N is any nucleotide and Y is a pyrimidine) codons encoding absolutely conserved serine residues in a number of proteins from diverse organisms. The second reveals double-nucleotide substitutions in primate noncoding sequences. These two complementary approaches provide similar high estimates for the rate of doublet substitutions, on the order of 0.1 per site per billion years.
Collapse
|
90
|
Sharp PM, Bailes E, Gao F, Beer BE, Hirsch VM, Hahn BH. Origins and evolution of AIDS viruses: estimating the time-scale. Biochem Soc Trans 2000; 28:275-82. [PMID: 10816142 DOI: 10.1042/bst0280275] [Citation(s) in RCA: 123] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
The primate lentiviruses comprise SIV strains from various host species, as well as two viruses, HIV-1 and HIV-2, that cause AIDS in humans. The origins of HIV-1 and HIV-2 have been traced to cross-species transmissions from chimpanzees and sooty mangabey monkeys respectively. Two approaches have been taken to estimate the time-scale of the evolution of these viruses. Certain groups of SIV strains appear to have evolved in a host-dependent manner, implying a time-scale of many thousands or even millions of years. In stark contrast, molecular clock calculations have previously been used to estimate a time-scale of only tens or hundreds of years. Those calculations largely ignored heterogeneity of evolutionary rates across different sites within sequences. In fact, the distribution of rates at different sites seems extremely skewed in HIV-1, and so the time-depth of the primate lentivirus evolutionary tree may have been underestimated by at least a factor of ten. However, these date estimates still seem to be far too recent to be consistent with host-dependent evolution.
Collapse
|
91
|
Abstract
Evidence of simian immunodeficiency virus (SIV) infection has been reported for 26 different species of African nonhuman primates. Two of these viruses, SIVcpz from chimpanzees and SIVsm from sooty mangabeys, are the cause of acquired immunodeficiency syndrome (AIDS) in humans. Together, they have been transmitted to humans on at least seven occasions. The implications of human infection by a diverse set of SIVs and of exposure to a plethora of additional human immunodeficiency virus-related viruses are discussed.
Collapse
|
92
|
Abstract
The pattern of codon usage in the amitochondriate diplomonad Giardia lamblia has been investigated. Very extensive heterogeneity was evident among a sample of 65 genes. A discrete group of genes featured unusual codon usage due to the amino acid composition of their products: these variant surface proteins (VSPs) are unusually rich in Cys and, to a lesser extent, Gly and Thr. Among the remaining 50 genes, correspondence analysis revealed a single major source of variation in synonymous codon usage. This trend was related to the extent of use of a particular subset of 21 codons which are inferred to be those which are optimal for translation; at one end of this trend were genes expected to be expressed at low levels with near random codon usage, while at the other extreme were genes expressed at high levels in which these optimal codons are used almost exclusively. These optimal codons all end in C or G so G + C content at silent sites varies enormously among genes, from values around 40%, expected to reflect the background level of the genome, up to nearly 100%. Although VSP genes are occasionally extremely highly expressed, they do not, in general, have high frequencies of optimal codons, presumably because their high expression is only intermittent. These results indicate that natural selection has been very effective in shaping codon usage in G. lamblia. These analyses focused on sequences from strains placed within G. lamblia "assemblage A"; a few sequences from other strains revealed extensive divergence at silent sites, including some divergence in the pattern of codon usage.
Collapse
|
93
|
Atherton JC, Sharp PM, Cover TL, Gonzalez-Valencia G, Peek RM, Thompson SA, Hawkey CJ, Blaser MJ. Vacuolating cytotoxin (vacA) alleles of Helicobacter pylori comprise two geographically widespread types, m1 and m2, and have evolved through limited recombination. Curr Microbiol 1999; 39:211-8. [PMID: 10486057 PMCID: PMC2762107 DOI: 10.1007/s002849900447] [Citation(s) in RCA: 46] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
Vacuolating cytotoxin (vacA) alleles of Helicobacter pylori vary, particularly in their mid region (which may be type m1 or m2) and their signal peptide coding region (type s1 or s2). We investigated nucleotide diversity among vacA alleles in strains from several locales in Asia, South America, and the USA. Phylogenetic analysis of vacA mid region sequences from 18 strains validated the division into two main groups (m1 and m2) and showed further significant divisions within these groups. Informative site analysis demonstrated one example of recombination between m1 and m2 alleles, and several examples of recombination among alleles within these groups. Recombination was not sufficiently extensive to destroy phylogenetic structure entirely. Synonymous nucleotide substitution rates were markedly different between regions of vacA, suggesting different evolutionary divergence times and implying horizontal transfer of genetic elements within vacA. Non-synonymous/synonymous rate ratios were greater between m1 and m2 sequences than among m1 sequences, consistent with m1 and m2 alleles encoding functions fitting strains for slightly different ecological niches.
Collapse
|
94
|
Beer BE, Bailes E, Goeken R, Dapolito G, Coulibaly C, Norley SG, Kurth R, Gautier JP, Gautier-Hion A, Vallet D, Sharp PM, Hirsch VM. Simian immunodeficiency virus (SIV) from sun-tailed monkeys (Cercopithecus solatus): evidence for host-dependent evolution of SIV within the C. lhoesti superspecies. J Virol 1999; 73:7734-44. [PMID: 10438863 PMCID: PMC104300 DOI: 10.1128/jvi.73.9.7734-7744.1999] [Citation(s) in RCA: 79] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/1999] [Accepted: 05/27/1999] [Indexed: 11/20/2022] Open
Abstract
Recently we reported the characterization of simian immunodeficiency virus (SIVlhoest) from a central African l'hoest monkey (Cercopithecus lhoesti lhoesti) that revealed a distant relationship to SIV isolated from a mandrill (SIVmnd). The present report describes a novel SIV (SIVsun) isolated from a healthy, wild-caught sun-tailed monkey (Cercopithecus lhoesti solatus), another member of the l'hoest superspecies. SIVsun replicated in a variety of human T-cell lines and in peripheral blood mononuclear cells of macaques (Macaca spp.) and patas monkeys (Erythrocebus patas). A full-length infectious clone of SIVsun was derived, and genetic analysis revealed that SIVsun was most closely related to SIVlhoest, with an amino acid identity of 71% in Gag, 73% in Pol, and 67% in Env. This degree of similarity is reminiscent of that observed between SIVagm isolates from vervet, grivet, and tantalus species of African green monkeys. The close relationship between SIVsun and SIVlhoest, despite their geographically distinct habitats, is consistent with evolution from a common ancestor, providing further evidence for the ancient nature of the primate lentivirus family. In addition, this observation leads us to suggest that the SIVmnd lineage should be designated the SIVlhoest lineage.
Collapse
|
95
|
Abstract
BACKGROUND Nucleotide substitution rates and G + C content vary considerably among mammalian genes. It has been proposed that the mammalian genome comprises a mosaic of regions - termed isochores - with differing G + C content. The regional variation in gene G + C content might therefore be a reflection of the isochore structure of chromosomes, but the factors influencing the variation of nucleotide substitution rate are still open to question. RESULTS To examine whether nucleotide substitution rates and gene G + C content are influenced by the chromosomal location of genes, we compared human and murid (mouse or rat) orthologues known to belong to one of the chromosomal (autosomal) segments conserved between these species. Multiple members of gene families were excluded from the dataset. Sets of neighbouring genes were defined as those lying within 1 centiMorgan (cM) of each other on the mouse genetic map. For both synonymous substitution rates and G + C content at silent sites, neighbouring genes were found to be significantly more similar to each other than sets of genes randomly drawn from the dataset. Moreover, we demonstrated that the regional similarities in G + C content (isochores) and synonymous substitution rate were independent of each other. CONCLUSIONS Our results provide the first substantial statistical evidence for the existence of a regional variation in the synonymous substitution rate within the mammalian genome, indicating that different chromosomal regions evolve at different rates. This regional phenomenon which shapes gene evolution could reflect the existence of 'evolutionary rate units' along the chromosome.
Collapse
|
96
|
Ball JK, Curran R, Berridge S, Grabowska AM, Jameson CL, Thomson BJ, Irving WL, Sharp PM. TT virus sequence heterogeneity in vivo: evidence for co-infection with multiple genetic types. J Gen Virol 1999; 80 ( Pt 7):1759-1768. [PMID: 10423145 DOI: 10.1099/0022-1317-80-7-1759] [Citation(s) in RCA: 64] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
TT virus (TTV) is a newly described DNA virus of humans that exhibits an unusually high degree of genetic heterogeneity. We have performed extensive analysis of the TTV populations present in samples, taken over a period of 2 to 6 years, from three individuals with persistent TTV infection. TTV DNA titres estimated for sequential samples were found to be quite stable over the entire study period in two patients, but fluctuated considerably in the third. DNA sequence analysis revealed different genetic diversity among TTV populations from samples from the three patients. In one case, absolute sequence homogeneity was observed among samples over a 3 year period. In a second, a limited amount of heterogeneity was found, including one sequence exhibiting G-->A hypermutation. TTV DNA sequences from the third patient exhibited quite remarkable genetic heterogeneity: evidence was found of seven distinct infecting viruses, representing four of the six TTV genotypes that have been described. In addition, minor variants of three of these seven sequences were observed. The heterogeneity of the viral population in this individual declined steadily over a 6 year period. This patient infected with a genetically diverse TTV population had the highest viral DNA titre.
Collapse
|
97
|
Sharp PM, Bailes E, Robertson DL, Gao F, Hahn BH. Origins and evolution of AIDS viruses. THE BIOLOGICAL BULLETIN 1999; 196:338-342. [PMID: 10390833 DOI: 10.2307/1542965] [Citation(s) in RCA: 59] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
|
98
|
Bradnam KR, Seoighe C, Sharp PM, Wolfe KH. G+C content variation along and among Saccharomyces cerevisiae chromosomes. Mol Biol Evol 1999; 16:666-75. [PMID: 10335660 DOI: 10.1093/oxfordjournals.molbev.a026149] [Citation(s) in RCA: 42] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Past analyses of the genome of the yeast Saccharomyces cerevisiae have revealed substantial regional variation in G+C content. Important questions remain, though, as to the origin, nature, significance, and generality of this variation. We conducted an extensive analysis of the yeast genome to try to answer these questions. Our results indicate that open reading frames (ORFs) with similar G+C contents at silent codon positions are significantly clustered on chromosomes. This clustering can be explained by very short range correlations of silent-site G+C contents at neighboring ORFs. ORFs of high silent-site G+C content are disproportionately concentrated on shorter chromosomes, which causes a negative relationship between chromosome length and G+C content. Contrary to previous reports, there is no correlation between gene density and silent-site G+C content in yeast. Chromosome III is atypical in many regards, and possible reasons for this are discussed.
Collapse
|
99
|
Lafay B, Lloyd AT, McLean MJ, Devine KM, Sharp PM, Wolfe KH. Proteome composition and codon usage in spirochaetes: species-specific and DNA strand-specific mutational biases. Nucleic Acids Res 1999; 27:1642-9. [PMID: 10075995 PMCID: PMC148367 DOI: 10.1093/nar/27.7.1642] [Citation(s) in RCA: 114] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The genomes of the spirochaetes Borrelia burgdorferi and Treponema pallidum show strong strand-specific skews in nucleotide composition, with the leading strand in replication being richer in G and T than the lagging strand in both species. This mutation bias results in codon usage and amino acid composition patterns that are significantly different between genes encoded on the two strands, in both species. There are also substantial differences between the species, with T.pallidum having a much higher G+C content than B. burgdorferi. These changes in amino acid and codon compositions represent neutral sequence change that has been caused by strong strand- and species-specific mutation pressures. Genes that have been relocated between the leading and lagging strands since B. burgdorferi and T.pallidum diverged from a common ancestor now show codon and amino acid compositions typical of their current locations. There is no evidence that translational selection operates on codon usage in highly expressed genes in these species, and the primary influence on codon usage is whether a gene is transcribed in the same direction as replication, or opposite to it. The dnaA gene in both species has codon usage patterns distinctive of a lagging strand gene, indicating that the origin of replication lies downstream of this gene, possibly within dnaN. Our findings strongly suggest that gene-finding algorithms that ignore variability within the genome may be flawed.
Collapse
|
100
|
Gao F, Bailes E, Robertson DL, Chen Y, Rodenburg CM, Michael SF, Cummins LB, Arthur LO, Peeters M, Shaw GM, Sharp PM, Hahn BH. Origin of HIV-1 in the chimpanzee Pan troglodytes troglodytes. Nature 1999; 397:436-41. [PMID: 9989410 DOI: 10.1038/17130] [Citation(s) in RCA: 914] [Impact Index Per Article: 36.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The human AIDS viruses human immunodeficiency virus type 1 (HIV-1) and type 2 (HIV-2) represent cross-species (zoonotic) infections. Although the primate reservoir of HIV-2 has been clearly identified as the sooty mangabey (Cercocebus atys), the origin of HIV-1 remains uncertain. Viruses related to HIV-1 have been isolated from the common chimpanzee (Pan troglodytes), but only three such SIVcpz infections have been documented, one of which involved a virus so divergent that it might represent a different primate lentiviral lineage. In a search for the HIV-1 reservoir, we have now sequenced the genome of a new SIVcpzstrain (SIVcpzUS) and have determined, by mitochondrial DNA analysis, the subspecies identity of all known SIVcpz-infected chimpanzees. We find that two chimpanzee subspecies in Africa, the central P. t. troglodytes and the eastern P. t. schweinfurthii, harbour SIVcpz and that their respective viruses form two highly divergent (but subspecies-specific) phylogenetic lineages. All HIV-1 strains known to infect man, including HIV-1 groups M, N and O, are closely related to just one of these SIVcpz lineages, that found in P. t. troglodytes. Moreover, we find that HIV-1 group N is a mosaic of SIVcpzUS- and HIV-1-related sequences, indicating an ancestral recombination event in a chimpanzee host. These results, together with the observation that the natural range of P. t. troglodytes coincides uniquely with areas of HIV-1 group M, N and O endemicity, indicate that P. t. troglodytes is the primary reservoir for HIV-1 and has been the source of at least three independent introductions of SIVcpz into the human population.
Collapse
|