1
|
Kapli P, Kotari I, Telford MJ, Goldman N, Yang Z. DNA Sequences Are as Useful as Protein Sequences for Inferring Deep Phylogenies. Syst Biol 2023; 72:1119-1135. [PMID: 37366056 PMCID: PMC10627555 DOI: 10.1093/sysbio/syad036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2022] [Indexed: 06/28/2023] Open
Abstract
Inference of deep phylogenies has almost exclusively used protein rather than DNA sequences based on the perception that protein sequences are less prone to homoplasy and saturation or to issues of compositional heterogeneity than DNA sequences. Here, we analyze a model of codon evolution under an idealized genetic code and demonstrate that those perceptions may be misconceptions. We conduct a simulation study to assess the utility of protein versus DNA sequences for inferring deep phylogenies, with protein-coding data generated under models of heterogeneous substitution processes across sites in the sequence and among lineages on the tree, and then analyzed using nucleotide, amino acid, and codon models. Analysis of DNA sequences under nucleotide-substitution models (possibly with the third codon positions excluded) recovered the correct tree at least as often as analysis of the corresponding protein sequences under modern amino acid models. We also applied the different data-analysis strategies to an empirical dataset to infer the metazoan phylogeny. Our results from both simulated and real data suggest that DNA sequences may be as useful as proteins for inferring deep phylogenies and should not be excluded from such analyses. Analysis of DNA data under nucleotide models has a major computational advantage over protein-data analysis, potentially making it feasible to use advanced models that account for among-site and among-lineage heterogeneity in the nucleotide-substitution process in inference of deep phylogenies.
Collapse
Affiliation(s)
- Paschalia Kapli
- Department of Genetics, University College London, Gower Street, London WC1E 6BT, UK
| | - Ioanna Kotari
- Department of Genetics, University College London, Gower Street, London WC1E 6BT, UK
- Institut für Populationsgenetik, Vetmeduni Vienna, Vienna, 1210, Austria
| | - Maximilian J Telford
- Department of Genetics, University College London, Gower Street, London WC1E 6BT, UK
| | - Nick Goldman
- European Molecular Biology Laboratory - European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Ziheng Yang
- Department of Genetics, University College London, Gower Street, London WC1E 6BT, UK
| |
Collapse
|
2
|
Panova VV, Dolinnaya NG, Novoselov KA, Savitskaya VY, Chernykh IS, Kubareva EA, Alexeevski AV, Zvereva MI. Conserved G-Quadruplex-Forming Sequences in Mammalian TERT Promoters and Their Effect on Mutation Frequency. Life (Basel) 2023; 13:1478. [PMID: 37511853 PMCID: PMC10381784 DOI: 10.3390/life13071478] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2023] [Revised: 06/24/2023] [Accepted: 06/26/2023] [Indexed: 07/30/2023] Open
Abstract
Somatic mutations in the promoter region of the human telomerase reverse transcriptase (hTERT) gene have been identified in many types of cancer. The hTERT promoter is known to be enriched with sequences that enable the formation of G-quadruplex (G4) structures, whose presence is associated with elevated mutagenicity and genome instability. Here, we used a bioinformatics tool (QGRS mapper) to search for G4-forming sequences (G4 motifs) in the 1000 bp TERT promoter regions of 141 mammalian species belonging to 20 orders, 5 of which, including primates and predators, contain more than 10 species. Groups of conserved G4 motifs and single-nucleotide variants within these groups were discovered using a block alignment approach (based on the Nucleotide PanGenome explorer). It has been shown that: (i) G4 motifs are predominantly located in the region proximal to the transcription start site (up to 400 bp) and are over-represented on the non-coding strand of the TERT promoters, (ii) 11 to 22% of the G4 motifs found are evolutionarily conserved across the related organisms, and (iii) a statistically significant higher frequency of nucleotide substitutions in the conserved G4 motifs compared to the surrounding regions was confirmed only for the order Primates. These data support the assumption that G4s can interfere with the DNA repair process and affect the evolutionary adaptation of organisms and species.
Collapse
Affiliation(s)
- Vera V Panova
- Department of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Leninskie Gory 1, Moscow 119234, Russia
| | - Nina G Dolinnaya
- Department of Chemistry, Lomonosov Moscow State University, Leninskie Gory 1, Moscow 119991, Russia
| | - Kirill A Novoselov
- Department of Chemistry, Lomonosov Moscow State University, Leninskie Gory 1, Moscow 119991, Russia
| | - Viktoriia Yu Savitskaya
- Department of Chemistry, Lomonosov Moscow State University, Leninskie Gory 1, Moscow 119991, Russia
| | - Ivan S Chernykh
- Department of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Leninskie Gory 1, Moscow 119234, Russia
| | - Elena A Kubareva
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Leninskie Gory 1, Moscow 119992, Russia
| | - Andrei V Alexeevski
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Leninskie Gory 1, Moscow 119992, Russia
- Department of Mathematics, Scientific Research Institute for System Studies, Russian Academy of Sciences, Nakhimovskii Prospekt 36-1, Moscow 117218, Russia
| | - Maria I Zvereva
- Department of Chemistry, Lomonosov Moscow State University, Leninskie Gory 1, Moscow 119991, Russia
| |
Collapse
|
3
|
Sen S, Rathi S, Sahu J, Mandal SC, Ray S, Slama P, Roychoudhury S. In Silico Mining and Characterization of High-Quality SNP/Indels in Some Agro-Economically Important Species Belonging to the Family Euphorbiaceae. Genes (Basel) 2023; 14. [PMID: 36833259 DOI: 10.3390/genes14020332] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2022] [Revised: 01/07/2023] [Accepted: 01/20/2023] [Indexed: 01/31/2023] Open
Abstract
(1) Background: To assess the genetic makeup among the agro-economically important members of Euphorbiaceae, the present study was conducted to identify and characterize high-quality single-nucleotide polymorphism (SNP) markers and their comparative distribution in exonic and intronic regions from the publicly available expressed sequence tags (ESTs). (2) Methods: Quality sequences obtained after pre-processing by an EG assembler were assembled into contigs using the CAP3 program at 95% identity; the mining of SNP was performed by QualitySNP; GENSCAN (standalone) was used for detecting the distribution of SNPs in the exonic and intronic regions. (3) Results: A total of 25,432 potential SNPs (pSNP) and 14,351 high-quality SNPs (qSNP), including 2276 indels, were detected from 260,479 EST sequences. The ratio of quality SNP to potential SNP ranged from 0.22 to 0.75. A higher frequency of transitions and transversions was observed more in the exonic than the intronic region, while indels were present more in the intronic region. C↔T (transition) was the most dominant nucleotide substitution, while in transversion, A↔T was the dominant nucleotide substitution, and in indel, A/- was dominant. (4) Conclusions: Detected SNP markers may be useful for linkage mapping; marker-assisted breeding; studying genetic diversity; mapping important phenotypic traits, such as adaptation or oil production; or disease resistance by targeting and screening mutations in important genes.
Collapse
|
4
|
Morton BR. Do Noncoding and Coding Sites in Angiosperm Chloroplast DNA Have Different Mutation Processes? Genes (Basel) 2023; 14. [PMID: 36672890 DOI: 10.3390/genes14010148] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Revised: 12/30/2022] [Accepted: 01/03/2023] [Indexed: 01/09/2023] Open
Abstract
Fourfold degenerate sites within coding regions and intergenic sites have both been used as estimates of neutral evolution. In chloroplast DNA, the pattern of substitution at intergenic sites is strongly dependent on the composition of the surrounding hexanucleotide composed of the three base pairs on each side, which suggests that the mutation process is highly context-dependent in this genome. This study examines the context-dependency of substitutions at fourfold degenerate sites in protein-coding regions and compares the pattern to what has been observed at intergenic sites. Overall, there is strong similarity between the two types of sites, but there are some intriguing differences. One of these is that substitutions of G and C are significantly higher at fourfold degenerate sites across a range of contexts. In fact, A → T and T → A substitutions are the only substitution types that occur at a lower rate at fourfold degenerate sites. The data are not consistent with selective constraints being responsible for the difference in substitution patterns between intergenic and fourfold degenerate sites. Rather, it is suggested that the difference may be a result of different epigenetic modifications that result in slightly different mutation patterns in coding and intergenic DNA.
Collapse
|
5
|
Zhao F, Zhou H, Li S, An Q, Fang Q, Luo Y, Hickford JGH. Variation in the Ovine Glycogen Synthase Kinase 3 Beta-Interaction Protein Gene ( GSKIP) Affects Carcass and Growth Traits in Romney Sheep. Animals (Basel) 2021; 11:2690. [PMID: 34573656 DOI: 10.3390/ani11092690] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2021] [Revised: 09/09/2021] [Accepted: 09/10/2021] [Indexed: 11/23/2022] Open
Abstract
Simple Summary The glycogen synthase kinase 3 beta (GSK3β)-interacting protein plays a role in regulating glycogen metabolism, protein synthesis, the cell cycle, and in regulating gene expression. To date, physiological function research into the GSK3β-interacting protein has been focused on cell lines, gene ‘knockout’ models, and over-expression studies, and to our knowledge, there have been no reports on how variation in the GSK3β-interacting protein gene (GSKIP) may affect phenotypic traits. In this study, PCR-SSCP methods were used to screen for variation in exon 1 and exon 2 of GSKIP in 840 New Zealand (NZ) Romney sheep. Two variant sequences were identified in exon 1 and this variation in GSKIP was associated with variation in lamb birth weight, hot carcass weight, and fat depth at the 12th rib. Abstract The glycogen synthase kinase 3 beta (GSK3β)-interacting protein (encoded by the gene GSKIP) is a small A-kinase anchoring protein, which complexes with GSK3βand protein kinase A (PKA) and acts synergistically with cAMP/PKA signaling to inhibit GSK3β activity. The protein plays a role in regulating glycogen metabolism, protein synthesis, the cell cycle, and in regulating gene expression. In this study, PCR-single strand conformation polymorphism (PCR-SSCP) analyses were used to screen for variation in exon 1 and exon 2 of GSKIP in 840 New Zealand (NZ) Romney sheep. Two SSCP banding patterns representing two different nucleotide variants (A and B) were detected in an exon 1 region, whereas in an exon 2 region only one pattern was detected. Variants A and B of exon 1 had one non-synonymous nucleotide difference c.37A/G (p.Met13Val). The birthweight of sheep of genotype AA (5.9 ± 0.06 kg) was different (p = 0.023) to sheep of genotype AB (5.7 ± 0.06 kg) and BB (5.7 ± 0.06 kg). The hot carcass weight (HCW) of sheep of genotype AA (17.2 ± 0.22 kg) was different (p = 0.012) to sheep of genotype AB (17.6 ± 0.22 kg) and BB (18.0 ± 0.29 kg), and the fat depth at the 12th rib (V-GR) of sheep of genotype AA (7.7 ± 0.31 mm) was different (p = 0.016) to sheep of genotype AB (8.3 ± 0.30 mm) and BB (8.5 ± 0.39 mm). The results suggest that the c.37A/G substitution in ovine GSKIP may affect sheep growth and carcass traits.
Collapse
|
6
|
Mohanta TK, Mishra AK, Khan A, Hashem A, Abd_Allah EF, Al-Harrasi A. Gene Loss and Evolution of the Plastome. Genes (Basel) 2020; 11:E1133. [PMID: 32992972 PMCID: PMC7650654 DOI: 10.3390/genes11101133] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2020] [Revised: 09/07/2020] [Accepted: 09/14/2020] [Indexed: 12/13/2022] Open
Abstract
Chloroplasts are unique organelles within the plant cells and are responsible for sustaining life forms on the earth due to their ability to conduct photosynthesis. Multiple functional genes within the chloroplast are responsible for a variety of metabolic processes that occur in the chloroplast. Considering its fundamental role in sustaining life on the earth, it is important to identify the level of diversity present in the chloroplast genome, what genes and genomic content have been lost, what genes have been transferred to the nuclear genome, duplication events, and the overall origin and evolution of the chloroplast genome. Our analysis of 2511 chloroplast genomes indicated that the genome size and number of coding DNA sequences (CDS) in the chloroplasts genome of algae are higher relative to other lineages. Approximately 10.31% of the examined species have lost the inverted repeats (IR) in the chloroplast genome that span across all the lineages. Genome-wide analyses revealed the loss of the Rbcl gene in parasitic and heterotrophic plants occurred approximately 56 Ma ago. PsaM, Psb30, ChlB, ChlL, ChlN, and Rpl21 were found to be characteristic signature genes of the chloroplast genome of algae, bryophytes, pteridophytes, and gymnosperms; however, none of these genes were found in the angiosperm or magnoliid lineage which appeared to have lost them approximately 203-156 Ma ago. A variety of chloroplast-encoded genes were lost across different species lineages throughout the evolutionary process. The Rpl20 gene, however, was found to be the most stable and intact gene in the chloroplast genome and was not lost in any of the analyzed species, suggesting that it is a signature gene of the plastome. Our evolutionary analysis indicated that chloroplast genomes evolved from multiple common ancestors ~1293 Ma ago and have undergone vivid recombination events across different taxonomic lineages.
Collapse
Affiliation(s)
- Tapan Kumar Mohanta
- Biotech and Omics Laboratory, Natural and Medical Sciences Research Centre, University of Nizwa, Nizwa 616, Oman;
| | | | - Adil Khan
- Biotech and Omics Laboratory, Natural and Medical Sciences Research Centre, University of Nizwa, Nizwa 616, Oman;
| | - Abeer Hashem
- Botany and Microbiology Department, College of Science, King Saud University, Riyadh 11451, Saudi Arabia;
- Mycology and Plant Disease Survey Department, Plant Pathology Research Institute, Giza 12511, Egypt
| | - Elsayed Fathi Abd_Allah
- Plant Production Department, College of Food and Agricultural Sciences, King Saud University, P.O. Box. 2460, Riyadh 11451, Saudi Arabia;
| | - Ahmed Al-Harrasi
- Natural Product Laboratory, Natural and Medical Sciences Research Centre, University of Nizwa, Nizwa 616, Oman
| |
Collapse
|
7
|
Kabir MH, Takenouchi A, Haqani MI, Nakamura Y, Takeuchi S, Tsudzuki M. Discovery of a new nucleotide substitution in the MC1R gene and haplotype distribution in native and non-Japanese chicken breeds. Anim Genet 2020; 51:235-248. [PMID: 31977074 DOI: 10.1111/age.12906] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2019] [Revised: 11/13/2019] [Accepted: 12/17/2019] [Indexed: 01/07/2023]
Abstract
Melanocortin 1-receptor (MC1R) is one of the major genes that controls chicken plumage colour. In this study, we investigated the sequence and haplotype distribution of the MC1R gene in native Japanese chickens, along with non-Japanese chicken breeds. In total, 732 and 155 chickens from 30 Japanese and eight non-Japanese breeds respectively were used. Three synonymous and 11 non-synonymous nucleotide substitutions were detected, resulting in 15 haplotypes (H0-H14). Of these, three were newly found haplotypes (H9, H13 and H14), of which one (H9) was composed of known substitutions C69T, T212C, G274A and G636A. The second one (H13) possessed newly found non-synonymous substitution C919G, apart from the known substitutions C69T, G178A, G274A, G636A and T637C. The third one (H14) comprised a newly discovered substitution C919G in addition to the known C69T, G274A and G409A substitutions. The homozygote for this new haplotype exhibited wt like plumage despite the presence of G274A. In addition to discovering a new nucleotide substitution (C919G) and three new haplotypes, we defined the plumage colour of the bird that was homozygous for the A644C substitution (H5 haplotype) as wheaten-like for the first time; although the substitution has been already reported, its effect was not revealed. Besides detecting the new plumage colour, we also confirmed that the A427G and G274A substitutions contribute in expressing brownish and black plumage colour respectively, as reported by the previous studies. Moreover, we confirmed that the buttercup allele does not express black plumage despite possessing a G274A substitution, under the suppression effect of A644C. In contrast, the birds homozygous for the birchen allele presented solid black plumage, which was contradictory to the previous reports. In conclusion, we revealed a large diversity in the MC1R gene of native Japanese chicken breeds, along with the discovery of a new non-synonymous nucleotide substitution (C919G) and three novel haplotypes (H9, H13 and H14).
Collapse
Affiliation(s)
- M H Kabir
- Graduate School of Biosphere Science, Hiroshima University, Higashi-Hiroshima, Hiroshima, 739-8528, Japan
| | - A Takenouchi
- Graduate School of Integrated Sciences for Life, Hiroshima University, Higashi-Hiroshima, Hiroshima, 739-8528, Japan.,Japanese Avian Bioresource Project Research Center, Hiroshima University, Higashi-Hiroshima, Hiroshima, 739-8528, Japan
| | - M I Haqani
- Graduate School of Biosphere Science, Hiroshima University, Higashi-Hiroshima, Hiroshima, 739-8528, Japan
| | - Y Nakamura
- Graduate School of Integrated Sciences for Life, Hiroshima University, Higashi-Hiroshima, Hiroshima, 739-8528, Japan.,Japanese Avian Bioresource Project Research Center, Hiroshima University, Higashi-Hiroshima, Hiroshima, 739-8528, Japan
| | - S Takeuchi
- Japanese Avian Bioresource Project Research Center, Hiroshima University, Higashi-Hiroshima, Hiroshima, 739-8528, Japan.,Graduate School of Natural Science and Technology, Okayama University, Okayama, Okayama, 700-8530, Japan
| | - M Tsudzuki
- Graduate School of Integrated Sciences for Life, Hiroshima University, Higashi-Hiroshima, Hiroshima, 739-8528, Japan.,Japanese Avian Bioresource Project Research Center, Hiroshima University, Higashi-Hiroshima, Hiroshima, 739-8528, Japan
| |
Collapse
|
8
|
Abstract
Chlorarachniophyte and cryptophyte algae are unique among plastid-containing species in that they have a nucleomorph genome: a compact, highly reduced nuclear genome from a photosynthetic eukaryotic endosymbiont. Despite their independent origins, the nucleomorph genomes of these two lineages have similar genomic architectures, but little is known about the evolutionary pressures impacting nucleomorph DNA, particularly how their rates of evolution compare to those of the neighboring genetic compartments (the mitochondrion, plastid, and nucleus). Here, we use synonymous substitution rates to estimate relative mutation rates in the four genomes of nucleomorph-bearing algae. We show that the relative mutation rates of the host versus endosymbiont nuclear genomes are similar in both chlorarachniophytes and cryptophytes, despite the fact that nucleomorph gene sequences are notoriously highly divergent. There is some evidence, however, for slightly elevated mutation rates in the nucleomorph DNA of chlorarachniophytes-a feature not observed in that of cryptophytes. For both lineages, relative mutation rates in the plastid appear to be lower than those in the nucleus and nucleomorph (and, in one case, the mitochondrion), which is consistent with studies of other plastid-bearing protists. Given the divergent nature of nucleomorph genes, our finding of relatively low evolutionary rates in these genomes suggests that for both lineages a burst of evolutionary change and/or decreased selection pressures likely occurred early in the integration of the secondary endosymbiont.
Collapse
Affiliation(s)
- Cameron J Grisdale
- Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia, Canada.,Centre for Comparative Genomics and Evolutionary Bioinformatics, Dalhousie University, Halifax, Nova Scotia, Canada
| | - David R Smith
- Department of Biology, University of Western Ontario, London, Ontario, Canada
| | - John M Archibald
- Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia, Canada.,Centre for Comparative Genomics and Evolutionary Bioinformatics, Dalhousie University, Halifax, Nova Scotia, Canada
| |
Collapse
|
9
|
Vaulin OV, Karagodin DA, Baricheva EM, Zakharov IK. Estimated resistance of the malaria mosquito Anopheles messeae s.l. to the insecticide malathion. J Vector Ecol 2019; 44:48-56. [PMID: 31124233 DOI: 10.1111/jvec.12328] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/11/2018] [Accepted: 12/17/2018] [Indexed: 06/09/2023]
Abstract
Resistance to agricultural pesticides is an important and insufficiently studied concern for pest and disease vector research. We determined the malathion resistance of species in the Anopheles maculipennis mosquito group in a habitat near Novosibirsk, Russia. Most of the 851 individuals we measured were members of the Anopheles messeae s.l. complex (An. messeae and An. daciae species). The LC50 value for malathion was 0.052 mg/L for the mixed specimens, and we failed to find any differences between species. The LC50 value was within the range of values for malathion resistance of Anopheles stephensi and Culex quinquefasciatus. As the main resistance mechanism to organophosphate and carbamate insecticides is a single mononucleotide substitution in the ace-1 gene, we searched for this mutation in An. messeae s.l. and An. beklemishevi by restriction analysis. This mutation was not found in 347 of the specimens. We sequenced the ace-1 gene fragment for 24 specimens from four species of the Anopheles maculipennis group, including An. messeae, An. daciae, An. atroparvus, and An. beklemishevi. These specimens harbored a nucleotide substitution in the triplet where a mutation can lead to insecticide resistance, but this substitution would make it difficult for the resistance to develop. Since the studied specimens belong to branches of the Palearctic portion of the Anopheles maculipennis group, we suspect that all other Palearctic species of this group would have difficulties harboring the ace-1 mutation that would lead to organophosphate and carbamate resistance.
Collapse
Affiliation(s)
- Oleg V Vaulin
- Federal Research Center Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, Novosibirsk, Russia
| | - Dmitry A Karagodin
- Federal Research Center Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, Novosibirsk, Russia
| | - Elina M Baricheva
- Federal Research Center Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, Novosibirsk, Russia
| | - Ilya K Zakharov
- Federal Research Center Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, Novosibirsk, Russia
| |
Collapse
|
10
|
Matsumoto T, Akashi H. Distinguishing Among Evolutionary Forces Acting on Genome-Wide Base Composition: Computer Simulation Analysis of Approximate Methods for Inferring Site Frequency Spectra of Derived Mutations. G3 (Bethesda) 2018; 8:1755-69. [PMID: 29588382 DOI: 10.1534/g3.117.300512] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Inferred ancestral nucleotide states are increasingly employed in analyses of within- and between -species genome variation. Although numerous studies have focused on ancestral inference among distantly related lineages, approaches to infer ancestral states in polymorphism data have received less attention. Recently developed approaches that employ complex transition matrices allow us to infer ancestral nucleotide sequence in various evolutionary scenarios of base composition. However, the requirement of a single gene tree to calculate a likelihood is an important limitation for conducting ancestral inference using within-species variation in recombining genomes. To resolve this problem, and to extend the applicability of ancestral inference in studies of base composition evolution, we first evaluate three previously proposed methods to infer ancestral nucleotide sequences among within- and between-species sequence variation data. The methods employ a single allele, bifurcating tree, or a star tree for within-species variation data. Using simulated nucleotide sequences, we employ ancestral inference to infer fixations and polymorphisms. We find that all three methods show biased inference. We modify the bifurcating tree method to include weights to adjust for an expected site frequency spectrum, “bifurcating tree with weighting” (BTW). Our simulation analysis show that the BTW method can substantially improve the reliability and robustness of ancestral inference in a range of scenarios that include non-neutral and/or non-stationary base composition evolution.
Collapse
|
11
|
Kuramitsu M, Sekizuka T, Yamochi T, Firouzi S, Sato T, Umeki K, Sasaki D, Hasegawa H, Kubota R, Sobata R, Matsumoto C, Kaneko N, Momose H, Araki K, Saito M, Nosaka K, Utsunomiya A, Koh KR, Ogata M, Uchimaru K, Iwanaga M, Sagara Y, Yamano Y, Okayama A, Miura K, Satake M, Saito S, Itabashi K, Yamaguchi K, Kuroda M, Watanabe T, Okuma K, Hamaguchi I. Proviral Features of Human T Cell Leukemia Virus Type 1 in Carriers with Indeterminate Western Blot Analysis Results. J Clin Microbiol 2017; 55:2838-49. [PMID: 28701419 DOI: 10.1128/JCM.00659-17] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2017] [Accepted: 07/07/2017] [Indexed: 12/17/2022] Open
Abstract
Western blotting (WB) for human T cell leukemia virus type 1 (HTLV-1) is performed to confirm anti-HTLV-1 antibodies detected at the initial screening of blood donors and in pregnant women. However, the frequent occurrence of indeterminate results is a problem with this test. We therefore assessed the cause of indeterminate WB results by analyzing HTLV-1 provirus genomic sequences. A quantitative PCR assay measuring HTLV-1 provirus in WB-indeterminate samples revealed that the median proviral load was approximately 100-fold lower than that of WB-positive samples (0.01 versus 0.71 copy/100 cells). Phylogenic analysis of the complete HTLV-1 genomes of WB-indeterminate samples did not identify any specific phylogenetic groups. When we analyzed the nucleotide changes in 19 HTLV-1 isolates from WB-indeterminate samples, we identified 135 single nucleotide substitutions, composed of four types, G to A (29%), C to T (19%), T to C (19%), and A to G (16%). In the most frequent G-to-A substitution, 64% occurred at GG dinucleotides, indicating that APOBEC3G is responsible for mutagenesis in WB-indeterminate samples. Moreover, interestingly, five WB-indeterminate isolates had nonsense mutations in Pol and/or Tax, Env, p12, and p30. These findings suggest that WB-indeterminate carriers have low production of viral antigens because of a combination of a low proviral load and mutations in the provirus, which may interfere with host recognition of HTLV-1 antigens.
Collapse
|
12
|
Koh SH, Li H, Sivasithamparam K, Admiraal R, Jones MGK, Wylie SJ. Evolution of a wild-plant tobamovirus passaged through an exotic host: Fixation of mutations and increased replication. Virus Evol 2017; 3:vex001. [PMID: 28458912 PMCID: PMC5399921 DOI: 10.1093/ve/vex001] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
Tobamovirus is a group of viruses that have become serious pathogens of crop plants. As part of a study informing risk of wild plant virus spill over to crops, we investigated the capacity of a solanaceous-infecting tobamovirus from an isolated indigenous flora to adapt to new exotic hosts. Yellow tailflower mild mottle virus (YTMMV) (genus Tobamovirus, family Virgaviridae) was isolated from a wild plant of yellow tailflower (Anthocercis littoria, family Solanaceae) and initially passaged through a plant of Nicotiana benthamiana, then one of Nicotiana glutinosa where a single local lesion was used to inoculate a N. benthamiana plant. Sap from this plant was used as starting material for nine serial passages through three plant species. The virus titre was recorded periodically, and 85% of the virus genome was sequenced at each passage for each host. Six polymorphic sites were found in the YTMMV genome across all hosts and passages. At five of these, the alternate alleles became fixed in the viral genome until the end of the experiment. Of these five alleles, one was a non-synonymous mutation (U1499C) that occurred only when the virus replicated in tomato. The mutant isolate harbouring U1499C, designated YTMMV-δ, increased its titre over passages in tomato and outcompeted the wild-type isolate when both were co-inoculated to tomato. That YTMMV-δ had greater reproductive fitness in an exotic host than did the wild type isolate suggests YTMMV evolution is influenced by host changes.
Collapse
Affiliation(s)
- Shu Hui Koh
- Plant Biotechnology Group - Plant Virology, Western Australian State Agricultural Biotechnology Centre, School of Veterinary and Life Sciences, Murdoch University, Perth, WA 6150, Australia
| | - Hua Li
- Plant Biotechnology Group - Plant Virology, Western Australian State Agricultural Biotechnology Centre, School of Veterinary and Life Sciences, Murdoch University, Perth, WA 6150, Australia
| | - Krishnapillai Sivasithamparam
- Plant Biotechnology Group - Plant Virology, Western Australian State Agricultural Biotechnology Centre, School of Veterinary and Life Sciences, Murdoch University, Perth, WA 6150, Australia
| | - Ryan Admiraal
- School of Engineering and Information Technology, Mathematics & Statistics, Murdoch University, Perth, WA 6150, Australia
| | - Michael G K Jones
- Plant Biotechnology Group - Plant Virology, Western Australian State Agricultural Biotechnology Centre, School of Veterinary and Life Sciences, Murdoch University, Perth, WA 6150, Australia
| | - Stephen J Wylie
- Plant Biotechnology Group - Plant Virology, Western Australian State Agricultural Biotechnology Centre, School of Veterinary and Life Sciences, Murdoch University, Perth, WA 6150, Australia
| |
Collapse
|
13
|
Yang Y, Zhou T, Duan D, Yang J, Feng L, Zhao G. Comparative Analysis of the Complete Chloroplast Genomes of Five Quercus Species. Front Plant Sci 2016; 7:959. [PMID: 27446185 PMCID: PMC4923075 DOI: 10.3389/fpls.2016.00959] [Citation(s) in RCA: 144] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/18/2016] [Accepted: 06/15/2016] [Indexed: 05/08/2023]
Abstract
Quercus is considered economically and ecologically one of the most important genera in the Northern Hemisphere. Oaks are taxonomically perplexing because of shared interspecific morphological traits and intraspecific morphological variation, which are mainly attributed to hybridization. Universal plastid markers cannot provide a sufficient number of variable sites to explore the phylogeny of this genus, and chloroplast genome-scale data have proven to be useful in resolving intractable phylogenetic relationships. In this study, the complete chloroplast genomes of four Quercus species were sequenced, and one published chloroplast genome of Quercus baronii was retrieved for comparative analyses. The five chloroplast genomes ranged from 161,072 bp (Q. baronii) to 161,237 bp (Q. dolicholepis) in length, and their gene organization and order, and GC content, were similar to those of other Fagaceae species. We analyzed nucleotide substitutions, indels, and repeats in the chloroplast genomes, and found 19 relatively highly variable regions that will potentially provide plastid markers for further taxonomic and phylogenetic studies within Quercus. We observed that four genes (ndhA, ndhK, petA, and ycf1) were subject to positive selection. The phylogenetic relationships of the Quercus species inferred from the chloroplast genomes obtained moderate-to-high support, indicating that chloroplast genome data may be useful in resolving relationships in this genus.
Collapse
Affiliation(s)
| | | | | | | | | | - Guifang Zhao
- Key Laboratory of Resource Biology and Biotechnology in Western China (Ministry of Education), College of Life Sciences, Northwest UniversityXi’an, China
| |
Collapse
|
14
|
Silva FJ, Santos-Garcia D. Slow and Fast Evolving Endosymbiont Lineages: Positive Correlation between the Rates of Synonymous and Non-Synonymous Substitution. Front Microbiol 2015; 6:1279. [PMID: 26617602 PMCID: PMC4643148 DOI: 10.3389/fmicb.2015.01279] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2015] [Accepted: 10/31/2015] [Indexed: 12/18/2022] Open
Abstract
The availability of complete genome sequences of bacterial endosymbionts with strict vertical transmission to the host progeny opens the possibility to estimate molecular evolutionary rates in different lineages and understand the main biological mechanisms influencing these rates. We have compared the rates of evolution for non-synonymous and synonymous substitutions in nine bacterial endosymbiont lineages, belonging to four clades (Baumannia, Blochmannia, Portiera, and Sulcia). The main results are the observation of a positive correlation between both rates with differences among lineages of up to three orders of magnitude and that the substitution rates decrease over long endosymbioses. To explain these results we propose three mechanisms. The first, variations in the efficiencies of DNA replication and DNA repair systems, is unable to explain most of the observed differences. The second, variations in the generation time among bacterial lineages, would be based on the accumulation of fewer DNA replication errors per unit time in organisms with longer generation times. The third, a potential control of the endosymbiont DNA replication and repair systems through the transfer of nuclear-encoded proteins, could explain the lower rates in long-term obligate endosymbionts. Because the preservation of the genomic integrity of the harbored obligate endosymbiont would be advantageous for the insect host, biological mechanisms producing a general reduction in the rates of nucleotide substitution per unit of time would be a target for natural selection.
Collapse
Affiliation(s)
- Francisco J Silva
- Institut Cavanilles de Biodiversitat i Biologia Evolutiva, Universitat de València València, Spain ; Unidad Mixta de Investigación en Genómica y Salud, Fundación para el Fomento de la Investigación Sanitaria y Biomédica de la Comunitat Valenciana, Salud Pública/Institut Cavanilles, Universitat de València València, Spain
| | | |
Collapse
|
15
|
Matsumoto T, Akashi H, Yang Z. Evaluation of Ancestral Sequence Reconstruction Methods to Infer Nonstationary Patterns of Nucleotide Substitution. Genetics 2015; 200:873-90. [PMID: 25948563 DOI: 10.1534/genetics.115.177386] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2015] [Accepted: 04/28/2015] [Indexed: 01/07/2023] Open
Abstract
Inference of gene sequences in ancestral species has been widely used to test hypotheses concerning the process of molecular sequence evolution. However, the approach may produce spurious results, mainly because using the single best reconstruction while ignoring the suboptimal ones creates systematic biases. Here we implement methods to correct for such biases and use computer simulation to evaluate their performance when the substitution process is nonstationary. The methods we evaluated include parsimony and likelihood using the single best reconstruction (SBR), averaging over reconstructions weighted by the posterior probabilities (AWP), and a new method called expected Markov counting (EMC) that produces maximum-likelihood estimates of substitution counts for any branch under a nonstationary Markov model. We simulated base composition evolution on a phylogeny for six species, with different selective pressures on G+C content among lineages, and compared the counts of nucleotide substitutions recorded during simulation with the inference by different methods. We found that large systematic biases resulted from (i) the use of parsimony or likelihood with SBR, (ii) the use of a stationary model when the substitution process is nonstationary, and (iii) the use of the Hasegawa-Kishino-Yano (HKY) model, which is too simple to adequately describe the substitution process. The nonstationary general time reversible (GTR) model, used with AWP or EMC, accurately recovered the substitution counts, even in cases of complex parameter fluctuations. We discuss model complexity and the compromise between bias and variance and suggest that the new methods may be useful for studying complex patterns of nucleotide substitution in large genomic data sets.
Collapse
|
16
|
Bhattacharjee MJ, Ghosh SK. Design of mini-barcode for catfishes for assessment of archival biodiversity. Mol Ecol Resour 2013; 14:469-77. [PMID: 24314114 DOI: 10.1111/1755-0998.12198] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2013] [Revised: 10/26/2013] [Accepted: 10/28/2013] [Indexed: 11/28/2022]
Abstract
Recovery of DNA barcode sequences is often challenging from the archived specimens. However, short fragments of DNA may be recovered, which would significantly improve many unresolved taxonomic conflicts. Here, we designed a mini-barcode for catfishes comprising several species and many cryptic taxa. We analysed a data set of 3048 publicly available COI barcode sequences representing 547 worldwide catfish species and performed 152 628 interspecies comparisons. A significantly more positively correlated interspecies distance was detected with transversion (0.78, P < 0.001) than with transition (0.70, P < 0.001). This suggested that transversions were better diagnostics for species identification. In the aligned data set, two transversion-rich fragments (53 bp and 119 bp) were identified. Transition/transversion bias value was 1.04 in 53-bp fragment, 1.23 in 119-bp fragment and 1.50 in full-length barcode. The interspecies distance with full-length barcode was 0.212 ± 0.037, while that with 53-bp and 119-bp fragments was 0.325 ± 0.039 and 0.218 ± 0.045, respectively. Survey of 53-bp fragment showed a possibility of only 1144 barcodes, while that of 119-bp fragment showed >4 million barcodes. Thus, the 119-bp fragment is a viable mini-barcode for catfishes comprising >3000 extant species. Experiment with 82 archived catfishes showed successful recovery of this mini-barcode using the designed primer. The mini-barcode sequences showed species-specific similarity in the range of 98-100% with the global database. Therefore, survey of a transversion-rich fragment within the full-length barcode would be an ideal approach of mini-barcode design for biodiversity assessment.
Collapse
|
17
|
Zhao M, Du J, Lin F, Tong C, Yu J, Huang S, Wang X, Liu S, Ma J. Shifts in the evolutionary rate and intensity of purifying selection between two Brassica genomes revealed by analyses of orthologous transposons and relics of a whole genome triplication. Plant J 2013; 76:211-22. [PMID: 23869625 DOI: 10.1111/tpj.12291] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/17/2013] [Revised: 06/28/2013] [Accepted: 07/04/2013] [Indexed: 05/23/2023]
Abstract
Recent sequencing of the Brassica rapa and Brassica oleracea genomes revealed extremely contrasting genomic features such as the abundance and distribution of transposable elements between the two genomes. However, whether and how these structural differentiations may have influenced the evolutionary rates of the two genomes since their split from a common ancestor are unknown. Here, we investigated and compared the rates of nucleotide substitution between two long terminal repeats (LTRs) of individual orthologous LTR-retrotransposons, the rates of synonymous and non-synonymous substitution among triplicated genes retained in both genomes from a shared whole genome triplication event, and the rates of genetic recombination estimated/deduced by the comparison of physical and genetic distances along chromosomes and ratios of solo LTRs to intact elements. Overall, LTR sequences and genic sequences showed more rapid nucleotide substitution in B. rapa than in B. oleracea. Synonymous substitution of triplicated genes retained from a shared whole genome triplication was detected at higher rates in B. rapa than in B. oleracea. Interestingly, non-synonymous substitution was observed at lower rates in the former than in the latter, indicating shifted densities of purifying selection between the two genomes. In addition to evolutionary asymmetry, orthologous genes differentially regulated and/or disrupted by transposable elements between the two genomes were also characterized. Our analyses suggest that local genomic and epigenomic features, such as recombination rates and chromatin dynamics reshaped by independent proliferation of transposable elements and elimination between the two genomes, are perhaps partially the causes and partially the outcomes of the observed inter-specific asymmetric evolution.
Collapse
Affiliation(s)
- Meixia Zhao
- Oil Crops Research Institute, Chinese Academy of Agricultural Sciences, Wuhan, 430062, China; Department of Agronomy, Purdue University, West Lafayette, 47907, IN, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
18
|
Habibi I, Sfar I, Ben Alaya W, Methlouthi J, Ayadi A, Brahim M, Blouin J, Dhagbouj R, Ben Rhomdhane T, Makhlouf M, Aouadi H, Ayed-Jendoubi S, Fremeaux-Bacchi V, Sfar T, Ben Abdallah T, Ayed K, Gorgi Y. Atypical hemolytic uremic syndrome and mutation analysis of factor H gene in two Tunisian families. Int J Nephrol Renovasc Dis 2010; 3:85-92. [PMID: 21694933 PMCID: PMC3108774 DOI: 10.2147/ijnrd.s8373] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2010] [Indexed: 11/23/2022] Open
Abstract
We carried out a protein and genetic investigation of the factor H gene mutations within two families presenting with a diagnostic suspicion of atypical hemolytic uremic syndrome (aHUS). The results within the patients of the first family revealed a factor H-deficiency. Direct sequencing allowed the detection of a 4-nucleotide deletion in the factor H gene. This deletion was found as the homozygote form in the proband and as the heterozygote form in the parents. Protein and functional analyses of the complement system were normal in all members of the second family. However, the molecular investigation for the father showed the presence of an amino acid substitution in the FH gene. Unfortunately, his two affected children died without being investigated for mutations. The functional consequences of these abnormal proteins are still to be demonstrated.
Collapse
Affiliation(s)
- Imen Habibi
- Laboratory of Immunology, Charles Nicolle Hospital, Tunis, Tunisia
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
19
|
Abstract
The mitochondrial genome of Physarum polycephalum encodes five tRNAs, four of which are edited by nucleotide insertion. Two of these tRNAs, tRNA(met1) and tRNA(met2), contain predicted mismatches at the beginning (proximal end) of the acceptor stem. In addition, the putative 5' end of tRNA(met2) overlaps the 3' end of a small, abundant, noncoding RNA, which we term ppoRNA. These anomalies led us to hypothesize that these two Physarum mitochondrial tRNAs undergo additional editing events. Here, we show that tRNA(met1) and tRNA(met2) each has a nonencoded G at its 5' end. In contrast to the other nucleotides that are added to Physarum mitochondrial RNAs, these extra G residues are likely added post-transcriptionally based on (1) the absence of added G in precursor transcripts containing inserted C and AA residues, (2) the presence of potential intermediates characteristic of 5' replacement editing, and (3) preferential incorporation of GTP into tRNA molecules under conditions that do not support transcription. This is the first report of both post-transcriptional nucleotide insertions and the addition of single Gs in P. polycephalum mitochondrial transcripts. We postulate that tRNA(met1) and tRNA(met2) are acted upon by an activity similar to that present in the mitochondria of certain other amoebozoons and chytrid fungi, suggesting that enzymes that repair the 5' end of tRNAs may be widespread.
Collapse
Affiliation(s)
- Jonatha M Gott
- Center for RNA Molecular Biology, Case Western Reserve University, 10900 Euclid Avenue, Cleveland, OH 44106, USA.
| | | | | |
Collapse
|
20
|
Hughes AL, Friedman R, Rivailler P, French JO. Synonymous and nonsynonymous polymorphisms versus divergences in bacterial genomes. Mol Biol Evol 2008; 25:2199-209. [PMID: 18667439 PMCID: PMC2734133 DOI: 10.1093/molbev/msn166] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/24/2008] [Indexed: 11/14/2022] Open
Abstract
Comparison of the ratio of nonsynonymous to synonymous polymorphisms within species with the ratio of nonsynonymous to synonymous substitutions between species has been widely used as a supposed indicator of positive Darwinian selection, with the ratio of these 2 ratios being designated as a neutrality index (NI). Comparison of genome-wide polymorphism within 12 species of bacteria with divergence from an outgroup species showed substantial differences in NI among taxa. A low level of nonsynonymous polymorphism at a locus was the best predictor of NI < 1, rather than a high level of nonsynonymous substitution between species. Moreover, genes with NI < 1 showed a strong tendency toward the occurrence of rare nonsynonymous polymorphisms, as expected under the action of ongoing purifying selection. Thus, our results are more consistent with the hypothesis that a high relative rate of between-species nonsynonymous substitution reflects mainly the action of purifying selection within species to eliminate slightly deleterious mutations rather than positive selection between species. This conclusion is consistent with previous results highlighting an important role of slightly deleterious variants in bacterial evolution and suggests caution in the use of the McDonald-Kreitman test and related statistics as tests of positive selection.
Collapse
Affiliation(s)
- Austin L Hughes
- Department of Biological Sciences, University of South Carolina, USA
| | | | | | | |
Collapse
|
21
|
Nakashima H. Characteristics of human and mouse orthologous protein-coding nucleotide sequences with large g+c content variations. Curr Genomics 2007; 8:445-52. [PMID: 19412417 PMCID: PMC2647157 DOI: 10.2174/138920207783591708] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2007] [Revised: 11/04/2007] [Accepted: 11/06/2007] [Indexed: 11/22/2022] Open
Abstract
Characteristics of human and mouse orthologous gene sequences which have large G+C content variations were investigated in this study. The orthologous gene pairs were classified into two groups according to the deviation between human and mouse G+C content at the third codon position (GC3) and were subsequently analyzed. In one group, mouse genes had higher GC3 than the corresponding human genes and in another group, human genes had higher GC3 than mouse. Furthermore, the orthologous pairs were separated based on the deviation between human or mouse GC3 and the G+C content at the third codon position of identical codons (IC3), to examine the effect of increased or decreased G+C content in human or mouse sequences. The nucleotide substitution patterns between human and mouse sequences in the two groups were remarkably distinct, and consistent with the state of G+C-rich or G+C-poor sequences. The effect of increase or decrease of G+C content in human or mouse sequences was not clear in the nucleotide substitution patterns. The chromosomal locations of human and mouse orthologous gene pairs were different between the two groups. The genes located on an identical syntenic segment showed the trend of having similar G+C content. Moreover, the same gene order of some genes on different chromosomes of both species demonstrated the gene rearrangements between human and mouse. Our study indicated that the chromosomal locations and rearrangements are associated with the GC3 variation between human and mouse sequences.
Collapse
Affiliation(s)
- Hiroshi Nakashima
- Department of Clinical Laboratory Science, Graduate Course of Medical Science and Technology, Division of Health Science, Kanazawa University, 5-11-80 Kodatsuno, Kanazawa 920-0942, Japan
| |
Collapse
|
22
|
Gatto L, Catanzaro D, Milinkovitch MC. Assessing the applicability of the GTR nucleotide substitution model through simulations. Evol Bioinform Online 2007; 2:145-55. [PMID: 19455208 PMCID: PMC2674669] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022] Open
Abstract
The General Time Reversible (GTR) model of nucleotide substitution is at the core of many distance-based and character-based phylogeny inference methods. The procedure described by Waddell and Steel (1997), for estimating distances and instantaneous substitution rate matrices, R, under the GTR model, is known to be inapplicable under some conditions, ie, it leads to the inapplicability of the GTR model. Here, we simulate the evolution of DNA sequences along 12 trees characterized by different combinations of tree length, (non-)homogeneity of the substitution rate matrix R, and sequence length. We then evaluate both the frequency of the GTR model inapplicability for estimating distances and the accuracy of inferred alignments. Our results indicate that, inapplicability of the Waddel and Steel's procedure can be considered a real practical issue, and illustrate that the probability of this inapplicability is a function of substitution rates and sequence length.We also discuss the implications of our results on the current implementations of maximum likelihood and Bayesian methods.
Collapse
|
23
|
Abstract
Centromeres have many unusual biological properties, including kinetochore attachment and severe repression of local meiotic recombination. These properties are partly an outcome, partly a cause, of unusual DNA structure in the centromeric region. Although several plant and animal genomes have been sequenced, most centromere sequences have not been completed or analyzed in depth. To shed light on the unique organization, variability, and evolution of centromeric DNA, detailed analysis of a 1.97-Mb sequence that includes centromere 8 (CEN8) of japonica rice was undertaken. Thirty-three long-terminal repeat (LTR)-retrotransposon families (including 11 previously unknown) were identified in the CEN8 region, totaling 245 elements and fragments that account for 67% of the region. The ratio of solo LTRs to intact elements in the CEN8 region is approximately 0.9:1, compared with approximately 2.2:1 in noncentromeric regions of rice. However, the ratio of solo LTRs to intact elements in the core of the CEN8 region ( approximately 2.5:1) is higher than in any other region investigated in rice, suggesting a hotspot for unequal recombination. Comparison of the CEN8 region of japonica and its orthologous segments from indica rice indicated that approximately 15% of the intact retrotransposons and solo LTRs were inserted into CEN8 after the divergence of japonica and indica from a common ancestor, compared with approximately 50% for previously studied euchromatic regions. Frequent DNA rearrangements were observed in the CEN8 region, including a 212-kb subregion that was found to be composed of three rearranged tandem repeats. Phylogenetic analysis also revealed recent segmental duplication and extensive rearrangement and reshuffling of the CentO satellite repeats.
Collapse
Affiliation(s)
- Jianxin Ma
- Department of Genetics, University of Georgia, Athens, GA 30602, USA
| | | |
Collapse
|