1
|
Yeo H, Harjoko DN, Rheindt FE. Double trouble: untangling mixed sequence signals in bird samples with avian haemosporidian co-infections. Parasitology 2022; 149:1-12. [PMID: 35343423 DOI: 10.1017/s0031182022000245] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Blood parasites comprise some of the most prevalent pathogens in nature, and their detection and identification are major objectives in varied fields such as ecology and biomedicine. Two approaches were compared, one based on Sanger sequencing and the other next-generation sequencing (NGS) based, in terms of their performance in detecting avian blood parasites across tropical Southeast Asian birds. Across a panel of 528 bird individuals, 43 birds were ascertained to be infected with avian haemosporidians using a polymerase chain reaction-based detection method. Among these samples, NGS-based barcoding confirmed co-infections by multiple blood parasites in all eight cases where Sanger sequencing produced double peaks. Importantly however, the NGS-based method produced another five diagnoses of co-infections (62.5%) in which Sanger-based barcoding remained equivocal. In contrast to Sanger sequencing, the NGS-based method was able to identify co-infecting haemosporidian lineages via their barcodes. The accuracy of avian haemosporidian lineage identification was not compromised by the shorter length of NGS sequences, with ~94% of NGS barcodes producing matches identical to those of the Sanger barcodes. The application of NGS-based barcoding methods promises to enhance parasite identification and reduce erroneous inferences based on artefacts.
Collapse
Affiliation(s)
- Huiqing Yeo
- Department of Biological Sciences, National University of Singapore, 16 Science Drive 4, Block S3 Level 4, 117558Singapore, Singapore
| | - Denise Nastaya Harjoko
- Department of Biological Sciences, National University of Singapore, 16 Science Drive 4, Block S3 Level 4, 117558Singapore, Singapore
| | - Frank E Rheindt
- Department of Biological Sciences, National University of Singapore, 16 Science Drive 4, Block S3 Level 4, 117558Singapore, Singapore
| |
Collapse
|
2
|
Bruzzese DJ, Schuler H, Wolfe TM, Glover MM, Mastroni JV, Doellman MM, Tait C, Yee WL, Rull J, Aluja M, Hood GR, Goughnour RB, Stauffer C, Nosil P, Feder JL. Testing the potential contribution of Wolbachia to speciation when cytoplasmic incompatibility becomes associated with host-related reproductive isolation. Mol Ecol 2021; 31:2935-2950. [PMID: 34455644 PMCID: PMC9290789 DOI: 10.1111/mec.16157] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2021] [Revised: 08/18/2021] [Accepted: 08/25/2021] [Indexed: 01/04/2023]
Abstract
Endosymbiont‐induced cytoplasmic incompatibility (CI) may play an important role in arthropod speciation. However, whether CI consistently becomes associated or coupled with other host‐related forms of reproductive isolation (RI) to impede the transfer of endosymbionts between hybridizing populations and further the divergence process remains an open question. Here, we show that varying degrees of pre‐ and postmating RI exist among allopatric populations of two interbreeding cherry‐infesting tephritid fruit flies (Rhagoletis cingulata and R. indifferens) across North America. These flies display allochronic and sexual isolation among populations, as well as unidirectional reductions in egg hatch in hybrid crosses involving southwestern USA males. All populations are infected by a Wolbachia strain, wCin2, whereas a second strain, wCin3, only co‐infects flies from the southwest USA and Mexico. Strain wCin3 is associated with a unique mitochondrial DNA haplotype and unidirectional postmating RI, implicating the strain as the cause of CI. When coupled with nonendosymbiont RI barriers, we estimate the strength of CI associated with wCin3 would not prevent the strain from introgressing from infected southwestern to uninfected populations elsewhere in the USA if populations were to come into secondary contact and hybridize. In contrast, cytoplasmic–nuclear coupling may impede the transfer of wCin3 if Mexican and USA populations were to come into contact. We discuss our results in the context of the general paucity of examples demonstrating stable Wolbachia hybrid zones and whether the spread of Wolbachia among taxa can be constrained in natural hybrid zones long enough for the endosymbiont to participate in speciation.
Collapse
Affiliation(s)
- Daniel J Bruzzese
- Department of Biological Sciences, University of Notre Dame, Notre Dame, IN, USA
| | - Hannes Schuler
- Faculty of Science and Technology, Free University of Bozen-Bolzano, Bozen-Bolzano, Italy.,Competence Centre for Plant Health, Free University of Bozen-Bolzano, Bozen-Bolzano, Italy
| | - Thomas M Wolfe
- Department of Forest and Soil Sciences, Boku, University of Natural Resources and Life Sciences, Vienna, Austria
| | - Mary M Glover
- Department of Biological Sciences, University of Notre Dame, Notre Dame, IN, USA
| | - Joseph V Mastroni
- Department of Biological Sciences, University of Notre Dame, Notre Dame, IN, USA
| | - Meredith M Doellman
- Department of Biological Sciences, University of Notre Dame, Notre Dame, IN, USA
| | - Cheyenne Tait
- Department of Biological Sciences, University of Notre Dame, Notre Dame, IN, USA
| | - Wee L Yee
- United States Department of Agriculture, Temperate Tree Fruit & Vegetable Research Unit, Agricultural Research Service, Wapato, WA, USA
| | - Juan Rull
- Instituto de Ecología A.C., Xalapa, México.,LIEMEN-División Control Biológico de Plagas, PROIMI Biotecnología-CONICET, Tucumán, Argentina
| | | | - Glen Ray Hood
- Department of Biological Sciences, Wayne State University, Detroit, MI, USA
| | | | - Christian Stauffer
- Department of Forest and Soil Sciences, Boku, University of Natural Resources and Life Sciences, Vienna, Austria
| | - Patrik Nosil
- CEFE, University Montpellier, CNRS, EPHE, IRD, University Paul Valéry Montpellier 3, Montpellier, France.,Department of Biology, Utah State University, UT, USA
| | - Jeffery L Feder
- Department of Biological Sciences, University of Notre Dame, Notre Dame, IN, USA
| |
Collapse
|
3
|
Kim YG, Kim MJ, Lee JS, Lee JA, Song JY, Cho SI, Park SS, Seong MW. SnackVar: An Open-Source Software for Sanger Sequencing Analysis Optimized for Clinical Use. J Mol Diagn 2020; 23:140-148. [PMID: 33246077 DOI: 10.1016/j.jmoldx.2020.11.001] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2020] [Revised: 10/23/2020] [Accepted: 11/10/2020] [Indexed: 01/03/2023] Open
Abstract
Despite the wide application of next-generation sequencing, Sanger sequencing still plays a necessary role in clinical laboratories. However, recent developments in the field of bioinformatics have focused mostly on next-generation sequencing, while tools for Sanger sequencing have shown little progress. In this study, SnackVar (https://github.com/Young-gonKim/SnackVar, last accessed June 22, 2020), a novel graphical user interface-based software for Sanger sequencing, was developed. All types of variants, including heterozygous insertion/deletion variants, can be identified by SnackVar with minimal user effort. The featured reference sequences of all of the genes are prestored in SnackVar, allowing for detected variants to be precisely described based on coding DNA references according to the nomenclature of the Human Genome Variation Society. Among 88 previously reported variants from four insertion/deletion-rich genes (BRCA1, APC, CALR, and CEBPA), the result of SnackVar agreed with reported results in 87 variants [98.9% (93.0%; 99.9%)]. The cause of one incorrect variant calling was proven to be erroneous base callings from poor-quality trace files. Compared with commercial software, SnackVar required less than one-half of the time taken for the analysis of a selected set of test cases. We expect SnackVar to be a cost-effective option for clinical laboratories performing Sanger sequencing.
Collapse
Affiliation(s)
- Young-Gon Kim
- Department of Laboratory Medicine, Seoul National University Hospital, Seoul, Republic of Korea
| | - Man Jin Kim
- Department of Laboratory Medicine, Seoul National University Hospital, Seoul, Republic of Korea
| | - Jee-Soo Lee
- Department of Laboratory Medicine, Seoul National University Hospital, Seoul, Republic of Korea
| | - Jung Ae Lee
- Department of Laboratory Medicine, Seoul National University Hospital, Seoul, Republic of Korea
| | - Ji Yun Song
- Department of Laboratory Medicine, Seoul National University Hospital, Seoul, Republic of Korea
| | - Sung Im Cho
- Department of Laboratory Medicine, Seoul National University Hospital, Seoul, Republic of Korea
| | - Sung-Sup Park
- Department of Laboratory Medicine, Seoul National University Hospital, Seoul, Republic of Korea
| | - Moon-Woo Seong
- Department of Laboratory Medicine, Seoul National University Hospital, Seoul, Republic of Korea.
| |
Collapse
|
4
|
Neuenschwander SM, Terrazos Miani MA, Amlang H, Perroulaz C, Bittel P, Casanova C, Droz S, Flandrois JP, Leib SL, Suter-Riniker F, Ramette A. A Sample-to-Report Solution for Taxonomic Identification of Cultured Bacteria in the Clinical Setting Based on Nanopore Sequencing. J Clin Microbiol 2020; 58:e00060-20. [PMID: 32229603 PMCID: PMC7269405 DOI: 10.1128/jcm.00060-20] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2020] [Accepted: 03/25/2020] [Indexed: 12/20/2022] Open
Abstract
Amplicon sequencing of the 16S rRNA gene is commonly used for the identification of bacterial isolates in diagnostic laboratories and mostly relies on the Sanger sequencing method. The latter, however, suffers from a number of limitations, with the most significant being the inability to resolve mixed amplicons when closely related species are coamplified from a mixed culture. This often leads to either increased turnaround time or absence of usable sequence data. Short-read next-generation sequencing (NGS) technologies could solve the mixed amplicon issue but would lack both cost efficiency at low throughput and fast turnaround times. Nanopore sequencing developed by Oxford Nanopore Technologies (ONT) could solve those issues by enabling a flexible number of samples per run and an adjustable sequencing time. Here, we report on the development of a standardized laboratory workflow combined with a fully automated analysis pipeline LORCAN (long read consensus analysis), which together provide a sample-to-report solution for amplicon sequencing and taxonomic identification of the resulting consensus sequences. Validation of the approach was conducted on a panel of reference strains and on clinical samples consisting of single or mixed rRNA amplicons associated with various bacterial genera by direct comparison to the corresponding Sanger sequences. Additionally, simulated read and amplicon mixtures were used to assess LORCAN's behavior when dealing with samples with known cross-contamination levels. We demonstrate that by combining ONT amplicon sequencing results with LORCAN, the accuracy of Sanger sequencing can be closely matched (>99.6% sequence identity) and that mixed samples can be resolved at the single-base resolution level. The presented approach has the potential to significantly improve the flexibility, reliability, and availability of amplicon sequencing in diagnostic settings.
Collapse
Affiliation(s)
| | | | - Heiko Amlang
- University of Bern, Institute for Infectious Diseases, Bern, Switzerland
| | - Carmen Perroulaz
- University of Bern, Institute for Infectious Diseases, Bern, Switzerland
| | - Pascal Bittel
- University of Bern, Institute for Infectious Diseases, Bern, Switzerland
| | - Carlo Casanova
- University of Bern, Institute for Infectious Diseases, Bern, Switzerland
| | - Sara Droz
- University of Bern, Institute for Infectious Diseases, Bern, Switzerland
| | - Jean-Pierre Flandrois
- University of Lyon, CNRS, UMR 5558, Laboratoire de Biométrie et Biologie Evolutive, Villeurbanne, France
| | - Stephen L Leib
- University of Bern, Institute for Infectious Diseases, Bern, Switzerland
| | | | - Alban Ramette
- University of Bern, Institute for Infectious Diseases, Bern, Switzerland
| |
Collapse
|
5
|
Köbölkuti ZA, Cseke K, Benke A, Báder M, Borovics A, Németh R. Allelic variation in candidate genes associated with wood properties of cultivated poplars (Populus). Biol Futur 2019; 70:286-294. [PMID: 34554544 DOI: 10.1556/019.70.2019.32] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2019] [Accepted: 10/26/2019] [Indexed: 11/19/2022]
Abstract
INTRODUCTION Since Populus has veritable value as timber, plywood, pulp, and paper, genomic research should create the sound basis for further breeding toward desirable wood quality attributes. MATERIALS AND METHODS In this study, we addressed the need for a research methodology that initially identifies and then characterize candidate genes encoding enzymes with wood property phenotypic traits, toward the aim of developing a genomics-based breeding technology. RESULTS On 23 different poplar species/hybrid samples, we successfully amplified 55 primers designed on Populus trichocarpa L. Considering the number of polymorphic sites, out of 73,206 bp, 51 SNPs and 31 indel events were found. Non-synonymous single base mutations could be detected in number of 30, 21 out of 164 sequences were the number of minimum recombination events and 41 significant pairwise comparisons between loci could be detected. DISCUSSION AND CONCLUSION Our results provide a roadmap for a future association genetic study between nucleotide diversity and precise evaluation of phenotype.
Collapse
Affiliation(s)
- Zoltán Attila Köbölkuti
- Department of Tree Breeding, National Agricultural Research and Innovation Centre, Forest Research Institute, Várkerulet 30/A, Sárvár, 9600, Hungary.
| | - Klára Cseke
- Department of Tree Breeding, National Agricultural Research and Innovation Centre, Forest Research Institute, Várkerulet 30/A, Sárvár, 9600, Hungary
| | - Attila Benke
- Department of Tree Breeding, National Agricultural Research and Innovation Centre, Forest Research Institute, Várkerulet 30/A, Sárvár, 9600, Hungary
| | - Mátyás Báder
- Simonyi Karoly Faculty of Engineering, Wood Sciences and Applied Arts, University of Sopron, Bajcsy Zs. u. 4, 9400, Sopron, Hungary
| | - Attila Borovics
- Department of Tree Breeding, National Agricultural Research and Innovation Centre, Forest Research Institute, Várkerulet 30/A, Sárvár, 9600, Hungary
| | - Róbert Németh
- Simonyi Karoly Faculty of Engineering, Wood Sciences and Applied Arts, University of Sopron, Bajcsy Zs. u. 4, 9400, Sopron, Hungary
| |
Collapse
|
6
|
Chen K, Dai X, Wu J. Alternative splicing: An important mechanism in stem cell biology. World J Stem Cells 2015; 7:1-10. [PMID: 25621101 PMCID: PMC4300919 DOI: 10.4252/wjsc.v7.i1.1] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/26/2014] [Revised: 09/22/2014] [Accepted: 10/27/2014] [Indexed: 02/06/2023] Open
Abstract
Alternative splicing (AS) is an essential mechanism in post-transcriptional regulation and leads to protein diversity. It has been shown that AS is prevalent in metazoan genomes, and the splicing pattern is dynamically regulated in different tissues and cell types, including embryonic stem cells. These observations suggest that AS may play critical roles in stem cell biology. Since embryonic stem cells and induced pluripotent stem cells have the ability to give rise to all types of cells and tissues, they hold the promise of future cell-based therapy. Many efforts have been devoted to understanding the mechanisms underlying stem cell self-renewal and differentiation. However, most of the studies focused on the expression of a core set of transcription factors and regulatory RNAs. The role of AS in stem cell differentiation was not clear. Recent advances in high-throughput technologies have allowed the profiling of dynamic splicing patterns and cis-motifs that are responsible for AS at a genome-wide scale, and provided novel insights in a number of studies. In this review, we discuss some recent findings involving AS and stem cells. An emerging picture from these findings is that AS is integrated in the transcriptional and post-transcriptional networks and together they control pluripotency maintenance and differentiation of stem cells.
Collapse
|
7
|
Hill JT, Demarest BL, Bisgrove BW, Su YC, Smith M, Yost HJ. Poly peak parser: Method and software for identification of unknown indels using sanger sequencing of polymerase chain reaction products. Dev Dyn 2014; 243:1632-6. [PMID: 25160973 DOI: 10.1002/dvdy.24183] [Citation(s) in RCA: 170] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2014] [Revised: 08/04/2014] [Accepted: 08/22/2014] [Indexed: 01/01/2023] Open
Abstract
BACKGROUND Genome editing techniques, including ZFN, TALEN, and CRISPR, have created a need to rapidly screen many F1 individuals to identify carriers of indels and determine the sequences of the mutations. Current techniques require multiple clones of the targeted region to be sequenced for each individual, which is inefficient when many individuals must be analyzed. Direct Sanger sequencing of a polymerase chain reaction (PCR) amplified region surrounding the target site is efficient, but Sanger sequencing genomes heterozygous for an indel results in a string of "double peaks" due to the mismatched region. RESULTS To facilitate indel identification, we developed an online tool called Poly Peak Parser (available at http://yost.genetics.utah.edu/software.php) that is able to separate chromatogram data containing ambiguous base calls into wild-type and mutant allele sequences. This tool allows the nature of the indel to be determined from a single sequencing run per individual performed directly on a PCR product spanning the targeted site, without cloning. CONCLUSIONS The method and algorithm described here facilitate rapid identification and sequence characterization of heterozygous mutant carriers generated by genome editing. Although designed for screening F1 individuals, this tool can also be used to identify heterozygous indels in many contexts.
Collapse
Affiliation(s)
- Jonathon T Hill
- Molecular Medicine Program and Department of Neurobiology & Anatomy, University of Utah School of Medicine, Salt Lake City, Utah
| | | | | | | | | | | |
Collapse
|
8
|
Zhidkov I, Cohen R, Geifman N, Mishmar D, Rubin E. CHILD: a new tool for detecting low-abundance insertions and deletions in standard sequence traces. Nucleic Acids Res 2011; 39:e47. [PMID: 21278161 PMCID: PMC3074157 DOI: 10.1093/nar/gkq1354] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
Several methods have been proposed for detecting insertion/deletions (indels) from chromatograms generated by Sanger sequencing. However, most such methods are unsuitable when the mutated and normal variants occur at unequal ratios, such as is expected to be the case in cancer, with organellar DNA or with alternatively spliced RNAs. In addition, the current methods do not provide robust estimates of the statistical confidence of their results, and the sensitivity of this approach has not been rigorously evaluated. Here, we present CHILD, a tool specifically designed for indel detection in mixtures where one variant is rare. CHILD makes use of standard sequence alignment statistics to evaluate the significance of the results. The sensitivity of CHILD was tested by sequencing controlled mixtures of deleted and undeleted plasmids at various ratios. Our results indicate that CHILD can identify deleted molecules present as just 5% of the mixture. Notably, the results were plasmid/primer-specific; for some primers and/or plasmids, the deleted molecule was only detected when it comprised 10% or more of the mixture. The false positive rate was estimated to be lower than 0.4%. CHILD was implemented as a user-oriented web site, providing a sensitive and experimentally validated method for the detection of rare indel-carrying molecules in common Sanger sequence reads.
Collapse
Affiliation(s)
- Ilia Zhidkov
- National Institute for Biotechnology in the Negev, Dept. of Life Sciences, Dept. of Computer Sciences and Shraga Segal Dept. of Microbiology and Immunology, Ben Gurion University of the Negev, Beer Sheva 84105, Israel
| | - Raphael Cohen
- National Institute for Biotechnology in the Negev, Dept. of Life Sciences, Dept. of Computer Sciences and Shraga Segal Dept. of Microbiology and Immunology, Ben Gurion University of the Negev, Beer Sheva 84105, Israel
| | - Nophar Geifman
- National Institute for Biotechnology in the Negev, Dept. of Life Sciences, Dept. of Computer Sciences and Shraga Segal Dept. of Microbiology and Immunology, Ben Gurion University of the Negev, Beer Sheva 84105, Israel
| | - Dan Mishmar
- National Institute for Biotechnology in the Negev, Dept. of Life Sciences, Dept. of Computer Sciences and Shraga Segal Dept. of Microbiology and Immunology, Ben Gurion University of the Negev, Beer Sheva 84105, Israel
| | - Eitan Rubin
- National Institute for Biotechnology in the Negev, Dept. of Life Sciences, Dept. of Computer Sciences and Shraga Segal Dept. of Microbiology and Immunology, Ben Gurion University of the Negev, Beer Sheva 84105, Israel
- *To whom correspondence should be addressed. Tel: +972 8 6477180; Fax: +972 8 6479197;
| |
Collapse
|
9
|
O'Malley RC, Ecker JR. Linking genotype to phenotype using the Arabidopsis unimutant collection. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2010; 61:928-40. [PMID: 20409268 DOI: 10.1111/j.1365-313x.2010.04119.x] [Citation(s) in RCA: 132] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]
Abstract
The large collections of Arabidopsis thaliana sequence-indexed T-DNA insertion mutants are among the most important resources to emerge from the sequencing of the genome. Several laboratories around the world have used the Arabidopsis reference genome sequence to map T-DNA flanking sequence tags (FST) for over 325,000 T-DNA insertion lines. Over the past decade, phenotypes identified with T-DNA-induced mutants have played a critical role in advancing both basic and applied plant research. These widely used mutants are an invaluable tool for direct interrogation of gene function. However, most lines are hemizygous for the insertion, necessitating a genotyping step to identify homozygous plants for the quantification of phenotypes. This situation has limited the application of these collections for genome-wide screens. Isolating multiple homozygous insert lines for every gene in the genome would make it possible to systematically test the phenotypic consequence of gene loss under a wide variety of conditions. One major obstacle to achieving this goal is that 12% of genes have no insertion and 8% are only represented by a single allele. Generation of additional mutations to achieve full genome coverage has been slow and expensive since each insertion is sequenced one at a time. Recent advances in high-throughput sequencing technology open up a potentially faster and cost-effective means to create new, very large insertion mutant populations for plants or animals. With the combination of new tools for genome-wide studies and emerging phenotyping platforms, these sequence-indexed mutant collections are poised to have a larger impact on our understanding of gene function.
Collapse
Affiliation(s)
- Ronan C O'Malley
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92307, USA
| | | |
Collapse
|
10
|
Decoding of superimposed traces produced by direct sequencing of heterozygous indels. PLoS Comput Biol 2008; 4:e1000113. [PMID: 18654614 PMCID: PMC2429969 DOI: 10.1371/journal.pcbi.1000113] [Citation(s) in RCA: 113] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2008] [Accepted: 06/04/2008] [Indexed: 01/22/2023] Open
Abstract
Direct Sanger sequencing of a diploid template containing a heterozygous insertion or deletion results in a difficult-to-interpret mixed trace formed by two allelic traces superimposed onto each other. Existing computational methods for deconvolution of such traces require knowledge of a reference sequence or the availability of both direct and reverse mixed sequences of the same template. We describe a simple yet accurate method, which uses dynamic programming optimization to predict superimposed allelic sequences solely from a string of letters representing peaks within an individual mixed trace. We used the method to decode 104 human traces (mean length 294 bp) containing heterozygous indels 5 to 30 bp with a mean of 99.1% bases per allelic sequence reconstructed correctly and unambiguously. Simulations with artificial sequences have demonstrated that the method yields accurate reconstructions when (1) the allelic sequences forming the mixed trace are sufficiently similar, (2) the analyzed fragment is significantly longer than the indel, and (3) multiple indels, if present, are well-spaced. Because these conditions occur in most encountered DNA sequences, the method is widely applicable. It is available as a free Web application Indelligent at http://ctap.inhs.uiuc.edu/dmitriev/indel.asp.
Collapse
|
11
|
FLOT JEANFRANÇOIS. champuru 1.0: a computer software for unraveling mixtures of two DNA sequences of unequal lengths. ACTA ACUST UNITED AC 2007. [DOI: 10.1111/j.1471-8286.2007.01857.x] [Citation(s) in RCA: 92] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
12
|
Seroussi Y, Seroussi E. TraceHaplotyper: using direct sequencing to determine the phase of an indel followed by biallelic SNPs. Biotechniques 2007; 43:452, 454, 456. [PMID: 18019335 DOI: 10.2144/000112552] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Sequencing of a diploid PCR product that is heterozygous for an indel mutation and a downstream single nucleotide polymorphism (SNP) allows determination of haplotype phase. Manual determination of phase from trace files of this type is tedious; therefore, we introduce the Perl script TraceHaplotyper, which works from the expected sequence to analyze a sequencing trace file for the sites that are informative for the genotyping and outputs the two underlying haplotypes.
Collapse
|