1
|
Transcriptomic evidence that von Economo neurons are regionally specialized extratelencephalic-projecting excitatory neurons. Nat Commun 2020; 11:1172. [PMID: 32127543 PMCID: PMC7054400 DOI: 10.1038/s41467-020-14952-3] [Citation(s) in RCA: 48] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2019] [Accepted: 01/31/2020] [Indexed: 12/12/2022] Open
Abstract
von Economo neurons (VENs) are bipolar, spindle-shaped neurons restricted to layer 5 of human frontoinsula and anterior cingulate cortex that appear to be selectively vulnerable to neuropsychiatric and neurodegenerative diseases, although little is known about other VEN cellular phenotypes. Single nucleus RNA-sequencing of frontoinsula layer 5 identifies a transcriptomically-defined cell cluster that contained VENs, but also fork cells and a subset of pyramidal neurons. Cross-species alignment of this cell cluster with a well-annotated mouse classification shows strong homology to extratelencephalic (ET) excitatory neurons that project to subcerebral targets. This cluster also shows strong homology to a putative ET cluster in human temporal cortex, but with a strikingly specific regional signature. Together these results suggest that VENs are a regionally distinctive type of ET neuron. Additionally, we describe the first patch clamp recordings of VENs from neurosurgically-resected tissue that show distinctive intrinsic membrane properties relative to neighboring pyramidal neurons.
Collapse
|
2
|
Single-nucleus and single-cell transcriptomes compared in matched cortical cell types. PLoS One 2018; 13:e0209648. [PMID: 30586455 PMCID: PMC6306246 DOI: 10.1371/journal.pone.0209648] [Citation(s) in RCA: 291] [Impact Index Per Article: 48.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2018] [Accepted: 12/10/2018] [Indexed: 12/21/2022] Open
Abstract
Transcriptomic profiling of complex tissues by single-nucleus RNA-sequencing (snRNA-seq) affords some advantages over single-cell RNA-sequencing (scRNA-seq). snRNA-seq provides less biased cellular coverage, does not appear to suffer cell isolation-based transcriptional artifacts, and can be applied to archived frozen specimens. We used well-matched snRNA-seq and scRNA-seq datasets from mouse visual cortex to compare cell type detection. Although more transcripts are detected in individual whole cells (~11,000 genes) than nuclei (~7,000 genes), we demonstrate that closely related neuronal cell types can be similarly discriminated with both methods if intronic sequences are included in snRNA-seq analysis. We estimate that the nuclear proportion of total cellular mRNA varies from 20% to over 50% for large and small pyramidal neurons, respectively. Together, these results illustrate the high information content of nuclear RNA for characterization of cellular diversity in brain tissues.
Collapse
|
3
|
Transcriptomic and morphophysiological evidence for a specialized human cortical GABAergic cell type. Nat Neurosci 2018; 21:1185-1195. [PMID: 30150662 PMCID: PMC6130849 DOI: 10.1038/s41593-018-0205-2] [Citation(s) in RCA: 153] [Impact Index Per Article: 25.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2017] [Accepted: 06/14/2018] [Indexed: 11/29/2022]
Abstract
We describe convergent evidence from transcriptomics, morphology, and physiology for a specialized GABAergic neuron subtype in human cortex. Using unbiased single-nucleus RNA sequencing, we identify ten GABAergic interneuron subtypes with combinatorial gene signatures in human cortical layer 1 and characterize a group of human interneurons with anatomical features never described in rodents, having large 'rosehip'-like axonal boutons and compact arborization. These rosehip cells show an immunohistochemical profile (GAD1+CCK+, CNR1-SST-CALB2-PVALB-) matching a single transcriptomically defined cell type whose specific molecular marker signature is not seen in mouse cortex. Rosehip cells in layer 1 make homotypic gap junctions, predominantly target apical dendritic shafts of layer 3 pyramidal neurons, and inhibit backpropagating pyramidal action potentials in microdomains of the dendritic tuft. These cells are therefore positioned for potent local control of distal dendritic computation in cortical pyramidal neurons.
Collapse
|
4
|
Cell type discovery using single-cell transcriptomics: implications for ontological representation. Hum Mol Genet 2018; 27:R40-R47. [PMID: 29590361 PMCID: PMC5946857 DOI: 10.1093/hmg/ddy100] [Citation(s) in RCA: 38] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2017] [Revised: 03/14/2018] [Accepted: 03/16/2018] [Indexed: 12/20/2022] Open
Abstract
Cells are fundamental function units of multicellular organisms, with different cell types playing distinct physiological roles in the body. The recent advent of single-cell transcriptional profiling using RNA sequencing is producing 'big data', enabling the identification of novel human cell types at an unprecedented rate. In this review, we summarize recent work characterizing cell types in the human central nervous and immune systems using single-cell and single-nuclei RNA sequencing, and discuss the implications that these discoveries are having on the representation of cell types in the reference Cell Ontology (CL). We propose a method, based on random forest machine learning, for identifying sets of necessary and sufficient marker genes, which can be used to assemble consistent and reproducible cell type definitions for incorporation into the CL. The representation of defined cell type classes and their relationships in the CL using this strategy will make the cell type classes being identified by high-throughput/high-content technologies findable, accessible, interoperable and reusable (FAIR), allowing the CL to serve as a reference knowledgebase of information about the role that distinct cellular phenotypes play in human health and disease.
Collapse
|
5
|
Author Correction: L1-associated genomic regions are deleted in somatic cells of the healthy human brain. Nat Neurosci 2018; 21:1016. [PMID: 29703932 DOI: 10.1038/s41593-018-0131-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
In the version of this article initially published, NIH grant U01 MH106882 to F.H.G. was missing from the Acknowledgments. The error has been corrected in the HTML and PDF versions of the article.
Collapse
|
6
|
Abstract
Background A fundamental characteristic of multicellular organisms is the specialization of functional cell types through the process of differentiation. These specialized cell types not only characterize the normal functioning of different organs and tissues, they can also be used as cellular biomarkers of a variety of different disease states and therapeutic/vaccine responses. In order to serve as a reference for cell type representation, the Cell Ontology has been developed to provide a standard nomenclature of defined cell types for comparative analysis and biomarker discovery. Historically, these cell types have been defined based on unique cellular shapes and structures, anatomic locations, and marker protein expression. However, we are now experiencing a revolution in cellular characterization resulting from the application of new high-throughput, high-content cytometry and sequencing technologies. The resulting explosion in the number of distinct cell types being identified is challenging the current paradigm for cell type definition in the Cell Ontology. Results In this paper, we provide examples of state-of-the-art cellular biomarker characterization using high-content cytometry and single cell RNA sequencing, and present strategies for standardized cell type representations based on the data outputs from these cutting-edge technologies, including “context annotations” in the form of standardized experiment metadata about the specimen source analyzed and marker genes that serve as the most useful features in machine learning-based cell type classification models. We also propose a statistical strategy for comparing new experiment data to these standardized cell type representations. Conclusion The advent of high-throughput/high-content single cell technologies is leading to an explosion in the number of distinct cell types being identified. It will be critical for the bioinformatics community to develop and adopt data standard conventions that will be compatible with these new technologies and support the data representation needs of the research community. The proposals enumerated here will serve as a useful starting point to address these challenges.
Collapse
|
7
|
Corrigendum: L1-associated genomic regions are deleted in somatic cells of the healthy human brain. Nat Neurosci 2017; 20:1427. [PMID: 28949329 DOI: 10.1038/nn1017-1427a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
8
|
A Whole Blood Molecular Signature for Acute Myocardial Infarction. Sci Rep 2017; 7:12268. [PMID: 28947747 PMCID: PMC5612952 DOI: 10.1038/s41598-017-12166-0] [Citation(s) in RCA: 50] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2017] [Accepted: 09/01/2017] [Indexed: 12/21/2022] Open
Abstract
Chest pain is a leading reason patients seek medical evaluation. While assays to detect myocyte death are used to diagnose a heart attack (acute myocardial infarction, AMI), there is no biomarker to indicate an impending cardiac event. Transcriptional patterns present in circulating endothelial cells (CEC) may provide a window into the plaque rupture process and identify a proximal biomarker for AMI. Thus, we aimed to identify a transcriptomic signature of AMI present in whole blood, but derived from CECs. Candidate genes indicative of AMI were nominated from microarray of enriched CEC samples, and then verified for detectability and predictive potential via qPCR in whole blood. This signature was validated in an independent cohort. Our findings suggest that a whole blood CEC-derived molecular signature identifies patients with AMI and sets the framework to potentially identify the earlier stages of an impending cardiac event when used in concert with clinical history and other diagnostics where conventional biomarkers indicative of myonecrosis remain undetected.
Collapse
|
9
|
The metabolic potential of the single cell genomes obtained from the Challenger Deep, Mariana Trench within the candidate superphylum Parcubacteria (OD1). Environ Microbiol 2017; 19:2769-2784. [PMID: 28474498 DOI: 10.1111/1462-2920.13789] [Citation(s) in RCA: 39] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2016] [Revised: 04/27/2017] [Accepted: 04/27/2017] [Indexed: 11/28/2022]
Abstract
Candidate phyla (CP) are broad phylogenetic clusters of organisms that lack cultured representatives. Included in this fraction is the candidate Parcubacteria superphylum. Specific characteristics that have been ascribed to the Parcubacteria include reduced genome size, limited metabolic potential and exclusive reliance on fermentation for energy acquisition. The study of new environmental niches, such as the marine versus terrestrial subsurface, often expands the understanding of the genetic potential of taxonomic groups. For this reason, we analyzed 12 Parcubacteria single amplified genomes (SAGs) from sediment samples collected within the Challenger Deep of the Mariana Trench, obtained during the Deepsea Challenge (DSC) Expedition. Many of these SAGs are closely related to environmental sequences obtained from deep-sea environments based on 16S rRNA gene similarity and BLAST matches to predicted proteins. DSC SAGs encode features not previously identified in Parcubacteria obtained from other habitats. These include adaptation to oxidative stress, polysaccharide modification and genes associated with respiratory nitrate reduction. The DSC SAGs are also distinguished by relative greater abundance of genes for nucleotide and amino acid biosynthesis, repair of alkylated DNA and the synthesis of mechanosensitive ion channels. These results present an expanded view of the Parcubacteria, among members residing in an ultra-deep hadal environment.
Collapse
|
10
|
Pan-cancer analysis reveals technical artifacts in TCGA germline variant calls. BMC Genomics 2017; 18:458. [PMID: 28606096 PMCID: PMC5467262 DOI: 10.1186/s12864-017-3770-y] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2017] [Accepted: 05/07/2017] [Indexed: 02/08/2023] Open
Abstract
BACKGROUND Cancer research to date has largely focused on somatically acquired genetic aberrations. In contrast, the degree to which germline, or inherited, variation contributes to tumorigenesis remains unclear, possibly due to a lack of accessible germline variant data. Here we called germline variants on 9618 cases from The Cancer Genome Atlas (TCGA) database representing 31 cancer types. RESULTS We identified batch effects affecting loss of function (LOF) variant calls that can be traced back to differences in the way the sequence data were generated both within and across cancer types. Overall, LOF indel calls were more sensitive to technical artifacts than LOF Single Nucleotide Variant (SNV) calls. In particular, whole genome amplification of DNA prior to sequencing led to an artificially increased burden of LOF indel calls, which confounded association analyses relating germline variants to tumor type despite stringent indel filtering strategies. The samples affected by these technical artifacts include all acute myeloid leukemia and practically all ovarian cancer samples. CONCLUSIONS We demonstrate how technical artifacts induced by whole genome amplification of DNA can lead to false positive germline-tumor type associations and suggest TCGA whole genome amplified samples be used with caution. This study draws attention to the need to be sensitive to problems associated with a lack of uniformity in data generation in TCGA data.
Collapse
|
11
|
PRODUCTION OF A PRELIMINARY QUALITY CONTROL PIPELINE FOR SINGLE NUCLEI RNA-SEQ AND ITS APPLICATION IN THE ANALYSIS OF CELL TYPE DIVERSITY OF POST-MORTEM HUMAN BRAIN NEOCORTEX. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2017; 22:564-575. [PMID: 27897007 PMCID: PMC5338304 DOI: 10.1142/9789813207813_0052] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Next generation sequencing of the RNA content of single cells or single nuclei (sc/nRNA-seq) has become a powerful approach to understand the cellular complexity and diversity of multicellular organisms and environmental ecosystems. However, the fact that the procedure begins with a relatively small amount of starting material, thereby pushing the limits of the laboratory procedures required, dictates that careful approaches for sample quality control (QC) are essential to reduce the impact of technical noise and sample bias in downstream analysis applications. Here we present a preliminary framework for sample level quality control that is based on the collection of a series of quantitative laboratory and data metrics that are used as features for the construction of QC classification models using random forest machine learning approaches. We've applied this initial framework to a dataset comprised of 2272 single nuclei RNA-seq results and determined that ~79% of samples were of high quality. Removal of the poor quality samples from downstream analysis was found to improve the cell type clustering results. In addition, this approach identified quantitative features related to the proportion of unique or duplicate reads and the proportion of reads remaining after quality trimming as useful features for pass/fail classification. The construction and use of classification models for the identification of poor quality samples provides for an objective and scalable approach to sc/nRNA-seq quality control.
Collapse
|
12
|
L1-associated genomic regions are deleted in somatic cells of the healthy human brain. Nat Neurosci 2016; 19:1583-1591. [PMID: 27618310 DOI: 10.1038/nn.4388] [Citation(s) in RCA: 121] [Impact Index Per Article: 15.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2015] [Accepted: 08/09/2016] [Indexed: 02/08/2023]
Abstract
The healthy human brain is a mosaic of varied genomes. Long interspersed element-1 (LINE-1 or L1) retrotransposition is known to create mosaicism by inserting L1 sequences into new locations of somatic cell genomes. Using a machine learning-based, single-cell sequencing approach, we discovered that somatic L1-associated variants (SLAVs) are composed of two classes: L1 retrotransposition insertions and retrotransposition-independent L1-associated variants. We demonstrate that a subset of SLAVs comprises somatic deletions generated by L1 endonuclease cutting activity. Retrotransposition-independent rearrangements in inherited L1s resulted in the deletion of proximal genomic regions. These rearrangements were resolved by microhomology-mediated repair, which suggests that L1-associated genomic regions are hotspots for somatic copy number variants in the brain and therefore a heritable genetic contributor to somatic mosaicism. We demonstrate that SLAVs are present in crucial neural genes, such as DLG2 (also called PSD93), and affect 44-63% of cells of the cells in the healthy brain.
Collapse
|
13
|
Abstract
Genomic sequencing from single cells is a powerful tool in microbiology and holds great promise for infectious disease research. Vast numbers of uncultivable species and pathogens that persist at low abundance in environmental reservoirs are now accessible for genomic analysis.
Collapse
|
14
|
NeatFreq: reference-free data reduction and coverage normalization for De Novo sequence assembly. BMC Bioinformatics 2014; 15:357. [PMID: 25407910 PMCID: PMC4245761 DOI: 10.1186/s12859-014-0357-3] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2013] [Accepted: 10/22/2014] [Indexed: 11/10/2022] Open
Abstract
Background Deep shotgun sequencing on next generation sequencing (NGS) platforms has contributed significant amounts of data to enrich our understanding of genomes, transcriptomes, amplified single-cell genomes, and metagenomes. However, deep coverage variations in short-read data sets and high sequencing error rates of modern sequencers present new computational challenges in data interpretation, including mapping and de novo assembly. New lab techniques such as multiple displacement amplification (MDA) of single cells and sequence independent single primer amplification (SISPA) allow for sequencing of organisms that cannot be cultured, but generate highly variable coverage due to amplification biases. Results Here we introduce NeatFreq, a software tool that reduces a data set to more uniform coverage by clustering and selecting from reads binned by their median kmer frequency (RMKF) and uniqueness. Previous algorithms normalize read coverage based on RMKF, but do not include methods for the preferred selection of (1) extremely low coverage regions produced by extremely variable sequencing of random-primed products and (2) 2-sided paired-end sequences. The algorithm increases the incorporation of the most unique, lowest coverage, segments of a genome using an error-corrected data set. NeatFreq was applied to bacterial, viral plaque, and single-cell sequencing data. The algorithm showed an increase in the rate at which the most unique reads in a genome were included in the assembled consensus while also reducing the count of duplicative and erroneous contigs (strings of high confidence overlaps) in the deliverable consensus. The results obtained from conventional Overlap-Layout-Consensus (OLC) were compared to simulated multi-de Bruijn graph assembly alternatives trained for variable coverage input using sequence before and after normalization of coverage. Coverage reduction was shown to increase processing speed and reduce memory requirements when using conventional bacterial assembly algorithms. Conclusions The normalization of deep coverage spikes, which would otherwise inhibit consensus resolution, enables High Throughput Sequencing (HTS) assembly projects to consistently run to completion with existing assembly software. The NeatFreq software package is free, open source and available at https://github.com/bioh4x/NeatFreq. Electronic supplementary material The online version of this article (doi:10.1186/s12859-014-0357-3) contains supplementary material, which is available to authorized users.
Collapse
|
15
|
Abstract
We used single-cell genomic approaches to map DNA copy number variation (CNV) in neurons obtained from human induced pluripotent stem cell (hiPSC) lines and postmortem human brains. We identified aneuploid neurons, as well as numerous subchromosomal CNVs in euploid neurons. Neurotypic hiPSC-derived neurons had larger CNVs than fibroblasts, and several large deletions were found in hiPSC-derived neurons but not in matched neural progenitor cells. Single-cell sequencing of endogenous human frontal cortex neurons revealed that 13 to 41% of neurons have at least one megabase-scale de novo CNV, that deletions are twice as common as duplications, and that a subset of neurons have highly aberrant genomes marked by multiple alterations. Our results show that mosaic CNV is abundant in human neurons.
Collapse
|
16
|
Candidate phylum TM6 genome recovered from a hospital sink biofilm provides genomic insights into this uncultivated phylum. Proc Natl Acad Sci U S A 2013; 110:E2390-9. [PMID: 23754396 PMCID: PMC3696752 DOI: 10.1073/pnas.1219809110] [Citation(s) in RCA: 152] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
The "dark matter of life" describes microbes and even entire divisions of bacterial phyla that have evaded cultivation and have yet to be sequenced. We present a genome from the globally distributed but elusive candidate phylum TM6 and uncover its metabolic potential. TM6 was detected in a biofilm from a sink drain within a hospital restroom by analyzing cells using a highly automated single-cell genomics platform. We developed an approach for increasing throughput and effectively improving the likelihood of sampling rare events based on forming small random pools of single-flow-sorted cells, amplifying their DNA by multiple displacement amplification and sequencing all cells in the pool, creating a "mini-metagenome." A recently developed single-cell assembler, SPAdes, in combination with contig binning methods, allowed the reconstruction of genomes from these mini-metagenomes. A total of 1.07 Mb was recovered in seven contigs for this member of TM6 (JCVI TM6SC1), estimated to represent 90% of its genome. High nucleotide identity between a total of three TM6 genome drafts generated from pools that were independently captured, amplified, and assembled provided strong confirmation of a correct genomic sequence. TM6 is likely a Gram-negative organism and possibly a symbiont of an unknown host (nonfree living) in part based on its small genome, low-GC content, and lack of biosynthesis pathways for most amino acids and vitamins. Phylogenomic analysis of conserved single-copy genes confirms that TM6SC1 is a deeply branching phylum.
Collapse
|
17
|
Isolation and genome analysis of single virions using 'single virus genomics'. J Vis Exp 2013:e3899. [PMID: 23728084 DOI: 10.3791/3899] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/31/2022] Open
Abstract
Whole genome amplification and sequencing of single microbial cells enables genomic characterization without the need of cultivation (1-3). Viruses, which are ubiquitous and the most numerous entities on our planet (4) and important in all environments (5), have yet to be revealed via similar approaches. Here we describe an approach for isolating and characterizing the genomes of single virions called 'Single Virus Genomics' (SVG). SVG utilizes flow cytometry to isolate individual viruses and whole genome amplification to obtain high molecular weight genomic DNA (gDNA) that can be used in subsequent sequencing reactions.
Collapse
|
18
|
Nearly finished genomes produced using gel microdroplet culturing reveal substantial intraspecies genomic diversity within the human microbiome. Genome Res 2013; 23:878-88. [PMID: 23493677 PMCID: PMC3638143 DOI: 10.1101/gr.142208.112] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
The majority of microbial genomic diversity remains unexplored. This is largely due to our inability to culture most microorganisms in isolation, which is a prerequisite for traditional genome sequencing. Single-cell sequencing has allowed researchers to circumvent this limitation. DNA is amplified directly from a single cell using the whole-genome amplification technique of multiple displacement amplification (MDA). However, MDA from a single chromosome copy suffers from amplification bias and a large loss of specificity from even very small amounts of DNA contamination, which makes assembling a genome difficult and completely finishing a genome impossible except in extraordinary circumstances. Gel microdrop cultivation allows culturing of a diverse microbial community and provides hundreds to thousands of genetically identical cells as input for an MDA reaction. We demonstrate the utility of this approach by comparing sequencing results of gel microdroplets and single cells following MDA. Bias is reduced in the MDA reaction and genome sequencing, and assembly is greatly improved when using gel microdroplets. We acquired multiple near-complete genomes for two bacterial species from human oral and stool microbiome samples. A significant amount of genome diversity, including single nucleotide polymorphisms and genome recombination, is discovered. Gel microdroplets offer a powerful and high-throughput technology for assembling whole genomes from complex samples and for probing the pan-genome of naturally occurring populations.
Collapse
|
19
|
Abstract
There is increasing evidence that the phenotypic effects of genomic sequence variants are best understood in terms of variant haplotypes rather than as isolated polymorphisms. Haplotype analysis is also critically important for uncovering population histories and for the study of evolutionary genetics. Although the sequencing of individual human genomes to reveal personal collections of sequence variants is now well established, there has been slower progress in the phasing of these variants into pairs of haplotypes along each pair of chromosomes. Here, we have developed a distinct approach to haplotyping that can yield chromosome-length haplotypes, including the vast majority of heterozygous single-nucleotide polymorphisms (SNPs) in an individual human genome. This approach exploits the haploid nature of sperm cells and employs a combination of genotyping and low-coverage sequencing on a short-read platform. In addition to generating chromosome-length haplotypes, the approach can directly identify recombination events (averaging 1.1 per chromosome) with a median resolution of <100 kb.
Collapse
|
20
|
The green monster process for the generation of yeast strains carrying multiple gene deletions. J Vis Exp 2012:e4072. [PMID: 23271437 DOI: 10.3791/4072] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/31/2022] Open
Abstract
Phenotypes for a gene deletion are often revealed only when the mutation is tested in a particular genetic background or environmental condition(1,2). There are examples where many genes need to be deleted to unmask hidden gene functions(3,4). Despite the potential for important discoveries, genetic interactions involving three or more genes are largely unexplored. Exhaustive searches of multi-mutant interactions would be impractical due to the sheer number of possible combinations of deletions. However, studies of selected sets of genes, such as sets of paralogs with a greater a priori chance of sharing a common function, would be informative. In the yeast Saccharomyces cerevisiae, gene knockout is accomplished by replacing a gene with a selectable marker via homologous recombination. Because the number of markers is limited, methods have been developed for removing and reusing the same marker(5,6,7,8,9,10). However, sequentially engineering multiple mutations using these methods is time-consuming because the time required scales linearly with the number of deletions to be generated. Here we describe the Green Monster method for routinely engineering multiple deletions in yeast(11). In this method, a green fluorescent protein (GFP) reporter integrated into deletions is used to quantitatively label strains according to the number of deletions contained in each strain (Figure 1). Repeated rounds of assortment of GFP-marked deletions via yeast mating and meiosis coupled with flow-cytometric enrichment of strains carrying more of these deletions lead to the accumulation of deletions in strains (Figure 2). Performing multiple processes in parallel, with each process incorporating one or more deletions per round, reduces the time required for strain construction. The first step is to prepare haploid single-mutants termed 'ProMonsters,' each of which carries a GFP reporter in a deleted locus and one of the 'toolkit' loci-either Green Monster GMToolkit-a or GMToolkit-α at the can1Δ locus (Figure 3). Using strains from the yeast deletion collection(12), GFP-marked deletions can be conveniently generated by replacing the common KanMX4 cassette existing in these strains with a universal GFP-URA3 fragment. Each GMToolkit contains: either the a- or α-mating-type-specific haploid selection marker(1) and exactly one of the two markers that, when both GMToolkits are present, collectively allow for selection of diploids. The second step is to carry out the sexual cycling through which deletion loci can be combined within a single cell by the random assortment and/or meiotic recombination that accompanies each cycle of mating and sporulation.
Collapse
|
21
|
Abstract
Bacteria in the 16S rRNA clade SAR86 are among the most abundant uncultivated constituents of microbial assemblages in the surface ocean for which little genomic information is currently available. Bioinformatic techniques were used to assemble two nearly complete genomes from marine metagenomes and single-cell sequencing provided two more partial genomes. Recruitment of metagenomic data shows that these SAR86 genomes substantially increase our knowledge of non-photosynthetic bacteria in the surface ocean. Phylogenomic analyses establish SAR86 as a basal and divergent lineage of γ-proteobacteria, and the individual genomes display a temperature-dependent distribution. Modestly sized at 1.25-1.7 Mbp, the SAR86 genomes lack several pathways for amino-acid and vitamin synthesis as well as sulfate reduction, trends commonly observed in other abundant marine microbes. SAR86 appears to be an aerobic chemoheterotroph with the potential for proteorhodopsin-based ATP generation, though the apparent lack of a retinal biosynthesis pathway may require it to scavenge exogenously-derived pigments to utilize proteorhodopsin. The genomes contain an expanded capacity for the degradation of lipids and carbohydrates acquired using a wealth of tonB-dependent outer membrane receptors. Like the abundant planktonic marine bacterial clade SAR11, SAR86 exhibits metabolic streamlining, but also a distinct carbon compound specialization, possibly avoiding competition.
Collapse
|
22
|
Efficient de novo assembly of single-cell bacterial genomes from short-read data sets. Nat Biotechnol 2011; 29:915-21. [PMID: 21926975 PMCID: PMC3558281 DOI: 10.1038/nbt.1966] [Citation(s) in RCA: 160] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2010] [Accepted: 08/09/2011] [Indexed: 11/09/2022]
Abstract
Whole genome amplification by the multiple displacement amplification (MDA) method allows sequencing of DNA from single cells of bacteria that cannot be cultured. Assembling a genome is challenging, however, because MDA generates highly nonuniform coverage of the genome. Here we describe an algorithm tailored for short-read data from single cells that improves assembly through the use of a progressively increasing coverage cutoff. Assembly of reads from single Escherichia coli and Staphylococcus aureus cells captures >91% of genes within contigs, approaching the 95% captured from an assembly based on many E. coli cells. We apply this method to assemble a genome from a single cell of an uncultivated SAR324 clade of Deltaproteobacteria, a cosmopolitan bacterial lineage in the global ocean. Metabolic reconstruction suggests that SAR324 is aerobic, motile and chemotaxic. Our approach enables acquisition of genome assemblies for individual uncultivated bacteria using only short reads, providing cell-specific genetic information absent from metagenomic studies.
Collapse
|
23
|
Multiple displacement amplification as an adjunct to PCR-based detection of Staphylococcus aureus in synovial fluid. BMC Res Notes 2010; 3:259. [PMID: 20942932 PMCID: PMC2967558 DOI: 10.1186/1756-0500-3-259] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2009] [Accepted: 10/13/2010] [Indexed: 12/04/2022] Open
Abstract
Background Detection of bacterial nucleic acids in synovial fluid following total joint arthroplasty with suspected infection can be difficult; among other technical challenges, inhibitors in the specimens require extensive sample preparation and can diminish assay sensitivity even using polymerase chain reaction (PCR)-based methods. To address this problem a simple protocol for prior use of multiple displacement amplification (MDA) as an adjunct to PCR was established and tested on both purified S. aureus DNA as well as on clinical samples known to contain S. aureus nucleic acids. Findings A single round of MDA on purified nucleic acids resulted in a > 300 thousand-fold increase in template DNA on subsequent quantitative PCR (qPCR) analysis. MDA use on clinical samples resulted in at least a 100-fold increase in sensitivity on subsequent qPCR and required no sample preparation other than a simple alkali/heat lysis step. Mixed samples of S. aureus DNA with a 103 - 104-fold excess of human genomic DNA still allowed for MDA amplification of the minor bacterial component to the threshold of detectability. Conclusion MDA is a promising technique that may serve to significantly enhance the sensitivity of molecular assays in cases of suspected joint infection while simultaneously reducing the specimen handling required.
Collapse
|
24
|
Transcriptomics from single cells. Clin Cancer Res 2010. [DOI: 10.1158/diag-10-pl2-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
New technologies are making it possible to measure gene expression from single cells. Reverse transcriptase is used to convert the few picograms of mRNA in a cell to cDNA. Next generation sequencing methods, such as SOLiD sequencing, provide a low cost means to sequence the cDNA allowing quantification as well as genotyping. Unlike RT-PCR which enables single cell measurement of one or a limited number of transcripts, the entire transcriptome can be accessed. It will be possible to discover new correlations between genes and their role in diseases. We have begun testing this method on a variety of cell types including cultured cancer cells and human stem cells. Methods are in development to test circulating cancer cells. Progress has also been made on a different technology that addresses genotyping of the genomic DNA from one cell. Whole genome amplification by a method called Multiple Displacement Amplification (MDA) provides sufficient DNA for use in sequencing and genotyping. This enables analysis of unexpressed regions of the genome which are lost in transcriptional studies. Together, single cell methods for transcriptomics and genotyping of the genomic DNA will be useful in studying disease processes at the level of the individual cell. It will be possible to isolated specific cell types based on their phenotypic characteristics and then determine their genetic and transcriptional signatures.
Collapse
|
25
|
Genomic sequencing of single microbial cells from environmental samples. Curr Opin Microbiol 2008; 11:198-204. [PMID: 18550420 DOI: 10.1016/j.mib.2008.05.006] [Citation(s) in RCA: 92] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2008] [Revised: 04/30/2008] [Accepted: 05/07/2008] [Indexed: 10/22/2022]
Abstract
Recently developed techniques allow genomic DNA sequencing from single microbial cells [Lasken RS: Single-cell genomic sequencing using multiple displacement amplification. Curr Opin Microbiol 2007, 10:510-516]. Here, we focus on research strategies for putting these methods into practice in the laboratory setting. An immediate consequence of single-cell sequencing is that it provides an alternative to culturing organisms as a prerequisite for genomic sequencing. The microgram amounts of DNA required as template are amplified from a single bacterium by a method called multiple displacement amplification (MDA) avoiding the need to grow cells. The ability to sequence DNA from individual cells will likely have an immense impact on microbiology considering the vast numbers of novel organisms, which have been inaccessible unless culture-independent methods could be used. However, special approaches have been necessary to work with amplified DNA. MDA may not recover the entire genome from the single copy present in most bacteria. Also, some sequence rearrangements can occur during the DNA amplification reaction. Over the past two years many research groups have begun to use MDA, and some practical approaches to single-cell sequencing have been developed. We review the consensus that is emerging on optimum methods, reliability of amplified template, and the proper interpretation of 'composite' genomes which result from the necessity of combining data from several single-cell MDA reactions in order to complete the assembly. Preferred laboratory methods are considered on the basis of experience at several large sequencing centers where >70% of genomes are now often recovered from single cells. Methods are reviewed for preparation of bacterial fractions from environmental samples, single-cell isolation, DNA amplification by MDA, and DNA sequencing.
Collapse
|
26
|
Something from (almost) nothing: the impact of multiple displacement amplification on microbial ecology. ISME JOURNAL 2008; 2:233-41. [PMID: 18256705 DOI: 10.1038/ismej.2008.10] [Citation(s) in RCA: 130] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Microbial ecology is a field that applies molecular techniques to analyze genes and communities associated with a plethora of unique environments on this planet. In the past, low biomass and the predominance of a few abundant community members have impeded the application of techniques such as PCR, microarray analysis and metagenomics to complex microbial populations. In the absence of suitable cultivation methods, it was not possible to obtain DNA samples from individual microorganisms. Recently, a method called multiple displacement amplification (MDA) has been used to circumvent these limitations by amplifying DNA from microbial communities in low-biomass environments, individual cells from uncultivated microbial species and active organisms obtained through stable isotope probing incubations. This review describes the development and applications of MDA, discusses its strengths and limitations and highlights the impact of MDA on the field of microbial ecology. Whole genome amplification via MDA has increased access to the genomic DNA of uncultivated microorganisms and low-biomass environments and represents a 'power tool' in the molecular toolbox of microbial ecologists.
Collapse
|
27
|
Insights into the genome of large sulfur bacteria revealed by analysis of single filaments. PLoS Biol 2007; 5:e230. [PMID: 17760503 PMCID: PMC1951784 DOI: 10.1371/journal.pbio.0050230] [Citation(s) in RCA: 136] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2007] [Accepted: 06/26/2007] [Indexed: 11/19/2022] Open
Abstract
Marine sediments are frequently covered by mats of the filamentous Beggiatoa and other large nitrate-storing bacteria that oxidize hydrogen sulfide using either oxygen or nitrate, which they store in intracellular vacuoles. Despite their conspicuous metabolic properties and their biogeochemical importance, little is known about their genetic repertoire because of the lack of pure cultures. Here, we present a unique approach to access the genome of single filaments of Beggiatoa by combining whole genome amplification, pyrosequencing, and optical genome mapping. Sequence assemblies were incomplete and yielded average contig sizes of approximately 1 kb. Pathways for sulfur oxidation, nitrate and oxygen respiration, and CO2 fixation confirm the chemolithoautotrophic physiology of Beggiatoa. In addition, Beggiatoa potentially utilize inorganic sulfur compounds and dimethyl sulfoxide as electron acceptors. We propose a mechanism of vacuolar nitrate accumulation that is linked to proton translocation by vacuolar-type ATPases. Comparative genomics indicates substantial horizontal gene transfer of storage, metabolic, and gliding capabilities between Beggiatoa and cyanobacteria. These capabilities enable Beggiatoa to overcome non-overlapping availabilities of electron donors and acceptors while gliding between oxic and sulfidic zones. The first look into the genome of these filamentous sulfur-oxidizing bacteria substantially deepens the understanding of their evolution and their contribution to sulfur and nitrogen cycling in marine sediments.
Collapse
|
28
|
Nanoliter reactors improve multiple displacement amplification of genomes from single cells. PLoS Genet 2007; 3:1702-8. [PMID: 17892324 PMCID: PMC1988849 DOI: 10.1371/journal.pgen.0030155] [Citation(s) in RCA: 231] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2007] [Accepted: 07/26/2007] [Indexed: 01/27/2023] Open
Abstract
Since only a small fraction of environmental bacteria are amenable to laboratory culture, there is great interest in genomic sequencing directly from single cells. Sufficient DNA for sequencing can be obtained from one cell by the Multiple Displacement Amplification (MDA) method, thereby eliminating the need to develop culture methods. Here we used a microfluidic device to isolate individual Escherichia coli and amplify genomic DNA by MDA in 60-nl reactions. Our results confirm a report that reduced MDA reaction volume lowers nonspecific synthesis that can result from contaminant DNA templates and unfavourable interaction between primers. The quality of the genome amplification was assessed by qPCR and compared favourably to single-cell amplifications performed in standard 50-μl volumes. Amplification bias was greatly reduced in nanoliter volumes, thereby providing a more even representation of all sequences. Single-cell amplicons from both microliter and nanoliter volumes provided high-quality sequence data by high-throughput pyrosequencing, thereby demonstrating a straightforward route to sequencing genomes from single cells. It is often challenging to manipulate or analyze the genetic material or genome of an individual cell. Biochemical DNA amplification technologies can be used to make many copies of the genome from a single cell, and in this paper we investigated how well such amplification works as a function of the reaction volume. We found that single-cell genome amplification in nanoliter volumes is much more effective than in microliter volumes, providing better representation of the starting genome with less bias in the product. It should therefore be possible to obtain high-quality genome sequences from single cells. This is useful because very few microbes can be obtained in pure culture, and are therefore only amenable to single-cell analysis.
Collapse
|
29
|
Single-cell genomic sequencing using Multiple Displacement Amplification. Curr Opin Microbiol 2007; 10:510-6. [PMID: 17923430 DOI: 10.1016/j.mib.2007.08.005] [Citation(s) in RCA: 161] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2007] [Revised: 08/24/2007] [Accepted: 08/29/2007] [Indexed: 12/01/2022]
Abstract
Single microbial cells can now be sequenced using DNA amplified by the Multiple Displacement Amplification (MDA) reaction. The few femtograms of DNA in a bacterium are amplified into micrograms of high molecular weight DNA suitable for DNA library construction and Sanger sequencing. The MDA-generated DNA also performs well when used directly as template for pyrosequencing by the 454 Life Sciences method. While MDA from single cells loses some of the genomic sequence, this approach will greatly accelerate the pace of sequencing from uncultured microbes. The genetically linked sequences from single cells are also a powerful tool to be used in guiding genomic assembly of shotgun sequences of multiple organisms from environmental DNA extracts (metagenomic sequences).
Collapse
|
30
|
Abstract
We have developed a new method for identifying specific single- or double-stranded DNA sequences called nicking endonuclease signal amplification (NESA). A probe and target DNA anneal to create a restriction site that is recognized by a strand-specific endonuclease that cleaves the probe into two pieces leaving the target DNA intact. The target DNA can then act as a template for fresh probe and the process of hybridization, cleavage and dissociation repeats. Laser-induced fluorescence coupled with capillary electrophoresis was used to measure the probe cleavage products. The reaction is rapid; full cleavage of probe occurs within one minute under ideal conditions. The reaction is specific since it requires complete complementarity between the oligonucleotide and the template at the restriction site and sufficient complementarity overall to allow hybridization. We show that both Bacillus subtilis and B. anthracis genomic DNA can be detected and specifically differentiated from DNA of other Bacillus species. When combined with multiple displacement amplification, detection of a single copy target from less than 30 cfu is possible. This method should be applicable whenever there is a requirement to detect a specific DNA sequence. Other applications include SNP analysis and genotyping. The reaction is inherently simple to multiplex and is amenable to automation.
Collapse
|
31
|
Mechanism of chimera formation during the Multiple Displacement Amplification reaction. BMC Biotechnol 2007; 7:19. [PMID: 17430586 PMCID: PMC1855051 DOI: 10.1186/1472-6750-7-19] [Citation(s) in RCA: 202] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2007] [Accepted: 04/12/2007] [Indexed: 11/21/2022] Open
Abstract
Background Multiple Displacement Amplification (MDA) is a method used for amplifying limiting DNA sources. The high molecular weight amplified DNA is ideal for DNA library construction. While this has enabled genomic sequencing from one or a few cells of unculturable microorganisms, the process is complicated by the tendency of MDA to generate chimeric DNA rearrangements in the amplified DNA. Determining the source of the DNA rearrangements would be an important step towards reducing or eliminating them. Results Here, we characterize the major types of chimeras formed by carrying out an MDA whole genome amplification from a single E. coli cell and sequencing by the 454 Life Sciences method. Analysis of 475 chimeras revealed the predominant reaction mechanisms that create the DNA rearrangements. The highly branched DNA synthesized in MDA can assume many alternative secondary structures. DNA strands extended on an initial template can be displaced becoming available to prime on a second template creating the chimeras. Evidence supports a model in which branch migration can displace 3'-ends freeing them to prime on the new templates. More than 85% of the resulting DNA rearrangements were inverted sequences with intervening deletions that the model predicts. Intramolecular rearrangements were favored, with displaced 3'-ends reannealing to single stranded 5'-strands contained within the same branched DNA molecule. In over 70% of the chimeric junctions, the 3' termini had initiated priming at complimentary sequences of 2–21 nucleotides (nts) in the new templates. Conclusion Formation of chimeras is an important limitation to the MDA method, particularly for whole genome sequencing. Identification of the mechanism for chimera formation provides new insight into the MDA reaction and suggests methods to reduce chimeras. The 454 sequencing approach used here will provide a rapid method to assess the utility of reaction modifications.
Collapse
|
32
|
Specific single-cell isolation and genomic amplification of uncultured microorganisms. Appl Microbiol Biotechnol 2006; 74:926-35. [PMID: 17109170 DOI: 10.1007/s00253-006-0725-7] [Citation(s) in RCA: 60] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2006] [Revised: 10/16/2006] [Accepted: 10/16/2006] [Indexed: 10/23/2022]
Abstract
We in this study describe a new method for genomic studies of individual uncultured prokaryotic organisms, which was used for the isolation and partial genome sequencing of a soil archaeon. The diversity of Archaea in a soil sample was mapped by generating a clone library using group-specific primers in combination with a terminal restriction fragment length polymorphism profile. Intact cells were extracted from the environmental sample, and fluorescent in situ hybridization probing with Cy3-labeled probes designed from the clone library was subsequently used to detect the organisms of interest. Single cells with a bright fluorescent signal were isolated using a micromanipulator and the genome of the single isolated cells served as a template for multiple displacement amplification (MDA) using the Phi29 DNA polymerase. The generated MDA product was afterwards used for 16S rRNA gene sequence analysis and shotgun-cloned for additional genomic analysis. Sequence analysis showed >99% 16S rRNA gene homology to soil crenarchaeotal clone SCA1170 and shotgun fragments had the closest match to a crenarchaeotal BAC clone previously retrieved from a soil sample. The system was validated using Methanothermobacter thermoautotrophicus as single-cell test organism, and the validation setup produced 100% sequence homology to the ten tested regions of the genome of this organism.
Collapse
MESH Headings
- Archaea/genetics
- Archaea/isolation & purification
- Base Sequence
- Biodiversity
- DNA Primers/genetics
- DNA, Archaeal/chemistry
- DNA, Archaeal/genetics
- DNA, Archaeal/isolation & purification
- DNA, Ribosomal/chemistry
- DNA, Ribosomal/genetics
- Genome, Archaeal/genetics
- In Situ Hybridization, Fluorescence
- Methanobacteriaceae/genetics
- Micromanipulation
- Molecular Sequence Data
- Nucleic Acid Amplification Techniques/methods
- Phylogeny
- RNA, Ribosomal, 16S/genetics
- Sequence Analysis, DNA
- Sequence Homology
- Soil Microbiology
Collapse
|
33
|
Abstract
Genomic DNA was amplified about 5 billion-fold from single, flow-sorted bacterial cells by the multiple displacement amplification (MDA) reaction, using phi 29 DNA polymerase. A 662-bp segment of the 16S rRNA gene could be accurately sequenced from the amplified DNA. MDA methods enable new strategies for studying non-culturable microorganisms.
Collapse
|
34
|
Two methods of whole-genome amplification enable accurate genotyping across a 2320-SNP linkage panel. Genome Res 2004; 14:901-7. [PMID: 15123587 PMCID: PMC479118 DOI: 10.1101/gr.1949704] [Citation(s) in RCA: 164] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Comprehensive genome scans involving many thousands of SNP assays will require significant amounts of genomic DNA from each sample. We report two successful methods for amplifying whole-genomic DNA prior to SNP analysis, multiple displacement amplification, and OmniPlex technology. We determined the coverage of amplification by analyzing a SNP linkage marker set that contained 2320 SNP markers spread across the genome at an average distance of 2.5 cM. We observed a concordance of >99.8% in genotyping results from genomic DNA and amplified DNA, strongly indicating the ability of both methods used to amplify genomic DNA in a highly representative manner. Furthermore, we were able to achieve a SNP call rate of >98% in both genomic and amplified DNA. The combination of whole-genome amplification and comprehensive SNP linkage analysis offers new opportunities for genetic analysis in clinical trials, disease association studies, and archiving of DNA samples.
Collapse
|
35
|
Whole genome amplification: abundant supplies of DNA from precious samples or clinical specimens. Trends Biotechnol 2003; 21:531-5. [PMID: 14624861 DOI: 10.1016/j.tibtech.2003.09.010] [Citation(s) in RCA: 140] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
36
|
High accuracy genotyping directly from genomic DNA using a rolling circle amplification based assay. BMC Genomics 2003; 4:21. [PMID: 12777185 PMCID: PMC165428 DOI: 10.1186/1471-2164-4-21] [Citation(s) in RCA: 40] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2003] [Accepted: 05/30/2003] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Rolling circle amplification of ligated probes is a simple and sensitive means for genotyping directly from genomic DNA. SNPs and mutations are interrogated with open circle probes (OCP) that can be circularized by DNA ligase when the probe matches the genotype. An amplified detection signal is generated by exponential rolling circle amplification (ERCA) of the circularized probe. The low cost and scalability of ligation/ERCA genotyping makes it ideally suited for automated, high throughput methods. RESULTS A retrospective study using human genomic DNA samples of known genotype was performed for four different clinically relevant mutations: Factor V Leiden, Factor II prothrombin, and two hemochromatosis mutations, C282Y and H63D. Greater than 99% accuracy was obtained genotyping genomic DNA samples from hundreds of different individuals. The combined process of ligation/ERCA was performed in a single tube and produced fluorescent signal directly from genomic DNA in less than an hour. In each assay, the probes for both normal and mutant alleles were combined in a single reaction. Multiple ERCA primers combined with a quenched-peptide nucleic acid (Q-PNA) fluorescent detection system greatly accellerated the appearance of signal. Probes designed with hairpin structures reduced misamplification. Genotyping accuracy was identical from either purified genomic DNA or genomic DNA generated using whole genome amplification (WGA). Fluorescent signal output was measured in real time and as an end point. CONCLUSIONS Combining the optimal elements for ligation/ERCA genotyping has resulted in a highly accurate single tube assay for genotyping directly from genomic DNA samples. Accuracy exceeded 99 % for four probe sets targeting clinically relevant mutations. No genotypes were called incorrectly using either genomic DNA or whole genome amplified sample.
Collapse
|
37
|
Abstract
Preparation of genomic DNA from clinical samples is a bottleneck in genotyping and DNA sequencing analysis and is frequently limited by the amount of specimen available. We use Multiple Displacement Amplification (MDA) to amplify the whole genome 10,000-fold directly from small amounts of whole blood, dried blood, buccal cells, cultured cells, and buffy coats specimens, generating large amounts of DNA for genetic testing. Genomic DNA was evenly amplified with complete coverage and consistent representation of all genes. All 47 loci analyzed from 44 individuals were represented in the amplified DNA at between 0.5- and 3.0-fold of the copy number in the starting genomic DNA template. A high-fidelity DNA polymerase ensures accurate representation of the DNA sequence. The amplified DNA was indistinguishable from the original genomic DNA template in 5 SNP and 10 microsatellite DNA assays on three different clinical sample types for 20 individuals. Amplification of genomic DNA directly from cells is highly reproducible, eliminates the need for DNA template purification, and allows genetic testing from small clinical samples. The low amplification bias of MDA represents a dramatic technical improvement in the ability to amplify a whole genome compared with older, PCR-based methods.
Collapse
|
38
|
A full-coverage, high-resolution human chromosome 22 genomic microarray for clinical and research applications. Hum Mol Genet 2002; 11:3221-9. [PMID: 12444106 DOI: 10.1093/hmg/11.25.3221] [Citation(s) in RCA: 110] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
We have constructed the first comprehensive microarray representing a human chromosome for analysis of DNA copy number variation. This chromosome 22 array covers 34.7 Mb, representing 1.1% of the genome, with an average resolution of 75 kb. To demonstrate the utility of the array, we have applied it to profile acral melanoma, dermatofibrosarcoma, DiGeorge syndrome and neurofibromatosis 2. We accurately diagnosed homozygous/heterozygous deletions, amplifications/gains, IGLV/IGLC locus instability, and breakpoints of an imbalanced translocation. We further identified the 14-3-3 eta isoform as a candidate tumor suppressor in glioblastoma. Two significant methodological advances in array construction were also developed and validated. These include a strictly sequence defined, repeat-free, and non-redundant strategy for array preparation. This approach allows an increase in array resolution and analysis of any locus; disregarding common repeats, genomic clone availability and sequence redundancy. In addition, we report that the application of phi29 DNA polymerase is advantageous in microarray preparation. A broad spectrum of issues in medical research and diagnostics can be approached using the array. This well annotated and gene-rich autosome contains numerous uncharacterized disease genes. It is therefore crucial to associate these genes to specific 22q-related conditions and this array will be instrumental towards this goal. Furthermore, comprehensive epigenetic profiling of 22q-located genes and high-resolution analysis of replication timing across the entire chromosome can be studied using our array.
Collapse
|
39
|
Comprehensive human genome amplification using multiple displacement amplification. Proc Natl Acad Sci U S A 2002; 99:5261-6. [PMID: 11959976 PMCID: PMC122757 DOI: 10.1073/pnas.082089499] [Citation(s) in RCA: 1010] [Impact Index Per Article: 45.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
Abstract
Fundamental to most genetic analysis is availability of genomic DNA of adequate quality and quantity. Because DNA yield from human samples is frequently limiting, much effort has been invested in developing methods for whole genome amplification (WGA) by random or degenerate oligonucleotide-primed PCR. However, existing WGA methods like degenerate oligonucleotide-primed PCR suffer from incomplete coverage and inadequate average DNA size. We describe a method, termed multiple displacement amplification (MDA), which provides a highly uniform representation across the genome. Amplification bias among eight chromosomal loci was less than 3-fold in contrast to 4-6 orders of magnitude for PCR-based WGA methods. Average product length was >10 kb. MDA is an isothermal, strand-displacing amplification yielding about 20-30 microg product from as few as 1-10 copies of human genomic DNA. Amplification can be carried out directly from biological samples including crude whole blood and tissue culture cells. MDA-amplified human DNA is useful for several common methods of genetic analysis, including genotyping of single nucleotide polymorphisms, chromosome painting, Southern blotting and restriction fragment length polymorphism analysis, subcloning, and DNA sequencing. MDA-based WGA is a simple and reliable method that could have significant implications for genetic studies, forensics, diagnostics, and long-term sample storage.
Collapse
|
40
|
High-throughput genotyping of single nucleotide polymorphisms with rolling circle amplification. BMC Genomics 2001; 2:4. [PMID: 11511324 PMCID: PMC37402 DOI: 10.1186/1471-2164-2-4] [Citation(s) in RCA: 106] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2001] [Accepted: 08/01/2001] [Indexed: 12/04/2022] Open
Abstract
BACKGROUND Single nucleotide polymorphisms (SNPs) are the foundation of powerful complex trait and pharmacogenomic analyses. The availability of large SNP databases, however, has emphasized a need for inexpensive SNP genotyping methods of commensurate simplicity, robustness, and scalability. We describe a solution-based, microtiter plate method for SNP genotyping of human genomic DNA. The method is based upon allele discrimination by ligation of open circle probes followed by rolling circle amplification of the signal using fluorescent primers. Only the probe with a 3' base complementary to the SNP is circularized by ligation. RESULTS SNP scoring by ligation was optimized to a 100,000 fold discrimination against probe mismatched to the SNP. The assay was used to genotype 10 SNPs from a set of 192 genomic DNA samples in a high-throughput format. Assay directly from genomic DNA eliminates the need to preamplify the target as done for many other genotyping methods. The sensitivity of the assay was demonstrated by genotyping from 1 ng of genomic DNA. We demonstrate that the assay can detect a single molecule of the circularized probe. CONCLUSIONS Compatibility with homogeneous formats and the ability to assay small amounts of genomic DNA meets the exacting requirements of automated, high-throughput SNP scoring.
Collapse
|
41
|
Rapid amplification of plasmid and phage DNA using Phi 29 DNA polymerase and multiply-primed rolling circle amplification. Genome Res 2001; 11:1095-9. [PMID: 11381035 PMCID: PMC311129 DOI: 10.1101/gr.180501] [Citation(s) in RCA: 749] [Impact Index Per Article: 32.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
We describe a simple method of using rolling circle amplification to amplify vector DNA such as M13 or plasmid DNA from single colonies or plaques. Using random primers and phi29 DNA polymerase, circular DNA templates can be amplified 10,000-fold in a few hours. This procedure removes the need for lengthy growth periods and traditional DNA isolation methods. Reaction products can be used directly for DNA sequencing after phosphatase treatment to inactivate unincorporated nucleotides. Amplified products can also be used for in vitro cloning, library construction, and other molecular biology applications.
Collapse
|
42
|
Abstract
We show that archaebacterial DNA polymerases are strongly inhibited by the presence of small amounts of uracil-containing DNA. Inhibition appears to be competitive, with the DNA polymerase exhibiting approximately 6500-fold greater affinity for binding the inhibitor than a DNase I-activated DNA substrate. All six archaebacterial DNA polymerases tested were inhibited, while no eubacterial, eukaryotic, or bacteriophage enzymes showed this effect. Only a small inhibition resulted when uracil was present as the deoxynucleoside triphosphate, dUTP. The rate of DNA synthesis was reduced by approximately 40% when dUTP was used in place of dTTP for archaebacterial DNA polymerases. Furthermore, an incorporated dUMP served as a productive 3'-primer terminus for subsequent elongation. In contrast, the presence of an oligonucleotide containing as little as a single dUrd residue was extremely inhibitory to DNA polymerase activity on other primer-template DNA.
Collapse
|
43
|
The dnaB-dnaC replication protein complex of Escherichia coli. I. Formation and properties. J Biol Chem 1989; 264:2463-8. [PMID: 2536712] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
The complex formed between the dnaB and dnaC replication proteins of Escherichia coli is stabilized by ATP binding to dnaC. The dnaB6-dnaC6-ATP6 complex can be maintained without ATP hydrolysis at a concentration as low as 5 x 10(-10) M. The complex is also formed with adenosine 5'-(gamma-thio)triphosphate but generates little or no dnaB activity, suggesting a requirement for ATP hydrolysis in the subsequent stage of binding of the complex to DNA. In this step, dnaC is released, leaving dnaB to function on the associated DNA.
Collapse
|
44
|
The dnaB-dnaC replication protein complex of Escherichia coli. II. Role of the complex in mobilizing dnaB functions. J Biol Chem 1989; 264:2469-75. [PMID: 2536713] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
The dnaC protein of Escherichia coli, by forming a complex with the dnaB protein, facilitates the interactions with single-stranded DNA that enable dnaB to perform its ATPase, helicase, and priming functions. Within the dnaB-dnaC complex, dnaB appears to be inactive but becomes active upon the ATP-dependent release of dnaC from the complex. With adenosine 5'-(gamma-thio)triphosphate substituted for ATP, the dnaB-dnaC complex does not direct dnaB to its targeted actions. Excess dnaC inhibits dna beta actions and augments the ATP gamma S effects. In the dnaA protein-driven initiation of duplex chromosome replication, dnaB is introduced for its essential helicase role via the dnaB-dnaC complex. Similarly, when the dnaA protein interacts nonspecifically with single-stranded DNA, the dnaB-dnaC complex is essential to introduce dnaB for its role in primer formation by primase.
Collapse
|
45
|
|
46
|
|
47
|
The primosomal protein n' of Escherichia coli is a DNA helicase. J Biol Chem 1988; 263:5512-8. [PMID: 2833507] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
Protein n' of Escherichia coli functions in assembly and translocation of the primosome, a mobile multiprotein complex involved in priming DNA replication (Kornberg, A. (1982) Supplement to DNA Replication, Freeman Publications, San Francisco). By itself, protein n' translocates on single-stranded DNA and destabilizes duplex regions by acting as a DNA helicase, using the energy of ATP or dATP hydrolysis. Single-stranded DNA binding protein was required for melting of duplex regions longer than 40 base pairs. Initial binding of protein n' to a specific site on DNA (Shlomai, J., and Kornberg, A. (1980) Proc. Natl. Acad. Sci. U.S.A. 77, 799-803) is essential for its helicase function. The polarity of protein n' translocation on DNA, in the 3' to 5' direction of the chain, suggests a mechanism for how the primosome may contribute to concurrent replication of both strands at a replication fork.
Collapse
|
48
|
|
49
|
The beta subunit dissociates readily from the Escherichia coli DNA polymerase III holoenzyme. J Biol Chem 1987; 262:1720-4. [PMID: 3543011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
Purified DNA polymerase III holoenzyme (holoenzyme) was separated by glycerol gradient sedimentation into the beta subunit and the subassembly that lacks it (pol III). In the presence of ATP, beta subunit dimer dissociated from holoenzyme with a KD of 1 nM; in the absence of ATP, the KD was greater than 5 nM. The beta subunit was known to remain tightly associated in the holoenzyme upon formation of an initiation complex with a primed template and during the course of replication. With separation from the template, holoenzyme dissociated into beta and pol III. Cycling to a new template depended on the reformation of holoenzyme. Holoenzyme was in equilibrium with pol III and the beta subunit in crude enzyme fractions as well as in pure preparations.
Collapse
|
50
|
A fidelity assay using "dideoxy" DNA sequencing: a measurement of sequence dependence and frequency of forming 5-bromouracil X guanine base mispairs. Proc Natl Acad Sci U S A 1985; 82:1301-5. [PMID: 3856263 PMCID: PMC397248 DOI: 10.1073/pnas.82.5.1301] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
DNA replication fidelity has been assayed by using a modified DNA sequencing reaction. In one experimental approach, dideoxycytidine 5'-triphosphate (ddCTP) was used as a chain terminator during replication of M13 phage DNA by the large fragment of DNA polymerase I. The deoxyribonucleotide analogue BrdUTP was used to compete against ddCTP-induced chain terminations as an assay for B X G base mispairing (B represents bromodeoxyuridine when the analogue is present as a base pair or base mispair). By comparing BrdUTP to dCTP for competition against ddCTP, an average misincorporation frequency for BrdUMP of 0.2% was found. A similar average misincorporation frequency has been measured previously for the incorporation of radioactively labeled BrdUMP and dCMP into the synthetic template-primer poly-[d(G,T)] X oligo(dA). The advantage of the sequencing method is that an error frequency is determined for each template guanine in a defined DNA sequence, thus providing information on the effect of neighboring base sequences on fidelity. Misincorporation frequencies varied no more than 5-fold among 50 template guanines tested. The approach used here is not limited for use with nucleotide analogues but is generally applicable in determining misincorporation frequencies and sequence specificities for any deoxynucleoside triphosphate substrate. In a second experimental approach, base mispairing between bromouracil and guanine was demonstrated directly by using 5-bromodideoxyuridine 5'-triphosphate (BrddUTP). A comparison of chain terminations attributable to BrddUTP and to dideoxythymidine 5'-triphosphate (ddTTP) revealed that B X A and T X A base pairs formed at about the same rate, whereas B X G mispairs occurred 4-10 times more frequently than T X G. The elevation in the frequency of B X G over T X G mispairs is consistent with the mutagenic behavior of the base analogue.
Collapse
|