1
|
Abstract
Next generation sequencing is a transformative technology for discovering and diagnosing genetic disorders. However, high-throughput sequencing remains error-prone, necessitating variant confirmation in order to meet the exacting demands of clinical diagnostic sequencing. To address this, we devised an orthogonal, dual platform approach employing complementary target capture and sequencing chemistries to improve speed and accuracy of variant calls at a genomic scale. We combined DNA selection by bait-based hybridization followed by Illumina NextSeq reversible terminator sequencing with DNA selection by amplification followed by Ion Proton semiconductor sequencing. This approach yields genomic scale orthogonal confirmation of ~95% of exome variants. Overall variant sensitivity improves as each method covers thousands of coding exons missed by the other. We conclude that orthogonal NGS offers improvements in variant calling sensitivity when two platforms are used, better specificity for variants identified on both platforms, and greatly reduces the time and expense of Sanger follow-up, thus enabling physicians to act on genomic results more quickly.
Collapse
|
2
|
Deadenylase depletion protects inherited mRNAs in primordial germ cells. Development 2014; 141:3134-42. [PMID: 25100654 PMCID: PMC4197533 DOI: 10.1242/dev.110395] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2014] [Accepted: 06/22/2014] [Indexed: 12/12/2022]
Abstract
A crucial event in animal development is the specification of primordial germ cells (PGCs), which become the stem cells that create sperm and eggs. How PGCs are created provides a valuable paradigm for understanding stem cells in general. We find that the PGCs of the sea urchin Strongylocentrotus purpuratus exhibit broad transcriptional repression, yet enrichment for a set of inherited mRNAs. Enrichment of several germline determinants in the PGCs requires the RNA-binding protein Nanos to target the transcript that encodes CNOT6, a deadenylase, for degradation in the PGCs, thereby creating a stable environment for RNA. Misexpression of CNOT6 in the PGCs results in their failure to retain Seawi transcripts and Vasa protein. Conversely, broad knockdown of CNOT6 expands the domain of Seawi RNA as well as exogenous reporters. Thus, Nanos-dependent spatially restricted CNOT6 differential expression is used to selectively localize germline RNAs to the PGCs. Our findings support a 'time capsule' model of germline determination, whereby the PGCs are insulated from differentiation by retaining the molecular characteristics of the totipotent egg and early embryo.
Collapse
|
3
|
On the importance of small changes in RNA expression. Methods 2013; 63:18-24. [DOI: 10.1016/j.ymeth.2013.03.027] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2012] [Revised: 02/26/2013] [Accepted: 03/22/2013] [Indexed: 01/09/2023] Open
|
4
|
Correction: Corrigendum: RNA sequencing of pancreatic circulating tumour cells implicates WNT signalling in metastasis. Nature 2012. [DOI: 10.1038/nature11644] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
5
|
An in-depth map of polyadenylation sites in cancer. Nucleic Acids Res 2012; 40:8460-71. [PMID: 22753024 PMCID: PMC3458571 DOI: 10.1093/nar/gks637] [Citation(s) in RCA: 115] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2011] [Revised: 05/16/2012] [Accepted: 06/06/2012] [Indexed: 12/22/2022] Open
Abstract
We present a comprehensive map of over 1 million polyadenylation sites and quantify their usage in major cancers and tumor cell lines using direct RNA sequencing. We built the Expression and Polyadenylation Database to enable the visualization of the polyadenylation maps in various cancers and to facilitate the discovery of novel genes and gene isoforms that are potentially important to tumorigenesis. Analyses of polyadenylation sites indicate that a large fraction (∼30%) of mRNAs contain alternative polyadenylation sites in their 3' untranslated regions, independent of the cell type. The shortest 3' untranslated region isoforms are preferentially upregulated in cancer tissues, genome-wide. Candidate targets of alternative polyadenylation-mediated upregulation of short isoforms include POLR2K, and signaling cascades of cell-cell and cell-extracellular matrix contact, particularly involving regulators of Rho GTPases. Polyadenylation maps also helped to improve 3' untranslated region annotations and identify candidate regulatory marks such as sequence motifs, H3K36Me3 and Pabpc1 that are isoform dependent and occur in a position-specific manner. In summary, these results highlight the need to go beyond monitoring only the cumulative transcript levels for a gene, to separately analysing the expression of its RNA isoforms.
Collapse
|
6
|
RNA sequencing of pancreatic circulating tumour cells implicates WNT signalling in metastasis. Nature 2012; 487:510-3. [PMID: 22763454 PMCID: PMC3408856 DOI: 10.1038/nature11217] [Citation(s) in RCA: 384] [Impact Index Per Article: 32.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2011] [Accepted: 05/08/2012] [Indexed: 01/05/2023]
Abstract
Circulating tumour cells (CTCs) shed into blood from primary cancers include putative precursors that initiate distal metastases1. While these cells are extraordinarily rare, they may identify cellular pathways contributing to the blood-borne dissemination of cancer. Here, we adapted a microfluidic device2 for efficient capture of CTCs from an endogenous mouse pancreatic cancer model3 and subjected CTCs to single molecule RNA sequencing4, identifying Wnt2 as a candidate gene enriched in CTCs. Expression of Wnt2 in pancreatic cancer cells suppresses anoikis, enhances anchorage-independent sphere formation, and increases metastatic propensity in vivo. This effect is correlated with fibronectin upregulation and suppressed by inhibition of Map3k7 (Tak1) kinase. In humans, formation of non-adherent tumour spheres by pancreatic cancer cells is associated with upregulation of multiple Wnt genes, and pancreatic CTCs revealed enrichment for Wnt signaling in 5 of 11 cases. Thus, molecular analysis of CTCs may identify candidate therapeutic targets to prevent the distal spread of cancer.
Collapse
|
7
|
Direct sequencing of Arabidopsis thaliana RNA reveals patterns of cleavage and polyadenylation. Nat Struct Mol Biol 2012; 19:845-52. [PMID: 22820990 PMCID: PMC3533403 DOI: 10.1038/nsmb.2345] [Citation(s) in RCA: 131] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2012] [Accepted: 06/19/2012] [Indexed: 11/22/2022]
Abstract
It has recently been shown that RNA 3′ end formation plays a more widespread role in controlling gene expression than previously thought. In order to examine the impact of regulated 3′ end formation genome-wide we applied direct RNA sequencing to A. thaliana. Here we show the authentic transcriptome in unprecedented detail and how 3′ end formation impacts genome organization. We reveal extreme heterogeneity in RNA 3′ ends, discover previously unrecognized non-coding RNAs and propose widespread re-annotation of the genome. We explain the origin of most poly(A)+ antisense RNAs and identify cis-elements that control 3′ end formation in different registers. These findings are essential to understand what the genome actually encodes, how it is organized and the impact of regulated 3′ end formation on these processes.
Collapse
|
8
|
Abstract 5304: RNA sequencing of circulating tumor cells implicates WNT signaling in pancreatic cancer metastasis. Cancer Res 2012. [DOI: 10.1158/1538-7445.am2012-5304] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Circulating tumor cells (CTCs) shed into blood from primary cancers include putative precursors that initiate distal metastases. While isolation and molecular analysis of these cells is technically challenging, they may identify cellular pathways that contribute to the blood-borne dissemination of cancer cells. Here, we adapted a microfluidic device for efficient capture of CTCs from an endogenous mouse pancreatic cancer model and subjected CTCs to digital gene expression (DGE) analysis using single molecule RNA sequencing. We identified Wnt2 as enriched in both CTCs and metastatic ascites cells, compared with primary tumors. Expression of Wnt2 in pancreatic cancer cells suppresses anoikis, enhances anchorage-independent sphere formation, and increases metastatic propensity in vivo. The effect of Wnt2 is correlated with fibronectin upregulation, and it is mediated in part through non-canonical Wnt signaling and suppressed by inhibition of the Map3k7 (Tak1) kinase, an integrator of Wnt, BMP and TGF-β signaling. DGE analysis of human pancreatic CTCs reveals significant enrichment for non-canonical Wnt signaling in 5 of 11 cases, with Wnt2 RNA detectable by in situ hybridization in a subset of CTCs. Identifying CTC-enriched transcripts such as Wnt2 may point to pathways that enhance haematogenous metastasis, thus providing novel therapeutic targets to prevent the distal spread of cancer.
Citation Format: {Authors}. {Abstract title} [abstract]. In: Proceedings of the 103rd Annual Meeting of the American Association for Cancer Research; 2012 Mar 31-Apr 4; Chicago, IL. Philadelphia (PA): AACR; Cancer Res 2012;72(8 Suppl):Abstract nr 5304. doi:1538-7445.AM2012-5304
Collapse
|
9
|
Abstract
The importance of short (<200 nt) RNAs in cell biogenesis has been well documented. These short RNAs include crucial classes of molecules such as transfer RNAs, small nuclear RNA, microRNAs, and many others (reviewed in Storz et al., Annu Rev Biochem 74:199-217, 2005; Ghildiyal and Zamore, Nat Rev Genet 10:94-108, 2009). Furthermore, the realm of functional RNAs that fall within this size range is growing to include less well-characterized RNAs such as short RNAs found at the promoters and 3' termini of genes (Affymetrix ENCODE Transcriptome Project et al., Nature 457:1028-1032, 2009; Davis and Ares, Proc Natl Acad Sci USA 103:3262-3267, 2006; Kapranov et al., Science 316:1484-1488, 2007; Taft et al., Nat Genet 41:572-578, 2009; Kapranov et al., Nature 466:642-646, 2010), short RNAs involved in paramutation (Rassoulzadegan et al., Nature 441:469-474, 2006), and others (reviewed in Kawaji and Hayashizaki, PLoS Genet 4:e22, 2008). Discovery and accurate quantification of these RNA molecules, less than 200 bases in size, is thus an important and also challenging aspect of understanding the full repertoire of cellular and extracellular RNAs. Here, we describe the strategies and procedures we developed to profile short RNA species using single-molecule sequencing (SMS) and the advantages SMS offers.
Collapse
|
10
|
Native molecular state of adeno-associated viral vectors revealed by single-molecule sequencing. Hum Gene Ther 2011; 23:46-55. [PMID: 21875357 DOI: 10.1089/hum.2011.160] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The single-stranded genome of adeno-associated viral (AAV) vectors is one of the key factors leading to slow-rising but long-term transgene expression kinetics. Previous molecular studies have established what is now considered a textbook molecular model of AAV genomes with two copies of inverted tandem repeats at either end. In this study, we profiled hundreds of thousands of individual molecules of AAV vector DNA directly isolated from capsids, using single-molecule sequencing (SMS), which avoids any intermediary steps such as plasmid cloning. The sequence profile at 3' ends of both the regular and oversized vector did show the presence of an inverted terminal repeat (ITR), which provided direct confirmation that AAV vector packaging initiates from its 3' end. Furthermore, the vector 5'-terminus profile showed inconsistent termination for oversized vectors. Such incomplete vectors would not be expected to undergo canonical synthesis of the second strand of their genomic DNA and thus could function only via annealing of complementary strands of DNA. Furthermore, low levels of contaminating plasmid DNA were also detected. SMS may become a valuable tool during the development phase of vectors that are candidates for clinical use and for facilitating/accelerating studies on vector biology.
Collapse
|
11
|
Abstract
Genetic testing for disease risk is an increasingly important component of medical care. However, testing can be expensive, which can lead to patients and physicians having limited access to the genetic information needed for medical decisions. To simplify DNA sample preparation and lower costs, we have developed a system in which any gene can be captured and sequenced directly from human genomic DNA without amplification, using no proteins or enzymes prior to sequencing. Extracted whole-genome DNA is acoustically sheared and loaded in a flow cell channel for single-molecule sequencing. Gene isolation, amplification, or ligation is not necessary. Accurate and low-cost detection of DNA sequence variants is demonstrated for the BRCA1 gene. Disease-causing mutations as well as common variants from well-characterized samples are identified. Single-molecule sequencing generates very reproducible coverage patterns, and these can be used to detect any size insertion or deletion directly, unlike PCR-based methods, which require additional assays. Because no gene isolation or amplification is required for sequencing, the exceptionally low costs of sample preparation and analysis could make genetic tests more accessible to those who wish to know their own disease susceptibility. Additionally, this approach has applications for sequencing integration sites for gene therapy vectors, transposons, retroviruses, and other mobile DNA elements in a more facile manner than possible with other methods.
Collapse
|
12
|
Genome-wide mapping of 5-hydroxymethylcytosine in embryonic stem cells. Nature 2011; 473:394-7. [PMID: 21552279 DOI: 10.1038/nature10102] [Citation(s) in RCA: 620] [Impact Index Per Article: 47.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2010] [Accepted: 04/11/2011] [Indexed: 01/05/2023]
Abstract
5-hydroxymethylcytosine (5hmC) is a modified base present at low levels in diverse cell types in mammals. 5hmC is generated by the TET family of Fe(II) and 2-oxoglutarate-dependent enzymes through oxidation of 5-methylcytosine (5mC). 5hmC and TET proteins have been implicated in stem cell biology and cancer, but information on the genome-wide distribution of 5hmC is limited. Here we describe two novel and specific approaches to profile the genomic localization of 5hmC. The first approach, termed GLIB (glucosylation, periodate oxidation, biotinylation) uses a combination of enzymatic and chemical steps to isolate DNA fragments containing as few as a single 5hmC. The second approach involves conversion of 5hmC to cytosine 5-methylenesulphonate (CMS) by treatment of genomic DNA with sodium bisulphite, followed by immunoprecipitation of CMS-containing DNA with a specific antiserum to CMS. High-throughput sequencing of 5hmC-containing DNA from mouse embryonic stem (ES) cells showed strong enrichment within exons and near transcriptional start sites. 5hmC was especially enriched at the start sites of genes whose promoters bear dual histone 3 lysine 27 trimethylation (H3K27me3) and histone 3 lysine 4 trimethylation (H3K4me3) marks. Our results indicate that 5hmC has a probable role in transcriptional regulation, and suggest a model in which 5hmC contributes to the 'poised' chromatin signature found at developmentally-regulated genes in ES cells.
Collapse
|
13
|
Protocol dependence of sequencing-based gene expression measurements. PLoS One 2011; 6:e19287. [PMID: 21573114 PMCID: PMC3089619 DOI: 10.1371/journal.pone.0019287] [Citation(s) in RCA: 74] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2011] [Accepted: 03/25/2011] [Indexed: 11/22/2022] Open
Abstract
RNA Seq provides unparalleled levels of information about the transcriptome including precise expression levels over a wide dynamic range. It is essential to understand how technical variation impacts the quality and interpretability of results, how potential errors could be introduced by the protocol, how the source of RNA affects transcript detection, and how all of these variations can impact the conclusions drawn. Multiple human RNA samples were used to assess RNA fragmentation, RNA fractionation, cDNA synthesis, and single versus multiple tag counting. Though protocols employing polyA RNA selection generate the highest number of non-ribosomal reads and the most precise measurements for coding transcripts, such protocols were found to detect only a fraction of the non-ribosomal RNA in human cells. PolyA RNA excludes thousands of annotated and even more unannotated transcripts, resulting in an incomplete view of the transcriptome. Ribosomal-depleted RNA provides a more cost-effective method for generating complete transcriptome coverage. Expression measurements using single tag counting provided advantages for assessing gene expression and for detecting short RNAs relative to multi-read protocols. Detection of short RNAs was also hampered by RNA fragmentation. Thus, this work will help researchers choose from among a range of options when analyzing gene expression, each with its own advantages and disadvantages.
Collapse
|
14
|
Abstract 1177: The majority of total nuclear-encoded non-ribosomal RNA in human normal and tumor cells is “dark matter” unannotated RNA. Cancer Res 2011. [DOI: 10.1158/1538-7445.am2011-1177] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Transcriptional output of human genome is far more complex than predicted by the current set of protein-coding annotations and most of the novel RNAs being produced appear to not encode proteins. This has transformed our understanding of genome complexity in recent years and suggested new paradigms of genome regulation. However, the fraction of the genome that is utilized to produce cellular RNA whose function we do not understand and even more so, their relative mass in a cell remains a controversial issue.
RNA from normal human liver and brain, the K562 leukemia cell line and 6 paired Ewing primary and metastatic tumors was converted into cDNA using random hexamers and sequenced using single-molecule sequencing (SMS). No amplification, ligation, or size selection were used thus minimizing methodological biases. PolyA+ RNA, total RNA, and total RNA depleted of ribosomal RNA were studied. The SMS reads were aligned to the complete human genome and uniquely mapping reads from human tissue sources were further filtered to exclude sequences aligning to rDNA sequences, the mitochondrial genome, as well as to genomic repeats annotated by the RepeatMasker program as rRNA. After filtering, the remaining informative reads were used for subsequent analyses, including comparison to known annotations defined by the exons of UCSC Genes.
This investigation makes the following key observations. 1. We show clearly that the so-called “dark matter RNAs”, which represent mostly non-coding RNA, not only exist in human cells but can comprise the majority of total non-ribosomal, non-mitochondrial RNA. In fact, we estimate that half to two-thirds of all such RNAs in a human cell is non-coding. 2. It shows a significant loss of this complexity if only polyA+ RNA is profiled. In this respect, most, if not all, contemporary RNA-seq papers continue to focus on this type of RNA and thus report significantly skewed results in terms of the true complexity of human RNA. 3. We show the presence of a large number of very long (100's of kbs), abundant intergenic transcribed regions located in areas of the genome that are devoid of protein-coding annotations. We show evidence that these very long and likely non-coding RNA transcripts are expressed during normal development, silenced in adult tissues and are then re-activated during cancer progression.
Our understanding of the repertoire of human RNAs remains far from complete, and almost all RNA-Seq studies have missed this complexity due to the limited view obtained when using only the polyA+ RNA fraction. Moreover, many novel genomic regions give rise to RNAs differentially expressed in different tumor types and also in primary vs metastatic tumors derived from the same patient. This brings a tantalizing possibility that a great number of hitherto uncharacterized RNAs are involved in tumoregenesis and they could be used as both diagnostic and potentially therapeutic targets.
Citation Format: {Authors}. {Abstract title} [abstract]. In: Proceedings of the 102nd Annual Meeting of the American Association for Cancer Research; 2011 Apr 2-6; Orlando, FL. Philadelphia (PA): AACR; Cancer Res 2011;71(8 Suppl):Abstract nr 1177. doi:10.1158/1538-7445.AM2011-1177
Collapse
|
15
|
Single-molecule direct RNA sequencing without cDNA synthesis. WILEY INTERDISCIPLINARY REVIEWS-RNA 2011; 2:565-70. [PMID: 21957044 DOI: 10.1002/wrna.84] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Methods for in-depth genome-wide characterization of transcriptomes and quantification of transcript levels using various microarray and next-generation sequencing technologies have emerged as valuable tools for understanding cellular physiology and human disease biology and have begun to be utilized in various clinical diagnostic applications. Current methods, however, typically require RNA to be converted to complementary DNA prior to measurements. This step has been shown to introduce many biases and artifacts. In order to best characterize the 'true' transcriptome, the single-molecule direct RNA sequencing (DRS) technology was developed. This review focuses on the underlying principles behind the DRS, sample preparation steps, and the current and novel avenues of research and applications DRS offers.
Collapse
|
16
|
A comparison of single molecule and amplification based sequencing of cancer transcriptomes. PLoS One 2011; 6:e17305. [PMID: 21390249 PMCID: PMC3046973 DOI: 10.1371/journal.pone.0017305] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2010] [Accepted: 01/28/2011] [Indexed: 11/19/2022] Open
Abstract
The second wave of next generation sequencing technologies, referred to as single-molecule sequencing (SMS), carries the promise of profiling samples directly without employing polymerase chain reaction steps used by amplification-based sequencing (AS) methods. To examine the merits of both technologies, we examine mRNA sequencing results from single-molecule and amplification-based sequencing in a set of human cancer cell lines and tissues. We observe a characteristic coverage bias towards high abundance transcripts in amplification-based sequencing. A larger fraction of AS reads cover highly expressed genes, such as those associated with translational processes and housekeeping genes, resulting in relatively lower coverage of genes at low and mid-level abundance. In contrast, the coverage of high abundance transcripts plateaus off using SMS. Consequently, SMS is able to sequence lower- abundance transcripts more thoroughly, including some that are undetected by AS methods; however, these include many more mapping artifacts. A better understanding of the technical and analytical factors introducing platform specific biases in high throughput transcriptome sequencing applications will be critical in cross platform meta-analytic studies.
Collapse
|
17
|
Abstract
Single-molecule sequencing enables DNA or RNA to be sequenced directly from biological samples, making it well-suited for diagnostic and clinical applications. Here we review the properties and applications of this rapidly evolving and promising technology.
Collapse
|
18
|
Abstract
In the few years since its initial application, massively parallel cDNA sequencing, or RNA-seq, has allowed many advances in the characterization and quantification of transcriptomes. Recently, several developments in RNA-seq methods have provided an even more complete characterization of RNA transcripts. These developments include improvements in transcription start site mapping, strand-specific measurements, gene fusion detection, small RNA characterization and detection of alternative splicing events. Ongoing developments promise further advances in the application of RNA-seq, particularly direct RNA sequencing and approaches that allow RNA quantification from very small amounts of cellular materials.
Collapse
|
19
|
Comprehensive polyadenylation site maps in yeast and human reveal pervasive alternative polyadenylation. Cell 2011; 143:1018-29. [PMID: 21145465 DOI: 10.1016/j.cell.2010.11.020] [Citation(s) in RCA: 311] [Impact Index Per Article: 23.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2010] [Revised: 09/28/2010] [Accepted: 11/09/2010] [Indexed: 01/12/2023]
Abstract
The emerging discoveries on the link between polyadenylation and disease states underline the need to fully characterize genome-wide polyadenylation states. Here, we report comprehensive maps of global polyadenylation events in human and yeast generated using refinements to the Direct RNA Sequencing technology. This direct approach provides a quantitative view of genome-wide polyadenylation states in a strand-specific manner and requires only attomole RNA quantities. The polyadenylation profiles revealed an abundance of unannotated polyadenylation sites, alternative polyadenylation patterns, and regulatory element-associated poly(A)(+) RNAs. We observed differences in sequence composition surrounding canonical and noncanonical human polyadenylation sites, suggesting novel noncoding RNA-specific polyadenylation mechanisms in humans. Furthermore, we observed the correlation level between sense and antisense transcripts to depend on gene expression levels, supporting the view that overlapping transcription from opposite strands may play a regulatory role. Our data provide a comprehensive view of the polyadenylation state and overlapping transcription.
Collapse
|
20
|
Abstract
The recent transition in gene expression analysis technology to ultra high-throughput cDNA sequencing provides a means for higher quantitation sensitivity across a wider dynamic range than previously possible. Sensitivity of detection is mostly a function of the sheer number of sequence reads generated. Typically, RNA is converted to cDNA using random hexamers and the cDNA is subsequently sequenced (RNA-Seq). With this approach, higher read numbers are generated for long transcripts as compared to short ones. This length bias necessitates the generation of very high read numbers to achieve sensitive quantitation of short, low-expressed genes. To eliminate this length bias, we have developed an ultra high-throughput sequencing approach where only a single read is generated for each transcript molecule (single-molecule sequencing Digital Gene Expression (smsDGE)). So, for example, equivalent quantitation accuracy of the yeast transcriptome can be achieved by smsDGE using only 25% of the reads that would be required using RNA-Seq. For sample preparation, RNA is first reverse-transcribed into single-stranded cDNA using oligo-dT as a primer. A poly-A tail is then added to the 3' ends of cDNA to facilitate the hybridization of the sample to the Helicos(®) single-molecule sequencing Flow-Cell to which a poly dT oligo serves as the substrate for subsequent sequencing by synthesis. No PCR, sample-size selection, or ligation steps are required, thus avoiding possible biases that may be introduced by such manipulations. Each tailed cDNA sample is injected into one of 50 flow-cell channels and sequenced on the Helicos(®) Genetic Analysis System. Thus, 50 samples are sequenced simultaneously generating 10-20 million sequence reads on average for each sample channel. The sequence reads can then be aligned to the reference of choice such as the transcriptome, for quantitation of known transcripts, or the genome for novel transcript discovery. This chapter provides a summary of the methods required for smsDGE.
Collapse
MESH Headings
- DNA Primers/genetics
- DNA Primers/metabolism
- DNA, Complementary/biosynthesis
- DNA, Complementary/metabolism
- DNA, Single-Stranded/biosynthesis
- DNA, Single-Stranded/metabolism
- Deoxyribonucleases/metabolism
- High-Throughput Nucleotide Sequencing/methods
- Nucleic Acid Hybridization
- Poly A/metabolism
- Polyadenylation
- RNA/genetics
- RNA/metabolism
- RNA, Fungal/genetics
- RNA, Fungal/metabolism
- RNA, Messenger/genetics
- RNA, Messenger/metabolism
- Sequence Analysis, DNA/methods
- Sequence Analysis, RNA/methods
Collapse
|
21
|
Abstract
With the advent of high-throughput sequencing technologies, multiple bacterial genomes can be sequenced in days. While the ultimate goal of de novo assembly of bacterial genomes is progressing, changes in the genomic sequence of closely related bacterial strains and isolates are now easily monitored by comparison of their sequences to those of a reference genome. Such studies can be applied to the fields of bacterial evolution, epidemiology, and diagnostics. We present a protocol for single-molecule sequencing of bacterial DNA whose end result is the identification of single nucleotide variants, and various size insertions and deletions relative to a reference genome. The protocol is characterized by the simplicity of sample preparation and the lack of amplification-related sequencing bias.
Collapse
|
22
|
Abstract
In the few years since its initial application, massively parallel cDNA sequencing, or RNA-seq, has allowed many advances in the characterization and quantification of transcriptomes. Recently, several developments in RNA-seq methods have provided an even more complete characterization of RNA transcripts. These developments include improvements in transcription start site mapping, strand-specific measurements, gene fusion detection, small RNA characterization and detection of alternative splicing events. Ongoing developments promise further advances in the application of RNA-seq, particularly direct RNA sequencing and approaches that allow RNA quantification from very small amounts of cellular materials.
Collapse
|
23
|
Abstract
Helicos Single-Molecule Sequencing provides a unique view of genome biology through direct sequencing of cellular and extracellular nucleic acids in an unbiased manner, providing both quantitation and sequence information. Using a simple sample preparation, involving no ligation or amplification, genomic DNA is sheared, tailed with poly-A and hybridized to the flow-cell surface containing oligo-dT for initiating sequencing-by-synthesis. RNA measurements involving direct RNA hybridization to the flow cell allows for the direct sequencing and quantitation of RNA molecules. From these methods, a diverse array of applications has now been successfully demonstrated with the Helicos Genetic Analysis System, including human genome sequencing for accurate variant detection, ChIP Seq studies involving picogram quantities of DNA obtained from small cell numbers, copy number variation studies from both fresh tumor tissue and formalin-fixed paraffin-embedded tissue and archival tissue samples, small RNA studies leading to the identification of new classes of RNAs, and the direct capture and sequencing of nucleic acids from cell quantities as few as 400 cells with our end goal of single cell measurements. Helicos methods provide an important opportunity to researchers, including genomic scientists, translational researchers, and diagnostic experts, to benefit from biological measurements at the single-molecule level. This chapter will describe the various methods available to researchers.
Collapse
|
24
|
New class of gene-termini-associated human RNAs suggests a novel RNA copying mechanism. Nature 2010; 466:642-6. [PMID: 20671709 DOI: 10.1038/nature09190] [Citation(s) in RCA: 76] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2009] [Accepted: 05/20/2010] [Indexed: 11/09/2022]
Abstract
Small (<200 nucleotide) RNA (sRNA) profiling of human cells using various technologies demonstrates unexpected complexity of sRNAs with hundreds of thousands of sRNA species present. Genetic and in vitro studies show that these RNAs are not merely degradation products of longer transcripts but could indeed have a function. Furthermore, profiling of RNAs, including the sRNAs, can reveal not only novel transcripts, but also make clear predictions about the existence and properties of novel biochemical pathways operating in a cell. For example, sRNA profiling in human cells indicated the existence of an unknown capping mechanism operating on cleaved RNA, a biochemical component of which was later identified. Here we show that human cells contain a novel type of sRNA that has non-genomically encoded 5' poly(U) tails. The presence of these RNAs at the termini of genes, specifically at the very 3' ends of known mRNAs, strongly argues for the presence of a yet uncharacterized endogenous biochemical pathway in cells that can copy RNA. We show that this pathway can operate on multiple genes, with specific enrichment towards transcript-encoding components of the translational machinery. Finally, we show that genes are also flanked by sense, 3' polyadenylated sRNAs that are likely to be capped.
Collapse
|
25
|
Amplification-free digital gene expression profiling from minute cell quantities. Nat Methods 2010; 7:619-21. [PMID: 20639869 DOI: 10.1038/nmeth.1480] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2010] [Accepted: 06/02/2010] [Indexed: 11/09/2022]
Abstract
Generating reliable expression profiles from minute cell quantities is critical for scientific discovery and potential clinical applications. Here we present low-quantity digital gene expression (LQ-DGE), an amplification-free approach involving capture of poly(A)(+) RNAs from cellular lysates onto poly(dT)-coated sequencing surfaces, followed by on-surface reverse transcription and sequencing. We applied LQ-DGE to profile malignant and nonmalignant mouse and human cells, demonstrating its quantitative power and potential applicability to archival specimens.
Collapse
|
26
|
Abstract 4087: Large intergenic noncoding RNAs associated with Ewing sarcoma family of tumors. Cancer Res 2010. [DOI: 10.1158/1538-7445.am10-4087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
BACKGROUND: Ewing sarcoma family of tumors (EFT) represents the second most common primary malignant bone tumor in children and adolescents. The majority of EFTs harbor a translocation (t11;22)(q24;q12) resulting in the expression of the EWS-FLI1 chimeric oncoprotein. We hypothesized that microarray gene expression profiling, in combination with next-generation sequencing technology, could be used to identify EFT-specific genes and transcripts including large intergenic noncoding (linc)RNAs. Next-generation sequencing, coupled with target enrichment for validation, can also identify driver mutations of metastasis and treatment resistance in high-risk cases.
RESULTS: Genome-wide expression profiles of childhood sarcoma and normal tissues were analyzed using both Partek Genomics Suite as well as our own customized software, Genetrix. These are compared with Helicos single molecule sequencing data. The results show that both known genes and candidate lincRNAs strongly associate with EFT and can be used to distinguish EFT from other childhood tumors. Moreover, many lincRNAs are seen on both the poly-A selected track and on the random hexamer primed total RNA expression profile, indicating that the transcripts are multi-exonic and poly-adenylated. Importantly, regions of EFT-associated lincRNA expression may indicate regions that are deregulated by EWS-FLI1 in these tumors. We also focused on lincRNAs with differential expression in an EFT cell line pair (CHLA-9 and −10) derived, respectively, from patient-matched primary and metastatic tumors. RNA-seq determined ∼160,000 regions representing ∼16MB of genomic sequence as being at least 3-fold differentially expressed between the two cell lines. A higher threshold of at least 10-fold expression difference still revealed ∼20,000 regions representing ∼2MB of genomic sequence. In addition, the proportion of differentially-expressed intergenic transcripts was higher in CHLA-9 (18.1%) versus CHLA-10 (11.2%). Annotated transcripts with at least 10-fold expression in CHLA-10 are enriched in plasma membrane and adhesion-related functions, and this is consistent with metastatic behavior. Genes most highly expressed in CHLA-10 include ANXA5, FABP3, and HIST1H1A, whereas genes with much lower expression include CXCL14, MCTP2, and TRIM22. These genes, along with lincRNAs, may play important roles in disease progression and drug resistance.
CONCLUSIONS: We have identified several large intergenic noncoding (linc)RNAs that are highly and differentially expressed by EFT. We hypothesize that these lincRNAs may be novel therapeutic targets in EFT.
Citation Format: {Authors}. {Abstract title} [abstract]. In: Proceedings of the 101st Annual Meeting of the American Association for Cancer Research; 2010 Apr 17-21; Washington, DC. Philadelphia (PA): AACR; Cancer Res 2010;70(8 Suppl):Abstract nr 4087.
Collapse
|
27
|
Abstract
Accurate profiling of minute quantities of RNA in a global manner can enable key advances in many scientific and clinical disciplines. Here, we present low-quantity RNA sequencing (LQ-RNAseq), a high-throughput sequencing-based technique allowing whole transcriptome surveys from subnanogram RNA quantities in an amplification/ligation-free manner. LQ-RNAseq involves first-strand cDNA synthesis from RNA templates, followed by 3' polyA tailing of the single-stranded cDNA products and direct single molecule sequencing. We applied LQ-RNAseq to profile S. cerevisiae polyA+ transcripts, demonstrate the reproducibility of the approach across different sample preparations and independent instrument runs, and establish the absolute quantitative power of this method through comparisons with other reported transcript profiling techniques and through utilization of RNA spike-in experiments. We demonstrate the practical application of this approach to define the transcriptional landscape of mouse embryonic and induced pluripotent stem cells, observing transcriptional differences, including over 100 genes exhibiting differential expression between these otherwise very similar stem cell populations. This amplification-independent technology, which utilizes small quantities of nucleic acid and provides quantitative measurements of cellular transcripts, enables global gene expression measurements from minute amounts of materials and offers broad utility in both basic research and translational biology for characterization of rare cells.
Collapse
|
28
|
|
29
|
Chromatin profiling by directly sequencing small quantities of immunoprecipitated DNA. Nat Methods 2010; 7:47-9. [PMID: 19946276 PMCID: PMC2862482 DOI: 10.1038/nmeth.1404] [Citation(s) in RCA: 99] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2009] [Accepted: 11/03/2009] [Indexed: 11/08/2022]
Abstract
Chromatin structure and transcription factor localization can be assayed genome-wide by sequencing genomic DNA fractionated by protein occupancy or other properties, but current technologies involve multiple steps that introduce bias and inefficiency. Here we apply a single-molecule approach to directly sequence chromatin immunoprecipitated DNA with minimal sample manipulation. This method is compatible with just 50 pg of DNA and should thus facilitate charting chromatin maps from limited cell populations.
Collapse
|
30
|
Emergence of single-molecule sequencing and potential for molecular diagnostic applications. Expert Rev Mol Diagn 2009; 9:659-66. [PMID: 19817551 DOI: 10.1586/erm.09.50] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
The effective demonstration of single-molecule sequencing at scale over the last several years offers the exciting opportunity for a new era in the field of molecular diagnostics. As we aim to personalize and deliver cost-effective healthcare, we must consider the need to fully integrate genomics into decision-making. We must be able to accurately and cost effectively obtain a complete genome sequence for disease diagnosis, interrogate a molecular signature from blood for therapeutic monitoring, obtain a tumor mutation profile for optimizing therapeutic choice - each molecular diagnostic measurement utilized to better inform patient care. Would a physician or molecular pathology laboratory want to utilize a PCR process in which millions of DNA copies of a patient's nucleic acid are created when an alternative approach allowing direct measurement of the nucleic acids is possible? I would suggest not! In this article we will focus on the emergence of single-molecule sequencing, the single-molecule sequencing methodologies in the marketplace or under development today, as well as the importance of these methods for molecular characterization and diagnosis of disease with the ultimate application for molecular diagnostics.
Collapse
|
31
|
Abstract
Our understanding of human biology and disease is ultimately dependent on a complete understanding of the genome and its functions. The recent application of microarray and sequencing technologies to transcriptomics has changed the simplistic view of transcriptomes to a more complicated view of genome-wide transcription where a large fraction of transcripts emanates from unannotated parts of genomes, and underlined our limited knowledge of the dynamic state of transcription. Most of this broad body of knowledge was obtained indirectly because current transcriptome analysis methods typically require RNA to be converted to complementary DNA (cDNA) before measurements, even though the cDNA synthesis step introduces multiple biases and artefacts that interfere with both the proper characterization and quantification of transcripts. Furthermore, cDNA synthesis is not particularly suitable for the analysis of short, degraded and/or small quantity RNA samples. Here we report direct single molecule RNA sequencing without prior conversion of RNA to cDNA. We applied this technology to sequence femtomole quantities of poly(A)(+) Saccharomyces cerevisiae RNA using a surface coated with poly(dT) oligonucleotides to capture the RNAs at their natural poly(A) tails and initiate sequencing by synthesis. We observed transcript 3' end heterogeneity and polyadenylated small nucleolar RNAs. This study provides a path to high-throughput and low-cost direct RNA sequencing and achieving the ultimate goal of a comprehensive and bias-free understanding of transcriptomes.
Collapse
|
32
|
New models of collaboration in genome-wide association studies: the Genetic Association Information Network. Nat Genet 2007; 39:1045-51. [PMID: 17728769 DOI: 10.1038/ng2127] [Citation(s) in RCA: 256] [Impact Index Per Article: 15.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
The Genetic Association Information Network (GAIN) is a public-private partnership established to investigate the genetic basis of common diseases through a series of collaborative genome-wide association studies. GAIN has used new approaches for project selection, data deposition and distribution, collaborative analysis, publication and protection from premature intellectual property claims. These demonstrate a new commitment to shared scientific knowledge that should facilitate rapid advances in understanding the genetics of complex diseases.
Collapse
|
33
|
Association of torsades de pointes with novel and known single nucleotide polymorphisms in long QT syndrome genes. Am Heart J 2006; 152:1116-22. [PMID: 17161064 DOI: 10.1016/j.ahj.2006.08.020] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/11/2006] [Accepted: 08/23/2006] [Indexed: 01/08/2023]
Abstract
BACKGROUND Reduction of drug-induced adverse events may be achievable through a better understanding of the underlying causes of such events. Identifying phenotypes and genotypes that allow event prediction would provide greater safety margins for new therapeutics. Torsades de pointes (TdP) is one such life-threatening adverse event and can arise from excessive lengthening of the QT interval. This study was designed to better understand the role of genetics in the development of TdP and to determine whether genotypes can be used to predict susceptibility and thus reduce adverse events. METHODS Seven known familial long QT syndrome genes were scanned for sequence variations in 34 patients with TdP. This group of patients is the largest such cohort ever assembled for this type of analysis. The allele frequencies for novel and known polymorphisms in these patients were compared with those in healthy control subjects. RESULTS Six novel mutations--4 in ANK2, 1 in KCNQ1, and 1 in SCN5A--were found in the patients with TdP. Two mutations were also found in 595 healthy control subjects, whereas the others were unique to patients with TdP. Two common single nucleotide polymorphisms may be associated with the risk of TdP. The entire ANK2 gene had not been screened in a population this large previously. CONCLUSIONS Genotypes alone could not be used to completely predict susceptibility to TdP, even when used with phenotypes. The best model using genotypic and phenotypic variables was unable to predict all events. It is unclear what other risk genes or environmental effects might be necessary to predict such cases.
Collapse
|
34
|
Drug-metabolizing enzymes: Evidence for clinical utility of pharmacogenomic tests. Clin Pharmacol Ther 2005; 78:559-81. [PMID: 16338273 DOI: 10.1016/j.clpt.2005.08.013] [Citation(s) in RCA: 101] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2005] [Accepted: 08/12/2005] [Indexed: 11/15/2022]
|
35
|
An association study of 43 SNPs in 16 candidate genes with atorvastatin response. THE PHARMACOGENOMICS JOURNAL 2005; 5:352-8. [PMID: 16103896 DOI: 10.1038/sj.tpj.6500328] [Citation(s) in RCA: 160] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Variation in individual response to statin therapy has been widely studied for a potential genetic component. Multiple genes have been identified as potential modulators of statin response, but few study findings have replicated. To further examine these associations, 2735 individuals on statin therapy, half on atorvastatin and the other half divided among fluvastatin, lovastatin, pravastatin and simvastatin were genotyped for 43 SNPs in 16 genes that have been implicated in statin response. Associations with low-density lipoprotein cholesterol (LDL-C) lowering, total cholesterol lowering, HDL-C elevation and triglyceride lowering were examined. The only significant associations with LDL-C lowering were found with apoE2 in which carriers of the rare allele who took atorvastatin lowered their LDL-C by 3.5% more than those homozygous for the common allele and with rs2032582 (S893A in ABCB1) in which the two groups of homozygotes differed by 3% in LDL-C lowering. These genetic effects were smaller than those observed with the demographic variables of age and gender. The magnitude of all the differences found is sufficiently small that genetic data from these genes should not influence clinical decisions on statin administration.
Collapse
|
36
|
Abstract
The rapid pace of genomic science advancements, including the completion of the human genome sequence, the extensive cataloguing of genetic variation and the acceleration of technologies to assess such variation, combined with clinical programmes with rich phenotypic data, serve as the foundation for the design and execution of pharmacogenomic studies which have an impact on the pharmaceutical pipeline from early discovery through to the marketplace. The authors discuss the required infrastructure to support pharmacogenomic studies and provide insight into the strategies and practical application to influence decision making in the pharmaceutical setting. Further, the influence of pharmacogenomics is currently affecting patient care in the oncology area and is highlighted as evident impact in the marketplace.
Collapse
|
37
|
Abstract
Association studies are used to identify genetic determinants of complex human traits of medical interest. With the large number of validated single nucleotide polymorphisms (SNPs) currently available, two limiting factors in association studies are genotyping capability and costs. Pooled DNA genotyping has been proposed as an efficient means of screening SNPs for allele frequency differences in case-control studies and for prioritising them for subsequent individual genotyping analysis. Here, we apply quantitative pooled genotyping followed by individual genotyping and replication to identify associations with human serum high-density lipoprotein (HDL) cholesterol levels. The DNA from individuals with low and high HDL cholesterol levels was pooled separately, each pool was amplified by polymerase chain reaction in triplicate and each amplified product was separately hybridised to a high-density oligonucleotide array. Allele frequency differences between case and control groups with low and high HDL cholesterol levels were estimated for 7,283 SNPs distributed across 71 candidate gene regions spanning a total of 17.1 megabases. A novel method was developed to take advantage of independently derived haplotype map information to improve the pooled estimates of allele frequency differences. A subset of SNPs with the largest estimated allele frequency differences between low and high HDL cholesterol groups was chosen for individual genotyping in the study population, as well as in a separate replication population. Four SNPs in a single haplotype block within the cholesteryl ester transfer protein (CETP) gene interval were significantly associated with HDL cholesterol levels in both populations. Our study is among the first to demonstrate the application of pooled genotyping followed by confirmation with individual genotyping to identify genetic determinants of a complex trait.
Collapse
|
38
|
Cholesteryl Ester Transfer Protein Variants Have Differential Stability but Uniform Inhibition by Torcetrapib. J Biol Chem 2005; 280:14918-22. [PMID: 15681846 DOI: 10.1074/jbc.m500523200] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Cholesteryl ester transfer protein (CETP) is an important modulator of high density lipoprotein cholesterol in humans and thus considered to be a therapeutic target for preventing cardiovascular disease. The gene encoding CETP has been shown to be highly variable, with multiple single nucleotide polymorphisms responsible for altering both its transcription and sequence. Examining nine missense variants of CETP, we found some had significant associations with CETP mass and high density lipoprotein cholesterol levels. Two variants, Pro-373 and Gln-451, appear to be more stable in vivo, an observation mirrored by partial proteolysis studies performed in vitro. Because these naturally occurring variant proteins are potentially present in clinical populations that will be treated with CETP inhibitors, all commonly occurring haplotypes were tested to determine whether the proteins they encode could be inhibited by torcetrapib, a compound currently in clinical trials in combination with atorvastatin. Torcetrapib behaved similarly with all variants, with no significant differences in inhibition.
Collapse
|
39
|
Cholesteryl ester transfer protein promoter single-nucleotide polymorphisms in Sp1-binding sites affect transcription and are associated with high-density lipoprotein cholesterol. Clin Genet 2005; 66:223-8. [PMID: 15324321 DOI: 10.1111/j.1399-0004.2004.00289.x] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
Genetic variation in the human cholesteryl ester transfer protein (CETP) promoter has been shown to be associated with high-density lipoprotein cholesterol (HDL-C) levels and cardiovascular disease. Some of this variation occurs in Sp1/Sp3 binding sites in the proximal promoter. We find that both the known promoter polymorphism at -629 and the previously uncharacterized polymorphism at -38 are associated with HDL-C levels in vivo and affect transcription in vitro. While the -629 polymorphism is common in all ethnic groups, the -38 polymorphism is found at significant levels (6.4%) only among African Americans. Those homozygous for the less common -38A allele have higher HDL-C levels than those with the more frequent -38G allele. This association was found in a population of African Americans at risk of cardiovascular disease and then replicated in a different population chosen from among patients with extremes of HDL-C. When studied in vitro, the most transcriptionally active allele (-629C/-38G) yields 51% more reporter protein than the least active allele (-629A/-38A) in HepG2 cells. These transcriptional effects reflect the projected impact of increased CETP expression on HDL-C phenotypes seen in vivo.
Collapse
|
40
|
CETP polymorphisms associated with HDL cholesterol may differ from those associated with cardiovascular disease. Atherosclerosis 2005; 181:45-53. [PMID: 15939053 DOI: 10.1016/j.atherosclerosis.2005.01.015] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/17/2004] [Revised: 12/12/2004] [Accepted: 01/10/2005] [Indexed: 12/01/2022]
Abstract
To better understand the role of cholesteryl ester transfer protein (CETP) in cardiovascular disease, nine polymorphisms spanning the gene from the upstream promoter region to beyond the 3'UTR were genotyped in 2553 individuals from multiple ethnic groups and with different cardiovascular disease profiles. The frequency of four of these SNPs varied by 40-300% between Caucasians and African Americans. SNPs in each ethnic group fell into two haploblocks with significant linkage disequilibrium within each block. SNPs in the 5' haploblock were significantly associated with HDL cholesterol while SNPs in the 3' haploblock were, at best, only weakly associated with HDL-C. One SNP in the 3' haploblock (rs1800774 in intron 12) was highly associated with history of myocardial infarction even though it was not associated with HDL-C. This association was driven by the effect in Caucasian women where 11.9% of the women with no history of MI are homozygous for the less common allele while 23.7% of those with a history of MI share this genotype. In addition, this SNP was highly associated with BMI among Caucasians (p < 0.0001). The association of HDL-C with CETP genotype was found to be independent of smoking or alcohol consumption. These results replicate some earlier findings and also help to explain some of the apparent contradictions in the literature surrounding the role of CETP in modulating HDL-C and cardiovascular disease.
Collapse
|
41
|
Abstract
Various drugs are reported to prolong the QT-interval on the surface ECG, thereby increasing the risk of developing a potentially fatal arrhythmia known as Torsades de Pointes (TdP). TdP case reports for these drugs have often been associated with risk factors such as overdosing, concomitant drugs and/or existing pathophysiological conditions. A few cases appear to be devoid of these factors. To determine what role genetic variation in the hERG gene plays in drug-induced arrhythmias, we screened DNA samples collected from 105 atrial-fibrillation patients treated with dofetilide for polymorphisms, seven of whom developed TdP. An uncommon missense change, R1047L, was identified in two of seven patients who experienced TdP as compared with five of 98 individuals who were free of TdP. Included in the affected individuals was the only subject homozygous for this SNP. Cellular electrophysiological studies revealed a 10-mV positive shift in the steady-state activation curve of the 1047L hERG channel stably expressed in HEK-293 cells as compared with the wild-type (WT) channel. The activation and inactivation kinetics of the 1047L current were significantly slower than the WT (P < 0.05) at given membrane potentials. A computer simulation using a rabbit ventricular myocyte model indicated that same extent of changes in the I(Kr) channel may result in an approximately 15% prolongation in the action potential duration. Our study suggests that 1047L leads to a functional impairment of the hERG channel, which may contribute to the higher incidence of TdP in 1047L carriers when challenged with a channel blocker.
Collapse
|
42
|
The serotonin transporter polymorphism, 5HTTLPR, is associated with a faster response time to sertraline in an elderly population with major depressive disorder. Psychopharmacology (Berl) 2004; 174:525-9. [PMID: 12955294 DOI: 10.1007/s00213-003-1562-3] [Citation(s) in RCA: 100] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/15/2002] [Accepted: 05/29/2003] [Indexed: 10/26/2022]
Abstract
RATIONALE A common polymorphism (5HTTLPR) within the promoter region of the serotonin transporter gene (LSC6A4) has been shown to influence response time as well as overall response to selective serotonin reuptake inhibitors (SSRIs) in subjects with major depressive disorder. We hypothesized that a similar effect in response time to sertraline would be observed and that no effect on response time would be seen in a placebo arm. OBJECTIVES We tested the hypothesis that subjects homozygous for the long allele at the 5HTTLPR polymorphism would respond more rapidly to sertraline than subjects carrying one or two copies of the short allele. METHODS HAM-D and CGI-I responses to sertraline and placebo were measured weekly in the context of an 8-week, placebo-controlled study in elderly depressed subjects. Genotyping of the 5HTTLPR polymorphism was performed to test for correlations with response at each week in the sertraline and placebo groups ( n=206). RESULTS Subjects homozygous for the long allele of 5HTTLPR showed a significant increase in response at week 1 and week 2, as assessed by the CGI-I scale compared with subjects carrying one or two copies of the short allele ( P=0.01 at both weeks). No significant difference was observed in the placebo group. CONCLUSIONS These results suggest that genetic variation in the serotonin transporter gene effects the response time to sertraline and provides complementing evidence to previous reports that this polymorphism affects response time to other SSRIs.
Collapse
|
43
|
Highly polymorphic repeat region in the CETP promoter induces unusual DNA structure. Biochim Biophys Acta Mol Cell Biol Lipids 2004; 1684:38-45. [PMID: 15450208 DOI: 10.1016/j.bbalip.2004.06.002] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2004] [Revised: 05/19/2004] [Accepted: 06/03/2004] [Indexed: 10/26/2022]
Abstract
Genetic variation in the human cholesteryl ester transfer protein (CETP) promoter is associated with HDL cholesterol levels and cardiovascular disease with much of the genetic variation in CETP attributed to the promoter region. In this region, there are several single nucleotide polymorphisms as well as a variable length tandem repeat located 1946 base pairs upstream of the CETP transcription start that is highly polymorphic with respect to both length and sequence. There are more than 10 different long alleles and these vary in their repeat structure. We find that the short allele of this repeat is associated with high HDL cholesterol levels in vivo (P<0.0001). In males, this association is independent of the nearby -629 polymorphism. In addition, the variable length GAAA repeat can stimulate an adjacent GGGGA repeat to form a structure that hinders DNA amplification and sequencing. This structure also has an effect in vivo as shown by orientation effects and cloning efficiency in Escherichia coli.
Collapse
|
44
|
Association between single-nucleotide polymorphisms in the endothelial lipase (LIPG) gene and high-density lipoprotein cholesterol levels. Biochim Biophys Acta Mol Cell Biol Lipids 2004; 1636:40-6. [PMID: 14984737 DOI: 10.1016/j.bbalip.2003.12.001] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2003] [Revised: 12/01/2003] [Accepted: 12/05/2003] [Indexed: 11/23/2022]
Abstract
Endothelial lipase (LIPG) is the latest addition to the triglyceride lipase family of genes that includes pancreatic lipase (PL), hepatic lipase (HL), and lipoprotein lipase (LPL). These lipolytic enzymes demonstrate both triglyceride lipase as well as phospholipase activities and are integrally involved in lipid absorption, transport, and metabolism. Several studies have demonstrated that LIPG is important for affecting lipid levels in mice but the data in humans is less complete. To more thoroughly characterize the LIPG gene, we resequenced it from an ethnically diverse population. Thirteen novel single-nucleotide polymorphisms (SNPs) were identified and seven others confirmed. High linkage disequilibrium was found among these SNPs spanning the length of the transcript, allowing interrogation of the entire gene for functional variation. Subjects with either high or low HDL cholesterol were used to investigate its association with LIPG gene variation. Associations were found with the most significant being the intronic variants C+42T/In5 and T+2864C/In8 (P=0.007 and 0.004, respectively). A trend for an association of the same SNPs with fewer myocardial infarctions (P=0.03) was also observed but was not significant after correction for multiple testing. The results of this study provide data linking variation in the human LIPG gene with HDL cholesterol levels as well as further evidence in support of LIPG as a potential target for therapeutic intervention.
Collapse
|
45
|
Abstract
The cholesteryl ester transfer protein (CETP) gene has been implicated in the variation of HDL levels but most studies have focused on only one or a few genetic variations. In order to properly understand the role of CETP in determining phenotype, it is necessary to examine the entire gene and all its common polymorphisms. The coding regions, adjacent introns, and proximal 5' and 3' regions were resequenced from an ethnically diverse population. Novel and previously known polymorphisms were then characterized and associations with HDL and CETP mass levels determined. The polymorphism most highly associated with CETP was 629 bp upstream of the transcription start site while the polymorphism most highly associated with HDL was a VNTR 1946 bp upstream of the transcription start site. Genetic variation in the CETP gene is associated with protective HDL levels. The ethnic diversity of some SNPs and complex interplay among them dictate careful analysis of the whole gene prior to conclusions about the role of individual polymorphisms.
Collapse
|
46
|
CYP2D6 genotyping as an alternative to phenotyping for determination of metabolic status in a clinical trial setting. AAPS PHARMSCI 2000; 2:E33. [PMID: 11741249 PMCID: PMC2751283 DOI: 10.1208/ps020433] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
The emerging application of pharmacogenomics in the clinical trial setting requires careful comparison with more traditional phenotyping methodologies, particularly in the drug metabolism area where phenotyping is used extensively. The research objectives of this study were 1) to assess the utility of cytochrome P450 2D6 (CYP2D6) genotyping as an alternative to traditional phenotyping as a predictor of poor metabolizer status; 2) to identify issues for consideration when implementing CYP2D6 genotyping in clinical trials; and 3) to outline the advantages and disadvantages of CYP2D6 genotyping compared with phenotyping. DNA samples obtained from 558 previously phenotyped individuals were blindly genotyped at the CYP2D6 locus, and the genotype-phenotype correlation was then determined. The CYP2D6 genotyping methodology successfully predicted all but 1 of the 46 poor metabolizer subjects, and it was determined that this 1 individual had a novel (presumably inactive) mutation within the coding region. In addition, we identified 2 subjects with CYP2D6 genotypes indicative of poor metabolizers who had extensive metabolizer phenotypes as determined by dextromethorphan/dextrorphan ratios. This finding suggests that traditional phenotyping methods do not always offer 100% specificity. Our results suggest that CYP2D6 genotyping is a valid alternative to traditional phenotyping in a clinical trial setting, and in some cases may be better. We also discuss some of the issues and considerations related to the use of genotyping in clinical trials and medical practice.
Collapse
|
47
|
Monocyte chemoattractant protein-1 accelerates atherosclerosis in apolipoprotein E-deficient mice. Arterioscler Thromb Vasc Biol 1999; 19:1518-25. [PMID: 10364084 DOI: 10.1161/01.atv.19.6.1518] [Citation(s) in RCA: 306] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The pro-inflammatory chemokine, monocyte chemoattractant protein-1 (MCP-1), plays a fundamental role in monocyte recruitment and has been implicated as a contributing factor to atherosclerosis. The predominant cell types within the vessel wall--endothelial cells, smooth muscle cells, and macrophages--all contribute to overexpression of MCP-1 in atherosclerotic tissue. In this report we assess the role of MCP-1 expression by leukocytes on lesion progression in a murine model susceptible to atherosclerosis. Bone marrow cells from mice overexpressing a murine MCP-1 transgene on a background of apoE-deficiency or from control mice were transplanted into irradiated apoE-knockout mice. After repopulation of apoE-knockout mice with bone marrow containing the MCP-1 transgene, macrophages expressing the MCP-1 transgene were found in several tissues, including the aorta. Qualitative assessment of atherosclerosis in these mice revealed increased lipid staining, a 3-fold (P<0.001) increase in the amount of oxidized lipid, and increased immunostaining for macrophage cell surface markers with anti-F4/80 and anti-CD11b antibodies. There were no differences in plasma lipids, plasma lipoprotein profiles, or body weight between the 2 groups. These results provide the first direct evidence that MCP-1 expression by leukocytes, predominately macrophages, increases the progression of atherosclerosis by increasing both macrophage numbers and oxidized lipid accumulation.
Collapse
|
48
|
Abstract
Squalene synthase catalyzes the first committed step in cholesterol biosynthesis and thus is important as a potential target for therapeutic intervention. In order to determine the important functional domains of the protein, the amino and carboxyl terminal regions thought to be involved in membrane association of the enzyme were removed genetically. The 30 N-terminal amino acids were deleted with no apparent effect on activity. Additional deletion of 81 or 97 amino acids from the C-terminus completely ablated activity. However, a protein with a C-terminal deletion of 47 amino acids retained full activity. The latter enzyme was readily overexpressed in Escherichia coli and purified to homogeneity. The pure, doubly truncated enzyme exhibited a specific activity similar to that reported for the protease-solubilized rat liver enzyme, had a KM for farnesyl diphosphate similar to that observed for native enzyme, and was inhibited by anionic compounds to the same degree as native enzyme. Using the vapor diffusion method, the protein was crystallized as an enzyme-inhibitor complex, yielding orthorhombic crystals which diffracted to 2.2 A.
Collapse
|
49
|
Sequencing of the cholesteryl ester transfer protein 5' regulatory region using artificial transposons. Gene X 1997; 197:101-7. [PMID: 9332354 DOI: 10.1016/s0378-1119(97)00247-3] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
We have isolated and sequenced genomic clones encompassing more than 5 kb of the 5' flanking region of the cholesteryl ester transfer protein gene. This region contains multiple Alu repeats, a Mermaid repeat, and an extensive GA repeat, which made sequencing exceedingly difficult. To circumvent the problems that these repeats posed to traditional sequencing methodologies, we employed a novel transposon-facilitated technique, which greatly simplified sequencing of regions that had been difficult to accomplish otherwise. We utilized the artificial transposon, AT-2, a Bluescript derivative containing the dhfr gene and unique primer sites at both ends of the insertion DNA. Integration of the transposon occurred efficiently and covered the entire region of interest. Analysis of the sequence indicates a number of potential regulatory factor binding sites upstream of the previously characterized minimal promoter. The 5.7-kb regulatory region confers significant transcriptional activation in a conditionally transformed mouse hepatocyte line as compared to a minimal 137-bp promoter fragment. In addition, a tetranucleotide repeat of variable length that may provide a useful genetic marker has been identified 2 kb upstream of the CETP transcriptional start site.
Collapse
|
50
|
Estrogen reduces atherosclerotic lesion development in apolipoprotein E-deficient mice. Proc Natl Acad Sci U S A 1996; 93:10022-7. [PMID: 8816744 PMCID: PMC38329 DOI: 10.1073/pnas.93.19.10022] [Citation(s) in RCA: 197] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
We have studied the effects of endogenous and exogenous estrogen on atherosclerotic lesions in apolipoprotein E-deficient mice. Female mice ovariectomized (OVX) at weaning displayed increases (P < 0.01) in fatty streak lesions in the proximal aorta and aortic sinus compared with female mice with intact ovarian function. These differences between the OVX and sham controls were apparent in both chow- and "Western-type" diet-fed mice. Moreover, increases in lesion size following OVX occurred without changes in plasma cholesterol. Hormone replacement with subdermal 17-beta-estradiol pellets releasing either 6, 14, or 28 micrograms/day significantly decreased (P < 0.001) atherosclerotic lesion area in both male and OVX female mice. In contrast, neither 17-alpha-estradiol (28 micrograms/day) or tamoxifen (85 micrograms/day) affected lesion progression in OVX female mice. In the Western diet-fed group, exogenous estradiol markedly reduced plasma cholesterol and triglycerides, whereas, in animals fed the chow diet, exogenous estrogen and tamoxifen treatment only decreased plasma and very low density lipoprotein triglycerides. However, lesion area was only weakly correlated with plasma cholesterol and triglycerides, 0.35 and 0.44 tau values, respectively (P < 0.01). In summary, in the apolipoprotein E-deficient mouse 17-beta-estradiol protects against atherosclerotic lesion formation, and this can only be partially explained through effects on plasma lipoprotein levels.
Collapse
|