Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Lavender CA, Shapiro AJ, Burkholder AB, Bennett BD, Adelman K, Fargo DC. ORIO (Online Resource for Integrative Omics): a web-based platform for rapid integration of next generation sequencing data. Nucleic Acids Res 2017;45:5678-5690. [PMID: 28402545 PMCID: PMC5449597 DOI: 10.1093/nar/gkx270] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2017] [Accepted: 04/05/2017] [Indexed: 11/14/2022] Open

For:	Lavender CA, Shapiro AJ, Burkholder AB, Bennett BD, Adelman K, Fargo DC. ORIO (Online Resource for Integrative Omics): a web-based platform for rapid integration of next generation sequencing data. Nucleic Acids Res 2017;45:5678-5690. [PMID: 28402545 PMCID: PMC5449597 DOI: 10.1093/nar/gkx270] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2017] [Accepted: 04/05/2017] [Indexed: 11/14/2022] Open

Number

Cited by Other Article(s)

Zhan C, Tang T, Wu E, Zhang Y, He M, Wu R, Bi C, Wang J, Zhang Y, Shen B. From multi-omics approaches to personalized medicine in myocardial infarction. Front Cardiovasc Med 2023;10:1250340. [PMID: 37965091 PMCID: PMC10642346 DOI: 10.3389/fcvm.2023.1250340] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Accepted: 10/17/2023] [Indexed: 11/16/2023] Open

Affiliation(s)

Chaoying Zhan Department of Cardiology and Institutes for Systems Genetics, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, China
Tong Tang Department of Cardiology and Institutes for Systems Genetics, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, China
Erman Wu Department of Cardiology and Institutes for Systems Genetics, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, China
Yuxin Zhang Department of Cardiology and Institutes for Systems Genetics, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, China KeyLaboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, China
Mengqiao He Department of Cardiology and Institutes for Systems Genetics, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, China
Rongrong Wu Department of Cardiology and Institutes for Systems Genetics, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, China
Cheng Bi Department of Cardiology and Institutes for Systems Genetics, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, China KeyLaboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, China
Jiao Wang Department of Cardiology and Institutes for Systems Genetics, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, China
Yingbo Zhang Department of Cardiology and Institutes for Systems Genetics, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, China Tropical Crops Genetic Resources Institute, Chinese Academy of Tropical Agricultural Sciences, Haikou, China
Bairong Shen Department of Cardiology and Institutes for Systems Genetics, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, China

Collapse

Olsen SN, Godfrey L, Healy JP, Choi YA, Kai Y, Hatton C, Perner F, Haarer EL, Nabet B, Yuan GC, Armstrong SA. MLL::AF9 degradation induces rapid changes in transcriptional elongation and subsequent loss of an active chromatin landscape. Mol Cell 2022;82:1140-1155.e11. [PMID: 35245435 PMCID: PMC9044330 DOI: 10.1016/j.molcel.2022.02.013] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2021] [Revised: 11/17/2021] [Accepted: 02/06/2022] [Indexed: 12/15/2022]

Tripp BA, Otu HH. Integration of Multi-Omics Data Using Probabilistic Graph Models and External Knowledge. Curr Bioinform 2022. [DOI: 10.2174/1574893616666210906141545] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Abstract Background: High-throughput sequencing technologies have revolutionized the ability to perform systems-level biology and elucidate molecular mechanisms of disease through the comprehensive characterization of different layers of biological information. Integration of these heterogeneous layers can provide insight into the underlying biology but is challenged by modeling complex interactions. Objective: We introduce OBaNK: omics integration using Bayesian networks and external knowledge, an algorithm to model interactions between heterogeneous high-dimensional biological data to elucidate complex functional clusters and emergent relationships associated with an observed phenotype. Method: Using Bayesian network learning, we modeled the statistical dependencies and interactions between lipidomics, proteomics, and metabolomics data. The strength of a learned interaction between molecules was altered based on external knowledge. Results : Networks learned from synthetic datasets based on real pathways achieved an average area under the curve score of ~0.85, an improvement of ~0.23 from baseline methods. When applied to real multi-omics data collected during pregnancy, five distinct functional networks of heterogeneous biological data were identified, and the results were compared to other multi-omics integration approaches. Conclusion: OBaNK successfully improved the accuracy of learning interaction networks from data integrating external knowledge, identified heterogeneous functional networks from real data, and suggested potential novel interactions associated with the phenotype. These findings can guide future hypothesis generation. OBaNK source code is available at: https://github.com/bridgettripp/OBaNK.git, and a graphical user interface is available at: http://otulab.unl.edu/OBaNK. Collapse

Elrod ND, Henriques T, Huang KL, Tatomer DC, Wilusz JE, Wagner EJ, Adelman K. The Integrator Complex Attenuates Promoter-Proximal Transcription at Protein-Coding Genes. Mol Cell 2020;76:738-752.e7. [PMID: 31809743 DOI: 10.1016/j.molcel.2019.10.034] [Citation(s) in RCA: 110] [Impact Index Per Article: 27.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2019] [Revised: 09/15/2019] [Accepted: 10/25/2019] [Indexed: 12/11/2022]

Lavender CA, Shapiro AJ, Day FS, Fargo DC. ORSO (Online Resource for Social Omics): A data-driven social network connecting scientists to genomics datasets. PLoS Comput Biol 2020;16:e1007571. [PMID: 31978042 PMCID: PMC7001987 DOI: 10.1371/journal.pcbi.1007571] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2018] [Revised: 02/05/2020] [Accepted: 11/29/2019] [Indexed: 11/17/2022] Open

Abstract

High-throughput sequencing has become ubiquitous in biomedical sciences. As new technologies emerge and sequencing costs decline, the diversity and volume of available data increases exponentially, and successfully navigating the data becomes more challenging. Though datasets are often hosted by public repositories, scientists must rely on inconsistent annotation to identify and interpret meaningful data. Moreover, the experimental heterogeneity and wide-ranging quality of high-throughput biological data means that even data with desired cell lines, tissue types, or molecular targets may not be readily interpretable or integrated. We have developed ORSO (Online Resource for Social Omics) as an easy-to-use web application to connect life scientists with genomics data. In ORSO, users interact within a data-driven social network, where they can favorite datasets and follow other users. In addition to more than 30,000 datasets hosted from major biomedical consortia, users may contribute their own data to ORSO, facilitating its discovery by other users. Leveraging user interactions, ORSO provides a novel recommendation system to automatically connect users with hosted data. In addition to social interactions, the recommendation system considers primary read coverage information and annotated metadata. Similarities used by the recommendation system are presented by ORSO in a graph display, allowing exploration of dataset associations. The topology of the network graph reflects established biology, with samples from related systems grouped together. We tested the recommendation system using an RNA-seq time course dataset from differentiation of embryonic stem cells to cardiomyocytes. The ORSO recommendation system correctly predicted early data point sources as embryonic stem cells and late data point sources as heart and muscle samples, resulting in recommendation of related datasets. By connecting scientists with relevant data, ORSO provides a critical new service that facilitates wide-ranging research interests.

New sequencing technologies have rapidly transformed biomedical research. Public data repositories now contain millions of datasets, which have the potential to accelerate and bolster research projects. However, the sheer magnitude of available data makes navigation difficult. We created ORSO (Online Resource for Social Omics) to address these challenges. ORSO is a social network where entries are not status updates or tweets, but biological datasets. Users may add their own data to ORSO, joining 30,000 validated datasets that are already hosted, and other users may find these data through intuitive search functions and informative analytics. Users can then favorite datasets relevant to their interests or follow contributing users. ORSO also uses a recommendation system like those used on commercial websites to automatically recommend data to users based on user interactions and dataset similarities. By making data more accessible and by connecting users to relevant data, we anticipate that ORSO will be an important resource for scientists. ORSO may be the first of many applications that use methods originating in social media and ecommerce to enhance and further research projects in the life sciences.

Collapse

Elrod ND, Henriques T, Huang KL, Tatomer DC, Wilusz JE, Wagner EJ, Adelman K. The Integrator Complex Attenuates Promoter-Proximal Transcription at Protein-Coding Genes. Mol Cell 2019;76:738-752.e7. [PMID: 31809743 DOI: 10.1101/725507] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2019] [Revised: 09/15/2019] [Accepted: 10/25/2019] [Indexed: 05/27/2023]

Fosslie M, Manaf A, Lerdrup M, Hansen K, Gilfillan GD, Dahl JA. Going low to reach high: Small-scale ChIP-seq maps new terrain. WILEY INTERDISCIPLINARY REVIEWS-SYSTEMS BIOLOGY AND MEDICINE 2019;12:e1465. [PMID: 31478357 DOI: 10.1002/wsbm.1465] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/15/2019] [Revised: 07/02/2019] [Accepted: 07/25/2019] [Indexed: 12/20/2022]

FQStat: a parallel architecture for very high-speed assessment of sequencing quality metrics. BMC Bioinformatics 2019;20:424. [PMID: 31416440 PMCID: PMC6694608 DOI: 10.1186/s12859-019-3015-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2019] [Accepted: 07/30/2019] [Indexed: 01/23/2023] Open

Abstract

Background

High throughput DNA/RNA sequencing has revolutionized biological and clinical research. Sequencing is widely used, and generates very large amounts of data, mainly due to reduced cost and advanced technologies. Quickly assessing the quality of giga-to-tera base levels of sequencing data has become a routine but important task. Identification and elimination of low-quality sequence data is crucial for reliability of downstream analysis results. There is a need for a high-speed tool that uses optimized parallel programming for batch processing and simply gauges the quality of sequencing data from multiple datasets independent of any other processing steps.

Results

FQStat is a stand-alone, platform-independent software tool that assesses the quality of FASTQ files using parallel programming. Based on the machine architecture and input data, FQStat automatically determines the number of cores and the amount of memory to be allocated per file for optimum performance. Our results indicate that in a core-limited case, core assignment overhead exceeds the benefit of additional cores. In a core-unlimited case, there is a saturation point reached in performance by increasingly assigning additional cores per file. We also show that memory allocation per file has a lower priority in performance when compared to the allocation of cores. FQStat’s output is summarized in HTML web page, tab-delimited text file, and high-resolution image formats. FQStat calculates and plots read count, read length, quality score, and high-quality base statistics. FQStat identifies and marks low-quality sequencing data to suggest removal from downstream analysis. We applied FQStat on real sequencing data to optimize performance and to demonstrate its capabilities. We also compared FQStat’s performance to similar quality control (QC) tools that utilize parallel programming and attained improvements in run time.

Conclusions

FQStat is a user-friendly tool with a graphical interface that employs a parallel programming architecture and automatically optimizes its performance to generate quality control statistics for sequencing data. Unlike existing tools, these statistics are calculated for multiple datasets and separately at the “lane,” “sample,” and “experiment” level to identify subsets of the samples with low quality, thereby preventing the loss of complete samples when reliable data can still be obtained.

Electronic supplementary material

The online version of this article (10.1186/s12859-019-3015-y) contains supplementary material, which is available to authorized users.

Collapse

Nguyen TAT, Grimm SA, Bushel PR, Li J, Li Y, Bennett BD, Lavender CA, Ward JM, Fargo DC, Anderson CW, Li L, Resnick MA, Menendez D. Revealing a human p53 universe. Nucleic Acids Res 2019;46:8153-8167. [PMID: 30107566 PMCID: PMC6144829 DOI: 10.1093/nar/gky720] [Citation(s) in RCA: 57] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2018] [Accepted: 07/27/2018] [Indexed: 12/13/2022] Open

Affiliation(s)

Thuy-Ai T Nguyen Genome Integrity & Structural Biology Laboratory, National Institute of Environmental Health Sciences/National Institutes of Health, Research Triangle Park, NC 27709, USA
Sara A Grimm Integrative Bioinformatics Support Group, National Institute of Environmental Health Sciences/National Institutes of Health, Research Triangle Park, NC 27709, USA
Pierre R Bushel Biostatistics & Computational Biology Branch, National Institute of Environmental Health Sciences/National Institutes of Health, Research Triangle Park, NC 27709, USA
Jianying Li Integrative Bioinformatics Support Group, National Institute of Environmental Health Sciences/National Institutes of Health, Research Triangle Park, NC 27709, USA
Yuanyuan Li Biostatistics & Computational Biology Branch, National Institute of Environmental Health Sciences/National Institutes of Health, Research Triangle Park, NC 27709, USA
Brian D Bennett Integrative Bioinformatics Support Group, National Institute of Environmental Health Sciences/National Institutes of Health, Research Triangle Park, NC 27709, USA
Christopher A Lavender Integrative Bioinformatics Support Group, National Institute of Environmental Health Sciences/National Institutes of Health, Research Triangle Park, NC 27709, USA
James M Ward Integrative Bioinformatics Support Group, National Institute of Environmental Health Sciences/National Institutes of Health, Research Triangle Park, NC 27709, USA
David C Fargo Integrative Bioinformatics Support Group, National Institute of Environmental Health Sciences/National Institutes of Health, Research Triangle Park, NC 27709, USA.,Office of Scientific Computing, National Institute of Environmental Health Sciences/National Institutes of Health, Research Triangle Park, NC 27709, USA
Carl W Anderson Genome Integrity & Structural Biology Laboratory, National Institute of Environmental Health Sciences/National Institutes of Health, Research Triangle Park, NC 27709, USA
Leping Li Biostatistics & Computational Biology Branch, National Institute of Environmental Health Sciences/National Institutes of Health, Research Triangle Park, NC 27709, USA
Michael A Resnick Genome Integrity & Structural Biology Laboratory, National Institute of Environmental Health Sciences/National Institutes of Health, Research Triangle Park, NC 27709, USA
Daniel Menendez Genome Integrity & Structural Biology Laboratory, National Institute of Environmental Health Sciences/National Institutes of Health, Research Triangle Park, NC 27709, USA

Collapse

Henriques T, Scruggs BS, Inouye MO, Muse GW, Williams LH, Burkholder AB, Lavender CA, Fargo DC, Adelman K. Widespread transcriptional pausing and elongation control at enhancers. Genes Dev 2018;32:26-41. [PMID: 29378787 PMCID: PMC5828392 DOI: 10.1101/gad.309351.117] [Citation(s) in RCA: 215] [Impact Index Per Article: 35.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2017] [Accepted: 12/21/2017] [Indexed: 02/07/2023]

Abstract

In this study, Henriques et al. demonstrate that transcription is a nearly universal feature of enhancers in Drosophila and mammalian cells and that nascent RNA sequencing strategies are optimal for identification of both enhancers and superenhancers. Their findings provide insights into the unique characteristics of superenhancers, which stimulate high-level gene expression through rapid pause release; interestingly, this property renders associated genes resistant to loss of factors that stabilize paused RNAPII.

Regulation by gene-distal enhancers is critical for cell type-specific and condition-specific patterns of gene expression. Thus, to understand the basis of gene activity in a given cell type or tissue, we must identify the precise locations of enhancers and functionally characterize their behaviors. Here, we demonstrate that transcription is a nearly universal feature of enhancers in Drosophila and mammalian cells and that nascent RNA sequencing strategies are optimal for identification of both enhancers and superenhancers. We dissect the mechanisms governing enhancer transcription and discover remarkable similarities to transcription at protein-coding genes. We show that RNA polymerase II (RNAPII) undergoes regulated pausing and release at enhancers. However, as compared with mRNA genes, RNAPII at enhancers is less stable and more prone to early termination. Furthermore, we found that the level of histone H3 Lys4 (H3K4) methylation at enhancers corresponds to transcriptional activity such that highly active enhancers display H3K4 trimethylation rather than the H3K4 monomethylation considered a hallmark of enhancers. Finally, our work provides insights into the unique characteristics of superenhancers, which stimulate high-level gene expression through rapid pause release; interestingly, this property renders associated genes resistant to the loss of factors that stabilize paused RNAPII.

Collapse