Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Najim SA, Lim IS. Trustworthy dimension reduction for visualization different data sets. Inf Sci (N Y) 2014. [DOI: 10.1016/j.ins.2014.03.048] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Number

Cited by Other Article(s)

EMBEDR: Distinguishing signal from noise in single-cell omics data. PATTERNS (NEW YORK, N.Y.) 2022;3:100443. [PMID: 35510181 PMCID: PMC9058925 DOI: 10.1016/j.patter.2022.100443] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Revised: 06/25/2021] [Accepted: 01/14/2022] [Indexed: 01/16/2023]

Abstract

Single-cell “omics”-based measurements are often high dimensional so that dimensionality reduction (DR) algorithms are necessary for data visualization and analysis. The lack of methods for separating signal from noise in DR outputs has limited their utility in generating data-driven discoveries in single-cell data. In this work we present EMBEDR, which assesses the output of any DR algorithm to distinguish evidence of structure from algorithm-induced noise in DR outputs. We apply EMBEDR to DR-generated representations of single-cell omics data of several modalities to show where they visually show real—not spurious—structure. EMBEDR generates a “p” value for each sample, allowing for direct comparisons of DR algorithms and facilitating optimization of algorithm hyperparameters. We show that the scale of a sample’s neighborhood can thus be determined and used to generate a novel “cell-wise optimal” embedding. EMBEDR is available as a Python package for immediate use.

An overview of the benefits and difficulties of dimensionality reduction

A novel algorithm for quantifying and identifying quality within embeddings of data

Quality can be optimized to find data scales and set algorithm parameters

A cell-wise view of quality generates robust and interpretable representations of data

Modern technologies have enabled biologists to construct enormous datasets containing millions of observations of thousands of measurements. These datasets push the limits of traditional analysis techniques, leaving doubts about the quality and fidelity of these methods. In this work, we present a sort of meta-algorithm, called EMBEDR, which seeks to evaluate when a certain class of methods, known as dimensionality reduction methods, are generating high-quality representations of data. We show that EMBEDR allows for visualizations of even large datasets to be interpreted with confidence. Furthermore, we show how asking about the method quality itself can lead to improved analyses of data. These improved analyses may directly impact our understanding of cellular biology, including how cells behave, grow, and respond to stimuli.

Collapse

Ren F, Liu N. Emotion computing using Word Mover's Distance features based on Ren_CECps. PLoS One 2018;13:e0194136. [PMID: 29624573 PMCID: PMC5889067 DOI: 10.1371/journal.pone.0194136] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2017] [Accepted: 02/09/2018] [Indexed: 11/27/2022] Open