1
|
Lee A, Tsekouras K, Calderon C, Bustamante C, Pressé S. Unraveling the Thousand Word Picture: An Introduction to Super-Resolution Data Analysis. Chem Rev 2017; 117:7276-7330. [PMID: 28414216 PMCID: PMC5487374 DOI: 10.1021/acs.chemrev.6b00729] [Citation(s) in RCA: 61] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Super-resolution microscopy provides direct insight into fundamental biological processes occurring at length scales smaller than light's diffraction limit. The analysis of data at such scales has brought statistical and machine learning methods into the mainstream. Here we provide a survey of data analysis methods starting from an overview of basic statistical techniques underlying the analysis of super-resolution and, more broadly, imaging data. We subsequently break down the analysis of super-resolution data into four problems: the localization problem, the counting problem, the linking problem, and what we've termed the interpretation problem.
Collapse
Affiliation(s)
- Antony Lee
- Department of Physics, University of California at Berkeley, Berkeley, California 94720, United States
- Jason L. Choy Laboratory of Single-Molecule Biophysics, University of California at Berkeley, Berkeley, California 94720, United States
| | - Konstantinos Tsekouras
- Department of Physics, University of California at Berkeley, Berkeley, California 94720, United States
- Department of Physics, Arizona State University, Tempe, Arizona 85287, United States
| | | | - Carlos Bustamante
- Jason L. Choy Laboratory of Single-Molecule Biophysics, University of California at Berkeley, Berkeley, California 94720, United States
- Biophysics Graduate Group, University of California at Berkeley, Berkeley, California 94720, United States
- Institute for Quantitative Biosciences-QB3, University of California at Berkeley, Berkeley, California 94720, United States
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, California 94720, United States
- Department of Chemistry, University of California at Berkeley, Berkeley, California 94720, United States
- Howard Hughes Medical Institute, University of California at Berkeley, Berkeley, California 94720, United States
- Kavli Energy Nanosciences Institute, University of California at Berkeley, Berkeley, California 94720, United States
| | - Steve Pressé
- Department of Physics, University of California at Berkeley, Berkeley, California 94720, United States
- Department of Chemistry and Chemical Biology, Indiana University–Purdue University Indianapolis, Indianapolis, Indiana 46202, United States
- Department of Cell and Integrative Physiology, Indiana University School of Medicine, Indianapolis, Indiana 46202, United States
- School of Molecular Sciences, Arizona State University, Tempe, Arizona 85287, United States
- Department of Physics, Arizona State University, Tempe, Arizona 85287, United States
| |
Collapse
|
2
|
Calderon CP. Motion blur filtering: A statistical approach for extracting confinement forces and diffusivity from a single blurred trajectory. Phys Rev E 2016; 93:053303. [PMID: 27301001 DOI: 10.1103/physreve.93.053303] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2015] [Indexed: 12/13/2022]
Abstract
Single particle tracking (SPT) can aid in understanding a variety of complex spatiotemporal processes. However, quantifying diffusivity and confinement forces from individual live cell trajectories is complicated by inter- and intratrajectory kinetic heterogeneity, thermal fluctuations, and (experimentally resolvable) statistical temporal dependence inherent to the underlying molecule's time correlated confined dynamics experienced in the cell. The problem is further complicated by experimental artifacts such as localization uncertainty and motion blur. The latter is caused by the tagged molecule emitting photons at different spatial positions during the exposure time of a single frame. The aforementioned experimental artifacts induce spurious time correlations in measured SPT time series that obscure the information of interest (e.g., confinement forces and diffusivity). We develop a maximum likelihood estimation (MLE) technique that decouples the above noise sources and systematically treats temporal correlation via time series methods. This ultimately permits a reliable algorithm for extracting diffusivity and effective forces in confined or unconfined environments. We illustrate how our approach avoids complications inherent to mean square displacement or autocorrelation techniques. Our algorithm modifies the established Kalman filter (which does not handle motion blur artifacts) to provide a likelihood based time series estimation procedure. The result extends A. J. Berglund's motion blur model [Phys. Rev. E 82, 011917 (2010)PLEEE81539-375510.1103/PhysRevE.82.011917] to handle confined dynamics. The approach can also systematically utilize (possibly time dependent) localization uncertainty estimates afforded by image analysis if available. This technique, which explicitly treats confinement and motion blur within a time domain MLE framework, uses an exact likelihood (time domain methods facilitate analyzing nonstationary signals). Our estimator is demonstrated to be consistent over a wide range of exposure times (5 to 100 ms), diffusion coefficients (1×10^{-3} to 1μm^{2}/s), and confinement widths (100 nm to 2μm). We demonstrate that neglecting motion blur or confinement can substantially bias estimation of kinetic parameters of interest to researchers. The technique also permits one to check statistical model assumptions against measured individual trajectories without "ground truth." The ability to reliably and consistently extract motion parameters in trajectories exhibiting confined and/or non-stationary dynamics, without exposure time artifacts corrupting estimates, is expected to aid in directly comparing trajectories obtained from different experiments or imaging modalities. A Python implementation is provided (open-source code will be maintained on GitHub; see also the Supplemental Material with this paper).
Collapse
|
3
|
Calderon CP, Bloom K. Inferring Latent States and Refining Force Estimates via Hierarchical Dirichlet Process Modeling in Single Particle Tracking Experiments. PLoS One 2015; 10:e0137633. [PMID: 26384324 PMCID: PMC4575198 DOI: 10.1371/journal.pone.0137633] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2015] [Accepted: 08/20/2015] [Indexed: 12/14/2022] Open
Abstract
Understanding the basis for intracellular motion is critical as the field moves toward a deeper understanding of the relation between Brownian forces, molecular crowding, and anisotropic (or isotropic) energetic forcing. Effective forces and other parameters used to summarize molecular motion change over time in live cells due to latent state changes, e.g., changes induced by dynamic micro-environments, photobleaching, and other heterogeneity inherent in biological processes. This study discusses limitations in currently popular analysis methods (e.g., mean square displacement-based analyses) and how new techniques can be used to systematically analyze Single Particle Tracking (SPT) data experiencing abrupt state changes in time or space. The approach is to track GFP tagged chromatids in metaphase in live yeast cells and quantitatively probe the effective forces resulting from dynamic interactions that reflect the sum of a number of physical phenomena. State changes can be induced by various sources including: microtubule dynamics exerting force through the centromere, thermal polymer fluctuations, and DNA-based molecular machines including polymerases and protein exchange complexes such as chaperones and chromatin remodeling complexes. Simulations aiming to show the relevance of the approach to more general SPT data analyses are also studied. Refined force estimates are obtained by adopting and modifying a nonparametric Bayesian modeling technique, the Hierarchical Dirichlet Process Switching Linear Dynamical System (HDP-SLDS), for SPT applications. The HDP-SLDS method shows promise in systematically identifying dynamical regime changes induced by unobserved state changes when the number of underlying states is unknown in advance (a common problem in SPT applications). We expand on the relevance of the HDP-SLDS approach, review the relevant background of Hierarchical Dirichlet Processes, show how to map discrete time HDP-SLDS models to classic SPT models, and discuss limitations of the approach. In addition, we demonstrate new computational techniques for tuning hyperparameters and for checking the statistical consistency of model assumptions directly against individual experimental trajectories; the techniques circumvent the need for "ground-truth" and/or subjective information.
Collapse
Affiliation(s)
| | - Kerry Bloom
- Department of Biology, University of North Carolina, Chapel Hill, NC, United States of America
| |
Collapse
|
4
|
Calderon CP. Data-driven techniques for detecting dynamical state changes in noisily measured 3D single-molecule trajectories. Molecules 2014; 19:18381-98. [PMID: 25397733 PMCID: PMC6271607 DOI: 10.3390/molecules191118381] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2014] [Revised: 10/28/2014] [Accepted: 10/29/2014] [Indexed: 11/16/2022] Open
Abstract
Optical microscopes and nanoscale probes (AFM, optical tweezers, etc.) afford researchers tools capable of quantitatively exploring how molecules interact with one another in live cells. The analysis of in vivo single-molecule experimental data faces numerous challenges due to the complex, crowded, and time changing environments associated with live cells. Fluctuations and spatially varying systematic forces experienced by molecules change over time; these changes are obscured by "measurement noise" introduced by the experimental probe monitoring the system. In this article, we demonstrate how the Hierarchical Dirichlet Process Switching Linear Dynamical System (HDP-SLDS) of Fox et al. [IEEE Transactions on Signal Processing 59] can be used to detect both subtle and abrupt state changes in time series containing "thermal" and "measurement" noise. The approach accounts for temporal dependencies induced by random and "systematic overdamped" forces. The technique does not require one to subjectively select the number of "hidden states" underlying a trajectory in an a priori fashion. The number of hidden states is simultaneously inferred along with change points and parameters characterizing molecular motion in a data-driven fashion. We use large scale simulations to study and compare the new approach to state-of-the-art Hidden Markov Modeling techniques. Simulations mimicking single particle tracking (SPT) experiments are the focus of this study.
Collapse
|
5
|
Calderon CP, Weiss LE, Moerner WE. Robust hypothesis tests for detecting statistical evidence of two-dimensional and three-dimensional interactions in single-molecule measurements. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2014; 89:052705. [PMID: 25353827 DOI: 10.1103/physreve.89.052705] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/06/2013] [Indexed: 06/04/2023]
Abstract
Experimental advances have improved the two- (2D) and three-dimensional (3D) spatial resolution that can be extracted from in vivo single-molecule measurements. This enables researchers to quantitatively infer the magnitude and directionality of forces experienced by biomolecules in their native environment. Situations where such force information is relevant range from mitosis to directed transport of protein cargo along cytoskeletal structures. Models commonly applied to quantify single-molecule dynamics assume that effective forces and velocity in the x,y (or x,y,z) directions are statistically independent, but this assumption is physically unrealistic in many situations. We present a hypothesis testing approach capable of determining if there is evidence of statistical dependence between positional coordinates in experimentally measured trajectories; if the hypothesis of independence between spatial coordinates is rejected, then a new model accounting for 2D (3D) interactions can and should be considered. Our hypothesis testing technique is robust, meaning it can detect interactions, even if the noise statistics are not well captured by the model. The approach is demonstrated on control simulations and on experimental data (directed transport of intraflagellar transport protein 88 homolog in the primary cilium).
Collapse
Affiliation(s)
| | - Lucien E Weiss
- Department of Chemistry, Stanford University, Stanford, California 94305, USA
| | - W E Moerner
- Department of Chemistry, Stanford University, Stanford, California 94305, USA
| |
Collapse
|
6
|
Calderon CP, Thompson MA, Casolari JM, Paffenroth RC, Moerner WE. Quantifying transient 3D dynamical phenomena of single mRNA particles in live yeast cell measurements. J Phys Chem B 2013; 117:15701-13. [PMID: 24015725 DOI: 10.1021/jp4064214] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Single-particle tracking (SPT) has been extensively used to obtain information about diffusion and directed motion in a wide range of biological applications. Recently, new methods have appeared for obtaining precise (10s of nm) spatial information in three dimensions (3D) with high temporal resolution (measurements obtained every 4 ms), which promise to more accurately sense the true dynamical behavior in the natural 3D cellular environment. Despite the quantitative 3D tracking information, the range of mathematical methods for extracting information about the underlying system has been limited mostly to mean-squared displacement analysis and other techniques not accounting for complex 3D kinetic interactions. There is a great need for new analysis tools aiming to more fully extract the biological information content from in vivo SPT measurements. High-resolution SPT experimental data has enormous potential to objectively scrutinize various proposed mechanistic schemes arising from theoretical biophysics and cell biology. At the same time, methods for rigorously checking the statistical consistency of both model assumptions and estimated parameters against observed experimental data (i.e., goodness-of-fit tests) have not received great attention. We demonstrate methods enabling (1) estimation of the parameters of 3D stochastic differential equation (SDE) models of the underlying dynamics given only one trajectory; and (2) construction of hypothesis tests checking the consistency of the fitted model with the observed trajectory so that extracted parameters are not overinterpreted (the tools are applicable to linear or nonlinear SDEs calibrated from nonstationary time series data). The approach is demonstrated on high-resolution 3D trajectories of single ARG3 mRNA particles in yeast cells in order to show the power of the methods in detecting signatures of transient directed transport. The methods presented are generally relevant to a wide variety of 2D and 3D SPT tracking applications.
Collapse
|