1
|
Tu D, Wrobel J, Satterthwaite TD, Goldsmith J, Gur RC, Gur RE, Gertheiss J, Bassett DS, Shinohara RT. Regression and alignment for functional data and network topology. Biostatistics 2024; 26:kxae026. [PMID: 39140988 PMCID: PMC11822954 DOI: 10.1093/biostatistics/kxae026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 07/05/2024] [Accepted: 07/08/2024] [Indexed: 08/15/2024] Open
Abstract
In the brain, functional connections form a network whose topological organization can be described by graph-theoretic network diagnostics. These include characterizations of the community structure, such as modularity and participation coefficient, which have been shown to change over the course of childhood and adolescence. To investigate if such changes in the functional network are associated with changes in cognitive performance during development, network studies often rely on an arbitrary choice of preprocessing parameters, in particular the proportional threshold of network edges. Because the choice of parameter can impact the value of the network diagnostic, and therefore downstream conclusions, we propose to circumvent that choice by conceptualizing the network diagnostic as a function of the parameter. As opposed to a single value, a network diagnostic curve describes the connectome topology at multiple scales-from the sparsest group of the strongest edges to the entire edge set. To relate these curves to executive function and other covariates, we use scalar-on-function regression, which is more flexible than previous functional data-based models used in network neuroscience. We then consider how systematic differences between networks can manifest in misalignment of diagnostic curves, and consequently propose a supervised curve alignment method that incorporates auxiliary information from other variables. Our algorithm performs both functional regression and alignment via an iterative, penalized, and nonlinear likelihood optimization. The illustrated method has the potential to improve the interpretability and generalizability of neuroscience studies where the goal is to study heterogeneity among a mixture of function- and scalar-valued measures.
Collapse
Affiliation(s)
- Danni Tu
- The Penn Statistics in Imaging and Visualization Endeavor (PennSIVE), Department of Biostatistics, Epidemiology, and Informatics, 423 Guardian Drive, University of Pennsylvania, Philadelphia, PA, 19104, United States
| | - Julia Wrobel
- Department of Biostatistics and Bioinformatics, 1518 Clifton Rd. NE, Emory University, Atlanta, GA, 30322, United States
| | - Theodore D Satterthwaite
- Department of Psychiatry, 3700 Hamilton Walk, Perelman School of Medicine, Philadelphia, PA, 19104, United States
- Penn Lifespan Informatics and Neuroimaging Center, 3700 Hamilton Walk, Philadelphia, PA, 19104, United States
| | - Jeff Goldsmith
- Department of Biostatistics, 722 W. 168th St, Columbia University, New York, NY, 10032, United States
| | - Ruben C Gur
- Department of Psychiatry, 3700 Hamilton Walk, Perelman School of Medicine, Philadelphia, PA, 19104, United States
- The Penn Medicine-CHOP Lifespan Brain Institute, 3700 Hamilton Walk, Philadelphia, PA, 19104, United States
| | - Raquel E Gur
- Department of Psychiatry, 3700 Hamilton Walk, Perelman School of Medicine, Philadelphia, PA, 19104, United States
- The Penn Medicine-CHOP Lifespan Brain Institute, 3700 Hamilton Walk, Philadelphia, PA, 19104, United States
| | - Jan Gertheiss
- Department of Mathematics and Statistics, School of Economics and Social Sciences, Holstenhofweg 85, Helmut Schmidt University, 22043 Hamburg, Germany
| | - Dani S Bassett
- Department of Bioengineering, 210 S 33rd St, University of Pennsylvania, Philadelphia, PA, 19104, United States
- Department of Physics and Astronomy, 209 S 33rd St, University of Pennsylvania, Philadelphia, PA, 19104, United States
- Department of Electrical and Systems Engineering, 200 S 33rd St, University of Pennsylvania, Philadelphia, PA, 19104, United States
- Department of Neurology, 3400 Spruce St, University of Pennsylvania, Philadelphia, PA, 19104, United States
| | - Russell T Shinohara
- The Penn Statistics in Imaging and Visualization Endeavor (PennSIVE), Department of Biostatistics, Epidemiology, and Informatics, 423 Guardian Drive, University of Pennsylvania, Philadelphia, PA, 19104, United States
| |
Collapse
|
2
|
Conde S, Tavakoli S, Ezer D. Functional regression clustering with multiple functional gene expressions. PLoS One 2024; 19:e0310991. [PMID: 39585813 PMCID: PMC11588248 DOI: 10.1371/journal.pone.0310991] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2024] [Accepted: 08/26/2024] [Indexed: 11/27/2024] Open
Abstract
Gene expression data is often collected in time series experiments, under different experimental conditions. There may be genes that have very different gene expression profiles over time, but that adjust their gene expression patterns in the same way under experimental conditions. Our aim is to develop a method that finds clusters of genes in which the relationship between these temporal gene expression profiles are similar to one another, even if the individual temporal gene expression profiles differ. We propose a K-means-type algorithm in which each cluster is defined by a function-on-function regression model, which, inter alia, allows for multiple functional explanatory variables. We validate this novel approach through extensive simulations and then apply it to identify groups of genes whose diurnal expression pattern is perturbed by the season in a similar way. Our clusters are enriched for genes with similar biological functions, including one cluster enriched in both photosynthesis-related functions and polysomal ribosomes, which shows that our method provides useful and novel biological insights.
Collapse
Affiliation(s)
- Susana Conde
- Department of Statistics, University of Warwick, Coventry, United Kingdom
- The Alan Turing Institute, London, United Kingdom
- Department of Biology, University of York, York, United Kingdom
- School of Mathematical Sciences, University of Southampton, Southampton, United Kingdom
| | - Shahin Tavakoli
- Research Institute for Statistics and Information Science, Geneva School of Economics and Management, University of Geneva, Geneva, Switzerland
| | - Daphne Ezer
- Department of Statistics, University of Warwick, Coventry, United Kingdom
- The Alan Turing Institute, London, United Kingdom
- Department of Biology, University of York, York, United Kingdom
| |
Collapse
|
3
|
Tu D, Wrobel J, Satterthwaite TD, Goldsmith J, Gur RC, Gur RE, Gertheiss J, Bassett DS, Shinohara RT. Regression and Alignment for Functional Data and Network Topology. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.07.13.548836. [PMID: 37503017 PMCID: PMC10370026 DOI: 10.1101/2023.07.13.548836] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
In the brain, functional connections form a network whose topological organization can be described by graph-theoretic network diagnostics. These include characterizations of the community structure, such as modularity and participation coefficient, which have been shown to change over the course of childhood and adolescence. To investigate if such changes in the functional network are associated with changes in cognitive performance during development, network studies often rely on an arbitrary choice of pre-processing parameters, in particular the proportional threshold of network edges. Because the choice of parameter can impact the value of the network diagnostic, and therefore downstream conclusions, we propose to circumvent that choice by conceptualizing the network diagnostic as a function of the parameter. As opposed to a single value, a network diagnostic curve describes the connectome topology at multiple scales-from the sparsest group of the strongest edges to the entire edge set. To relate these curves to executive function and other covariates, we use scalar-on-function regression, which is more flexible than previous functional data-based models used in network neuroscience. We then consider how systematic differences between networks can manifest in misalignment of diagnostic curves, and consequently propose a supervised curve alignment method that incorporates auxiliary information from other variables. Our algorithm performs both functional regression and alignment via an iterative, penalized, and nonlinear likelihood optimization. The illustrated method has the potential to improve the interpretability and generalizability of neuroscience studies where the goal is to study heterogeneity among a mixture of function- and scalar-valued measures.
Collapse
Affiliation(s)
- Danni Tu
- The Penn Statistics in Imaging and Visualization Endeavor (PennSIVE), Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, Philadelphia, PA, USA
| | - Julia Wrobel
- Department of Biostatistics and Bioinformatics, Emory University, Atlanta, GA, USA
| | - Theodore D. Satterthwaite
- Department of Psychiatry, Perelman School of Medicine, Philadelphia, PA USA
- Penn Lifespan Informatics and Neuroimaging Center, Philadelphia, PA, USA
| | - Jeff Goldsmith
- Department of Biostatistics, Columbia University, New York, NY, USA
| | - Ruben C. Gur
- Department of Psychiatry, Perelman School of Medicine, Philadelphia, PA USA
- The Penn Medicine-CHOP Lifespan Brain Institute, Philadelphia, PA, USA
| | - Raquel E. Gur
- Department of Psychiatry, Perelman School of Medicine, Philadelphia, PA USA
- The Penn Medicine-CHOP Lifespan Brain Institute, Philadelphia, PA, USA
| | - Jan Gertheiss
- Department of Mathematics and Statistics, School of Economics and Social Sciences, Helmut Schmidt University, Hamburg, Germany
| | - Dani S. Bassett
- Department of Bioengineering, University of Pennsylvania, Philadelphia, PA, USA
- Department of Physics and Astronomy, University of Pennsylvania, Philadelphia, PA, USA
- Department of Electrical and Systems Engineering, University of Pennsylvania, Philadelphia, PA, USA
- Department of Neurology, University of Pennsylvania, Philadelphia, PA, USA
| | - Russell T. Shinohara
- The Penn Statistics in Imaging and Visualization Endeavor (PennSIVE), Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, Philadelphia, PA, USA
| |
Collapse
|
4
|
Meng S, Huang Z, Zhang J, Jiang Z. Estimation on functional partially linear single index measurement error model. COMMUN STAT-THEOR M 2021. [DOI: 10.1080/03610926.2021.1999979] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Affiliation(s)
- Shuyu Meng
- School of Science, Nanjing University of Science and Technology, Nanjing, Jiangsu, P. R. China
| | - Zhensheng Huang
- School of Science, Nanjing University of Science and Technology, Nanjing, Jiangsu, P. R. China
| | - Jing Zhang
- School of Science, Nanjing University of Science and Technology, Nanjing, Jiangsu, P. R. China
| | - Zhiqiang Jiang
- School of Science, Nanjing University of Science and Technology, Nanjing, Jiangsu, P. R. China
| |
Collapse
|
5
|
Chakraborty A, Panaretos VM. Functional registration and local variations: Identifiability, rank, and tuning. BERNOULLI 2021. [DOI: 10.3150/20-bej1267] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
| | - Victor M. Panaretos
- Institut de Mathématiques, École Polytechnique Fédérale de Lausanne (EPFL), Switzerland
| |
Collapse
|
6
|
Wang B, Luo X, Zhao Y, Caffo B. Semiparametric partial common principal component analysis for covariance matrices. Biometrics 2020; 77:1175-1186. [PMID: 32935852 DOI: 10.1111/biom.13369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2019] [Revised: 08/30/2020] [Accepted: 09/01/2020] [Indexed: 11/29/2022]
Abstract
We consider the problem of jointly modeling multiple covariance matrices by partial common principal component analysis (PCPCA), which assumes a proportion of eigenvectors to be shared across covariance matrices and the rest to be individual-specific. This paper proposes consistent estimators of the shared eigenvectors in the PCPCA as the number of matrices or the number of samples to estimate each matrix goes to infinity. We prove such asymptotic results without making any assumptions on the ranks of eigenvalues that are associated with the shared eigenvectors. When the number of samples goes to infinity, our results do not require the data to be Gaussian distributed. Furthermore, this paper introduces a sequential testing procedure to identify the number of shared eigenvectors in the PCPCA. In simulation studies, our method shows higher accuracy in estimating the shared eigenvectors than competing methods. Applied to a motor-task functional magnetic resonance imaging data set, our estimator identifies meaningful brain networks that are consistent with current scientific understandings of motor networks during a motor paradigm.
Collapse
Affiliation(s)
- Bingkai Wang
- Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland
| | - Xi Luo
- The University of Texas, Health Science Center at Houston School of Public Health, Houston, Texas
| | - Yi Zhao
- Indiana University School of Medicine, Indianapolis, Indiana
| | - Brian Caffo
- Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland
| |
Collapse
|
7
|
|
8
|
Tavakoli S, Pigoli D, Aston JAD, Coleman JS. A Spatial Modeling Approach for Linguistic Object Data: Analyzing Dialect Sound Variations Across Great Britain. J Am Stat Assoc 2019. [DOI: 10.1080/01621459.2019.1607357] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Affiliation(s)
- Shahin Tavakoli
- Department of Statistics, University of Warwick, Coventry, UK
| | - Davide Pigoli
- Department of Mathematics, King’s College London, London, UK
| | | | | |
Collapse
|
9
|
Lila E, Aston JAD. Statistical Analysis of Functions on Surfaces, With an Application to Medical Imaging. J Am Stat Assoc 2019. [DOI: 10.1080/01621459.2019.1635479] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Affiliation(s)
- Eardi Lila
- Cambridge Centre for Analysis, University of Cambridge, Cambridge, UK
| | - John A. D. Aston
- Statistical Laboratory, DPMMS, University of Cambridge, Cambridge, UK
| |
Collapse
|
10
|
Wrobel J, Zipunnikov V, Schrack J, Goldsmith J. Registration for exponential family functional data. Biometrics 2019; 75:48-57. [PMID: 30129091 PMCID: PMC10585654 DOI: 10.1111/biom.12963] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2017] [Revised: 08/01/2018] [Accepted: 08/01/2018] [Indexed: 12/01/2022]
Abstract
We introduce a novel method for separating amplitude and phase variability in exponential family functional data. Our method alternates between two steps: the first uses generalized functional principal components analysis to calculate template functions, and the second estimates smooth warping functions that map observed curves to templates. Existing approaches to registration have primarily focused on continuous functional observations, and the few approaches for discrete functional data require a pre-smoothing step; these methods are frequently computationally intensive. In contrast, we focus on the likelihood of the observed data and avoid the need for preprocessing, and we implement both steps of our algorithm in a computationally efficient way. Our motivation comes from the Baltimore Longitudinal Study on Aging, in which accelerometer data provides valuable insights into the timing of sedentary behavior. We analyze binary functional data with observations each minute over 24 hours for 592 participants, where values represent activity and inactivity. Diurnal patterns of activity are obscured due to misalignment in the original data but are clear after curves are aligned. Simulations designed to mimic the application indicate that the proposed methods outperform competing approaches in terms of estimation accuracy and computational efficiency. Code for our method and simulations is publicly available.
Collapse
Affiliation(s)
- Julia Wrobel
- Department of Biostatistics, Mailman School of Public Health, Columbia University, New York, New York, U.S.A
| | - Vadim Zipunnikov
- Department of Biostatistics, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, Maryland, U.S.A
| | - Jennifer Schrack
- Department of Epidemiology, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, Maryland, U.S.A
- Longitudinal Studies Section, Translational Gerontology Branch, National Institute on Aging, National Institutes of Health, Bethesda, Maryland, U.S.A
| | - Jeff Goldsmith
- Department of Biostatistics, Mailman School of Public Health, Columbia University, New York, New York, U.S.A
| |
Collapse
|
11
|
Zhu H, Zhang R, Yu Z, Lian H, Liu Y. Estimation and testing for partially functional linear errors-in-variables models. J MULTIVARIATE ANAL 2019. [DOI: 10.1016/j.jmva.2018.11.005] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
12
|
Happ C, Scheipl F, Gabriel A, Greven S. A general framework for multivariate functional principal component analysis of amplitude and phase variation. Stat (Int Stat Inst) 2019. [DOI: 10.1002/sta4.220] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Affiliation(s)
- Clara Happ
- Department of StatisticsLMU Munich Munich Germany
| | | | | | - Sonja Greven
- Department of StatisticsLMU Munich Munich Germany
| |
Collapse
|
13
|
Leroux A, Xiao L, Crainiceanu C, Checkley W. Dynamic prediction in functional concurrent regression with an application to child growth. Stat Med 2018; 37:1376-1388. [PMID: 29230836 PMCID: PMC5847461 DOI: 10.1002/sim.7582] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2016] [Revised: 11/07/2017] [Accepted: 11/12/2017] [Indexed: 12/24/2022]
Abstract
In many studies, it is of interest to predict the future trajectory of subjects based on their historical data, referred to as dynamic prediction. Mixed effects models have traditionally been used for dynamic prediction. However, the commonly used random intercept and slope model is often not sufficiently flexible for modeling subject-specific trajectories. In addition, there may be useful exposures/predictors of interest that are measured concurrently with the outcome, complicating dynamic prediction. To address these problems, we propose a dynamic functional concurrent regression model to handle the case where both the functional response and the functional predictors are irregularly measured. Currently, such a model cannot be fit by existing software. We apply the model to dynamically predict children's length conditional on prior length, weight, and baseline covariates. Inference on model parameters and subject-specific trajectories is conducted using the mixed effects representation of the proposed model. An extensive simulation study shows that the dynamic functional regression model provides more accurate estimation and inference than existing methods. Methods are supported by fast, flexible, open source software that uses heavily tested smoothing techniques.
Collapse
Affiliation(s)
- Andrew Leroux
- Department of BiostatisticsJohns Hopkins UniversityBaltimoreMD 21205USA
| | - Luo Xiao
- Department of StatisticsNorth Carolina State UniversityRaleighNC 27606USA
| | | | | |
Collapse
|
14
|
Olsen NL, Markussen B, Raket LL. Simultaneous inference for misaligned multivariate functional data. J R Stat Soc Ser C Appl Stat 2018. [DOI: 10.1111/rssc.12276] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
15
|
Grigsby MR, Di J, Leroux A, Zipunnikov V, Xiao L, Crainiceanu C, Checkley W. Novel metrics for growth model selection. Emerg Themes Epidemiol 2018; 15:4. [PMID: 29483933 PMCID: PMC5824542 DOI: 10.1186/s12982-018-0072-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2017] [Accepted: 02/14/2018] [Indexed: 12/27/2022] Open
Abstract
BACKGROUND Literature surrounding the statistical modeling of childhood growth data involves a diverse set of potential models from which investigators can choose. However, the lack of a comprehensive framework for comparing non-nested models leads to difficulty in assessing model performance. This paper proposes a framework for comparing non-nested growth models using novel metrics of predictive accuracy based on modifications of the mean squared error criteria. METHODS Three metrics were created: normalized, age-adjusted, and weighted mean squared error (MSE). Predictive performance metrics were used to compare linear mixed effects models and functional regression models. Prediction accuracy was assessed by partitioning the observed data into training and test datasets. This partitioning was constructed to assess prediction accuracy for backward (i.e., early growth), forward (i.e., late growth), in-range, and on new-individuals. Analyses were done with height measurements from 215 Peruvian children with data spanning from near birth to 2 years of age. RESULTS Functional models outperformed linear mixed effects models in all scenarios tested. In particular, prediction errors for functional concurrent regression (FCR) and functional principal component analysis models were approximately 6% lower when compared to linear mixed effects models. When we weighted subject-specific MSEs according to subject-specific growth rates during infancy, we found that FCR was the best performer in all scenarios. CONCLUSION With this novel approach, we can quantitatively compare non-nested models and weight subgroups of interest to select the best performing growth model for a particular application or problem at hand.
Collapse
Affiliation(s)
- Matthew R. Grigsby
- Division of Pulmonary and Critical Care, School of Medicine, Johns Hopkins University, 1830 E. Monument Street, 5th Floor, Baltimore, MD 21287 USA
| | - Junrui Di
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD USA
| | - Andrew Leroux
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD USA
| | - Vadim Zipunnikov
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD USA
| | - Luo Xiao
- Department of Statistics, North Carolina State University, Raleigh, NC USA
| | - Ciprian Crainiceanu
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD USA
| | - William Checkley
- Division of Pulmonary and Critical Care, School of Medicine, Johns Hopkins University, 1830 E. Monument Street, 5th Floor, Baltimore, MD 21287 USA
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD USA
| |
Collapse
|
16
|
Application of Linear Mixed-Effects Models in Human Neuroscience Research: A Comparison with Pearson Correlation in Two Auditory Electrophysiology Studies. Brain Sci 2017; 7:brainsci7030026. [PMID: 28264422 PMCID: PMC5366825 DOI: 10.3390/brainsci7030026] [Citation(s) in RCA: 52] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2016] [Revised: 02/15/2017] [Accepted: 02/24/2017] [Indexed: 01/08/2023] Open
Abstract
Neurophysiological studies are often designed to examine relationships between measures from different testing conditions, time points, or analysis techniques within the same group of participants. Appropriate statistical techniques that can take into account repeated measures and multivariate predictor variables are integral and essential to successful data analysis and interpretation. This work implements and compares conventional Pearson correlations and linear mixed-effects (LME) regression models using data from two recently published auditory electrophysiology studies. For the specific research questions in both studies, the Pearson correlation test is inappropriate for determining strengths between the behavioral responses for speech-in-noise recognition and the multiple neurophysiological measures as the neural responses across listening conditions were simply treated as independent measures. In contrast, the LME models allow a systematic approach to incorporate both fixed-effect and random-effect terms to deal with the categorical grouping factor of listening conditions, between-subject baseline differences in the multiple measures, and the correlational structure among the predictor variables. Together, the comparative data demonstrate the advantages as well as the necessity to apply mixed-effects models to properly account for the built-in relationships among the multiple predictor variables, which has important implications for proper statistical modeling and interpretation of human behavior in terms of neural correlates and biomarkers.
Collapse
|
17
|
Abstract
We congratulate the authors for their excellent work that provides a clear overview of the large and now mature field of regression models for functional data. We here complement their discussion indicating some directions of further research that we deem particularly important.
Collapse
|
18
|
Abstract
Researchers are increasingly interested in regression models for functional data. This article discusses a comprehensive framework for additive (mixed) models for functional responses and/or functional covariates based on the guiding principle of reframing functional regression in terms of corresponding models for scalar data, allowing the adaptation of a large body of existing methods for these novel tasks. The framework encompasses many existing as well as new models. It includes regression for ‘generalized’ functional data, mean regression, quantile regression as well as generalized additive models for location, shape and scale (GAMLSS) for functional data. It admits many flexible linear, smooth or interaction terms of scalar and functional covariates as well as (functional) random effects and allows flexible choices of bases—particularly splines and functional principal components—and corresponding penalties for each term. It covers functional data observed on common (dense) or curve-specific (sparse) grids. Penalized-likelihood-based and gradient-boosting-based inference for these models are implemented in R packages refund and FDboost , respectively. We also discuss identifiability and computational complexity for the functional regression models covered. A running example on a longitudinal multiple sclerosis imaging study serves to illustrate the flexibility and utility of the proposed model class. Reproducible code for this case study is made available online.
Collapse
Affiliation(s)
- Sonja Greven
- Department of Statistics, Ludwig-Maximilians-Universität München, Germany
| | - Fabian Scheipl
- Department of Statistics, Ludwig-Maximilians-Universität München, Germany
| |
Collapse
|
19
|
Tavakoli S, Panaretos VM. Detecting and Localizing Differences in Functional Time Series Dynamics: A Case Study in Molecular Biophysics. J Am Stat Assoc 2016. [DOI: 10.1080/01621459.2016.1147355] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
Affiliation(s)
- Shahin Tavakoli
- Statistical Laboratory, University of Cambridge, Cambridge, UK
| | - Victor M. Panaretos
- Institut de Mathématiques, Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| |
Collapse
|
20
|
Park J, Ahn J. Clustering multivariate functional data with phase variation. Biometrics 2016; 73:324-333. [PMID: 27218696 DOI: 10.1111/biom.12546] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2015] [Revised: 03/01/2016] [Accepted: 04/01/2016] [Indexed: 11/27/2022]
Abstract
When functional data come as multiple curves per subject, characterizing the source of variations is not a trivial problem. The complexity of the problem goes deeper when there is phase variation in addition to amplitude variation. We consider clustering problem with multivariate functional data that have phase variations among the functional variables. We propose a conditional subject-specific warping framework in order to extract relevant features for clustering. Using multivariate growth curves of various parts of the body as a motivating example, we demonstrate the effectiveness of the proposed approach. The found clusters have individuals who show different relative growth patterns among different parts of the body.
Collapse
Affiliation(s)
- Juhyun Park
- Department of Mathematics and Statistics, Lancaster University, Lancaster LA1 4YF, UK
| | - Jeongyoun Ahn
- Department of Statistics, University of Georgia, Athens, Georgia 30602-1952, U.S.A
| |
Collapse
|
21
|
Marron JS, Ramsay JO, Sangalli LM, Srivastava A. Functional Data Analysis of Amplitude and Phase Variation. Stat Sci 2015. [DOI: 10.1214/15-sts524] [Citation(s) in RCA: 81] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|