1
|
Liu T, Yuan S, Zhang L, Zhang D. Hepatotoxicity Evaluation of Levornidazole and Its Three Main Impurities: Based on Structure-Toxicity Classification Prediction Combined with Zebrafish Toxicity Assessment. Molecules 2025; 30:995. [PMID: 40076220 PMCID: PMC11901814 DOI: 10.3390/molecules30050995] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2024] [Revised: 02/14/2025] [Accepted: 02/19/2025] [Indexed: 03/14/2025] Open
Abstract
Levornidazole, a nitroimidazole compound, has been linked to hepatotoxic adverse effects in clinical settings. However, the hepatotoxicity of levornidazole and its impurities has not been fully elucidated. This study aimed to predict and evaluate the potential hepatotoxicity of levornidazole, and elucidate the underlying mechanisms of action. Computational models based on support vector machines (SVM) and artificial neural networks (ANN) predicted that levornidazole, ornidazole, and impurity II exhibited hepatotoxic effects. The hepatotoxicity of levornidazole and impurity II was confirmed using a zebrafish toxicity study, with impurity II demonstrating hepatotoxicity at lower doses. Molecular structure analysis revealed that the electronegativity of the side-chain groups and the molecular polarity structure were correlated with the degree of hepatotoxicity. The toxic response was primarily associated with specific structural domains of the molecule, including the 2-methyl-5-nitro-1H-imiddaster-1-yl structure and the substituent groups of 1-chloro and 2(S)-2-methyloxirane. Transcriptome sequencing analysis indicated that levornidazole and impurity II affect multiple metabolic processes in the liver, including glucose, lipid, protein, hormone, and drug metabolism. These findings highlight the potential hepatotoxic risks associated with levomeprazole and its impurities, emphasizing the importance of further investigation and regulatory attention to ensure patient safety.
Collapse
Affiliation(s)
- Ting Liu
- Institute for Food Control, National Institutes for Food and Drug Control, Beijing 102629, China; (T.L.); (L.Z.)
| | - Song Yuan
- Institute for Drug Control, National Institutes for Food and Drug Control, Beijing 102629, China;
| | - Luyong Zhang
- Institute for Food Control, National Institutes for Food and Drug Control, Beijing 102629, China; (T.L.); (L.Z.)
| | - Dousheng Zhang
- Institute for Drug Control, National Institutes for Food and Drug Control, Beijing 102629, China;
| |
Collapse
|
2
|
Chin JL, Tan ZC, Chan LC, Ruffin F, Parmar R, Ahn R, Taylor SD, Bayer AS, Hoffmann A, Fowler VG, Reed EF, Yeaman MR, Meyer AS. Tensor modeling of MRSA bacteremia cytokine and transcriptional patterns reveals coordinated, outcome-associated immunological programs. PNAS NEXUS 2024; 3:pgae185. [PMID: 38779114 PMCID: PMC11109816 DOI: 10.1093/pnasnexus/pgae185] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Accepted: 04/17/2024] [Indexed: 05/25/2024]
Abstract
Methicillin-resistant Staphylococcus aureus (MRSA) bacteremia is a common and life-threatening infection that imposes up to 30% mortality even when appropriate therapy is used. Despite in vitro efficacy determined by minimum inhibitory concentration breakpoints, antibiotics often fail to resolve these infections in vivo, resulting in persistent MRSA bacteremia. Recently, several genetic, epigenetic, and proteomic correlates of persistent outcomes have been identified. However, the extent to which single variables or their composite patterns operate as independent predictors of outcome or reflect shared underlying mechanisms of persistence is unknown. To explore this question, we employed a tensor-based integration of host transcriptional and cytokine datasets across a well-characterized cohort of patients with persistent or resolving MRSA bacteremia outcomes. This method yielded high correlative accuracy with outcomes and immunologic signatures united by transcriptomic and cytokine datasets. Results reveal that patients with persistent MRSA bacteremia (PB) exhibit signals of granulocyte dysfunction, suppressed antigen presentation, and deviated lymphocyte polarization. In contrast, patients with resolving bacteremia (RB) heterogeneously exhibit correlates of robust antigen-presenting cell trafficking and enhanced neutrophil maturation corresponding to appropriate T lymphocyte polarization and B lymphocyte response. These results suggest that transcriptional and cytokine correlates of PB vs. RB outcomes are complex and may not be disclosed by conventional modeling. In this respect, a tensor-based integration approach may help to reveal consensus molecular and cellular mechanisms and their biological interpretation.
Collapse
Affiliation(s)
- Jackson L Chin
- Department of Bioengineering, University of California, Los Angeles, Los Angeles, CA 90024, USA
| | - Zhixin Cyrillus Tan
- Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA 90024, USA
| | - Liana C Chan
- The Lundquist Institute for Biomedical Innovation, Harbor-UCLA Medical Center, Torrance, CA 90502, USA
- Department of Medicine, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
- Division of Infectious Diseases, Department of Medicine, Harbor-UCLA Medical Center, Torrance, CA 90502, USA
- Division of Molecular Medicine, Department of Medicine, Harbor-UCLA Medical Center, Torrance, CA 90502, USA
| | - Felicia Ruffin
- Division of Infectious Diseases, Duke University School of Medicine, Durham, NC 27710, USA
| | - Rajesh Parmar
- Department of Pathology and Laboratory Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Richard Ahn
- Institute for Quantitative and Computational Biosciences, David Geffen School of Medicine at UCLA, Los Angeles, CA 90095, USA
| | - Scott D Taylor
- Department of Bioengineering, University of California, Los Angeles, Los Angeles, CA 90024, USA
| | - Arnold S Bayer
- The Lundquist Institute for Biomedical Innovation, Harbor-UCLA Medical Center, Torrance, CA 90502, USA
- Department of Medicine, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
| | - Alexander Hoffmann
- Institute for Quantitative and Computational Biosciences, David Geffen School of Medicine at UCLA, Los Angeles, CA 90095, USA
| | - Vance G Fowler
- Division of Infectious Diseases, Duke University School of Medicine, Durham, NC 27710, USA
| | - Elaine F Reed
- Department of Pathology and Laboratory Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Michael R Yeaman
- The Lundquist Institute for Biomedical Innovation, Harbor-UCLA Medical Center, Torrance, CA 90502, USA
- Department of Medicine, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
- Division of Infectious Diseases, Department of Medicine, Harbor-UCLA Medical Center, Torrance, CA 90502, USA
- Division of Molecular Medicine, Department of Medicine, Harbor-UCLA Medical Center, Torrance, CA 90502, USA
- Division of Infectious Diseases, Duke University School of Medicine, Durham, NC 27710, USA
| | - Aaron S Meyer
- Department of Bioengineering, University of California, Los Angeles, Los Angeles, CA 90024, USA
- Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA 90024, USA
- Jonsson Comprehensive Cancer Center, University of California, Los Angeles, Los Angeles, CA 90024, USA
- Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research, University of California, Los Angeles, Los Angeles, CA 90024, USA
| |
Collapse
|
3
|
Su Z, Li B, Cook D. Envelope model for function-on-function linear regression. J Comput Graph Stat 2023. [DOI: 10.1080/10618600.2022.2163652] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Affiliation(s)
- Zhihua Su
- Department of Statistics, University of Florida
| | - Bing Li
- Department of Statistics, Pennsylvania State University
| | - Dennis Cook
- School of Statistics, University of Minnesota
| |
Collapse
|
4
|
Zhang J, Sun WW, Li L. Generalized Connectivity Matrix Response Regression with Applications in Brain Connectivity Studies. J Comput Graph Stat 2022; 32:252-262. [PMID: 36970553 PMCID: PMC10035565 DOI: 10.1080/10618600.2022.2074434] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Accepted: 04/23/2022] [Indexed: 10/18/2022]
Abstract
Multiple-subject network data are fast emerging in recent years, where a separate connectivity matrix is measured over a common set of nodes for each individual subject, along with subject covariates information. In this article, we propose a new generalized matrix response regression model, where the observed network is treated as a matrix-valued response and the subject covariates as predictors. The new model characterizes the population-level connectivity pattern through a low-rank intercept matrix, and the effect of subject covariates through a sparse slope tensor. We develop an efficient alternating gradient descent algorithm for parameter estimation, and establish the non-asymptotic error bound for the actual estimator from the algorithm, which quantifies the interplay between the computational and statistical errors. We further show the strong consistency for graph community recovery, as well as the edge selection consistency. We demonstrate the efficacy of our method through simulations and two brain connectivity studies.
Collapse
Affiliation(s)
- Jingfei Zhang
- Department of Management Science, Miami Herbert Business School, University of Miami, Miami, FL, 33146
| | - Will Wei Sun
- Krannert School of Management, Purdue University, West Lafayette, IN, 47906
| | - Lexin Li
- Department of Biostatistics and Epidemiology, School of Public Health, University of California at Berkeley, Berkeley, CA, 94720
| |
Collapse
|
5
|
Yan P, Zhang C, Mei K, Chen F, Wang Y. Research on fault diagnosis of transformer based on laser induced fluorescence technology. J Mol Struct 2022. [DOI: 10.1016/j.molstruc.2022.132645] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
6
|
Ghannam M, Nkurunziza S. The risk of tensor Stein-rules in elliptically contoured distributions. STATISTICS-ABINGDON 2022. [DOI: 10.1080/02331888.2022.2051508] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Affiliation(s)
- Mai Ghannam
- Mathematics and Statistics, University of Windsor Faculty of Science, Windsor, Ontario, Canada
| | - Sévérien Nkurunziza
- Mathematics and Statistics, University of Windsor Faculty of Science, Windsor, Ontario, Canada
| |
Collapse
|
7
|
Ghannam M, Nkurunziza S. Improved estimation in tensor regression with multiple change-points. Electron J Stat 2022. [DOI: 10.1214/22-ejs2035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
- Mai Ghannam
- University of Windsor, Mathematics and Statistics department 401 Sunset Avenue, Windsor, ON, N9B 3P4
| | - Sévérien Nkurunziza
- University of Windsor, Mathematics and Statistics department 401 Sunset Avenue, Windsor, ON, N9B 3P4
| |
Collapse
|
8
|
Min K, Mai Q. A general framework for tensor screening through smoothing. Electron J Stat 2022. [DOI: 10.1214/21-ejs1954] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
- Keqian Min
- Department of Statistics, Florida State University, Tallahassee, Florida 32306, U.S.A
| | - Qing Mai
- Department of Statistics, Florida State University, Tallahassee, Florida 32306, U.S.A
| |
Collapse
|
9
|
Zhang J, Huang Z. Efficient simultaneous partial envelope model in multivariate linear regression. J STAT COMPUT SIM 2021. [DOI: 10.1080/00949655.2021.1995866] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Affiliation(s)
- Jing Zhang
- School of Science, Nanjing University of Science and Technology, Nanjing, Jiangsu, People's Republic of China
- School of Mathematics and Finance, Chuzhou University, Chuzhou, Anhui, People's Republic of China
| | - Zhensheng Huang
- School of Science, Nanjing University of Science and Technology, Nanjing, Jiangsu, People's Republic of China
| |
Collapse
|
10
|
Tan ZC, Murphy MC, Alpay HS, Taylor SD, Meyer AS. Tensor-structured decomposition improves systems serology analysis. Mol Syst Biol 2021; 17:e10243. [PMID: 34487431 PMCID: PMC8420856 DOI: 10.15252/msb.202110243] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2021] [Revised: 08/12/2021] [Accepted: 08/16/2021] [Indexed: 01/04/2023] Open
Abstract
Systems serology provides a broad view of humoral immunity by profiling both the antigen-binding and Fc properties of antibodies. These studies contain structured biophysical profiling across disease-relevant antigen targets, alongside additional measurements made for single antigens or in an antigen-generic manner. Identifying patterns in these measurements helps guide vaccine and therapeutic antibody development, improve our understanding of diseases, and discover conserved regulatory mechanisms. Here, we report that coupled matrix-tensor factorization (CMTF) can reduce these data into consistent patterns by recognizing the intrinsic structure of these data. We use measurements from two previous studies of HIV- and SARS-CoV-2-infected subjects as examples. CMTF outperforms standard methods like principal components analysis in the extent of data reduction while maintaining equivalent prediction of immune functional responses and disease status. Under CMTF, model interpretation improves through effective data reduction, separation of the Fc and antigen-binding effects, and recognition of consistent patterns across individual measurements. Data reduction also helps make prediction models more replicable. Therefore, we propose that CMTF is an effective general strategy for data exploration in systems serology.
Collapse
Affiliation(s)
- Zhixin Cyrillus Tan
- Bioinformatics Interdepartmental ProgramUniversity of California, Los AngelesLos AngelesCAUSA
| | - Madeleine C Murphy
- Computational and Systems BiologyUniversity of California, Los AngelesLos AngelesCAUSA
| | - Hakan S Alpay
- Department of Computer ScienceUniversity of California, Los AngelesLos AngelesCAUSA
| | - Scott D Taylor
- Department of BioengineeringUniversity of California, Los AngelesLos AngelesCAUSA
| | - Aaron S Meyer
- Bioinformatics Interdepartmental ProgramUniversity of California, Los AngelesLos AngelesCAUSA
- Department of BioengineeringUniversity of California, Los AngelesLos AngelesCAUSA
- Jonsson Comprehensive Cancer CenterUniversity of California, Los AngelesLos AngelesCAUSA
- Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell ResearchUniversity of California, Los AngelesLos AngelesCAUSA
| |
Collapse
|
11
|
Deng K, Zhang X. Tensor envelope mixture model for simultaneous clustering and multiway dimension reduction. Biometrics 2021; 78:1067-1079. [PMID: 34010459 DOI: 10.1111/biom.13486] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2020] [Revised: 04/28/2021] [Accepted: 05/06/2021] [Indexed: 11/26/2022]
Abstract
In the form of multidimensional arrays, tensor data have become increasingly prevalent in modern scientific studies and biomedical applications such as computational biology, brain imaging analysis, and process monitoring system. These data are intrinsically heterogeneous with complex dependencies and structure. Therefore, ad-hoc dimension reduction methods on tensor data may lack statistical efficiency and can obscure essential findings. Model-based clustering is a cornerstone of multivariate statistics and unsupervised learning; however, existing methods and algorithms are not designed for tensor-variate samples. In this article, we propose a tensor envelope mixture model (TEMM) for simultaneous clustering and multiway dimension reduction of tensor data. TEMM incorporates tensor-structure-preserving dimension reduction into mixture modeling and drastically reduces the number of free parameters and estimative variability. An expectation-maximization-type algorithm is developed to obtain likelihood-based estimators of the cluster means and covariances, which are jointly parameterized and constrained onto a series of lower dimensional subspaces known as the tensor envelopes. We demonstrate the encouraging empirical performance of the proposed method in extensive simulation studies and a real data application in comparison with existing vector and tensor clustering methods.
Collapse
Affiliation(s)
- Kai Deng
- Department of Statistics, Florida State University, Tallahassee, Florida, USA
| | - Xin Zhang
- Department of Statistics, Florida State University, Tallahassee, Florida, USA
| |
Collapse
|
12
|
Zhang J, Huang Z, Jiang Z. Groupwise partial envelope model: efficient estimation in multivariate linear regression. COMMUN STAT-SIMUL C 2021. [DOI: 10.1080/03610918.2021.1921800] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Affiliation(s)
- Jing Zhang
- School of Science, Nanjing University of Science and Technology, Nanjing, Jiangsu, P. R. China
- School of Mathematics and Finance, Chuzhou University, Chuzhou, Anhui, P. R. China
| | - Zhensheng Huang
- School of Science, Nanjing University of Science and Technology, Nanjing, Jiangsu, P. R. China
| | - Zhiqiang Jiang
- School of Science, Nanjing University of Science and Technology, Nanjing, Jiangsu, P. R. China
| |
Collapse
|
13
|
Mai Q, Zhang X, Pan Y, Deng K. A Doubly Enhanced EM Algorithm for Model-Based Tensor Clustering. J Am Stat Assoc 2021. [DOI: 10.1080/01621459.2021.1904959] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Affiliation(s)
- Qing Mai
- Department of Statistics, Florida State University, Tallahassee, FL
| | - Xin Zhang
- Department of Statistics, Florida State University, Tallahassee, FL
| | - Yuqing Pan
- Department of Statistics, Florida State University, Tallahassee, FL
| | - Kai Deng
- Department of Statistics, Florida State University, Tallahassee, FL
| |
Collapse
|
14
|
Liu Y, Liu J, Zhu C. Low-Rank Tensor Train Coefficient Array Estimation for Tensor-on-Tensor Regression. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2020; 31:5402-5411. [PMID: 32054589 DOI: 10.1109/tnnls.2020.2967022] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
The tensor-on-tensor regression can predict a tensor from a tensor, which generalizes most previous multilinear regression approaches, including methods to predict a scalar from a tensor, and a tensor from a scalar. However, the coefficient array could be much higher dimensional due to both high-order predictors and responses in this generalized way. Compared with the current low CANDECOMP/PARAFAC (CP) rank approximation-based method, the low tensor train (TT) approximation can further improve the stability and efficiency of the high or even ultrahigh-dimensional coefficient array estimation. In the proposed low TT rank coefficient array estimation for tensor-on-tensor regression, we adopt a TT rounding procedure to obtain adaptive ranks, instead of selecting ranks by experience. Besides, an l2 constraint is imposed to avoid overfitting. The hierarchical alternating least square is used to solve the optimization problem. Numerical experiments on a synthetic data set and two real-life data sets demonstrate that the proposed method outperforms the state-of-the-art methods in terms of prediction accuracy with comparable computational complexity, and the proposed method is more computationally efficient when the data are high dimensional with small size in each mode.
Collapse
|
15
|
Abstract
Biclustering is an important exploratory analysis tool that simultaneously clusters rows (e.g., samples) and columns (e.g., variables) of a data matrix. Checkerboard-like biclusters reveal intrinsic associations between rows and columns. However, most existing methods rely on Gaussian assumptions and only apply to matrix data. In practice, non-Gaussian and/or multi-way tensor data are frequently encountered. A new CO-clustering method via Regularized Alternating Least Squares (CORALS) is proposed, which generalizes biclustering to non-Gaussian data and multi-way tensor arrays. Non-Gaussian data are modeled with single-parameter exponential family distributions and co-clusters are identified in the natural parameter space via sparse CANDECOMP/PARAFAC tensor decomposition. A regularized alternating (iteratively reweighted) least squares algorithm is devised for model fitting and a deflation procedure is exploited to automatically determine the number of co-clusters. Comprehensive simulation studies and three real data examples demonstrate the efficacy of the proposed method. The data and code are publicly available.
Collapse
Affiliation(s)
- Gen Li
- Department of Biostatistics, Columbia University. New York, NY 10032
| |
Collapse
|
16
|
Hu W, Pan T, Kong D, Shen W. Nonparametric matrix response regression with application to brain imaging data analysis. Biometrics 2020; 77:1227-1240. [PMID: 32869275 DOI: 10.1111/biom.13362] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2020] [Revised: 07/19/2020] [Accepted: 08/20/2020] [Indexed: 11/26/2022]
Abstract
With the rapid growth of neuroimaging technologies, a great effort has been dedicated recently to investigate the dynamic changes in brain activity. Examples include time course calcium imaging and dynamic brain functional connectivity. In this paper, we propose a novel nonparametric matrix response regression model to characterize the nonlinear association between 2D image outcomes and predictors such as time and patient information. Our estimation procedure can be formulated as a nuclear norm regularization problem, which can capture the underlying low-rank structure of the dynamic 2D images. We present a computationally efficient algorithm, derive the asymptotic theory, and show that the method outperforms other existing approaches in simulations. We then apply the proposed method to a calcium imaging study for estimating the change of fluorescent intensities of neurons, and an electroencephalography study for a comparison in the dynamic connectivity covariance matrices between alcoholic and control individuals. For both studies, the method leads to a substantial improvement in prediction error.
Collapse
Affiliation(s)
- Wei Hu
- Department of Statistics, University of California, Irvine, California
| | - Tianyu Pan
- Department of Statistics, University of California, Irvine, California
| | - Dehan Kong
- Department of Statistical Sciences, University of Toronto, Canada
| | - Weining Shen
- Department of Statistics, University of California, Irvine, California
| |
Collapse
|
17
|
Pfeiffer RM, Kapla DB, Bura E. Least squares and maximum likelihood estimation of sufficient reductions in regressions with matrix-valued predictors. INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS 2020; 11:11-26. [PMID: 33553594 PMCID: PMC7840662 DOI: 10.1007/s41060-020-00228-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2020] [Accepted: 07/16/2020] [Indexed: 11/24/2022]
Abstract
We propose methods to estimate sufficient reductions in matrix-valued predictors for regression or classification. We assume that the first moment of the predictor matrix given the response can be decomposed into a row and column component via a Kronecker product structure. We obtain least squares and maximum likelihood estimates of the sufficient reductions in the matrix predictors, derive statistical properties of the resulting estimates and present fast computational algorithms with assured convergence. The performance of the proposed approaches in regression and classification is compared in simulations.We illustrate the methods on two examples, using longitudinally measured serum biomarker and neuroimaging data.
Collapse
Affiliation(s)
- Ruth M. Pfeiffer
- Biostatistics Branch, DCEG, National Cancer Institute, NIH, Bethesda, USA
| | - Daniel B. Kapla
- Faculty of Mathematics, Institute of Statistics and Mathematical Methods in Economics, TU Wien, Vienna, Austria
| | - Efstathia Bura
- Faculty of Mathematics, Institute of Statistics and Mathematical Methods in Economics, TU Wien, Vienna, Austria
| |
Collapse
|
18
|
Zhang X, Lee CE, Shao X. Envelopes in multivariate regression models with nonlinearity and heteroscedasticity. Biometrika 2020. [DOI: 10.1093/biomet/asaa036] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Summary
Envelopes have been proposed in recent years as a nascent methodology for sufficient dimension reduction and efficient parameter estimation in multivariate linear models. We extend the classical definition of envelopes in Cook et al. (2010) to incorporate a nonlinear conditional mean function and a heteroscedastic error. Given any two random vectors ${X}\in\mathbb{R}^{p}$ and ${Y}\in\mathbb{R}^{r}$, we propose two new model-free envelopes, called the martingale difference divergence envelope and the central mean envelope, and study their relationships to the standard envelope in the context of response reduction in multivariate linear models. The martingale difference divergence envelope effectively captures the nonlinearity in the conditional mean without imposing any parametric structure or requiring any tuning in estimation. Heteroscedasticity, or nonconstant conditional covariance of ${Y}\mid{X}$, is further detected by the central mean envelope based on a slicing scheme for the data. We reveal the nested structure of different envelopes: (i) the central mean envelope contains the martingale difference divergence envelope, with equality when ${Y}\mid{X}$ has a constant conditional covariance; and (ii) the martingale difference divergence envelope contains the standard envelope, with equality when ${Y}\mid{X}$ has a linear conditional mean. We develop an estimation procedure that first obtains the martingale difference divergence envelope and then estimates the additional envelope components in the central mean envelope. We establish consistency in envelope estimation of the martingale difference divergence envelope and central mean envelope without stringent model assumptions. Simulations and real-data analysis demonstrate the advantages of the martingale difference divergence envelope and the central mean envelope over the standard envelope in dimension reduction.
Collapse
Affiliation(s)
- X Zhang
- Department of Statistics, Florida State University, 117 N.Woodward Ave., Tallahassee, Florida 32306, U.S.A
| | - C E Lee
- Department of Business Analytics and Statistics, University of Tennessee, Knoxville, 916 Volunteer Blvd, Knoxville, Tennessee 37996, U.S.A
| | - X Shao
- Department of Statistics, University of Illinois at Urbana Champaign, 725 South Wright St, Champaign, Illinois 61820, U.S.A
| |
Collapse
|
19
|
Jiang B, Petkova E, Tarpey T, Ogden RT. A Bayesian approach to joint modeling of matrix-valued imaging data and treatment outcome with applications to depression studies. Biometrics 2020; 76:87-97. [PMID: 31529701 PMCID: PMC7067625 DOI: 10.1111/biom.13151] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2018] [Accepted: 08/28/2019] [Indexed: 11/28/2022]
Abstract
In this paper, we propose a unified Bayesian joint modeling framework for studying association between a binary treatment outcome and a baseline matrix-valued predictor. Specifically, a joint modeling approach relating an outcome to a matrix-valued predictor through a probabilistic formulation of multilinear principal component analysis is developed. This framework establishes a theoretical relationship between the outcome and the matrix-valued predictor, although the predictor is not explicitly expressed in the model. Simulation studies are provided showing that the proposed method is superior or competitive to other methods, such as a two-stage approach and a classical principal component regression in terms of both prediction accuracy and estimation of association; its advantage is most notable when the sample size is small and the dimensionality in the imaging covariate is large. Finally, our proposed joint modeling approach is shown to be a very promising tool in an application exploring the association between baseline electroencephalography data and a favorable response to treatment in a depression treatment study by achieving a substantial improvement in prediction accuracy in comparison to competing methods.
Collapse
Affiliation(s)
- Bei Jiang
- Department of Mathematical and Statistical Sciences, University of Alberta, Edmonton, AB T6G 2E8, Canada
| | - Eva Petkova
- Department of Population Health, New York University, New York, NY 10016, USA
- Department of Child and Adolescent Psychiatry, New York University, New York, NY 10016, USA
| | - Thaddeus Tarpey
- Department of Population Health, New York University, New York, NY 10016, USA
| | - R. Todd Ogden
- Department of Biostatistics, Columbia University, New York, NY 10032, USA
| |
Collapse
|
20
|
Affiliation(s)
- Minji Lee
- Department of Statistics University of Florida Gainesville Florida USA
| | - Zhihua Su
- Department of Statistics University of Florida Gainesville Florida USA
| |
Collapse
|
21
|
|
22
|
Affiliation(s)
- Yuqing Pan
- Department of Statistics, Florida State University, Tallahassee, FL
| | - Qing Mai
- Department of Statistics, Florida State University, Tallahassee, FL
| | - Xin Zhang
- Department of Statistics, Florida State University, Tallahassee, FL
| |
Collapse
|
23
|
Zhang X, Mai Q. Efficient Integration of Sufficient Dimension Reduction and Prediction in Discriminant Analysis. Technometrics 2018. [DOI: 10.1080/00401706.2018.1512901] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Affiliation(s)
- Xin Zhang
- Department of Statistics, Florida State University, Tallahassee, FL
| | - Qing Mai
- Department of Statistics, Florida State University, Tallahassee, FL
| |
Collapse
|
24
|
Zhang X, Wang C, Wu Y. Functional envelope for model-free sufficient dimension reduction. J MULTIVARIATE ANAL 2018. [DOI: 10.1016/j.jmva.2017.09.010] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
25
|
|
26
|
Eliseyev A, Auboiroux V, Costecalde T, Langar L, Charvet G, Mestais C, Aksenova T, Benabid AL. Recursive Exponentially Weighted N-way Partial Least Squares Regression with Recursive-Validation of Hyper-Parameters in Brain-Computer Interface Applications. Sci Rep 2017; 7:16281. [PMID: 29176638 PMCID: PMC5701264 DOI: 10.1038/s41598-017-16579-9] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2017] [Accepted: 11/14/2017] [Indexed: 12/02/2022] Open
Abstract
A tensor-input/tensor-output Recursive Exponentially Weighted N-Way Partial Least Squares (REW-NPLS) regression algorithm is proposed for high dimension multi-way (tensor) data treatment and adaptive modeling of complex processes in real-time. The method unites fast and efficient calculation schemes of the Recursive Exponentially Weighted PLS with the robustness of tensor-based approaches. Moreover, contrary to other multi-way recursive algorithms, no loss of information occurs in the REW-NPLS. In addition, the Recursive-Validation method for recursive estimation of the hyper-parameters is proposed instead of conventional cross-validation procedure. The approach was then compared to state-of-the-art methods. The efficiency of the methods was tested in electrocorticography (ECoG) and magnetoencephalography (MEG) datasets. The algorithms are implemented in software suitable for real-time operation. Although the Brain-Computer Interface applications are used to demonstrate the methods, the proposed approaches could be efficiently used in a wide range of tasks beyond neuroscience uniting complex multi-modal data structures, adaptive modeling, and real-time computational requirements.
Collapse
Affiliation(s)
- Andrey Eliseyev
- Univ. Grenoble Alpes, CEA, LETI, CLINATEC, MINATEC Campus, 38000, Grenoble, France.
| | - Vincent Auboiroux
- Univ. Grenoble Alpes, CEA, LETI, CLINATEC, MINATEC Campus, 38000, Grenoble, France
| | - Thomas Costecalde
- Univ. Grenoble Alpes, CEA, LETI, CLINATEC, MINATEC Campus, 38000, Grenoble, France
| | - Lilia Langar
- Centre Hospitalier Universitaire Grenoble Alpes, 38700, La Tronche, France
| | - Guillaume Charvet
- Univ. Grenoble Alpes, CEA, LETI, CLINATEC, MINATEC Campus, 38000, Grenoble, France
| | - Corinne Mestais
- Univ. Grenoble Alpes, CEA, LETI, CLINATEC, MINATEC Campus, 38000, Grenoble, France
| | - Tetiana Aksenova
- Univ. Grenoble Alpes, CEA, LETI, CLINATEC, MINATEC Campus, 38000, Grenoble, France
| | - Alim-Louis Benabid
- Univ. Grenoble Alpes, CEA, LETI, CLINATEC, MINATEC Campus, 38000, Grenoble, France
| |
Collapse
|