Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: He Q, Zhang HH, Avery CL, Lin DY. Sparse meta-analysis with high-dimensional data. Biostatistics 2015;17:205-20. [PMID: 26395907 DOI: 10.1093/biostatistics/kxv038] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2015] [Accepted: 08/31/2015] [Indexed: 01/10/2023] Open

For:	He Q, Zhang HH, Avery CL, Lin DY. Sparse meta-analysis with high-dimensional data. Biostatistics 2015;17:205-20. [PMID: 26395907 DOI: 10.1093/biostatistics/kxv038] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2015] [Accepted: 08/31/2015] [Indexed: 01/10/2023] Open

Number

Cited by Other Article(s)

Ye X, Yang S, Wang X, Liu Y. Integrative analysis of high-dimensional RCT and RWD subject to censoring and hidden confounding. LIFETIME DATA ANALYSIS 2025:10.1007/s10985-025-09654-1. [PMID: 40301269 DOI: 10.1007/s10985-025-09654-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/26/2024] [Accepted: 03/31/2025] [Indexed: 05/01/2025]

Chang C, Bu Z, Long Q. CEDAR: communication efficient distributed analysis for regressions. Biometrics 2023;79:2357-2369. [PMID: 36305019 PMCID: PMC10133408 DOI: 10.1111/biom.13786] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Accepted: 10/05/2022] [Indexed: 11/27/2022]

Liu Y, Sun W, Hsu L, He Q. Statistical inference for high-dimensional pathway analysis with multiple responses. Comput Stat Data Anal 2022;169. [PMID: 35125572 PMCID: PMC8813039 DOI: 10.1016/j.csda.2021.107418] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]

Hu Z, Zhou Y, Tong T. Meta-Analyzing Multiple Omics Data With Robust Variable Selection. Front Genet 2021;12:656826. [PMID: 34290735 PMCID: PMC8288516 DOI: 10.3389/fgene.2021.656826] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2021] [Accepted: 05/24/2021] [Indexed: 12/03/2022] Open

Cai T, Liu M, Xia Y. Individual Data Protected Integrative Regression Analysis of High-Dimensional Heterogeneous Data. J Am Stat Assoc 2021;117:2105-2119. [PMID: 37975021 PMCID: PMC10653033 DOI: 10.1080/01621459.2021.1904958] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2019] [Revised: 03/05/2021] [Accepted: 03/13/2021] [Indexed: 01/29/2023]

Sheng Y, Sun Y, Huang CY, Kim MO. Synthesizing external aggregated information in the presence of population heterogeneity: A penalized empirical likelihood approach. Biometrics 2021;78:679-690. [PMID: 33528028 DOI: 10.1111/biom.13429] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Revised: 08/23/2020] [Accepted: 12/31/2020] [Indexed: 01/04/2023]

Hong C, Wang Y, Cai T. A divide-and-conquer method for sparse risk prediction and evaluation. Biostatistics 2020;23:397-411. [PMID: 32909599 DOI: 10.1093/biostatistics/kxaa031] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2019] [Revised: 07/06/2020] [Accepted: 07/10/2020] [Indexed: 11/12/2022] Open

Abstract

Divide-and-conquer (DAC) is a commonly used strategy to overcome the challenges of extraordinarily large data, by first breaking the dataset into series of data blocks, then combining results from individual data blocks to obtain a final estimation. Various DAC algorithms have been proposed to fit a sparse predictive regression model in the $L_1$ regularization setting. However, many existing DAC algorithms remain computationally intensive when sample size and number of candidate predictors are both large. In addition, no existing DAC procedures provide inference for quantifying the accuracy of risk prediction models. In this article, we propose a screening and one-step linearization infused DAC (SOLID) algorithm to fit sparse logistic regression to massive datasets, by integrating the DAC strategy with a screening step and sequences of linearization. This enables us to maximize the likelihood with only selected covariates and perform penalized estimation via a fast approximation to the likelihood. To assess the accuracy of a predictive regression model, we develop a modified cross-validation (MCV) that utilizes the side products of the SOLID, substantially reducing the computational burden. Compared with existing DAC methods, the MCV procedure is the first to make inference on accuracy. Extensive simulation studies suggest that the proposed SOLID and MCV procedures substantially outperform the existing methods with respect to computational speed and achieve similar statistical efficiency as the full sample-based estimator. We also demonstrate that the proposed inference procedure provides valid interval estimators. We apply the proposed SOLID procedure to develop and validate a classification model for disease diagnosis using narrative clinical notes based on electronic medical record data from Partners HealthCare.

Collapse

Yang T, Kim J, Wu C, Ma Y, Wei P, Pan W. An adaptive test for meta-analysis of rare variant association studies. Genet Epidemiol 2020;44:104-116. [PMID: 31830326 PMCID: PMC6980317 DOI: 10.1002/gepi.22273] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2019] [Revised: 11/12/2019] [Accepted: 11/25/2019] [Indexed: 01/02/2023]

Liu Y, Sun W, Reiner AP, Kooperberg C, He Q. Statistical inference of genetic pathway analysis in high dimensions. Biometrika 2019;106:651. [PMID: 31427824 DOI: 10.1093/biomet/asz033] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2017] [Indexed: 11/13/2022] Open

Silva-Fernández L, Carmona L. Meta-analysis in the era of big data. Clin Rheumatol 2019;38:2027-2028. [PMID: 31273634 DOI: 10.1007/s10067-019-04666-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2019] [Revised: 06/20/2019] [Accepted: 06/26/2019] [Indexed: 02/07/2023]

el Bouhaddani S, Uh HW, Hayward C, Jongbloed G, Houwing-Duistermaat J. Probabilistic partial least squares model: Identifiability, estimation and application. J MULTIVARIATE ANAL 2018. [DOI: 10.1016/j.jmva.2018.05.009] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]