Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	[Subscribe] [Scholar Register]

Number

Cited by Other Article(s)

Ma TF, Wang F, Zhu J. On generalized latent factor modeling and inference for high-dimensional binomial data. Biometrics 2023;79:2311-2320. [PMID: 36200926 DOI: 10.1111/biom.13768] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2020] [Accepted: 09/23/2022] [Indexed: 11/30/2022]

Ye H, Zhang X, Wang C, Goode EL, Chen J. Batch-effect correction with sample remeasurement in highly confounded case-control studies. NATURE COMPUTATIONAL SCIENCE 2023;3:709-719. [PMID: 38177326 PMCID: PMC10993308 DOI: 10.1038/s43588-023-00500-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Accepted: 07/11/2023] [Indexed: 01/06/2024]

Huang C, Zhu H. Functional hybrid factor regression model for handling heterogeneity in imaging studies. Biometrika 2022;109:1133-1148. [PMID: 36531154 PMCID: PMC9754099 DOI: 10.1093/biomet/asac007] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2023] Open

Guo Z, Ćevid D, Bühlmann P. Doubly debiased lasso: High-dimensional inference under hidden confounding. Ann Stat 2022;50:1320-1347. [DOI: 10.1214/21-aos2152] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Bing X, Ning Y, Xu Y. Adaptive estimation in multivariate response regression with hidden variables. Ann Stat 2022. [DOI: 10.1214/21-aos2059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Payne NY, Gagnon-Bartsch JA. Separating and reintegrating latent variables to improve classification of genomic data. Biostatistics 2022;23:1133-1149. [DOI: 10.1093/biostatistics/kxab046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2021] [Revised: 11/09/2021] [Accepted: 11/24/2021] [Indexed: 11/12/2022] Open

McKennan C, Nicolae D. Estimating and accounting for unobserved covariates in high-dimensional correlated data. J Am Stat Assoc 2022;117:225-236. [PMID: 35615339 PMCID: PMC9126075 DOI: 10.1080/01621459.2020.1769635] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]

Jernigan R, Jia K, Ren Z, Zhou W. Large-scale multiple inference of collective dependence with applications to protein function. Ann Appl Stat 2021;15:902-924. [DOI: 10.1214/20-aoas1431] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Gerard D. Data-based RNA-seq simulations by binomial thinning. BMC Bioinformatics 2020;21:206. [PMID: 32448189 PMCID: PMC7245910 DOI: 10.1186/s12859-020-3450-9] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2019] [Accepted: 03/10/2020] [Indexed: 11/23/2022] Open

Gerard D, Stephens M. Empirical Bayes shrinkage and false discovery rate estimation, allowing for unwanted variation. Biostatistics 2020;21:15-32. [PMID: 29985984 PMCID: PMC8204175 DOI: 10.1093/biostatistics/kxy029] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2017] [Revised: 06/06/2018] [Accepted: 06/18/2018] [Indexed: 11/12/2022] Open

McKennan C, Nicolae D. Accounting for unobserved covariates with varying degrees of estimability in high-dimensional biological data. Biometrika 2019;106:823-840. [PMID: 31754283 DOI: 10.1093/biomet/asz037] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2018] [Indexed: 12/18/2022] Open

Zhou W, Koudijs KKM, Böhringer S. Influence of batch effect correction methods on drug induced differential gene expression profiles. BMC Bioinformatics 2019;20:437. [PMID: 31438848 PMCID: PMC6706913 DOI: 10.1186/s12859-019-3028-6] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2019] [Accepted: 08/13/2019] [Indexed: 01/17/2023] Open

Abstract

Background

Batch effects were not accounted for in most of the studies of computational drug repositioning based on gene expression signatures. It is unknown how batch effect removal methods impact the results of signature-based drug repositioning. Herein, we conducted differential analyses on the Connectivity Map (CMAP) database using several batch effect correction methods to evaluate the influence of batch effect correction methods on computational drug repositioning using microarray data and compare several batch effect correction methods.

Results

Differences in average signature size were observed with different methods applied. The gene signatures identified by the Latent Effect Adjustment after Primary Projection (LEAPP) method and the methods fitted with Linear Models for Microarray Data (limma) software demonstrated little agreement. The external validity of the gene signatures was evaluated by connectivity mapping between the CMAP database and the Library of Integrated Network-based Cellular Signatures (LINCS) database. The results of connectivity mapping indicate that the genes identified were not reliable for drugs with total sample size (drug + control samples) smaller than 40, irrespective of the batch effect correction method applied. With total sample size larger than 40, the methods correcting for batch effects produced significantly better results than the method with no batch effect correction. In a simulation study, the power was generally low for simulated data with sample size smaller than 40. We observed best performance when using the limma method correcting for two principal components.

Conclusion

Batch effect correction methods strongly impact differential gene expression analysis when the sample size is large enough to contain sufficient information and thus the downstream drug repositioning. We recommend including two or three principal components as covariates in fitting models with limma when sample size is sufficient (larger than 40 drug and controls combined).

Electronic supplementary material

The online version of this article (10.1186/s12859-019-3028-6) contains supplementary material, which is available to authorized users.

Collapse

Hornstein M, Fan R, Shedden K, Zhou S. Joint Mean and Covariance Estimation with Unreplicated Matrix-Variate Data. J Am Stat Assoc 2019. [DOI: 10.1080/01621459.2018.1429275] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]

Dahl A, Guillemot V, Mefford J, Aschard H, Zaitlen N. Adjusting for Principal Components of Molecular Phenotypes Induces Replicating False Positives. Genetics 2019;211:1179-1189. [PMID: 30692194 PMCID: PMC6456307 DOI: 10.1534/genetics.118.301768] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2018] [Accepted: 01/23/2019] [Indexed: 12/20/2022] Open

Hung H. A robust removing unwanted variation-testing procedure via γ -divergence. Biometrics 2018;75:650-662. [PMID: 30430537 DOI: 10.1111/biom.13002] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2017] [Accepted: 10/29/2018] [Indexed: 11/29/2022]

Dobriban E, Owen AB. Deterministic parallel analysis: an improved method for selecting factors and principal components. J R Stat Soc Series B Stat Methodol 2018. [DOI: 10.1111/rssb.12301] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Zhao Q. On Sensitivity Value of Pair-Matched Observational Studies. J Am Stat Assoc 2018. [DOI: 10.1080/01621459.2018.1429277] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

Guillaume B, Wang C, Poh J, Shen MJ, Ong ML, Tan PF, Karnani N, Meaney M, Qiu A. Improving mass-univariate analysis of neuroimaging data by modelling important unknown covariates: Application to Epigenome-Wide Association Studies. Neuroimage 2018;173:57-71. [PMID: 29448075 DOI: 10.1016/j.neuroimage.2018.01.073] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2017] [Revised: 01/03/2018] [Accepted: 01/28/2018] [Indexed: 10/18/2022] Open

Controlling for Confounding Effects in Single Cell RNA Sequencing Studies Using both Control and Target Genes. Sci Rep 2017;7:13587. [PMID: 29051597 PMCID: PMC5648789 DOI: 10.1038/s41598-017-13665-w] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2017] [Accepted: 09/29/2017] [Indexed: 11/24/2022] Open

Wang J, Zhao Q, Hastie T, Owen AB. CONFOUNDER ADJUSTMENT IN MULTIPLE HYPOTHESIS TESTING. Ann Stat 2017;45:1863-1894. [PMID: 31439967 PMCID: PMC6706069 DOI: 10.1214/16-aos1511] [Citation(s) in RCA: 47] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Lee S, Sun W, Wright FA, Zou F. An improved and explicit surrogate variable analysis procedure by coefficient adjustment. Biometrika 2017;104:303-316. [PMID: 29430031 PMCID: PMC5627626 DOI: 10.1093/biomet/asx018] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2015] [Indexed: 01/31/2023] Open

Du L, Zhang C. Estimation of false discovery proportion in multiple testing: From normal to chi-squared test statistics. Electron J Stat 2017. [DOI: 10.1214/17-ejs1256] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Sheu CF, Perthame É, Lee YS, Causeur D. Accounting for time dependence in large-scale multiple testing of event-related potential data. Ann Appl Stat 2016. [DOI: 10.1214/15-aoas888] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Owen AB, Wang J. Bi-Cross-Validation for Factor Analysis. Stat Sci 2016. [DOI: 10.1214/15-sts539] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Delattre S, Roquain E. On empirical distribution function of high-dimensional Gaussian vector components with an application to multiple testing. BERNOULLI 2016. [DOI: 10.3150/14-bej659] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Blum Y, Houée-Bigot M, Causeur D. Sparse factor model for co-expression networks with an application using prior biological knowledge. Stat Appl Genet Mol Biol 2016;15:253-72. [DOI: 10.1515/sagmb-2015-0002] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Jiang Y, Oldridge DA, Diskin SJ, Zhang NR. CODEX: a normalization and copy number variation detection method for whole exome sequencing. Nucleic Acids Res 2015;43:e39. [PMID: 25618849 PMCID: PMC4381046 DOI: 10.1093/nar/gku1363] [Citation(s) in RCA: 95] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2014] [Accepted: 12/19/2014] [Indexed: 01/24/2023] Open