1
|
Kappenberg F, Grinberg M, Jiang X, Kopp-Schneider A, Hengstler JG, Rahnenführer J. Comparison Of Observation-Based And Model-Based Identification Of Alert Concentrations From Concentration-Expression Data. Bioinformatics 2021; 37:btab043. [PMID: 33515236 PMCID: PMC8337003 DOI: 10.1093/bioinformatics/btab043] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2020] [Revised: 01/15/2021] [Accepted: 01/18/2021] [Indexed: 11/23/2022] Open
Abstract
MOTIVATION An important goal of concentration-response studies in toxicology is to determine an 'alert' concentration where a critical level of the response variable is exceeded. In a classical observation-based approach, only measured concentrations are considered as potential alert concentrations. Alternatively, a parametric curve is fitted to the data that describes the relationship between concentration and response. For a prespecified effect level, both an absolute estimate of the alert concentration and an estimate of the lowest concentration where the effect level is exceeded significantly are of interest. RESULTS In a simulation study for gene expression data, we compared the observation-based and the model-based approach for both absolute and significant exceedance of the prespecified effect level. Results show that, compared to the observation-based approach, the model-based approach overestimates the true alert concentration less often and more frequently leads to a valid estimate, especially for genes with large variance. AVAILABILITY AND IMPLEMENTATION The code used for the simulation studies is available via the GitHub repository: https://github.com/FKappenberg/Paper-IdentificationAlertConcentrations. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | - Marianna Grinberg
- Department of Statistics, TU Dortmund University, Dortmund 44221, Germany
| | - Xiaoqi Jiang
- Division of Biostatistics, German Cancer Research Center (DKFZ), Heidelberg 69120, Germany
| | - Annette Kopp-Schneider
- Division of Biostatistics, German Cancer Research Center (DKFZ), Heidelberg 69120, Germany
| | - Jan G Hengstler
- Department of Toxikologie/Systemtoxikologie, Leibniz Research Centre for Working Environment and Human Factors (IfADo), TU Dortmund University, Dortmund 44139, Germany
| | - Jörg Rahnenführer
- Department of Statistics, TU Dortmund University, Dortmund 44221, Germany
| |
Collapse
|
2
|
Mazo G, Portier F. Parametric versus nonparametric: The fitness coefficient. Scand Stat Theory Appl 2020. [DOI: 10.1111/sjos.12495] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Gildas Mazo
- MaIAGE, INRAE Université Paris Saclay France
| | | |
Collapse
|
3
|
Affiliation(s)
- Jiahui Yu
- Department of Mathematics and Statistics, Boston University, Boston, MA
| | | | - Anna Liu
- Department of Mathematics and Statistics, University of Massachusetts, Amherst, MA
| | - Yuedong Wang
- Department Statistics and Applied Probability, University of California, Santa Barbara, CA
| |
Collapse
|
4
|
Yuan A, Chen X, Zhou Y, Tan MT. Subgroup analysis with semiparametric models toward precision medicine. Stat Med 2018; 37:1830-1845. [PMID: 29575056 DOI: 10.1002/sim.7638] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2017] [Revised: 01/23/2018] [Accepted: 01/26/2018] [Indexed: 11/11/2022]
Abstract
In analyzing clinical trials, one important objective is to classify the patients into treatment-favorable and nonfavorable subgroups. Existing parametric methods are not robust, and the commonly used classification rules ignore the fact that the implications of treatment-favorable and nonfavorable subgroups can be different. To address these issues, we propose a semiparametric model, incorporating both our knowledge and uncertainty about the true model. The Wald statistics is used to test the existence of subgroups, while the Neyman-Pearson rule to classify each subject. Asymptotic properties are derived, simulation studies are conducted to evaluate the performance of the method, and then method is used to analyze a real-world trial data.
Collapse
Affiliation(s)
- Ao Yuan
- Department of Biostatistics, Bioinformatics and Biomathematics, Georgetown University, Washington DC, 20057, USA
| | - Xiaofei Chen
- Department of Biostatistics, Bioinformatics and Biomathematics, Georgetown University, Washington DC, 20057, USA
| | - Yizhao Zhou
- Department of Biostatistics, Bioinformatics and Biomathematics, Georgetown University, Washington DC, 20057, USA
| | - Ming T Tan
- Department of Biostatistics, Bioinformatics and Biomathematics, Georgetown University, Washington DC, 20057, USA
| |
Collapse
|
5
|
Xie Y, Zhu Y, Cotton CA, Wu P. A model averaging approach for estimating propensity scores by optimizing balance. Stat Methods Med Res 2017; 28:84-101. [DOI: 10.1177/0962280217715487] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Many approaches, including traditional parametric modeling and machine learning techniques, have been proposed to estimate propensity scores. This paper describes a new model averaging approach to propensity score estimation in which parametric and nonparametric estimates are combined to achieve covariate balance. Simulation studies are conducted across different scenarios varying in the degree of interactions and nonlinearities in the treatment model. The results show that, based on inverse probability weighting, the proposed propensity score estimator produces less bias and smaller standard errors than existing approaches. They also show that a model averaging approach with the objective of minimizing the average Kolmogorov–Smirnov statistic leads to the best performing IPW estimator. The proposed approach is also applied to a real data set in evaluating the causal effect of formula or mixed feeding versus exclusive breastfeeding on a child’s body mass index Z-score at age 4. The data analysis shows that formula or mixed feeding is more likely to lead to obesity at age 4, compared to exclusive breastfeeding.
Collapse
Affiliation(s)
- Yuying Xie
- Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, Canada
| | - Yeying Zhu
- Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, Canada
| | - Cecilia A Cotton
- Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, Canada
| | - Pan Wu
- Value Institute, Christiana Care Health System, Newark, USA
| |
Collapse
|
6
|
Dosne AG, Bergstrand M, Karlsson MO, Renard D, Heimann G. Model averaging for robust assessment of QT prolongation by concentration-response analysis. Stat Med 2017; 36:3844-3857. [PMID: 28703360 DOI: 10.1002/sim.7395] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2016] [Revised: 05/22/2017] [Accepted: 06/11/2017] [Indexed: 11/07/2022]
Abstract
Assessing the QT prolongation potential of a drug is typically done based on pivotal safety studies called thorough QT studies. Model-based estimation of the drug-induced QT prolongation at the estimated mean maximum drug concentration could increase efficiency over the currently used intersection-union test. However, robustness against model misspecification needs to be guaranteed in pivotal settings. The objective of this work was to develop an efficient, fully prespecified model-based inference method for thorough QT studies, which controls the type I error and provides satisfactory test power. This is achieved by model averaging: The proposed estimator of the concentration-response relationship is a weighted average of a parametric (linear) and a nonparametric (monotonic I-splines) estimator, with weights based on mean integrated square error. The desired properties of the method were confirmed in an extensive simulation study, which demonstrated that the proposed method controlled the type I error adequately, and that its power was higher than the power of the nonparametric method alone. The method can be extended from thorough QT studies to the analysis of QT data from pooled phase I studies.
Collapse
Affiliation(s)
| | | | | | - D Renard
- Novartis Pharma AG, Basel, Switzerland
| | - G Heimann
- Novartis Pharma AG, Basel, Switzerland
| |
Collapse
|
7
|
Verhoeven T, Hübner D, Tangermann M, Müller KR, Dambre J, Kindermans PJ. Improving zero-training brain-computer interfaces by mixing model estimators. J Neural Eng 2017; 14:036021. [PMID: 28287076 DOI: 10.1088/1741-2552/aa6639] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
OBJECTIVE Brain-computer interfaces (BCI) based on event-related potentials (ERP) incorporate a decoder to classify recorded brain signals and subsequently select a control signal that drives a computer application. Standard supervised BCI decoders require a tedious calibration procedure prior to every session. Several unsupervised classification methods have been proposed that tune the decoder during actual use and as such omit this calibration. Each of these methods has its own strengths and weaknesses. Our aim is to improve overall accuracy of ERP-based BCIs without calibration. APPROACH We consider two approaches for unsupervised classification of ERP signals. Learning from label proportions (LLP) was recently shown to be guaranteed to converge to a supervised decoder when enough data is available. In contrast, the formerly proposed expectation maximization (EM) based decoding for ERP-BCI does not have this guarantee. However, while this decoder has high variance due to random initialization of its parameters, it obtains a higher accuracy faster than LLP when the initialization is good. We introduce a method to optimally combine these two unsupervised decoding methods, letting one method's strengths compensate for the weaknesses of the other and vice versa. The new method is compared to the aforementioned methods in a resimulation of an experiment with a visual speller. MAIN RESULTS Analysis of the experimental results shows that the new method exceeds the performance of the previous unsupervised classification approaches in terms of ERP classification accuracy and symbol selection accuracy during the spelling experiment. Furthermore, the method shows less dependency on random initialization of model parameters and is consequently more reliable. SIGNIFICANCE Improving the accuracy and subsequent reliability of calibrationless BCIs makes these systems more appealing for frequent use.
Collapse
Affiliation(s)
- T Verhoeven
- Electronics and Informations Systems, Ghent University, Ghent, Belgium
| | | | | | | | | | | |
Collapse
|
8
|
Lee SSM, Soleymani M. A Simple Formula for Mixing Estimators With Different Convergence Rates. J Am Stat Assoc 2016. [DOI: 10.1080/01621459.2014.960966] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
9
|
Talamakrouni M, Van Keilegom I, El Ghouch A. Parametrically guided nonparametric density and hazard estimation with censored data. Comput Stat Data Anal 2016. [DOI: 10.1016/j.csda.2015.01.009] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
10
|
Mohseni Ahooyi T, Arbogast JE, Soroush M. Rolling Pin Method: Efficient General Method of Joint Probability Modeling. Ind Eng Chem Res 2014. [DOI: 10.1021/ie503584q] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Taha Mohseni Ahooyi
- Department
of Chemical and Biological Engineering, Drexel University, Philadelphia, Pennsylvania 19104, United States
| | | | - Masoud Soroush
- Department
of Chemical and Biological Engineering, Drexel University, Philadelphia, Pennsylvania 19104, United States
| |
Collapse
|
11
|
A bootstrap procedure for local semiparametric density estimation amid model uncertainties. J Stat Plan Inference 2014. [DOI: 10.1016/j.jspi.2014.05.004] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
12
|
A data-adaptive strategy for inverse weighted estimation of causal effects. HEALTH SERVICES AND OUTCOMES RESEARCH METHODOLOGY 2014. [DOI: 10.1007/s10742-014-0124-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
13
|
Delaigle A, Hall P. Parametrically Assisted Nonparametric Estimation of a Density in the Deconvolution Problem. J Am Stat Assoc 2014. [DOI: 10.1080/01621459.2013.857611] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
14
|
|
15
|
Weak conditions for shrinking multivariate nonparametric density estimators. J MULTIVARIATE ANAL 2013. [DOI: 10.1016/j.jmva.2012.09.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
16
|
Qu L, Nettleton D, Dekkers JCM. Improved estimation of the noncentrality parameter distribution from a large number of t-statistics, with applications to false discovery rate estimation in microarray data analysis. Biometrics 2012; 68:1178-87. [PMID: 22551000 DOI: 10.1111/j.1541-0420.2012.01764.x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Given a large number of t-statistics, we consider the problem of approximating the distribution of noncentrality parameters (NCPs) by a continuous density. This problem is closely related to the control of false discovery rates (FDR) in massive hypothesis testing applications, e.g., microarray gene expression analysis. Our methodology is similar to, but improves upon, the existing approach by Ruppert, Nettleton, and Hwang (2007, Biometrics, 63, 483-495). We provide parametric, nonparametric, and semiparametric estimators for the distribution of NCPs, as well as estimates of the FDR and local FDR. In the parametric situation, we assume that the NCPs follow a distribution that leads to an analytically available marginal distribution for the test statistics. In the nonparametric situation, we use convex combinations of basis density functions to estimate the density of the NCPs. A sequential quadratic programming procedure is developed to maximize the penalized likelihood. The smoothing parameter is selected with the approximate network information criterion. A semiparametric estimator is also developed to combine both parametric and nonparametric fits. Simulations show that, under a variety of situations, our density estimates are closer to the underlying truth and our FDR estimates are improved compared with alternative methods. Data-based simulations and the analyses of two microarray datasets are used to evaluate the performance in realistic situations.
Collapse
Affiliation(s)
- Long Qu
- Biostat Solutions, Inc., MD 21771, USA
| | | | | |
Collapse
|
17
|
Affiliation(s)
- Carey E. Priebe
- a Department of Mathematical Sciences , The Johns Hopkins University , Baltimore , MD , 21218
| |
Collapse
|
18
|
Pan W. Incorporating gene functional annotations in detecting differential gene expression. J R Stat Soc Ser C Appl Stat 2006. [DOI: 10.1111/1467-9876.00066-i1] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
19
|
Abstract
In the analysis of a quantal dose-response experiment with grouped data, the most commonly used parametric procedure is logistic regression, commonly referred to as 'logit analysis'. The adequacy of the fit by the logistic regression curve is tested using the chi-square lack-of-fit test. If the lack-of-fit test is not significant, then the logistic model is assumed to be adequate and estimation of effective doses and confidence intervals on the effective doses can be made. When the tolerance distribution of the dose-response data is not known and cannot be assumed by the user, one can use non-parametric methods, such as kernel regression or local linear regression, to estimate the dose-response curve, effective doses and confidence intervals. This research proposes another alternative based on semi-parametric regression to analysing quantal dose-response data called model-robust quantal regression (MRQR). MRQR linearly combines the parametric and non-parametric predictions with the use of a mixing parameter. MRQR uses logistic regression as the parametric portion of the model and local linear regression as the non-parametric portion of the model. Our research has shown that the MRQR procedure can improve the fit of the dose-response curve by producing narrower confidence intervals for predictions while providing improved precision of estimates of the effective doses with respect to either logistic or local linear regression results.
Collapse
Affiliation(s)
- Q J Nottingham
- Department of Management Science & Information Technology, Virginia Polytechnic Institute and State University, 1007 Pamplin Hall, Blacksburg, VA 24061-0235, USA
| | | |
Collapse
|
20
|
|
21
|
Kouassi DA, Singh J. A Semiparametric Approach to Hazard Estimation with Randomly Censored Observations. J Am Stat Assoc 1997. [DOI: 10.1080/01621459.1997.10473656] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
22
|
|
23
|
|