1
|
Christoffersen B, Mahjani B, Clements M, Kjellström H, Humphreys K. Quasi-Monte Carlo Methods for Binary Event Models with Complex Family Data. J Comput Graph Stat 2022. [DOI: 10.1080/10618600.2022.2151454] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
Affiliation(s)
- Benjamin Christoffersen
- Division of Robotics, Perception and Learning, KTH Royal Institute of Technology
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet
- Swedish e-Science Research Center (SeRC)
| | - Behrang Mahjani
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet
- Seaver Autism Center for Research and Treatment, Icahn School of Medicine at Mount Sinai
| | - Mark Clements
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet
- Swedish e-Science Research Center (SeRC)
| | - Hedvig Kjellström
- Division of Robotics, Perception and Learning, KTH Royal Institute of Technology
- Swedish e-Science Research Center (SeRC)
| | - Keith Humphreys
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet
- Swedish e-Science Research Center (SeRC)
| |
Collapse
|
2
|
Gower RM, Richtárik P, Bach F. Stochastic quasi-gradient methods: variance reduction via Jacobian sketching. MATHEMATICAL PROGRAMMING 2020; 188:135-192. [PMID: 34720193 PMCID: PMC8550794 DOI: 10.1007/s10107-020-01506-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/08/2018] [Accepted: 04/09/2020] [Indexed: 06/13/2023]
Abstract
We develop a new family of variance reduced stochastic gradient descent methods for minimizing the average of a very large number of smooth functions. Our method-JacSketch-is motivated by novel developments in randomized numerical linear algebra, and operates by maintaining a stochastic estimate of a Jacobian matrix composed of the gradients of individual functions. In each iteration, JacSketch efficiently updates the Jacobian matrix by first obtaining a random linear measurement of the true Jacobian through (cheap) sketching, and then projecting the previous estimate onto the solution space of a linear matrix equation whose solutions are consistent with the measurement. The Jacobian estimate is then used to compute a variance-reduced unbiased estimator of the gradient. Our strategy is analogous to the way quasi-Newton methods maintain an estimate of the Hessian, and hence our method can be seen as a stochastic quasi-gradient method. Our method can also be seen as stochastic gradient descent applied to a controlled stochastic optimization reformulation of the original problem, where the control comes from the Jacobian estimates. We prove that for smooth and strongly convex functions, JacSketch converges linearly with a meaningful rate dictated by a single convergence theorem which applies to general sketches. We also provide a refined convergence theorem which applies to a smaller class of sketches, featuring a novel proof technique based on a stochastic Lyapunov function. This enables us to obtain sharper complexity results for variants of JacSketch with importance sampling. By specializing our general approach to specific sketching strategies, JacSketch reduces to the celebrated stochastic average gradient (SAGA) method, and its several existing and many new minibatch, reduced memory, and importance sampling variants. Our rate for SAGA with importance sampling is the current best-known rate for this method, resolving a conjecture by Schmidt et al. (Proceedings of the eighteenth international conference on artificial intelligence and statistics, AISTATS 2015, San Diego, California, 2015). The rates we obtain for minibatch SAGA are also superior to existing rates and are sufficiently tight as to show a decrease in total complexity as the minibatch size increases. Moreover, we obtain the first minibatch SAGA method with importance sampling.
Collapse
Affiliation(s)
- Robert M. Gower
- LTCI, Telécom Paris, Institut Polytechnique de Paris, Palaiseau, France
| | - Peter Richtárik
- King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
- University of Edinburgh, Edinburgh, UK
- Moscow Institute of Physics and Technology (MIPT), Dolgoprudny, Russia
| | - Francis Bach
- INRIA - ENS - PSL Research University, Paris, France
| |
Collapse
|
3
|
Zhang A, Brown LD, Cai TT. Semi-supervised inference: General theory and estimation of means. Ann Stat 2019. [DOI: 10.1214/18-aos1756] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
4
|
Briol FX, Oates CJ, Girolami M, Osborne MA, Sejdinovic D. Probabilistic Integration: A Role in Statistical Computation? Stat Sci 2019. [DOI: 10.1214/18-sts660] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
5
|
Randomized Quasi-Monte Carlo: An Introduction for Practitioners. SPRINGER PROCEEDINGS IN MATHEMATICS & STATISTICS 2018. [DOI: 10.1007/978-3-319-91436-7_2] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/02/2022]
|
6
|
Cecere S, Jara A, Lesaffre E. Analyzing the emergence times of permanent teeth: an example of modeling the covariance matrix with interval-censored data. STAT MODEL 2016. [DOI: 10.1177/1471082006071844] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Based on a data set obtained in a large dental longitudinal study, conducted in Flanders (Belgium), the joint emergence distribution of seven teeth was modeled as a function of gender and caries experience on primary teeth. Besides establishing the marginal dependence of emergence on the covariates, there was also interest in examining the impact of the covariates on the association among emergence times. This allows the establishment of the preferred rankings of emergence and their dependence on covariates. To this end, the covariance matrix was modeled as a function of covariates. Modeling the covariance matrix in this way needs to ensure the positive definiteness of the covariance matrix and it is preferable that the regression parameters of the model are interpretable. The modified Cholesky decomposition of the covariance matrix, as suggested by Pourahmadi, splits up the covariance matrix into two parts where the parameters can be interpreted, given a natural ranking of the responses. This approach was used here taking into account that the emergence times are interval-censored. Hence, we opted for a Bayesian implementation of the data augmentation algorithm.
Collapse
Affiliation(s)
- Silvia Cecere
- Biostatistical Centre, Catholic University of Leuven, Leuven, Belgium
| | - Alejandro Jara
- Biostatistical Centre, Catholic University of Leuven, Leuven, Belgium
| | - Emmanuel Lesaffre
- Biostatistical Centre, Catholic University of Leuven, Leuven, Belgium,
| |
Collapse
|
7
|
Affiliation(s)
- Mathieu Gerber
- Université de Lausanne; Switzerland
- Centre de Recherche en Economie et Statistique; Paris France
| | - Nicolas Chopin
- Centre de Recherche en Economie et Statistique; Paris France
- Ecole Nationale de la Statistique et de l'Administration Economique; Paris France
| |
Collapse
|
8
|
Fang HB, Chen X, Pei XY, Grant S, Tan M. Experimental design and statistical analysis for three-drug combination studies. Stat Methods Med Res 2015; 26:1261-1280. [PMID: 25744107 DOI: 10.1177/0962280215574320] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
Drug combination is a critically important therapeutic approach for complex diseases such as cancer and HIV due to its potential for efficacy at lower, less toxic doses and the need to move new therapies rapidly into clinical trials. One of the key issues is to identify which combinations are additive, synergistic, or antagonistic. While the value of multidrug combinations has been well recognized in the cancer research community, to our best knowledge, all existing experimental studies rely on fixing the dose of one drug to reduce the dimensionality, e.g. looking at pairwise two-drug combinations, a suboptimal design. Hence, there is an urgent need to develop experimental design and analysis methods for studying multidrug combinations directly. Because the complexity of the problem increases exponentially with the number of constituent drugs, there has been little progress in the development of methods for the design and analysis of high-dimensional drug combinations. In fact, contrary to common mathematical reasoning, the case of three-drug combinations is fundamentally more difficult than two-drug combinations. Apparently, finding doses of the combination, number of combinations, and replicates needed to detect departures from additivity depends on dose-response shapes of individual constituent drugs. Thus, different classes of drugs of different dose-response shapes need to be treated as a separate case. Our application and case studies develop dose finding and sample size method for detecting departures from additivity with several common (linear and log-linear) classes of single dose-response curves. Furthermore, utilizing the geometric features of the interaction index, we propose a nonparametric model to estimate the interaction index surface by B-spine approximation and derive its asymptotic properties. Utilizing the method, we designed and analyzed a combination study of three anticancer drugs, PD184, HA14-1, and CEP3891 inhibiting myeloma H929 cell line. To our best knowledge, this is the first ever three drug combinations study performed based on the original 4D dose-response surface formed by dose ranges of three drugs.
Collapse
Affiliation(s)
- Hong-Bin Fang
- 1 Department of Biostatistics, Bioinformatics and Biomathematics, Georgetown University Medical Center, Washington, USA
| | - Xuerong Chen
- 2 School of Statistics, Southwestern University of Finance and Economics, Chengdu, China
| | - Xin-Yan Pei
- 3 Departments of Medicine, Virginia Commonwealth University and Massey Cancer Center, Richmond, USA
| | - Steven Grant
- 3 Departments of Medicine, Virginia Commonwealth University and Massey Cancer Center, Richmond, USA
| | - Ming Tan
- 1 Department of Biostatistics, Bioinformatics and Biomathematics, Georgetown University Medical Center, Washington, USA
| |
Collapse
|
9
|
Tan MT, Fang HB, Tian GL. Dose and Sample Size Determination for Multi-Drug Combination Studies. Stat Biopharm Res 2009. [DOI: 10.1198/sbr.2009.0029] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
10
|
Fang HB, Tian GL, Li W, Tan M. Design and Sample Size for Evaluating Combinations of Drugs of Linear and Loglinear Dose-Response Curves. J Biopharm Stat 2009; 19:625-40. [DOI: 10.1080/10543400902964019] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Affiliation(s)
- Hong-Bin Fang
- a Division of Biostatistics , University of Maryland Greenebaum Cancer Center , Baltimore, Maryland, USA
| | - Guo-Liang Tian
- a Division of Biostatistics , University of Maryland Greenebaum Cancer Center , Baltimore, Maryland, USA
| | - Wei Li
- b Department of Pharmaceutical Sciences , College of Pharmacy, The University of Tennessee Health Science Center , Memphis, Tennessee, USA
| | - Ming Tan
- a Division of Biostatistics , University of Maryland Greenebaum Cancer Center , Baltimore, Maryland, USA
| |
Collapse
|
11
|
Abstract
Semiparametric regression is a fusion between parametric regression and nonparametric regression that integrates low-rank penalized splines, mixed model and hierarchical Bayesian methodology - thus allowing more streamlined handling of longitudinal and spatial correlation. We review progress in the field over the five-year period between 2003 and 2007. We find semiparametric regression to be a vibrant field with substantial involvement and activity, continual enhancement and widespread application.
Collapse
Affiliation(s)
- David Ruppert
- School of Operations Research and Information Engineering, Cornell University, 1170 Comstock Hall, Ithaca, NY 14853, U.S.A
| | | | | |
Collapse
|
12
|
|
13
|
Kuo FY, Dunsmuir WTM, Sloan IH, Wand MP, Womersley RS. Quasi-Monte Carlo for Highly Structured Generalised Response Models. Methodol Comput Appl Probab 2007. [DOI: 10.1007/s11009-007-9045-3] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
14
|
|