1
|
Mathis SM, Webber AE, León TM, Murray EL, Sun M, White LA, Brooks LC, Green A, Hu AJ, McDonald DJ, Rosenfeld R, Shemetov D, Tibshirani RJ, Kandula S, Pei S, Shaman J, Yaari R, Yamana TK, Agarwal P, Balusu S, Gururajan G, Kamarthi H, Prakash BA, Raman R, Rodríguez A, Zhao Z, Meiyappan A, Omar S, Baccam P, Gurung HL, Stage SA, Suchoski BT, Ajelli M, Kummer AG, Litvinova M, Ventura PC, Wadsworth S, Niemi J, Carcelen E, Hill AL, Jung SM, Lemaitre JC, Lessler J, Loo SL, McKee CD, Sato K, Smith C, Truelove S, McAndrew T, Ye W, Bosse N, Hlavacek WS, Lin YT, Mallela A, Chen Y, Lamm SM, Lee J, Posner RG, Perofsky AC, Viboud C, Clemente L, Lu F, Meyer AG, Santillana M, Chinazzi M, Davis JT, Mu K, Piontti APY, Vespignani A, Xiong X, Ben-Nun M, Riley P, Turtle J, Hulme-Lowe C, Jessa S, Nagraj VP, Turner SD, Williams D, Basu A, Drake JM, Fox SJ, Gibson GC, Suez E, Thommes EW, Cojocaru MG, Cramer EY, Gerding A, Stark A, Ray EL, Reich NG, Shandross L, Wattanachit N, Wang Y, Zorn MW, Al Aawar M, Srivastava A, Meyers LA, Adiga A, Hurt B, Kaur G, Lewis BL, Marathe M, Venkatramanan S, Butler P, Farabow A, Muralidhar N, Ramakrishnan N, Reed C, Biggerstaff M, Borchering RK. Evaluation of FluSight influenza forecasting in the 2021-22 and 2022-23 seasons with a new target laboratory-confirmed influenza hospitalizations. medRxiv 2023:2023.12.08.23299726. [PMID: 38168429 PMCID: PMC10760285 DOI: 10.1101/2023.12.08.23299726] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2024]
Abstract
Accurate forecasts can enable more effective public health responses during seasonal influenza epidemics. Forecasting teams were asked to provide national and jurisdiction-specific probabilistic predictions of weekly confirmed influenza hospital admissions for one through four weeks ahead for the 2021-22 and 2022-23 influenza seasons. Across both seasons, 26 teams submitted forecasts, with the submitting teams varying between seasons. Forecast skill was evaluated using the Weighted Interval Score (WIS), relative WIS, and coverage. Six out of 23 models outperformed the baseline model across forecast weeks and locations in 2021-22 and 12 out of 18 models in 2022-23. Averaging across all forecast targets, the FluSight ensemble was the 2nd most accurate model measured by WIS in 2021-22 and the 5th most accurate in the 2022-23 season. Forecast skill and 95% coverage for the FluSight ensemble and most component models degraded over longer forecast horizons and during periods of rapid change. Current influenza forecasting efforts help inform situational awareness, but research is needed to address limitations, including decreased performance during periods of changing epidemic dynamics.
Collapse
Affiliation(s)
- Sarabeth M Mathis
- Centers for Disease Control and Prevention, Atlanta, Georgia, 30329, USA
| | - Alexander E Webber
- Centers for Disease Control and Prevention, Atlanta, Georgia, 30329, USA
| | - Tomás M León
- California Department of Public Health, Richmond, CA, 95899
| | - Erin L Murray
- California Department of Public Health, Richmond, CA, 95899
| | - Monica Sun
- California Department of Public Health, Richmond, CA, 95899
| | - Lauren A White
- California Department of Public Health, Richmond, CA, 95899
| | - Logan C Brooks
- Carnegie Mellon University, Pittsburgh, PA, 15213
- University of California, Berkeley, Berkeley, CA 94720
| | - Alden Green
- Carnegie Mellon University, Pittsburgh, PA, 15213
| | - Addison J Hu
- Carnegie Mellon University, Pittsburgh, PA, 15213
| | | | | | | | - Ryan J Tibshirani
- Carnegie Mellon University, Pittsburgh, PA, 15213
- University of California, Berkeley, Berkeley, CA 94720
| | | | - Sen Pei
- Columbia University, New York, NY, 10032
| | - Jeffrey Shaman
- Columbia University, New York, NY, 10032
- Columbia University School of Climate, New York, NY 10025
| | - Rami Yaari
- Columbia University, New York, NY, 10032
| | | | | | | | | | | | | | - Rishi Raman
- Georgia Institute of Technology, Atlanta, GA, 30318
| | | | - Zhiyuan Zhao
- Georgia Institute of Technology, Atlanta, GA, 30318
| | | | - Shalina Omar
- Guidehouse Advisory and Consulting Services, McClean VA, 22102
| | | | | | | | | | - Marco Ajelli
- Indiana University School of Public Health, Bloomington, IN, 47405
| | | | - Maria Litvinova
- Indiana University School of Public Health, Bloomington, IN, 47405
| | - Paulo C Ventura
- Indiana University School of Public Health, Bloomington, IN, 47405
| | | | | | | | | | - Sung-Mok Jung
- University of North Carolina at Chapel Hill, Chapel Hill, NC
| | | | - Justin Lessler
- University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Sara L Loo
- Johns Hopkins University, Baltimore, MD, 21205
| | | | - Koji Sato
- Johns Hopkins University, Baltimore, MD, 21205
| | | | | | | | | | - Nikos Bosse
- London School of Health and Tropical Medicine, London, UK, WC1E 7HT
| | | | - Yen Ting Lin
- Los Alamos National Laboratory, Los Alamos, NM, 87545
| | | | - Ye Chen
- Northern Arizona University, Flagstaff, AZ, 86011
| | | | - Jaechoul Lee
- Northern Arizona University, Flagstaff, AZ, 86011
| | | | - Amanda C Perofsky
- Fogarty International Center, National Institutes of Health, Bethesda, MD, 20892
| | - Cécile Viboud
- Fogarty International Center, National Institutes of Health, Bethesda, MD, 20892
| | | | - Fred Lu
- Northeastern University, Boston, MA, 02115
| | | | | | | | | | - Kunpeng Mu
- Northeastern University, Boston, MA, 02115
| | | | | | | | | | - Pete Riley
- Predictive Science Inc, San Diego, CA 92121
| | | | | | | | - V P Nagraj
- Signature Science, LLC, Charlottesville, VA, 22911
| | | | | | | | | | | | | | - Ehsan Suez
- University of Georgia, Athens, GA, 30609
| | - Edward W Thommes
- University of Guelph, Guelph, ON N1G 2W1, Canada
- Sanofi, Toronto, ON, M2R 3T4
| | | | | | - Aaron Gerding
- University of Massachusetts Amherst, Amherst, MA, 01003
| | - Ariane Stark
- University of Massachusetts Amherst, Amherst, MA, 01003
| | - Evan L Ray
- University of Massachusetts Amherst, Amherst, MA, 01003
| | | | - Li Shandross
- University of Massachusetts Amherst, Amherst, MA, 01003
| | | | - Yijin Wang
- University of Massachusetts Amherst, Amherst, MA, 01003
| | - Martha W Zorn
- University of Massachusetts Amherst, Amherst, MA, 01003
| | - Majd Al Aawar
- University of Southern California, Los Angeles, CA, 90089
| | | | | | | | | | | | | | | | | | | | | | | | | | - Carrie Reed
- Centers for Disease Control and Prevention, Atlanta, Georgia, 30329, USA
| | | | | |
Collapse
|
2
|
Rumack A, Tibshirani RJ, Rosenfeld R. Recalibrating probabilistic forecasts of epidemics. PLoS Comput Biol 2022; 18:e1010771. [PMID: 36520949 PMCID: PMC9799311 DOI: 10.1371/journal.pcbi.1010771] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2021] [Revised: 12/29/2022] [Accepted: 11/28/2022] [Indexed: 12/23/2022] Open
Abstract
Distributional forecasts are important for a wide variety of applications, including forecasting epidemics. Often, forecasts are miscalibrated, or unreliable in assigning uncertainty to future events. We present a recalibration method that can be applied to a black-box forecaster given retrospective forecasts and observations, as well as an extension to make this method more effective in recalibrating epidemic forecasts. This method is guaranteed to improve calibration and log score performance when trained and measured in-sample. We also prove that the increase in expected log score of a recalibrated forecaster is equal to the entropy of the PIT distribution. We apply this recalibration method to the 27 influenza forecasters in the FluSight Network and show that recalibration reliably improves forecast accuracy and calibration. This method, available on Github, is effective, robust, and easy to use as a post-processing tool to improve epidemic forecasts.
Collapse
Affiliation(s)
- Aaron Rumack
- Machine Learning Department, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America
| | - Ryan J. Tibshirani
- Machine Learning Department, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America
- Department of Statistics & Data Science, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America
| | - Roni Rosenfeld
- Machine Learning Department, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America
| |
Collapse
|
3
|
Ray EL, Brooks LC, Bien J, Biggerstaff M, Bosse NI, Bracher J, Cramer EY, Funk S, Gerding A, Johansson MA, Rumack A, Wang Y, Zorn M, Tibshirani RJ, Reich NG. Comparing trained and untrained probabilistic ensemble forecasts of COVID-19 cases and deaths in the United States. Int J Forecast 2022:S0169-2070(22)00096-6. [PMID: 35791416 PMCID: PMC9247236 DOI: 10.1016/j.ijforecast.2022.06.005] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
The U.S. COVID-19 Forecast Hub aggregates forecasts of the short-term burden of COVID-19 in the United States from many contributing teams. We study methods for building an ensemble that combines forecasts from these teams. These experiments have informed the ensemble methods used by the Hub. To be most useful to policy makers, ensemble forecasts must have stable performance in the presence of two key characteristics of the component forecasts: (1) occasional misalignment with the reported data, and (2) instability in the relative performance of component forecasters over time. Our results indicate that in the presence of these challenges, an untrained and robust approach to ensembling using an equally weighted median of all component forecasts is a good choice to support public health decision makers. In settings where some contributing forecasters have a stable record of good performance, trained ensembles that give those forecasters higher weight can also be helpful.
Collapse
Affiliation(s)
- Evan L Ray
- School of Public Health and Health Sciences, University of Massachusetts Amherst, United States of America
| | - Logan C Brooks
- Machine Learning Department, Carnegie Mellon University, United States of America
| | - Jacob Bien
- Department of Data Sciences and Operations, University of Southern California, United States of America
| | - Matthew Biggerstaff
- COVID-19 Response, U.S. Centers for Disease Control and Prevention, United States of America
| | - Nikos I Bosse
- London School of Hygiene & Tropical Medicine, United Kingdom
| | - Johannes Bracher
- Chair of Statistical Methods and Econometrics, Karlsruhe Institute of Technology, Germany
- Computational Statistics Group, Heidelberg Institute for Theoretical Studies, Germany
| | - Estee Y Cramer
- School of Public Health and Health Sciences, University of Massachusetts Amherst, United States of America
| | - Sebastian Funk
- London School of Hygiene & Tropical Medicine, United Kingdom
| | - Aaron Gerding
- School of Public Health and Health Sciences, University of Massachusetts Amherst, United States of America
| | - Michael A Johansson
- COVID-19 Response, U.S. Centers for Disease Control and Prevention, United States of America
| | - Aaron Rumack
- Machine Learning Department, Carnegie Mellon University, United States of America
| | - Yijin Wang
- School of Public Health and Health Sciences, University of Massachusetts Amherst, United States of America
| | - Martha Zorn
- School of Public Health and Health Sciences, University of Massachusetts Amherst, United States of America
| | - Ryan J Tibshirani
- Machine Learning Department, Carnegie Mellon University, United States of America
| | - Nicholas G Reich
- School of Public Health and Health Sciences, University of Massachusetts Amherst, United States of America
| |
Collapse
|
4
|
Reich NG, Lessler J, Funk S, Viboud C, Vespignani A, Tibshirani RJ, Shea K, Schienle M, Runge MC, Rosenfeld R, Ray EL, Niehus R, Johnson HC, Johansson MA, Hochheiser H, Gardner L, Bracher J, Borchering RK, Biggerstaff M. Collaborative Hubs: Making the Most of Predictive Epidemic Modeling. Am J Public Health 2022; 112:839-842. [PMID: 35420897 PMCID: PMC9137029 DOI: 10.2105/ajph.2022.306831] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/04/2022] [Indexed: 12/16/2022]
Affiliation(s)
- Nicholas G Reich
- Nicholas G. Reich and Evan L. Ray are with the Department of Biostatistics and Epidemiology, University of Massachusetts-Amherst. Justin Lessler is with the Department of Epidemiology, UNC Gillings School of Global Public Health, University of North Carolina at Chapel Hill. Sebastian Funk is with the Centre for Mathematical Modelling of Infectious Diseases and Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK. Cecile Viboud is with the Fogarty International Center, National Institutes of Health, Bethesda, MD. Alessandro Vespignani is with the Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA. Ryan J. Tibshirani is with the Department of Statistics & Data Science and the Department of Machine Learning, Carnegie Mellon University, Pittsburgh, PA. Katriona Shea and Rebecca K. Borchering are with the Department of Biology and Center for Infectious Disease Dynamics, The Pennsylvania State University, University Park. Melanie Schienle and Johannes Bracher are with the Chair of Statistics and Econometrics, Karlsruhe Institute of Technology, Karlsruhe, Germany. Michael C. Runge is with the Eastern Ecological Science Center, US Geological Survey, Laurel, MD. Roni Rosenfeld is with the Machine Learning Department, Carnegie Mellon University. Rene Niehus and Helen C. Johnson are with the European Centre for Disease Prevention and Control, Solna, Sweden. Michael A. Johansson and Matthew Biggerstaff are with the COVID-19 Response, Centers for Disease Control and Prevention, Atlanta, GA. Harry Hochheiser is with the Department of Biomedical Informatics and Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA. Lauren Gardner is with the Department of Civil and Systems Engineering, Johns Hopkins University, Baltimore, MD
| | - Justin Lessler
- Nicholas G. Reich and Evan L. Ray are with the Department of Biostatistics and Epidemiology, University of Massachusetts-Amherst. Justin Lessler is with the Department of Epidemiology, UNC Gillings School of Global Public Health, University of North Carolina at Chapel Hill. Sebastian Funk is with the Centre for Mathematical Modelling of Infectious Diseases and Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK. Cecile Viboud is with the Fogarty International Center, National Institutes of Health, Bethesda, MD. Alessandro Vespignani is with the Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA. Ryan J. Tibshirani is with the Department of Statistics & Data Science and the Department of Machine Learning, Carnegie Mellon University, Pittsburgh, PA. Katriona Shea and Rebecca K. Borchering are with the Department of Biology and Center for Infectious Disease Dynamics, The Pennsylvania State University, University Park. Melanie Schienle and Johannes Bracher are with the Chair of Statistics and Econometrics, Karlsruhe Institute of Technology, Karlsruhe, Germany. Michael C. Runge is with the Eastern Ecological Science Center, US Geological Survey, Laurel, MD. Roni Rosenfeld is with the Machine Learning Department, Carnegie Mellon University. Rene Niehus and Helen C. Johnson are with the European Centre for Disease Prevention and Control, Solna, Sweden. Michael A. Johansson and Matthew Biggerstaff are with the COVID-19 Response, Centers for Disease Control and Prevention, Atlanta, GA. Harry Hochheiser is with the Department of Biomedical Informatics and Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA. Lauren Gardner is with the Department of Civil and Systems Engineering, Johns Hopkins University, Baltimore, MD
| | - Sebastian Funk
- Nicholas G. Reich and Evan L. Ray are with the Department of Biostatistics and Epidemiology, University of Massachusetts-Amherst. Justin Lessler is with the Department of Epidemiology, UNC Gillings School of Global Public Health, University of North Carolina at Chapel Hill. Sebastian Funk is with the Centre for Mathematical Modelling of Infectious Diseases and Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK. Cecile Viboud is with the Fogarty International Center, National Institutes of Health, Bethesda, MD. Alessandro Vespignani is with the Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA. Ryan J. Tibshirani is with the Department of Statistics & Data Science and the Department of Machine Learning, Carnegie Mellon University, Pittsburgh, PA. Katriona Shea and Rebecca K. Borchering are with the Department of Biology and Center for Infectious Disease Dynamics, The Pennsylvania State University, University Park. Melanie Schienle and Johannes Bracher are with the Chair of Statistics and Econometrics, Karlsruhe Institute of Technology, Karlsruhe, Germany. Michael C. Runge is with the Eastern Ecological Science Center, US Geological Survey, Laurel, MD. Roni Rosenfeld is with the Machine Learning Department, Carnegie Mellon University. Rene Niehus and Helen C. Johnson are with the European Centre for Disease Prevention and Control, Solna, Sweden. Michael A. Johansson and Matthew Biggerstaff are with the COVID-19 Response, Centers for Disease Control and Prevention, Atlanta, GA. Harry Hochheiser is with the Department of Biomedical Informatics and Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA. Lauren Gardner is with the Department of Civil and Systems Engineering, Johns Hopkins University, Baltimore, MD
| | - Cecile Viboud
- Nicholas G. Reich and Evan L. Ray are with the Department of Biostatistics and Epidemiology, University of Massachusetts-Amherst. Justin Lessler is with the Department of Epidemiology, UNC Gillings School of Global Public Health, University of North Carolina at Chapel Hill. Sebastian Funk is with the Centre for Mathematical Modelling of Infectious Diseases and Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK. Cecile Viboud is with the Fogarty International Center, National Institutes of Health, Bethesda, MD. Alessandro Vespignani is with the Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA. Ryan J. Tibshirani is with the Department of Statistics & Data Science and the Department of Machine Learning, Carnegie Mellon University, Pittsburgh, PA. Katriona Shea and Rebecca K. Borchering are with the Department of Biology and Center for Infectious Disease Dynamics, The Pennsylvania State University, University Park. Melanie Schienle and Johannes Bracher are with the Chair of Statistics and Econometrics, Karlsruhe Institute of Technology, Karlsruhe, Germany. Michael C. Runge is with the Eastern Ecological Science Center, US Geological Survey, Laurel, MD. Roni Rosenfeld is with the Machine Learning Department, Carnegie Mellon University. Rene Niehus and Helen C. Johnson are with the European Centre for Disease Prevention and Control, Solna, Sweden. Michael A. Johansson and Matthew Biggerstaff are with the COVID-19 Response, Centers for Disease Control and Prevention, Atlanta, GA. Harry Hochheiser is with the Department of Biomedical Informatics and Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA. Lauren Gardner is with the Department of Civil and Systems Engineering, Johns Hopkins University, Baltimore, MD
| | - Alessandro Vespignani
- Nicholas G. Reich and Evan L. Ray are with the Department of Biostatistics and Epidemiology, University of Massachusetts-Amherst. Justin Lessler is with the Department of Epidemiology, UNC Gillings School of Global Public Health, University of North Carolina at Chapel Hill. Sebastian Funk is with the Centre for Mathematical Modelling of Infectious Diseases and Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK. Cecile Viboud is with the Fogarty International Center, National Institutes of Health, Bethesda, MD. Alessandro Vespignani is with the Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA. Ryan J. Tibshirani is with the Department of Statistics & Data Science and the Department of Machine Learning, Carnegie Mellon University, Pittsburgh, PA. Katriona Shea and Rebecca K. Borchering are with the Department of Biology and Center for Infectious Disease Dynamics, The Pennsylvania State University, University Park. Melanie Schienle and Johannes Bracher are with the Chair of Statistics and Econometrics, Karlsruhe Institute of Technology, Karlsruhe, Germany. Michael C. Runge is with the Eastern Ecological Science Center, US Geological Survey, Laurel, MD. Roni Rosenfeld is with the Machine Learning Department, Carnegie Mellon University. Rene Niehus and Helen C. Johnson are with the European Centre for Disease Prevention and Control, Solna, Sweden. Michael A. Johansson and Matthew Biggerstaff are with the COVID-19 Response, Centers for Disease Control and Prevention, Atlanta, GA. Harry Hochheiser is with the Department of Biomedical Informatics and Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA. Lauren Gardner is with the Department of Civil and Systems Engineering, Johns Hopkins University, Baltimore, MD
| | - Ryan J Tibshirani
- Nicholas G. Reich and Evan L. Ray are with the Department of Biostatistics and Epidemiology, University of Massachusetts-Amherst. Justin Lessler is with the Department of Epidemiology, UNC Gillings School of Global Public Health, University of North Carolina at Chapel Hill. Sebastian Funk is with the Centre for Mathematical Modelling of Infectious Diseases and Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK. Cecile Viboud is with the Fogarty International Center, National Institutes of Health, Bethesda, MD. Alessandro Vespignani is with the Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA. Ryan J. Tibshirani is with the Department of Statistics & Data Science and the Department of Machine Learning, Carnegie Mellon University, Pittsburgh, PA. Katriona Shea and Rebecca K. Borchering are with the Department of Biology and Center for Infectious Disease Dynamics, The Pennsylvania State University, University Park. Melanie Schienle and Johannes Bracher are with the Chair of Statistics and Econometrics, Karlsruhe Institute of Technology, Karlsruhe, Germany. Michael C. Runge is with the Eastern Ecological Science Center, US Geological Survey, Laurel, MD. Roni Rosenfeld is with the Machine Learning Department, Carnegie Mellon University. Rene Niehus and Helen C. Johnson are with the European Centre for Disease Prevention and Control, Solna, Sweden. Michael A. Johansson and Matthew Biggerstaff are with the COVID-19 Response, Centers for Disease Control and Prevention, Atlanta, GA. Harry Hochheiser is with the Department of Biomedical Informatics and Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA. Lauren Gardner is with the Department of Civil and Systems Engineering, Johns Hopkins University, Baltimore, MD
| | - Katriona Shea
- Nicholas G. Reich and Evan L. Ray are with the Department of Biostatistics and Epidemiology, University of Massachusetts-Amherst. Justin Lessler is with the Department of Epidemiology, UNC Gillings School of Global Public Health, University of North Carolina at Chapel Hill. Sebastian Funk is with the Centre for Mathematical Modelling of Infectious Diseases and Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK. Cecile Viboud is with the Fogarty International Center, National Institutes of Health, Bethesda, MD. Alessandro Vespignani is with the Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA. Ryan J. Tibshirani is with the Department of Statistics & Data Science and the Department of Machine Learning, Carnegie Mellon University, Pittsburgh, PA. Katriona Shea and Rebecca K. Borchering are with the Department of Biology and Center for Infectious Disease Dynamics, The Pennsylvania State University, University Park. Melanie Schienle and Johannes Bracher are with the Chair of Statistics and Econometrics, Karlsruhe Institute of Technology, Karlsruhe, Germany. Michael C. Runge is with the Eastern Ecological Science Center, US Geological Survey, Laurel, MD. Roni Rosenfeld is with the Machine Learning Department, Carnegie Mellon University. Rene Niehus and Helen C. Johnson are with the European Centre for Disease Prevention and Control, Solna, Sweden. Michael A. Johansson and Matthew Biggerstaff are with the COVID-19 Response, Centers for Disease Control and Prevention, Atlanta, GA. Harry Hochheiser is with the Department of Biomedical Informatics and Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA. Lauren Gardner is with the Department of Civil and Systems Engineering, Johns Hopkins University, Baltimore, MD
| | - Melanie Schienle
- Nicholas G. Reich and Evan L. Ray are with the Department of Biostatistics and Epidemiology, University of Massachusetts-Amherst. Justin Lessler is with the Department of Epidemiology, UNC Gillings School of Global Public Health, University of North Carolina at Chapel Hill. Sebastian Funk is with the Centre for Mathematical Modelling of Infectious Diseases and Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK. Cecile Viboud is with the Fogarty International Center, National Institutes of Health, Bethesda, MD. Alessandro Vespignani is with the Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA. Ryan J. Tibshirani is with the Department of Statistics & Data Science and the Department of Machine Learning, Carnegie Mellon University, Pittsburgh, PA. Katriona Shea and Rebecca K. Borchering are with the Department of Biology and Center for Infectious Disease Dynamics, The Pennsylvania State University, University Park. Melanie Schienle and Johannes Bracher are with the Chair of Statistics and Econometrics, Karlsruhe Institute of Technology, Karlsruhe, Germany. Michael C. Runge is with the Eastern Ecological Science Center, US Geological Survey, Laurel, MD. Roni Rosenfeld is with the Machine Learning Department, Carnegie Mellon University. Rene Niehus and Helen C. Johnson are with the European Centre for Disease Prevention and Control, Solna, Sweden. Michael A. Johansson and Matthew Biggerstaff are with the COVID-19 Response, Centers for Disease Control and Prevention, Atlanta, GA. Harry Hochheiser is with the Department of Biomedical Informatics and Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA. Lauren Gardner is with the Department of Civil and Systems Engineering, Johns Hopkins University, Baltimore, MD
| | - Michael C Runge
- Nicholas G. Reich and Evan L. Ray are with the Department of Biostatistics and Epidemiology, University of Massachusetts-Amherst. Justin Lessler is with the Department of Epidemiology, UNC Gillings School of Global Public Health, University of North Carolina at Chapel Hill. Sebastian Funk is with the Centre for Mathematical Modelling of Infectious Diseases and Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK. Cecile Viboud is with the Fogarty International Center, National Institutes of Health, Bethesda, MD. Alessandro Vespignani is with the Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA. Ryan J. Tibshirani is with the Department of Statistics & Data Science and the Department of Machine Learning, Carnegie Mellon University, Pittsburgh, PA. Katriona Shea and Rebecca K. Borchering are with the Department of Biology and Center for Infectious Disease Dynamics, The Pennsylvania State University, University Park. Melanie Schienle and Johannes Bracher are with the Chair of Statistics and Econometrics, Karlsruhe Institute of Technology, Karlsruhe, Germany. Michael C. Runge is with the Eastern Ecological Science Center, US Geological Survey, Laurel, MD. Roni Rosenfeld is with the Machine Learning Department, Carnegie Mellon University. Rene Niehus and Helen C. Johnson are with the European Centre for Disease Prevention and Control, Solna, Sweden. Michael A. Johansson and Matthew Biggerstaff are with the COVID-19 Response, Centers for Disease Control and Prevention, Atlanta, GA. Harry Hochheiser is with the Department of Biomedical Informatics and Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA. Lauren Gardner is with the Department of Civil and Systems Engineering, Johns Hopkins University, Baltimore, MD
| | - Roni Rosenfeld
- Nicholas G. Reich and Evan L. Ray are with the Department of Biostatistics and Epidemiology, University of Massachusetts-Amherst. Justin Lessler is with the Department of Epidemiology, UNC Gillings School of Global Public Health, University of North Carolina at Chapel Hill. Sebastian Funk is with the Centre for Mathematical Modelling of Infectious Diseases and Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK. Cecile Viboud is with the Fogarty International Center, National Institutes of Health, Bethesda, MD. Alessandro Vespignani is with the Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA. Ryan J. Tibshirani is with the Department of Statistics & Data Science and the Department of Machine Learning, Carnegie Mellon University, Pittsburgh, PA. Katriona Shea and Rebecca K. Borchering are with the Department of Biology and Center for Infectious Disease Dynamics, The Pennsylvania State University, University Park. Melanie Schienle and Johannes Bracher are with the Chair of Statistics and Econometrics, Karlsruhe Institute of Technology, Karlsruhe, Germany. Michael C. Runge is with the Eastern Ecological Science Center, US Geological Survey, Laurel, MD. Roni Rosenfeld is with the Machine Learning Department, Carnegie Mellon University. Rene Niehus and Helen C. Johnson are with the European Centre for Disease Prevention and Control, Solna, Sweden. Michael A. Johansson and Matthew Biggerstaff are with the COVID-19 Response, Centers for Disease Control and Prevention, Atlanta, GA. Harry Hochheiser is with the Department of Biomedical Informatics and Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA. Lauren Gardner is with the Department of Civil and Systems Engineering, Johns Hopkins University, Baltimore, MD
| | - Evan L Ray
- Nicholas G. Reich and Evan L. Ray are with the Department of Biostatistics and Epidemiology, University of Massachusetts-Amherst. Justin Lessler is with the Department of Epidemiology, UNC Gillings School of Global Public Health, University of North Carolina at Chapel Hill. Sebastian Funk is with the Centre for Mathematical Modelling of Infectious Diseases and Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK. Cecile Viboud is with the Fogarty International Center, National Institutes of Health, Bethesda, MD. Alessandro Vespignani is with the Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA. Ryan J. Tibshirani is with the Department of Statistics & Data Science and the Department of Machine Learning, Carnegie Mellon University, Pittsburgh, PA. Katriona Shea and Rebecca K. Borchering are with the Department of Biology and Center for Infectious Disease Dynamics, The Pennsylvania State University, University Park. Melanie Schienle and Johannes Bracher are with the Chair of Statistics and Econometrics, Karlsruhe Institute of Technology, Karlsruhe, Germany. Michael C. Runge is with the Eastern Ecological Science Center, US Geological Survey, Laurel, MD. Roni Rosenfeld is with the Machine Learning Department, Carnegie Mellon University. Rene Niehus and Helen C. Johnson are with the European Centre for Disease Prevention and Control, Solna, Sweden. Michael A. Johansson and Matthew Biggerstaff are with the COVID-19 Response, Centers for Disease Control and Prevention, Atlanta, GA. Harry Hochheiser is with the Department of Biomedical Informatics and Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA. Lauren Gardner is with the Department of Civil and Systems Engineering, Johns Hopkins University, Baltimore, MD
| | - Rene Niehus
- Nicholas G. Reich and Evan L. Ray are with the Department of Biostatistics and Epidemiology, University of Massachusetts-Amherst. Justin Lessler is with the Department of Epidemiology, UNC Gillings School of Global Public Health, University of North Carolina at Chapel Hill. Sebastian Funk is with the Centre for Mathematical Modelling of Infectious Diseases and Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK. Cecile Viboud is with the Fogarty International Center, National Institutes of Health, Bethesda, MD. Alessandro Vespignani is with the Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA. Ryan J. Tibshirani is with the Department of Statistics & Data Science and the Department of Machine Learning, Carnegie Mellon University, Pittsburgh, PA. Katriona Shea and Rebecca K. Borchering are with the Department of Biology and Center for Infectious Disease Dynamics, The Pennsylvania State University, University Park. Melanie Schienle and Johannes Bracher are with the Chair of Statistics and Econometrics, Karlsruhe Institute of Technology, Karlsruhe, Germany. Michael C. Runge is with the Eastern Ecological Science Center, US Geological Survey, Laurel, MD. Roni Rosenfeld is with the Machine Learning Department, Carnegie Mellon University. Rene Niehus and Helen C. Johnson are with the European Centre for Disease Prevention and Control, Solna, Sweden. Michael A. Johansson and Matthew Biggerstaff are with the COVID-19 Response, Centers for Disease Control and Prevention, Atlanta, GA. Harry Hochheiser is with the Department of Biomedical Informatics and Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA. Lauren Gardner is with the Department of Civil and Systems Engineering, Johns Hopkins University, Baltimore, MD
| | - Helen C Johnson
- Nicholas G. Reich and Evan L. Ray are with the Department of Biostatistics and Epidemiology, University of Massachusetts-Amherst. Justin Lessler is with the Department of Epidemiology, UNC Gillings School of Global Public Health, University of North Carolina at Chapel Hill. Sebastian Funk is with the Centre for Mathematical Modelling of Infectious Diseases and Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK. Cecile Viboud is with the Fogarty International Center, National Institutes of Health, Bethesda, MD. Alessandro Vespignani is with the Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA. Ryan J. Tibshirani is with the Department of Statistics & Data Science and the Department of Machine Learning, Carnegie Mellon University, Pittsburgh, PA. Katriona Shea and Rebecca K. Borchering are with the Department of Biology and Center for Infectious Disease Dynamics, The Pennsylvania State University, University Park. Melanie Schienle and Johannes Bracher are with the Chair of Statistics and Econometrics, Karlsruhe Institute of Technology, Karlsruhe, Germany. Michael C. Runge is with the Eastern Ecological Science Center, US Geological Survey, Laurel, MD. Roni Rosenfeld is with the Machine Learning Department, Carnegie Mellon University. Rene Niehus and Helen C. Johnson are with the European Centre for Disease Prevention and Control, Solna, Sweden. Michael A. Johansson and Matthew Biggerstaff are with the COVID-19 Response, Centers for Disease Control and Prevention, Atlanta, GA. Harry Hochheiser is with the Department of Biomedical Informatics and Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA. Lauren Gardner is with the Department of Civil and Systems Engineering, Johns Hopkins University, Baltimore, MD
| | - Michael A Johansson
- Nicholas G. Reich and Evan L. Ray are with the Department of Biostatistics and Epidemiology, University of Massachusetts-Amherst. Justin Lessler is with the Department of Epidemiology, UNC Gillings School of Global Public Health, University of North Carolina at Chapel Hill. Sebastian Funk is with the Centre for Mathematical Modelling of Infectious Diseases and Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK. Cecile Viboud is with the Fogarty International Center, National Institutes of Health, Bethesda, MD. Alessandro Vespignani is with the Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA. Ryan J. Tibshirani is with the Department of Statistics & Data Science and the Department of Machine Learning, Carnegie Mellon University, Pittsburgh, PA. Katriona Shea and Rebecca K. Borchering are with the Department of Biology and Center for Infectious Disease Dynamics, The Pennsylvania State University, University Park. Melanie Schienle and Johannes Bracher are with the Chair of Statistics and Econometrics, Karlsruhe Institute of Technology, Karlsruhe, Germany. Michael C. Runge is with the Eastern Ecological Science Center, US Geological Survey, Laurel, MD. Roni Rosenfeld is with the Machine Learning Department, Carnegie Mellon University. Rene Niehus and Helen C. Johnson are with the European Centre for Disease Prevention and Control, Solna, Sweden. Michael A. Johansson and Matthew Biggerstaff are with the COVID-19 Response, Centers for Disease Control and Prevention, Atlanta, GA. Harry Hochheiser is with the Department of Biomedical Informatics and Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA. Lauren Gardner is with the Department of Civil and Systems Engineering, Johns Hopkins University, Baltimore, MD
| | - Harry Hochheiser
- Nicholas G. Reich and Evan L. Ray are with the Department of Biostatistics and Epidemiology, University of Massachusetts-Amherst. Justin Lessler is with the Department of Epidemiology, UNC Gillings School of Global Public Health, University of North Carolina at Chapel Hill. Sebastian Funk is with the Centre for Mathematical Modelling of Infectious Diseases and Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK. Cecile Viboud is with the Fogarty International Center, National Institutes of Health, Bethesda, MD. Alessandro Vespignani is with the Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA. Ryan J. Tibshirani is with the Department of Statistics & Data Science and the Department of Machine Learning, Carnegie Mellon University, Pittsburgh, PA. Katriona Shea and Rebecca K. Borchering are with the Department of Biology and Center for Infectious Disease Dynamics, The Pennsylvania State University, University Park. Melanie Schienle and Johannes Bracher are with the Chair of Statistics and Econometrics, Karlsruhe Institute of Technology, Karlsruhe, Germany. Michael C. Runge is with the Eastern Ecological Science Center, US Geological Survey, Laurel, MD. Roni Rosenfeld is with the Machine Learning Department, Carnegie Mellon University. Rene Niehus and Helen C. Johnson are with the European Centre for Disease Prevention and Control, Solna, Sweden. Michael A. Johansson and Matthew Biggerstaff are with the COVID-19 Response, Centers for Disease Control and Prevention, Atlanta, GA. Harry Hochheiser is with the Department of Biomedical Informatics and Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA. Lauren Gardner is with the Department of Civil and Systems Engineering, Johns Hopkins University, Baltimore, MD
| | - Lauren Gardner
- Nicholas G. Reich and Evan L. Ray are with the Department of Biostatistics and Epidemiology, University of Massachusetts-Amherst. Justin Lessler is with the Department of Epidemiology, UNC Gillings School of Global Public Health, University of North Carolina at Chapel Hill. Sebastian Funk is with the Centre for Mathematical Modelling of Infectious Diseases and Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK. Cecile Viboud is with the Fogarty International Center, National Institutes of Health, Bethesda, MD. Alessandro Vespignani is with the Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA. Ryan J. Tibshirani is with the Department of Statistics & Data Science and the Department of Machine Learning, Carnegie Mellon University, Pittsburgh, PA. Katriona Shea and Rebecca K. Borchering are with the Department of Biology and Center for Infectious Disease Dynamics, The Pennsylvania State University, University Park. Melanie Schienle and Johannes Bracher are with the Chair of Statistics and Econometrics, Karlsruhe Institute of Technology, Karlsruhe, Germany. Michael C. Runge is with the Eastern Ecological Science Center, US Geological Survey, Laurel, MD. Roni Rosenfeld is with the Machine Learning Department, Carnegie Mellon University. Rene Niehus and Helen C. Johnson are with the European Centre for Disease Prevention and Control, Solna, Sweden. Michael A. Johansson and Matthew Biggerstaff are with the COVID-19 Response, Centers for Disease Control and Prevention, Atlanta, GA. Harry Hochheiser is with the Department of Biomedical Informatics and Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA. Lauren Gardner is with the Department of Civil and Systems Engineering, Johns Hopkins University, Baltimore, MD
| | - Johannes Bracher
- Nicholas G. Reich and Evan L. Ray are with the Department of Biostatistics and Epidemiology, University of Massachusetts-Amherst. Justin Lessler is with the Department of Epidemiology, UNC Gillings School of Global Public Health, University of North Carolina at Chapel Hill. Sebastian Funk is with the Centre for Mathematical Modelling of Infectious Diseases and Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK. Cecile Viboud is with the Fogarty International Center, National Institutes of Health, Bethesda, MD. Alessandro Vespignani is with the Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA. Ryan J. Tibshirani is with the Department of Statistics & Data Science and the Department of Machine Learning, Carnegie Mellon University, Pittsburgh, PA. Katriona Shea and Rebecca K. Borchering are with the Department of Biology and Center for Infectious Disease Dynamics, The Pennsylvania State University, University Park. Melanie Schienle and Johannes Bracher are with the Chair of Statistics and Econometrics, Karlsruhe Institute of Technology, Karlsruhe, Germany. Michael C. Runge is with the Eastern Ecological Science Center, US Geological Survey, Laurel, MD. Roni Rosenfeld is with the Machine Learning Department, Carnegie Mellon University. Rene Niehus and Helen C. Johnson are with the European Centre for Disease Prevention and Control, Solna, Sweden. Michael A. Johansson and Matthew Biggerstaff are with the COVID-19 Response, Centers for Disease Control and Prevention, Atlanta, GA. Harry Hochheiser is with the Department of Biomedical Informatics and Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA. Lauren Gardner is with the Department of Civil and Systems Engineering, Johns Hopkins University, Baltimore, MD
| | - Rebecca K Borchering
- Nicholas G. Reich and Evan L. Ray are with the Department of Biostatistics and Epidemiology, University of Massachusetts-Amherst. Justin Lessler is with the Department of Epidemiology, UNC Gillings School of Global Public Health, University of North Carolina at Chapel Hill. Sebastian Funk is with the Centre for Mathematical Modelling of Infectious Diseases and Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK. Cecile Viboud is with the Fogarty International Center, National Institutes of Health, Bethesda, MD. Alessandro Vespignani is with the Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA. Ryan J. Tibshirani is with the Department of Statistics & Data Science and the Department of Machine Learning, Carnegie Mellon University, Pittsburgh, PA. Katriona Shea and Rebecca K. Borchering are with the Department of Biology and Center for Infectious Disease Dynamics, The Pennsylvania State University, University Park. Melanie Schienle and Johannes Bracher are with the Chair of Statistics and Econometrics, Karlsruhe Institute of Technology, Karlsruhe, Germany. Michael C. Runge is with the Eastern Ecological Science Center, US Geological Survey, Laurel, MD. Roni Rosenfeld is with the Machine Learning Department, Carnegie Mellon University. Rene Niehus and Helen C. Johnson are with the European Centre for Disease Prevention and Control, Solna, Sweden. Michael A. Johansson and Matthew Biggerstaff are with the COVID-19 Response, Centers for Disease Control and Prevention, Atlanta, GA. Harry Hochheiser is with the Department of Biomedical Informatics and Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA. Lauren Gardner is with the Department of Civil and Systems Engineering, Johns Hopkins University, Baltimore, MD
| | - Matthew Biggerstaff
- Nicholas G. Reich and Evan L. Ray are with the Department of Biostatistics and Epidemiology, University of Massachusetts-Amherst. Justin Lessler is with the Department of Epidemiology, UNC Gillings School of Global Public Health, University of North Carolina at Chapel Hill. Sebastian Funk is with the Centre for Mathematical Modelling of Infectious Diseases and Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK. Cecile Viboud is with the Fogarty International Center, National Institutes of Health, Bethesda, MD. Alessandro Vespignani is with the Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA. Ryan J. Tibshirani is with the Department of Statistics & Data Science and the Department of Machine Learning, Carnegie Mellon University, Pittsburgh, PA. Katriona Shea and Rebecca K. Borchering are with the Department of Biology and Center for Infectious Disease Dynamics, The Pennsylvania State University, University Park. Melanie Schienle and Johannes Bracher are with the Chair of Statistics and Econometrics, Karlsruhe Institute of Technology, Karlsruhe, Germany. Michael C. Runge is with the Eastern Ecological Science Center, US Geological Survey, Laurel, MD. Roni Rosenfeld is with the Machine Learning Department, Carnegie Mellon University. Rene Niehus and Helen C. Johnson are with the European Centre for Disease Prevention and Control, Solna, Sweden. Michael A. Johansson and Matthew Biggerstaff are with the COVID-19 Response, Centers for Disease Control and Prevention, Atlanta, GA. Harry Hochheiser is with the Department of Biomedical Informatics and Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA. Lauren Gardner is with the Department of Civil and Systems Engineering, Johns Hopkins University, Baltimore, MD
| |
Collapse
|
5
|
Affiliation(s)
- Maria Jahja
- Maria Jahja is Ph.D. Candidate, Department of Statistics & Data Science, Machine Learning Department, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA
| | - Andrew Chin
- Andrew Chin is Statistical Developer, Machine Learning Department, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA
| | - Ryan J. Tibshirani
- Ryan J. Tibshirani is Professor, Department of Statistics & Data Science, Machine Learning Department, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA
| |
Collapse
|
6
|
Cramer EY, Ray EL, Lopez VK, Bracher J, Brennen A, Castro Rivadeneira AJ, Gerding A, Gneiting T, House KH, Huang Y, Jayawardena D, Kanji AH, Khandelwal A, Le K, Mühlemann A, Niemi J, Shah A, Stark A, Wang Y, Wattanachit N, Zorn MW, Gu Y, Jain S, Bannur N, Deva A, Kulkarni M, Merugu S, Raval A, Shingi S, Tiwari A, White J, Abernethy NF, Woody S, Dahan M, Fox S, Gaither K, Lachmann M, Meyers LA, Scott JG, Tec M, Srivastava A, George GE, Cegan JC, Dettwiller ID, England WP, Farthing MW, Hunter RH, Lafferty B, Linkov I, Mayo ML, Parno MD, Rowland MA, Trump BD, Zhang-James Y, Chen S, Faraone SV, Hess J, Morley CP, Salekin A, Wang D, Corsetti SM, Baer TM, Eisenberg MC, Falb K, Huang Y, Martin ET, McCauley E, Myers RL, Schwarz T, Sheldon D, Gibson GC, Yu R, Gao L, Ma Y, Wu D, Yan X, Jin X, Wang YX, Chen Y, Guo L, Zhao Y, Gu Q, Chen J, Wang L, Xu P, Zhang W, Zou D, Biegel H, Lega J, McConnell S, Nagraj VP, Guertin SL, Hulme-Lowe C, Turner SD, Shi Y, Ban X, Walraven R, Hong QJ, Kong S, van de Walle A, Turtle JA, Ben-Nun M, Riley S, Riley P, Koyluoglu U, DesRoches D, Forli P, Hamory B, Kyriakides C, Leis H, Milliken J, Moloney M, Morgan J, Nirgudkar N, Ozcan G, Piwonka N, Ravi M, Schrader C, Shakhnovich E, Siegel D, Spatz R, Stiefeling C, Wilkinson B, Wong A, Cavany S, España G, Moore S, Oidtman R, Perkins A, Kraus D, Kraus A, Gao Z, Bian J, Cao W, Ferres JL, Li C, Liu TY, Xie X, Zhang S, Zheng S, Vespignani A, Chinazzi M, Davis JT, Mu K, Pastore y Piontti A, Xiong X, Zheng A, Baek J, Farias V, Georgescu A, Levi R, Sinha D, Wilde J, Perakis G, Bennouna MA, Nze-Ndong D, Singhvi D, Spantidakis I, Thayaparan L, Tsiourvas A, Sarker A, Jadbabaie A, Shah D, Della Penna N, Celi LA, Sundar S, Wolfinger R, Osthus D, Castro L, Fairchild G, Michaud I, Karlen D, Kinsey M, Mullany LC, Rainwater-Lovett K, Shin L, Tallaksen K, Wilson S, Lee EC, Dent J, Grantz KH, Hill AL, Kaminsky J, Kaminsky K, Keegan LT, Lauer SA, Lemaitre JC, Lessler J, Meredith HR, Perez-Saez J, Shah S, Smith CP, Truelove SA, Wills J, Marshall M, Gardner L, Nixon K, Burant JC, Wang L, Gao L, Gu Z, Kim M, Li X, Wang G, Wang Y, Yu S, Reiner RC, Barber R, Gakidou E, Hay SI, Lim S, Murray C, Pigott D, Gurung HL, Baccam P, Stage SA, Suchoski BT, Prakash BA, Adhikari B, Cui J, Rodríguez A, Tabassum A, Xie J, Keskinocak P, Asplund J, Baxter A, Oruc BE, Serban N, Arik SO, Dusenberry M, Epshteyn A, Kanal E, Le LT, Li CL, Pfister T, Sava D, Sinha R, Tsai T, Yoder N, Yoon J, Zhang L, Abbott S, Bosse NI, Funk S, Hellewell J, Meakin SR, Sherratt K, Zhou M, Kalantari R, Yamana TK, Pei S, Shaman J, Li ML, Bertsimas D, Lami OS, Soni S, Bouardi HT, Ayer T, Adee M, Chhatwal J, Dalgic OO, Ladd MA, Linas BP, Mueller P, Xiao J, Wang Y, Wang Q, Xie S, Zeng D, Green A, Bien J, Brooks L, Hu AJ, Jahja M, McDonald D, Narasimhan B, Politsch C, Rajanala S, Rumack A, Simon N, Tibshirani RJ, Tibshirani R, Ventura V, Wasserman L, O’Dea EB, Drake JM, Pagano R, Tran QT, Ho LST, Huynh H, Walker JW, Slayton RB, Johansson MA, Biggerstaff M, Reich NG. Evaluation of individual and ensemble probabilistic forecasts of COVID-19 mortality in the United States. Proc Natl Acad Sci U S A 2022; 119:e2113561119. [PMID: 35394862 PMCID: PMC9169655 DOI: 10.1073/pnas.2113561119] [Citation(s) in RCA: 87] [Impact Index Per Article: 43.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2021] [Accepted: 01/24/2022] [Indexed: 01/15/2023] Open
Abstract
Short-term probabilistic forecasts of the trajectory of the COVID-19 pandemic in the United States have served as a visible and important communication channel between the scientific modeling community and both the general public and decision-makers. Forecasting models provide specific, quantitative, and evaluable predictions that inform short-term decisions such as healthcare staffing needs, school closures, and allocation of medical supplies. Starting in April 2020, the US COVID-19 Forecast Hub (https://covid19forecasthub.org/) collected, disseminated, and synthesized tens of millions of specific predictions from more than 90 different academic, industry, and independent research groups. A multimodel ensemble forecast that combined predictions from dozens of groups every week provided the most consistently accurate probabilistic forecasts of incident deaths due to COVID-19 at the state and national level from April 2020 through October 2021. The performance of 27 individual models that submitted complete forecasts of COVID-19 deaths consistently throughout this year showed high variability in forecast skill across time, geospatial units, and forecast horizons. Two-thirds of the models evaluated showed better accuracy than a naïve baseline model. Forecast accuracy degraded as models made predictions further into the future, with probabilistic error at a 20-wk horizon three to five times larger than when predicting at a 1-wk horizon. This project underscores the role that collaboration and active coordination between governmental public-health agencies, academic modeling teams, and industry partners can play in developing modern modeling capabilities to support local, state, and federal response to outbreaks.
Collapse
Affiliation(s)
- Estee Y. Cramer
- Department of Biostatistics and Epidemiology, University of Massachusetts, Amherst, MA 01003
| | - Evan L. Ray
- Department of Biostatistics and Epidemiology, University of Massachusetts, Amherst, MA 01003
| | - Velma K. Lopez
- COVID-19 Response, Centers for Disease Control and Prevention; Atlanta, GA 30333
| | - Johannes Bracher
- Chair of Econometrics and Statistics, Karlsruhe Institute of Technology, 76185 Karlsruhe, Germany
- Computational Statistics Group, Heidelberg Institute for Theoretical Studies, 69118 Heidelberg, Germany
| | | | | | - Aaron Gerding
- Department of Biostatistics and Epidemiology, University of Massachusetts, Amherst, MA 01003
| | - Tilmann Gneiting
- Computational Statistics Group, Heidelberg Institute for Theoretical Studies, 69118 Heidelberg, Germany
- Institute of Stochastics, Karlsruhe Institute of Technology, 69118 Karlsruhe, Germany
| | - Katie H. House
- Department of Biostatistics and Epidemiology, University of Massachusetts, Amherst, MA 01003
| | - Yuxin Huang
- Department of Biostatistics and Epidemiology, University of Massachusetts, Amherst, MA 01003
| | - Dasuni Jayawardena
- Department of Biostatistics and Epidemiology, University of Massachusetts, Amherst, MA 01003
| | - Abdul H. Kanji
- Department of Biostatistics and Epidemiology, University of Massachusetts, Amherst, MA 01003
| | - Ayush Khandelwal
- Department of Biostatistics and Epidemiology, University of Massachusetts, Amherst, MA 01003
| | - Khoa Le
- Department of Biostatistics and Epidemiology, University of Massachusetts, Amherst, MA 01003
| | - Anja Mühlemann
- Institute of Mathematical Statistics and Actuarial Science, University of Bern, CH-3012 Bern, Switzerland
| | - Jarad Niemi
- Department of Statistics, Iowa State University, Ames, IA 50011
| | - Apurv Shah
- Department of Biostatistics and Epidemiology, University of Massachusetts, Amherst, MA 01003
| | - Ariane Stark
- Department of Biostatistics and Epidemiology, University of Massachusetts, Amherst, MA 01003
| | - Yijin Wang
- Department of Biostatistics and Epidemiology, University of Massachusetts, Amherst, MA 01003
| | - Nutcha Wattanachit
- Department of Biostatistics and Epidemiology, University of Massachusetts, Amherst, MA 01003
| | - Martha W. Zorn
- Department of Biostatistics and Epidemiology, University of Massachusetts, Amherst, MA 01003
| | | | - Sansiddh Jain
- Wadhwani Institute of Artificial Intelligence, Andheri East, Mumbai, 400093, India
| | - Nayana Bannur
- Wadhwani Institute of Artificial Intelligence, Andheri East, Mumbai, 400093, India
| | - Ayush Deva
- Wadhwani Institute of Artificial Intelligence, Andheri East, Mumbai, 400093, India
| | - Mihir Kulkarni
- Wadhwani Institute of Artificial Intelligence, Andheri East, Mumbai, 400093, India
| | - Srujana Merugu
- Wadhwani Institute of Artificial Intelligence, Andheri East, Mumbai, 400093, India
| | - Alpan Raval
- Wadhwani Institute of Artificial Intelligence, Andheri East, Mumbai, 400093, India
| | - Siddhant Shingi
- Wadhwani Institute of Artificial Intelligence, Andheri East, Mumbai, 400093, India
| | - Avtansh Tiwari
- Wadhwani Institute of Artificial Intelligence, Andheri East, Mumbai, 400093, India
| | - Jerome White
- Wadhwani Institute of Artificial Intelligence, Andheri East, Mumbai, 400093, India
| | | | - Spencer Woody
- Department of Integrative Biology, University of Texas at Austin, Austin, TX 78712
| | - Maytal Dahan
- Texas Advanced Computing Center, Austin, TX 78758
| | - Spencer Fox
- Department of Integrative Biology, University of Texas at Austin, Austin, TX 78712
| | | | | | - Lauren Ancel Meyers
- Department of Integrative Biology, University of Texas at Austin, Austin, TX 78712
| | - James G. Scott
- Department of Information, Risk, and Operations Management, University of Texas at Austin, Austin, TX 78712
| | - Mauricio Tec
- Department of Statistics and Data Sciences, University of Texas at Austin, Austin, TX 78712
| | - Ajitesh Srivastava
- Ming Hsieh Department of Computer and Electrical Engineering, University of Southern California, Los Angeles, CA 90089
| | - Glover E. George
- US Army Engineer Research and Development Center, Vicksburg, MS 39180
| | - Jeffrey C. Cegan
- US Army Engineer Research and Development Center, Concord, MA 01742
| | - Ian D. Dettwiller
- US Army Engineer Research and Development Center, Vicksburg, MS 39180
| | | | | | - Robert H. Hunter
- US Army Engineer Research and Development Center, Vicksburg, MS 39180
| | - Brandon Lafferty
- US Army Engineer Research and Development Center, Vicksburg, MS 39180
| | - Igor Linkov
- US Army Engineer Research and Development Center, Concord, MA 01742
| | - Michael L. Mayo
- US Army Engineer Research and Development Center, Vicksburg, MS 39180
| | - Matthew D. Parno
- US Army Engineer Research and Development Center, Hanover, NH 03755
| | | | | | - Yanli Zhang-James
- Department of Psychiatry and Behavioral Sciences, State University of New York Upstate Medical University, Syracuse, NY 13210
| | - Samuel Chen
- School of Medicine, State University of New York Upstate Medical University, Syracuse, NY 13210
| | - Stephen V. Faraone
- Department of Psychiatry and Behavioral Sciences, State University of New York Upstate Medical University, Syracuse, NY 13210
| | - Jonathan Hess
- Department of Psychiatry and Behavioral Sciences, State University of New York Upstate Medical University, Syracuse, NY 13210
| | - Christopher P. Morley
- Department of Public Health & Preventive Medicine, State University of New York Upstate Medical University, Syracuse, NY 13210
| | - Asif Salekin
- Department of Electrical Engineering and Computer Science, Syracuse University, Syracuse, NY 13207
| | - Dongliang Wang
- Department of Public Health & Preventive Medicine, State University of New York Upstate Medical University, Syracuse, NY 13210
| | | | - Thomas M. Baer
- Department of Physics, Trinity University, San Antonio, TX 78212
| | - Marisa C. Eisenberg
- Department of Complex Systems, University of Michigan, Ann Arbor, MI 48109
- Department of Mathematics, University of Michigan, Ann Arbor, MI 48109
- School of Public Health, Department of Epidemiology, University of Michigan, Ann Arbor, MI 48109
| | - Karl Falb
- Department of Physics, University of Michigan, Ann Arbor, MI, 48109
| | - Yitao Huang
- Department of Physics, University of Michigan, Ann Arbor, MI, 48109
| | - Emily T. Martin
- School of Public Health, Department of Epidemiology, University of Michigan, Ann Arbor, MI 48109
| | - Ella McCauley
- Department of Physics, University of Michigan, Ann Arbor, MI, 48109
| | - Robert L. Myers
- Department of Physics, University of Michigan, Ann Arbor, MI, 48109
| | - Tom Schwarz
- Department of Physics, University of Michigan, Ann Arbor, MI, 48109
| | - Daniel Sheldon
- College of Information and Computer Sciences, University of Massachusetts, Amherst, MA 01003
| | - Graham Casey Gibson
- School of Public Health and Health Sciences, University of Massachusetts, Amherst, MA 01003
| | - Rose Yu
- Department of Computer Science and Engineering, University of California, San Diego, CA 92093
- Khoury College of Computer Sciences, Northeastern University, Boston, MA 02115
| | - Liyao Gao
- Department of Statistics, University of Washington, Seattle, WA 98185
| | - Yian Ma
- Halıcıoğlu Data Science Institute, University of California, San Diego, CA 92093
| | - Dongxia Wu
- Department of Computer Science and Engineering, University of California, San Diego, CA 92093
| | - Xifeng Yan
- Department of Computer Science, University of California, Santa Barbara, CA 93106
| | - Xiaoyong Jin
- Department of Computer Science, University of California, Santa Barbara, CA 93106
| | - Yu-Xiang Wang
- Department of Computer Science, University of California, Santa Barbara, CA 93106
| | - YangQuan Chen
- Mechatronics, Embedded Systems and Automation Lab, Department of Mechanical Engineering, University of California, Merced, CA 95301
| | - Lihong Guo
- Jilin University, Changchun City, Jilin Province, 130012, People's Republic of China
| | - Yanting Zhao
- University of Science and Technology of China, Heifei, Anhui, 230027, People's Republic of China
| | - Quanquan Gu
- Department of Computer Science, University of California, Los Angeles, CA 90095
| | - Jinghui Chen
- Department of Computer Science, University of California, Los Angeles, CA 90095
| | - Lingxiao Wang
- Department of Computer Science, University of California, Los Angeles, CA 90095
| | - Pan Xu
- Department of Computer Science, University of California, Los Angeles, CA 90095
| | - Weitong Zhang
- Department of Computer Science, University of California, Los Angeles, CA 90095
| | - Difan Zou
- Department of Computer Science, University of California, Los Angeles, CA 90095
| | - Hannah Biegel
- Department of Mathematics, University of Arizona, Tucson, AZ 85721
| | - Joceline Lega
- Department of Mathematics, University of Arizona, Tucson, AZ 85721
| | | | - V. P. Nagraj
- Quality Assurance and Data Science, Signature Science, LLC, Charlottesville, VA 22911
| | - Stephanie L. Guertin
- Quality Assurance and Data Science, Signature Science, LLC, Charlottesville, VA 22911
| | | | - Stephen D. Turner
- Quality Assurance and Data Science, Signature Science, LLC, Charlottesville, VA 22911
| | - Yunfeng Shi
- Department of Materials Science and Engineering, Rensselaer Polytechnic Institute, Troy, NY 12309
| | - Xuegang Ban
- Department of Civil and Environmental Engineering, University of Washington, Seattle, WA 98195
| | | | - Qi-Jun Hong
- School for Engineering of Matter, Transport and Energy, Arizona State University, Tempe, AZ 85287
- School of Engineering, Brown University, Providence, RI 02912
| | | | | | - James A. Turtle
- Infectious Disease Group, Predictive Science, Inc, San Diego, CA 92121
| | - Michal Ben-Nun
- Infectious Disease Group, Predictive Science, Inc, San Diego, CA 92121
| | - Steven Riley
- Medical Research Council Centre for Global Infectious Disease Analysis, Department of Infectious Disease Epidemiology, School of Public Health, Imperial College, W2 1PG London, United Kingdom
| | - Pete Riley
- Infectious Disease Group, Predictive Science, Inc, San Diego, CA 92121
| | | | | | - Pedro Forli
- Oliver Wyman Digital, Oliver Wyman, Sao Paolo, Brazil 04711-904
| | - Bruce Hamory
- Health & Life Sciences, Oliver Wyman, Boston, MA 02110
| | | | - Helen Leis
- Health & Life Sciences, Oliver Wyman, New York, NY 10036
| | - John Milliken
- Financial Services, Oliver Wyman, New York, NY 10036
| | | | - James Morgan
- Financial Services, Oliver Wyman, New York, NY 10036
| | | | - Gokce Ozcan
- Financial Services, Oliver Wyman, New York, NY 10036
| | - Noah Piwonka
- Health & Life Sciences, Oliver Wyman, New York, NY 10036
| | - Matt Ravi
- Core Consultant Group, Oliver Wyman, New York, NY 10036
| | - Chris Schrader
- Health & Life Sciences, Oliver Wyman, New York, NY 10036
| | | | - Daniel Siegel
- Financial Services, Oliver Wyman, New York, NY 10036
| | - Ryan Spatz
- Core Consultant Group, Oliver Wyman, New York, NY 10036
| | - Chris Stiefeling
- Financial Services, Oliver Wyman Digital, Toronto, ON, Canada M5J 0A1
| | | | | | - Sean Cavany
- Department of Biological Sciences, University of Notre Dame, Notre Dame, IN 46556
| | - Guido España
- Department of Biological Sciences, University of Notre Dame, Notre Dame, IN 46556
| | - Sean Moore
- Department of Biological Sciences, University of Notre Dame, Notre Dame, IN 46556
| | - Rachel Oidtman
- Department of Biological Sciences, University of Notre Dame, Notre Dame, IN 46556
- Department of Ecology and Evolution, University of Chicago, Chicago, IL 60637
| | - Alex Perkins
- Department of Biological Sciences, University of Notre Dame, Notre Dame, IN 46556
| | - David Kraus
- Department of Mathematics and Statistics, Masaryk University, 61137 Brno, Czech Republic
| | - Andrea Kraus
- Department of Mathematics and Statistics, Masaryk University, 61137 Brno, Czech Republic
| | | | | | - Wei Cao
- Microsoft, Redmond, WA 98029
| | | | | | | | | | | | | | - Alessandro Vespignani
- Institute for Scientific Interchange Foundation, Turin, 10133, Italy
- Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA 02115
| | - Matteo Chinazzi
- Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA 02115
| | - Jessica T. Davis
- Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA 02115
| | - Kunpeng Mu
- Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA 02115
| | - Ana Pastore y Piontti
- Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA 02115
| | - Xinyue Xiong
- Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA 02115
| | - Andrew Zheng
- Operations Research Center, Massachusetts Institute of Technology; Cambridge, MA 02139
| | - Jackie Baek
- Operations Research Center, Massachusetts Institute of Technology; Cambridge, MA 02139
| | - Vivek Farias
- Sloan School of Management, Massachusetts Institute of Technology, Cambridge, MA 02142
| | - Andreea Georgescu
- Operations Research Center, Massachusetts Institute of Technology; Cambridge, MA 02139
| | - Retsef Levi
- Sloan School of Management, Massachusetts Institute of Technology, Cambridge, MA 02142
| | - Deeksha Sinha
- Operations Research Center, Massachusetts Institute of Technology; Cambridge, MA 02139
| | - Joshua Wilde
- Operations Research Center, Massachusetts Institute of Technology; Cambridge, MA 02139
| | | | | | | | - Divya Singhvi
- Technology, Operations and Statistics (TOPS) group, Stern School of Business, New York University, New York, NY 10012
| | | | | | | | - Arnab Sarker
- Institute for Data, Systems, and Society, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Ali Jadbabaie
- Institute for Data, Systems, and Society, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Devavrat Shah
- Institute for Data, Systems, and Society, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Nicolas Della Penna
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Leo A. Celi
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, MA 02139
| | | | | | - Dave Osthus
- Statistical Sciences Group, Los Alamos National Laboratory, Los Alamos, NM 87545
| | - Lauren Castro
- Information Systems and Modeling Group, Los Alamos National Laboratory, Los Alamos, NM 87545
| | - Geoffrey Fairchild
- Information Systems and Modeling Group, Los Alamos National Laboratory, Los Alamos, NM 87545
| | - Isaac Michaud
- Statistical Sciences Group, Los Alamos National Laboratory, Los Alamos, NM 87545
| | - Dean Karlen
- Department of Physics and Astronomy, University of Victoria, Victoria, BC, V8W 2Y2, Canada
- Physical Sciences Division, TRIUMF, Vancouver, BC, V8W 2Y2, Canada
| | - Matt Kinsey
- Johns Hopkins University Applied Physics Laboratory, Laurel, MD 20723
| | - Luke C. Mullany
- Johns Hopkins University Applied Physics Laboratory, Laurel, MD 20723
| | | | - Lauren Shin
- Johns Hopkins University Applied Physics Laboratory, Laurel, MD 20723
| | | | - Shelby Wilson
- Johns Hopkins University Applied Physics Laboratory, Laurel, MD 20723
| | - Elizabeth C. Lee
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21215
| | - Juan Dent
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21215
| | - Kyra H. Grantz
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21215
| | - Alison L. Hill
- Institute for Computational Medicine, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21218
| | - Joshua Kaminsky
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21215
| | | | - Lindsay T. Keegan
- Division of Epidemiology, Department of Internal Medicine, University of Utah, Salt Lake City, UT 84108
| | - Stephen A. Lauer
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21215
| | - Joseph C. Lemaitre
- Laboratory of Ecohydrology, School of Architecture, Civil and Environmental Engineering, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Justin Lessler
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21215
| | - Hannah R. Meredith
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21215
| | - Javier Perez-Saez
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21215
| | - Sam Shah
- Unaffiliated, San Francisco, CA 94122
| | - Claire P. Smith
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21215
| | - Shaun A. Truelove
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21215
- International Vaccine Access Center, Johns Hopkins University, Baltimore, MD 21231
- Department of International Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21231
| | | | - Maximilian Marshall
- Department of Civil and Systems Engineering, Johns Hopkins University, Baltimore, MD 21218
| | - Lauren Gardner
- Department of Civil and Systems Engineering, Johns Hopkins University, Baltimore, MD 21218
| | - Kristen Nixon
- Department of Civil and Systems Engineering, Johns Hopkins University, Baltimore, MD 21218
| | | | - Lily Wang
- Department of Statistics, Iowa State University, Ames, IA 50011
| | - Lei Gao
- Department of Finance, Iowa State University, Ames, IA 50011
| | - Zhiling Gu
- Department of Statistics, Iowa State University, Ames, IA 50011
| | - Myungjin Kim
- Department of Statistics, Iowa State University, Ames, IA 50011
| | - Xinyi Li
- School of Mathematical and Statistical Sciences, Clemson University, Clemson, SC 29634
| | - Guannan Wang
- Department of Mathematics, College of William & Mary, Williamsburg, VA 23187
| | - Yueying Wang
- Department of Statistics, Iowa State University, Ames, IA 50011
| | - Shan Yu
- Department of Statistics, University of Virginia, Charlottesville, VA 22904
| | - Robert C. Reiner
- Institute for Health Metrics and Evaluation, University of Washington, Seattle, WA 98195
| | - Ryan Barber
- Institute for Health Metrics and Evaluation, University of Washington, Seattle, WA 98195
| | - Emmanuela Gakidou
- Institute for Health Metrics and Evaluation, University of Washington, Seattle, WA 98195
| | - Simon I. Hay
- Institute for Health Metrics and Evaluation, University of Washington, Seattle, WA 98195
| | - Steve Lim
- Institute for Health Metrics and Evaluation, University of Washington, Seattle, WA 98195
| | - Chris Murray
- Institute for Health Metrics and Evaluation, University of Washington, Seattle, WA 98195
| | - David Pigott
- Institute for Health Metrics and Evaluation, University of Washington, Seattle, WA 98195
| | | | | | | | | | - B. Aditya Prakash
- College of Computing, Georgia Institute of Technology, Atlanta, GA 30308
| | - Bijaya Adhikari
- Department of Computer Science, University of Iowa, Iowa City, IA 52242
| | - Jiaming Cui
- College of Computing, Georgia Institute of Technology, Atlanta, GA 30308
| | | | - Anika Tabassum
- Department of Computer Science, Virginia Tech, Falls Church, VA 22043
| | - Jiajia Xie
- College of Computing, Georgia Institute of Technology, Atlanta, GA 30308
| | - Pinar Keskinocak
- H. Milton Stewart School of Industrial and Systems Engineering, Georgia Institute of Technology, Atlanta, GA, 30332
| | - John Asplund
- Advanced Data Analytics, Metron, Inc., Reston, VA 20190
| | - Arden Baxter
- H. Milton Stewart School of Industrial and Systems Engineering, Georgia Institute of Technology, Atlanta, GA, 30332
| | - Buse Eylul Oruc
- H. Milton Stewart School of Industrial and Systems Engineering, Georgia Institute of Technology, Atlanta, GA, 30332
| | - Nicoleta Serban
- H. Milton Stewart School of Industrial and Systems Engineering, Georgia Institute of Technology, Atlanta, GA, 30332
| | | | | | | | | | | | | | | | | | | | - Thomas Tsai
- Department of Health Policy and Management, Harvard University, Cambridge, MA 02138
| | | | | | | | - Sam Abbott
- Centre for Mathematical Modelling of Infectious Diseases, London School of Hygiene & Tropical Medicine, WC1E 7HT London, United Kingdom
| | - Nikos I. Bosse
- Centre for Mathematical Modelling of Infectious Diseases, London School of Hygiene & Tropical Medicine, WC1E 7HT London, United Kingdom
| | - Sebastian Funk
- Centre for Mathematical Modelling of Infectious Diseases, London School of Hygiene & Tropical Medicine, WC1E 7HT London, United Kingdom
| | - Joel Hellewell
- Centre for Mathematical Modelling of Infectious Diseases, London School of Hygiene & Tropical Medicine, WC1E 7HT London, United Kingdom
| | - Sophie R. Meakin
- Centre for Mathematical Modelling of Infectious Diseases, London School of Hygiene & Tropical Medicine, WC1E 7HT London, United Kingdom
| | - Katharine Sherratt
- Centre for Mathematical Modelling of Infectious Diseases, London School of Hygiene & Tropical Medicine, WC1E 7HT London, United Kingdom
| | - Mingyuan Zhou
- McCombs School of Business, The University of Texas at Austin, Austin, TX 78712
| | - Rahi Kalantari
- Department of Electrical and Computer Engineering, The University of Texas at Austin, Austin, TX 78712
| | - Teresa K. Yamana
- Department of Environmental Health Sciences, Columbia University, New York, NY 10032
| | - Sen Pei
- Department of Environmental Health Sciences, Columbia University, New York, NY 10032
| | - Jeffrey Shaman
- Department of Environmental Health Sciences, Columbia University, New York, NY 10032
| | - Michael L. Li
- Operations Research Center, Massachusetts Institute of Technology; Cambridge, MA 02139
| | - Dimitris Bertsimas
- Sloan School of Management, Massachusetts Institute of Technology, Cambridge, MA 02142
| | - Omar Skali Lami
- Operations Research Center, Massachusetts Institute of Technology; Cambridge, MA 02139
| | - Saksham Soni
- Operations Research Center, Massachusetts Institute of Technology; Cambridge, MA 02139
| | - Hamza Tazi Bouardi
- Operations Research Center, Massachusetts Institute of Technology; Cambridge, MA 02139
| | - Turgay Ayer
- H. Milton Stewart School of Industrial and Systems Engineering, Georgia Institute of Technology, Atlanta, GA, 30332
- Winship Cancer Institute, Emory University Medical School, Atlanta, GA 30322
| | - Madeline Adee
- Radiology-Institute for Technology Assessment, Massachusetts General Hospital, Boston, MA 02114
| | - Jagpreet Chhatwal
- Radiology-Institute for Technology Assessment, Massachusetts General Hospital, Boston, MA 02114
| | - Ozden O. Dalgic
- Health Economic Modeling, Value Analytics Labs, 34776 İstanbul, Turkey
| | - Mary A. Ladd
- Radiology-Institute for Technology Assessment, Massachusetts General Hospital, Boston, MA 02114
| | - Benjamin P. Linas
- Department of Medicine, Section of Infectious Diseases, Boston University School of Medicine, Boston, MA 02118
| | - Peter Mueller
- Radiology-Institute for Technology Assessment, Massachusetts General Hospital, Boston, MA 02114
| | - Jade Xiao
- H. Milton Stewart School of Industrial and Systems Engineering, Georgia Institute of Technology, Atlanta, GA, 30332
| | - Yuanjia Wang
- Department of Biostatistics, Columbia University, New York, NY 10032
- Department of Psychiatry, Columbia University, New York, NY 10032
| | - Qinxia Wang
- Department of Biostatistics, Columbia University, New York, NY 10032
| | - Shanghong Xie
- Department of Biostatistics, Columbia University, New York, NY 10032
| | - Donglin Zeng
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599
| | - Alden Green
- Department of Statistics, Carnegie Mellon University, Pittsburgh, PA 15213
| | - Jacob Bien
- Marshall School of Business, Department of Data Sciences and Operations (DSO), University of Southern California, Los Angeles, CA 90089
| | - Logan Brooks
- Department of Statistics, Carnegie Mellon University, Pittsburgh, PA 15213
| | - Addison J. Hu
- Department of Statistics, Carnegie Mellon University, Pittsburgh, PA 15213
| | - Maria Jahja
- Department of Statistics, Carnegie Mellon University, Pittsburgh, PA 15213
| | - Daniel McDonald
- Department of Statistics, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
| | - Balasubramanian Narasimhan
- Department of Biomedical Data Sciences, Stanford University, Stanford, CA 94305
- Department of Statistics, Stanford University, Stanford, CA 94305
| | - Collin Politsch
- Machine Learning Department, Carnegie Mellon University, Pittsburgh, PA 15213
| | - Samyak Rajanala
- Department of Statistics, Stanford University, Stanford, CA 94305
| | - Aaron Rumack
- Machine Learning Department, Carnegie Mellon University, Pittsburgh, PA 15213
| | - Noah Simon
- Department of Biostatistics, University of Washington, Seattle, WA 98195
| | - Ryan J. Tibshirani
- Department of Statistics, Carnegie Mellon University, Pittsburgh, PA 15213
| | - Rob Tibshirani
- Department of Statistics, Stanford University, Stanford, CA 94305
| | - Valerie Ventura
- Department of Statistics, Carnegie Mellon University, Pittsburgh, PA 15213
| | - Larry Wasserman
- Department of Statistics, Carnegie Mellon University, Pittsburgh, PA 15213
| | - Eamon B. O’Dea
- Odum School of Ecology, University of Georgia, Athens, GA 30602
| | - John M. Drake
- Odum School of Ecology, University of Georgia, Athens, GA 30602
| | | | - Quoc T. Tran
- Catalog Data Science, Walmart Inc., Sunnyvale, CA 94085
| | - Lam Si Tung Ho
- Department of Mathematics and Statistics, Dalhousie University, Halifax, NS, B3H 4R2, Canada
| | - Huong Huynh
- Virtual Power System Inc, Milpitas, CA 95035
| | - Jo W. Walker
- COVID-19 Response, Centers for Disease Control and Prevention; Atlanta, GA 30333
| | - Rachel B. Slayton
- COVID-19 Response, Centers for Disease Control and Prevention; Atlanta, GA 30333
| | - Michael A. Johansson
- COVID-19 Response, Centers for Disease Control and Prevention; Atlanta, GA 30333
| | - Matthew Biggerstaff
- COVID-19 Response, Centers for Disease Control and Prevention; Atlanta, GA 30333
| | - Nicholas G. Reich
- Department of Biostatistics and Epidemiology, University of Massachusetts, Amherst, MA 01003
| |
Collapse
|
7
|
Abstract
Interpolators-estimators that achieve zero training error-have attracted growing attention in machine learning, mainly because state-of-the art neural networks appear to be models of this type. In this paper, we study minimum ℓ 2 norm ("ridgeless") interpolation least squares regression, focusing on the high-dimensional regime in which the number of unknown parameters p is of the same order as the number of samples n. We consider two different models for the feature distribution: a linear model, where the feature vectors x i ∈ ℝ p are obtained by applying a linear transform to a vector of i.i.d. entries, x i = Σ1/2 z i (with z i ∈ ℝ p ); and a nonlinear model, where the feature vectors are obtained by passing the input through a random one-layer neural network, xi = φ(Wz i ) (with z i ∈ ℝ d , W ∈ ℝ p × d a matrix of i.i.d. entries, and φ an activation function acting componentwise on Wz i ). We recover-in a precise quantitative way-several phenomena that have been observed in large-scale neural networks and kernel machines, including the "double descent" behavior of the prediction risk, and the potential benefits of overparametrization.
Collapse
Affiliation(s)
- Trevor Hastie
- Department of Statistics and Department of Biomedical Data Science, Stanford University
| | - Andrea Montanari
- Department of Statistics and Department of Electrical Engineering, Stanford University
| | | | - Ryan J. Tibshirani
- Department of Statistics and Department of Machine Learning, Carnegie Mellon University
| |
Collapse
|
8
|
McDonald DJ, Bien J, Green A, Hu AJ, DeFries N, Hyun S, Oliveira NL, Sharpnack J, Tang J, Tibshirani R, Ventura V, Wasserman L, Tibshirani RJ. Can auxiliary indicators improve COVID-19 forecasting and hotspot prediction? Proc Natl Acad Sci U S A 2021; 118:e2111453118. [PMID: 34903655 PMCID: PMC8713796 DOI: 10.1073/pnas.2111453118] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/02/2021] [Indexed: 02/07/2023] Open
Abstract
Short-term forecasts of traditional streams from public health reporting (such as cases, hospitalizations, and deaths) are a key input to public health decision-making during a pandemic. Since early 2020, our research group has worked with data partners to collect, curate, and make publicly available numerous real-time COVID-19 indicators, providing multiple views of pandemic activity in the United States. This paper studies the utility of five such indicators-derived from deidentified medical insurance claims, self-reported symptoms from online surveys, and COVID-related Google search activity-from a forecasting perspective. For each indicator, we ask whether its inclusion in an autoregressive (AR) model leads to improved predictive accuracy relative to the same model excluding it. Such an AR model, without external features, is already competitive with many top COVID-19 forecasting models in use today. Our analysis reveals that 1) inclusion of each of these five indicators improves on the overall predictive accuracy of the AR model; 2) predictive gains are in general most pronounced during times in which COVID cases are trending in "flat" or "down" directions; and 3) one indicator, based on Google searches, seems to be particularly helpful during "up" trends.
Collapse
Affiliation(s)
- Daniel J McDonald
- Department of Statistics, University of British Columbia, Vancouver, BC, Canada V6T 1Z4;
| | - Jacob Bien
- Department of Data Sciences and Operations, University of Southern California, Los Angeles, CA 90089
| | - Alden Green
- Department of Statistics & Data Science, Carnegie Mellon University, Pittsburgh, PA 15213
| | - Addison J Hu
- Department of Statistics & Data Science, Carnegie Mellon University, Pittsburgh, PA 15213
- Machine Learning Department, Carnegie Mellon University, Pittsburgh, PA 15213
| | - Nat DeFries
- Machine Learning Department, Carnegie Mellon University, Pittsburgh, PA 15213
| | - Sangwon Hyun
- Department of Data Sciences and Operations, University of Southern California, Los Angeles, CA 90089
| | - Natalia L Oliveira
- Department of Statistics & Data Science, Carnegie Mellon University, Pittsburgh, PA 15213
- Machine Learning Department, Carnegie Mellon University, Pittsburgh, PA 15213
| | - James Sharpnack
- Department of Statistics, University of California, Davis, CA 95616
| | - Jingjing Tang
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA 15213
| | - Robert Tibshirani
- Department of Statistics, Stanford University, Stanford, CA 94305
- Department of Biomedical Data Science, Stanford University, Stanford, CA 94305
| | - Valérie Ventura
- Department of Statistics & Data Science, Carnegie Mellon University, Pittsburgh, PA 15213
| | - Larry Wasserman
- Department of Statistics & Data Science, Carnegie Mellon University, Pittsburgh, PA 15213
- Machine Learning Department, Carnegie Mellon University, Pittsburgh, PA 15213
| | - Ryan J Tibshirani
- Department of Statistics & Data Science, Carnegie Mellon University, Pittsburgh, PA 15213
- Machine Learning Department, Carnegie Mellon University, Pittsburgh, PA 15213
| |
Collapse
|
9
|
Stokell BG, Shah RD, Tibshirani RJ. Modelling high‐dimensional categorical data using nonconvex fusion penalties. J R Stat Soc Series B Stat Methodol 2021. [DOI: 10.1111/rssb.12432] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
10
|
|
11
|
Hyun S, Lin KZ, G'Sell M, Tibshirani RJ. Post-selection inference for changepoint detection algorithms with application to copy number variation data. Biometrics 2021; 77:1037-1049. [PMID: 33434289 DOI: 10.1111/biom.13422] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2019] [Revised: 07/06/2020] [Accepted: 08/11/2020] [Indexed: 11/30/2022]
Abstract
Changepoint detection methods are used in many areas of science and engineering, for example, in the analysis of copy number variation data to detect abnormalities in copy numbers along the genome. Despite the broad array of available tools, methodology for quantifying our uncertainty in the strength (or the presence) of given changepoints post-selection are lacking. Post-selection inference offers a framework to fill this gap, but the most straightforward application of these methods results in low-powered hypothesis tests and leaves open several important questions about practical usability. In this work, we carefully tailor post-selection inference methods toward changepoint detection, focusing on copy number variation data. To accomplish this, we study commonly used changepoint algorithms: binary segmentation, as well as two of its most popular variants, wild and circular, and the fused lasso. We implement some of the latest developments in post-selection inference theory, mainly auxiliary randomization. This improves the power, which requires implementations of Markov chain Monte Carlo algorithms (importance sampling and hit-and-run sampling) to carry out our tests. We also provide recommendations for improving practical useability, detailed simulations, and example analyses on array comparative genomic hybridization as well as sequencing data.
Collapse
Affiliation(s)
- Sangwon Hyun
- Department of Data Sciences and Operations, University of Southern California, Los Angeles, California, USA
| | - Kevin Z Lin
- Department of Statistics, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Max G'Sell
- Department of Statistics and Data Science, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA
| | - Ryan J Tibshirani
- Department of Statistics and Data Science, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA
| |
Collapse
|
12
|
Hastie T, Tibshirani R, Tibshirani RJ. Rejoinder: Best Subset, Forward Stepwise or Lasso? Analysis and Recommendations Based on Extensive Comparisons. Stat Sci 2020. [DOI: 10.1214/20-sts733rej] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
13
|
Rosset S, Tibshirani RJ. From Fixed-X to Random-X Regression: Bias-Variance Decompositions, Covariance Penalties, and Prediction Error Estimation: Rejoinder. J Am Stat Assoc 2020. [DOI: 10.1080/01621459.2020.1727236] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Affiliation(s)
- Saharon Rosset
- Department of Statistics and Operations Research, Tel Aviv University, Tel Aviv, Israel
| | - Ryan J. Tibshirani
- Department of Statistics and Machine Learning Department, Carnegie Mellon University, Pittsburgh, PA
| |
Collapse
|
14
|
|
15
|
|
16
|
Tibshirani RJ, Rosset S. Excess Optimism: How Biased is the Apparent Error of an Estimator Tuned by SURE? J Am Stat Assoc 2019. [DOI: 10.1080/01621459.2018.1429276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Affiliation(s)
- Ryan J. Tibshirani
- Department of Statistics and Machine Learning Department, Carnegie Mellon University, Pittsburgh, PA
| | - Saharon Rosset
- Department of Statistics, Tel Aviv University, Tel Aviv, Israel
| |
Collapse
|
17
|
Rosset S, Tibshirani RJ. From Fixed-X to Random-X Regression: Bias-Variance Decompositions, Covariance Penalties, and Prediction Error Estimation. J Am Stat Assoc 2019. [DOI: 10.1080/01621459.2018.1424632] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Affiliation(s)
- Saharon Rosset
- Department of Statistics and Operations Research, Tel Aviv University, Tel Aviv, Israel
| | - Ryan J. Tibshirani
- Department of Statistics and Machine Learning Department, Carnegie Mellon University, Pittsburgh, PA
| |
Collapse
|
18
|
Adhikari S, Lecci F, Becker JT, Junker BW, Kuller LH, Lopez OL, Tibshirani RJ. High-dimensional longitudinal classification with the multinomial fused lasso. Stat Med 2019; 38:2184-2205. [PMID: 30701586 DOI: 10.1002/sim.8100] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2018] [Revised: 12/10/2018] [Accepted: 01/02/2019] [Indexed: 01/03/2023]
Abstract
We study regularized estimation in high-dimensional longitudinal classification problems, using the lasso and fused lasso regularizers. The constructed coefficient estimates are piecewise constant across the time dimension in the longitudinal problem, with adaptively selected change points (break points). We present an efficient algorithm for computing such estimates, based on proximal gradient descent. We apply our proposed technique to a longitudinal data set on Alzheimer's disease from the Cardiovascular Health Study Cognition Study. Using data analysis and a simulation study, we motivate and demonstrate several practical considerations such as the selection of tuning parameters and the assessment of model stability. While race, gender, vascular and heart disease, lack of caregivers, and deterioration of learning and memory are all important predictors of dementia, we also find that these risk factors become more relevant in the later stages of life.
Collapse
Affiliation(s)
- Samrachana Adhikari
- Department of Population Health, New York University School of Medicine, New York, New York
| | - Fabrizio Lecci
- Department of Statistics, Carnegie Mellon University, Pittsburgh, Pennsylvania
| | - James T Becker
- Department of Neurology, University of Pittsburgh, Pittsburgh, Pennsylvania
| | - Brian W Junker
- Department of Statistics, Carnegie Mellon University, Pittsburgh, Pennsylvania
| | - Lewis H Kuller
- Department of Epidemiology, University of Pittsburgh, Pittsburgh, Pennsylvania
| | - Oscar L Lopez
- Department of Neurology, University of Pittsburgh, Pittsburgh, Pennsylvania
| | - Ryan J Tibshirani
- Department of Statistics, Carnegie Mellon University, Pittsburgh, Pennsylvania
| |
Collapse
|
19
|
|
20
|
Brooks LC, Farrow DC, Hyun S, Tibshirani RJ, Rosenfeld R. Nonmechanistic forecasts of seasonal influenza with iterative one-week-ahead distributions. PLoS Comput Biol 2018; 14:e1006134. [PMID: 29906286 PMCID: PMC6034894 DOI: 10.1371/journal.pcbi.1006134] [Citation(s) in RCA: 38] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2017] [Revised: 07/06/2018] [Accepted: 04/10/2018] [Indexed: 11/18/2022] Open
Abstract
Accurate and reliable forecasts of seasonal epidemics of infectious disease can assist in the design of countermeasures and increase public awareness and preparedness. This article describes two main contributions we made recently toward this goal: a novel approach to probabilistic modeling of surveillance time series based on "delta densities", and an optimization scheme for combining output from multiple forecasting methods into an adaptively weighted ensemble. Delta densities describe the probability distribution of the change between one observation and the next, conditioned on available data; chaining together nonparametric estimates of these distributions yields a model for an entire trajectory. Corresponding distributional forecasts cover more observed events than alternatives that treat the whole season as a unit, and improve upon multiple evaluation metrics when extracting key targets of interest to public health officials. Adaptively weighted ensembles integrate the results of multiple forecasting methods, such as delta density, using weights that can change from situation to situation. We treat selection of optimal weightings across forecasting methods as a separate estimation task, and describe an estimation procedure based on optimizing cross-validation performance. We consider some details of the data generation process, including data revisions and holiday effects, both in the construction of these forecasting methods and when performing retrospective evaluation. The delta density method and an adaptively weighted ensemble of other forecasting methods each improve significantly on the next best ensemble component when applied separately, and achieve even better cross-validated performance when used in conjunction. We submitted real-time forecasts based on these contributions as part of CDC's 2015/2016 FluSight Collaborative Comparison. Among the fourteen submissions that season, this system was ranked by CDC as the most accurate.
Collapse
Affiliation(s)
- Logan C. Brooks
- School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America
- * E-mail:
| | - David C. Farrow
- School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America
| | - Sangwon Hyun
- Department of Statistics, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America
| | - Ryan J. Tibshirani
- School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America
- Department of Statistics, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America
| | - Roni Rosenfeld
- School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America
| |
Collapse
|
21
|
Affiliation(s)
- Jing Lei
- Department of Statistics, Carnegie Mellon University, Pittsburgh, PA
| | - Max G’Sell
- Department of Statistics, Carnegie Mellon University, Pittsburgh, PA
| | | | | | - Larry Wasserman
- Department of Statistics, Carnegie Mellon University, Pittsburgh, PA
| |
Collapse
|
22
|
|
23
|
Biggerstaff M, Johansson M, Alper D, Brooks LC, Chakraborty P, Farrow DC, Hyun S, Kandula S, McGowan C, Ramakrishnan N, Rosenfeld R, Shaman J, Tibshirani R, Tibshirani RJ, Vespignani A, Yang W, Zhang Q, Reed C. Results from the second year of a collaborative effort to forecast influenza seasons in the United States. Epidemics 2018; 24:26-33. [PMID: 29506911 DOI: 10.1016/j.epidem.2018.02.003] [Citation(s) in RCA: 66] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2017] [Revised: 02/06/2018] [Accepted: 02/20/2018] [Indexed: 11/25/2022] Open
Abstract
Accurate forecasts could enable more informed public health decisions. Since 2013, CDC has worked with external researchers to improve influenza forecasts by coordinating seasonal challenges for the United States and the 10 Health and Human Service Regions. Forecasted targets for the 2014-15 challenge were the onset week, peak week, and peak intensity of the season and the weekly percent of outpatient visits due to influenza-like illness (ILI) 1-4 weeks in advance. We used a logarithmic scoring rule to score the weekly forecasts, averaged the scores over an evaluation period, and then exponentiated the resulting logarithmic score. Poor forecasts had a score near 0, and perfect forecasts a score of 1. Five teams submitted forecasts from seven different models. At the national level, the team scores for onset week ranged from <0.01 to 0.41, peak week ranged from 0.08 to 0.49, and peak intensity ranged from <0.01 to 0.17. The scores for predictions of ILI 1-4 weeks in advance ranged from 0.02-0.38 and was highest 1 week ahead. Forecast skill varied by HHS region. Forecasts can predict epidemic characteristics that inform public health actions. CDC, state and local health officials, and researchers are working together to improve forecasts.
Collapse
Affiliation(s)
- Matthew Biggerstaff
- Epidemiology and Prevention Branch, Influenza Division, Centers for Disease Control and Prevention, Atlanta, GA, USA.
| | - Michael Johansson
- Dengue Branch, Division of Vector-Borne Diseases, Centers for Disease Control and Prevention, Atlanta, GA, USA
| | | | - Logan C Brooks
- Department of Computer Science, Carnegie Mellon University, Pittsburg, PA, USA
| | - Prithwish Chakraborty
- Discovery Analytics Center, Department of Computer Science, Virginia Tech, Blacksburg, VA, USA
| | - David C Farrow
- Department of Computational Biology, Carnegie Mellon University, Pittsburg, PA, USA
| | - Sangwon Hyun
- Deptartment of Statistics, Carnegie Mellon University, Pittsburg, PA, USA
| | - Sasikiran Kandula
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, USA
| | - Craig McGowan
- Epidemiology and Prevention Branch, Influenza Division, Centers for Disease Control and Prevention, Atlanta, GA, USA
| | - Naren Ramakrishnan
- Discovery Analytics Center, Department of Computer Science, Virginia Tech, Blacksburg, VA, USA
| | - Roni Rosenfeld
- Deptartment of Machine Learning, Department of Language Technologies, Department of Computational Biology, Department of Computer Science, Carnegie Mellon University, Pittsburg, PA, USA
| | - Jeffrey Shaman
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, USA
| | - Rob Tibshirani
- Department of Health Research and Policy, Department of Statistics, Stanford University, Stanford, CA, USA
| | - Ryan J Tibshirani
- Deptartment of Statistics, Department of Machine Learning, Carnegie Mellon University, Pittsburg, PA, USA
| | | | - Wan Yang
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, USA
| | - Qian Zhang
- Northeastern University, Boston, MA, USA
| | - Carrie Reed
- Epidemiology and Prevention Branch, Influenza Division, Centers for Disease Control and Prevention, Atlanta, GA, USA
| |
Collapse
|
24
|
|
25
|
|
26
|
|
27
|
Tibshirani RJ, Taylor J, Lockhart R, Tibshirani R. Rejoinder. J Am Stat Assoc 2016. [DOI: 10.1080/01621459.2016.1182787] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
28
|
|
29
|
|
30
|
|
31
|
|
32
|
|
33
|
|
34
|
Abstract
In the sparse linear regression setting, we consider testing the significance of the predictor variable that enters the current lasso model, in the sequence of models visited along the lasso solution path. We propose a simple test statistic based on lasso fitted values, called the covariance test statistic, and show that when the true model is linear, this statistic has an Exp(1) asymptotic distribution under the null hypothesis (the null being that all truly active variables are contained in the current lasso model). Our proof of this result for the special case of the first predictor to enter the model (i.e., testing for a single significant predictor variable against the global null) requires only weak assumptions on the predictor matrix X. On the other hand, our proof for a general step in the lasso path places further technical assumptions on X and the generative model, but still allows for the important high-dimensional case p > n, and does not necessarily require that the current lasso model achieves perfect recovery of the truly active variables. Of course, for testing the significance of an additional variable between two nested linear models, one typically uses the chi-squared test, comparing the drop in residual sum of squares (RSS) to a [Formula: see text] distribution. But when this additional variable is not fixed, and has been chosen adaptively or greedily, this test is no longer appropriate: adaptivity makes the drop in RSS stochastically much larger than [Formula: see text] under the null hypothesis. Our analysis explicitly accounts for adaptivity, as it must, since the lasso builds an adaptive sequence of linear models as the tuning parameter λ decreases. In this analysis, shrinkage plays a key role: though additional variables are chosen adaptively, the coefficients of lasso active variables are shrunken due to the [Formula: see text] penalty. Therefore, the test statistic (which is based on lasso fitted values) is in a sense balanced by these two opposing properties-adaptivity and shrinkage-and its null distribution is tractable and asymptotically Exp(1).
Collapse
Affiliation(s)
- Richard Lockhart
- Department of Statistics and Actuarial Science, Simon Fraser University, Burnaby, British Columbia V5A 1S6, Canada
| | - Jonathan Taylor
- Department of Statistics, Stanford University, Stanford, California 94305, USA
| | - Ryan J Tibshirani
- Departments of Statistics and Machine Learning, Carnegie Mellon University, 229B Baker Hall, Pittsburgh, Pennsylvania 15213, USA
| | - Robert Tibshirani
- Department of Health, Research & Policy, Department of Statistics, Stanford University, Stanford, California 94305, USA
| |
Collapse
|
35
|
|
36
|
|
37
|
|
38
|
|
39
|
Tibshirani R, Bien J, Friedman J, Hastie T, Simon N, Taylor J, Tibshirani RJ. Strong rules for discarding predictors in lasso-type problems. J R Stat Soc Series B Stat Methodol 2011; 74:245-266. [PMID: 25506256 DOI: 10.1111/j.1467-9868.2011.01004.x] [Citation(s) in RCA: 243] [Impact Index Per Article: 18.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
Abstract
We consider rules for discarding predictors in lasso regression and related problems, for computational efficiency. El Ghaoui and his colleagues have propose 'SAFE' rules, based on univariate inner products between each predictor and the outcome, which guarantee that a coefficient will be 0 in the solution vector. This provides a reduction in the number of variables that need to be entered into the optimization. We propose strong rules that are very simple and yet screen out far more predictors than the SAFE rules. This great practical improvement comes at a price: the strong rules are not foolproof and can mistakenly discard active predictors, i.e. predictors that have non-zero coefficients in the solution. We therefore combine them with simple checks of the Karush-Kuhn-Tucker conditions to ensure that the exact solution to the convex problem is delivered. Of course, any (approximate) screening method can be combined with the Karush-Kuhn-Tucker, conditions to ensure the exact solution; the strength of the strong rules lies in the fact that, in practice, they discard a very large number of the inactive predictors and almost never commit mistakes. We also derive conditions under which they are foolproof. Strong rules provide substantial savings in computational time for a variety of statistical optimization problems.
Collapse
|
40
|
|
41
|
|
42
|
|
43
|
Redelmeier DA, Tibshirani RJ. Car phones and car crashes: some popular misconceptions. CMAJ 2001; 164:1581-2. [PMID: 11402799 PMCID: PMC81115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/20/2023] Open
Affiliation(s)
- D A Redelmeier
- Departments of Medicine and of Health Administration, University of Toronto, Toronto, Ont.
| | | |
Collapse
|
44
|
Affiliation(s)
- D A Redelmeier
- University of Toronto, Sunnybrook and Women's College Health Sciences Centre, Ontario, Canada.
| | | |
Collapse
|
45
|
Abstract
The case-crossover design is an innovative epidemiologic technique with distinct strengths and limitations. We review the fundamental logic of this self-matching non-randomized design and direct attention to 15 concerns related to the available data, unavailable data, analytic technique, quantitative statistics, and etiologic model. Implications for each concern are discussed in the context of a recent report on whether cellular telephone calls are associated with an increased risk of a motor vehicle collision. We suggest that an understanding of the case-crossover design may help investigators explore selected questions in behavioral medical research.
Collapse
Affiliation(s)
- D A Redelmeier
- Department of Medicine, University of Toronto, Ontario, Canada
| | | |
Collapse
|
46
|
Abstract
BACKGROUND Because of a belief that the use of cellular telephones while driving may cause collisions, several countries have restricted their use in motor vehicles, and others are considering such regulations. We used an epidemiologic method, the case-crossover design, to study whether using a cellular telephone while driving increases the risk of a motor vehicle collision. METHODS We studied 699 drivers who had cellular telephones and who were involved in motor vehicle collisions resulting in substantial property damage but no personal injury. Each person's cellular-telephone calls on the day of the collision and during the previous week were analyzed through the use of detailed billing records. RESULTS A total of 26,798 cellular-telephone calls were made during the 14-month study period. The risk of a collision when using a cellular telephone was four times higher than the risk when a cellular telephone was not being used (relative risk, 4.3; 95 percent confidence interval, 3.0 to 6.5). The relative risk was similar for drivers who differed in personal characteristics such as age and driving experience; calls close to the time of the collision were particularly hazardous (relative risk, 4.8 for calls placed within 5 minutes of the accident, as compared with 1.3 for calls placed more than 15 minutes before the accident; P<0.001); and units that allowed the hands to be free (relative risk, 5.9) offered no safety advantage over hand-held units (relative risk, 3.9; P not significant). Thirty-nine percent of the drivers called emergency services after the collision, suggesting that having a cellular telephone may have had advantages in the aftermath of an event. CONCLUSIONS The use of cellular telephones in motor vehicles is associated with a quadrupling of the risk of a collision during the brief time interval involving a call. Decisions about regulation of such telephones, however, need to take into account the benefits of the technology and the role of individual responsibility.
Collapse
Affiliation(s)
- D A Redelmeier
- Department of Medicine, University of Toronto, ON, Canada
| | | |
Collapse
|
47
|
Abstract
Homeless adults often visit emergency departments and often leave dissatisfied. We tested whether compassionate care, by improving patient satisfaction, can alter subsequent use of emergency services. We identified 133 consecutive homeless adults visiting one inner-city emergency department who were not acutely psychotic, extremely intoxicated, unable to speak English, or medically unstable. Half were randomly assigned to receive compassionate contact from trained volunteers. All patients otherwise had usual care and were followed for repeat visits to emergency departments. We found that rates of use were high, with patients making an average of seven visits a year (0.60 per month). More than a third of all patients made two or more visits within two days of each other. The average number of visits per month after intervention was significantly lower for patients who received compassionate care (0.43 vs 0.65, p = 0.018). Analyses adjusting for each patient's previous rate of use confirmed that compassionate care led to a one third reduction in the number of return visits within one month (95% CI 14 to 40%). Compassionate management of selected homeless adults decreases repeat visits to the emergency department. One explanation is that patients tend to return frequently until they are satisfied with their treatment.
Collapse
Affiliation(s)
- D A Redelmeier
- Department of Medicine, University of Toronto, Ontario, Canada
| | | | | |
Collapse
|
48
|
Williams WG, Rebeyka IM, Tibshirani RJ, Coles JG, Lightfoot NE, Mehra A, Freedom RM, Trusler GA. Warm induction blood cardioplegia in the infant. A technique to avoid rapid cooling myocardial contracture. J Thorac Cardiovasc Surg 1990; 100:896-901. [PMID: 2246912] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
The use of profound hypothermia and total circulatory arrest for repair of heart defects in neonates usually involves a period of systemic and myocardial bypass cooling. Rapid cooling of muscle (skeletal, smooth, and myocardial) can result in contracture through elevation of cytosolic calcium levels. The increased myocardial tone caused by cooling might render the heart more vulnerable to a subsequent period of cardioplegic ischemic arrest. Infants may be more susceptible to contracture because their small body mass allows more rapid myocardial temperature change when prearrest bypass cooling is used. The influence of avoiding rapid myocardial cooling before induced cardioplegic arrest was analyzed in a group of infants weighing less than 6 kg at the time of open cardiac operation. Myocardial ischemic arrest by warm (37 degrees C) induction blood cardioplegia was used in 57 infants and compared with results in 440 infants treated with standard blood cardioplegia. Multivariate logistic regression analysis revealed that patient diagnosis, weight, and age at operation were significant risk factors for operative mortality. The use of warm induction blood cardioplegia had a strongly positive independent effect on survival (p = 0.0003) for any patient weight, age, or diagnostic group. We recommend the avoidance of rapid myocardial cooling on bypass in all patients before induction of cardioplegic ischemic arrest.
Collapse
Affiliation(s)
- W G Williams
- Department of Cardiovascular Surgery, Hospital for Sick Children, Toronto, Ontario, Canada
| | | | | | | | | | | | | | | |
Collapse
|
49
|
Tibshirani RJ, Ciampi A. A family of proportional- and additive-hazards models for survival data. Biometrics 1983; 39:141-7. [PMID: 6871343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
A family of proportional- and additive-hazards models for the analysis of grouped survival data is developed. This family generalizes the unpublished work of F.J. Aranda-Ordaz and follows Holford (1976, Biometrics 32, 227-237). It contains the proportional-hazards model for grouped data, as well as additive-hazards models with time trends. The time trends prove to be useful in an example in which the hazards of the two groups cross.
Collapse
|