1
|
Sewell DK. Leveraging network structure to improve pooled testing efficiency. J R Stat Soc Ser C Appl Stat 2022; 71:1648-1662. [PMID: 36632279 PMCID: PMC9826453 DOI: 10.1111/rssc.12594] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2021] [Accepted: 08/11/2022] [Indexed: 02/01/2023]
Abstract
Screening is a powerful tool for infection control, allowing for infectious individuals, whether they be symptomatic or asymptomatic, to be identified and isolated. The resource burden of regular and comprehensive screening can often be prohibitive, however. One such measure to address this is pooled testing, whereby groups of individuals are each given a composite test; should a group receive a positive diagnostic test result, those comprising the group are then tested individually. Infectious disease is spread through a transmission network, and this paper shows how assigning individuals to pools based on this underlying network can improve the efficiency of the pooled testing strategy, thereby reducing the resource burden. We designed a simulated annealing algorithm to improve the pooled testing efficiency as measured by the ratio of the expected number of correct classifications to the expected number of tests performed. We then evaluated our approach using an agent-based model designed to simulate the spread of SARS-CoV-2 in a school setting. Our results suggest that our approach can decrease the number of tests required to regularly screen the student body, and that these reductions are quite robust to assigning pools based on partially observed or noisy versions of the network.
Collapse
|
2
|
da Silva VH, Goes CP, Trevisoli PA, Lello R, Clemente LG, de Almeida TB, Petrini J, Coutinho LL. Simulation of group testing scenarios can boost COVID-19 screening power. Sci Rep 2022; 12:11854. [PMID: 35831373 PMCID: PMC9277601 DOI: 10.1038/s41598-022-14626-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2021] [Accepted: 06/09/2022] [Indexed: 12/02/2022] Open
Abstract
The COVID-19 has severely affected economies and health systems around the world. Mass testing could work as a powerful alternative to restrain disease dissemination, but the shortage of reagents is a limiting factor. A solution to optimize test usage relies on ‘grouping’ or ‘pooling’ strategies, which combine a set of individuals in a single reaction. To compare different group testing configurations, we developed the poolingr package, which performs an innovative hybrid in silico/in vitro approach to search for optimal testing configurations. We used 6759 viral load values, observed in 2389 positive individuals, to simulate a wide range of scenarios. We found that larger groups (>100) framed into multi-stage setups (up to six stages) could largely boost the power to detect spreaders. Although the boost was dependent on the disease prevalence, our method could point to cheaper grouping schemes to better mitigate COVID-19 dissemination through identification and quarantine recommendation for positive individuals.
Collapse
Affiliation(s)
- Vinicius Henrique da Silva
- Department of Animal Science, Luiz de Queiroz College of Agriculture (ESALQ), University of São Paulo (USP), Piracicaba, Brazil
| | - Carolina Purcell Goes
- Department of Animal Science, Luiz de Queiroz College of Agriculture (ESALQ), University of São Paulo (USP), Piracicaba, Brazil
| | - Priscila Anchieta Trevisoli
- Department of Animal Science, Luiz de Queiroz College of Agriculture (ESALQ), University of São Paulo (USP), Piracicaba, Brazil
| | - Raquel Lello
- Department of Animal Science, Luiz de Queiroz College of Agriculture (ESALQ), University of São Paulo (USP), Piracicaba, Brazil
| | - Luan Gaspar Clemente
- Department of Animal Science, Luiz de Queiroz College of Agriculture (ESALQ), University of São Paulo (USP), Piracicaba, Brazil
| | | | - Juliana Petrini
- Department of Animal Science, Luiz de Queiroz College of Agriculture (ESALQ), University of São Paulo (USP), Piracicaba, Brazil
| | - Luiz Lehmann Coutinho
- Department of Animal Science, Luiz de Queiroz College of Agriculture (ESALQ), University of São Paulo (USP), Piracicaba, Brazil.
| |
Collapse
|
3
|
Sewell DK. Network-Informed Constrained Divisive Pooled Testing Assignments. Front Big Data 2022; 5:893760. [PMID: 35875594 PMCID: PMC9304576 DOI: 10.3389/fdata.2022.893760] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Accepted: 06/06/2022] [Indexed: 11/13/2022] Open
Abstract
Frequent universal testing in a finite population is an effective approach to preventing large infectious disease outbreaks. Yet when the target group has many constituents, this strategy can be cost prohibitive. One approach to alleviate the resource burden is to group multiple individual tests into one unit in order to determine if further tests at the individual level are necessary. This approach, referred to as a group testing or pooled testing, has received much attention in finding the minimum cost pooling strategy. Existing approaches, however, assume either independence or very simple dependence structures between individuals. This assumption ignores the fact that in the context of infectious diseases there is an underlying transmission network that connects individuals. We develop a constrained divisive hierarchical clustering algorithm that assigns individuals to pools based on the contact patterns between individuals. In a simulation study based on real networks, we show the benefits of using our proposed approach compared to random assignments even when the network is imperfectly measured and there is a high degree of missingness in the data.
Collapse
Affiliation(s)
- Daniel K. Sewell
- Department of Biostatistics, University of Iowa, Iowa City, IA, United States
| |
Collapse
|
4
|
Best AF, Malinovsky Y, Albert PS. The efficient design of Nested Group Testing algorithms for disease identification in clustered data. J Appl Stat 2022; 50:2228-2245. [PMID: 37434628 PMCID: PMC10332225 DOI: 10.1080/02664763.2022.2071419] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2021] [Accepted: 04/23/2022] [Indexed: 10/18/2022]
Abstract
Group testing study designs have been used since the 1940s to reduce screening costs for uncommon diseases; for rare diseases, all cases are identifiable with substantially fewer tests than the population size. Substantial research has identified efficient designs under this paradigm. However, little work has focused on the important problem of disease screening among clustered data, such as geographic heterogeneity in HIV prevalence. We evaluated designs where we first estimate disease prevalence and then apply efficient group testing algorithms using these estimates. Specifically, we evaluate prevalence using individual testing on a fixed-size subset of each cluster and use these prevalence estimates to choose group sizes that minimize the corresponding estimated average number of tests per subject. We compare designs where we estimate cluster-specific prevalences as well as a common prevalence across clusters, use different group testing algorithms, construct groups from individuals within and in different clusters, and consider misclassification. For diseases with low prevalence, our results suggest that accounting for clustering is unnecessary. However, for diseases with higher prevalence and sizeable between-cluster heterogeneity, accounting for clustering in study design and implementation improves efficiency. We consider the practical aspects of our design recommendations with two examples with strong clustering effects: (1) Identification of HIV carriers in the US population and (2) Laboratory screening of anti-cancer compounds using cell lines.
Collapse
Affiliation(s)
- Ana F. Best
- Biostatistics Branch, Biometrics Research Program, Division of Cancer Treatment and Diagnosis, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Yaakov Malinovsky
- Department of Mathematics and Statistics, University of Maryland Baltimore County, Baltimore, MD, USA
| | - Paul S. Albert
- Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| |
Collapse
|
5
|
Tsirtsis S, De A, Lorch L, Gomez-Rodriguez M. Pooled testing of traced contacts under superspreading dynamics. PLoS Comput Biol 2022; 18:e1010008. [PMID: 35344547 PMCID: PMC8989305 DOI: 10.1371/journal.pcbi.1010008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Revised: 04/07/2022] [Accepted: 03/10/2022] [Indexed: 11/18/2022] Open
Abstract
Testing is recommended for all close contacts of confirmed COVID-19 patients. However, existing pooled testing methods are oblivious to the circumstances of contagion provided by contact tracing. Here, we build upon a well-known semi-adaptive pooled testing method, Dorfman's method with imperfect tests, and derive a simple pooled testing method based on dynamic programming that is specifically designed to use information provided by contact tracing. Experiments using a variety of reproduction numbers and dispersion levels, including those estimated in the context of the COVID-19 pandemic, show that the pools found using our method result in a significantly lower number of tests than those found using Dorfman's method. Our method provides the greatest competitive advantage when the number of contacts of an infected individual is small, or the distribution of secondary infections is highly overdispersed. Moreover, it maintains this competitive advantage under imperfect contact tracing and significant levels of dilution.
Collapse
Affiliation(s)
- Stratis Tsirtsis
- Μax Planck Institute for Software Systems, Kaiserslautern, Germany
| | | | | | | |
Collapse
|
6
|
Bilder CR, Tebbs JM, McMahan CS. Discussion on "Is group testing ready for prime-time in disease identification". Stat Med 2021; 40:3881-3886. [PMID: 34251038 PMCID: PMC8441930 DOI: 10.1002/sim.8988] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Accepted: 03/25/2021] [Indexed: 11/08/2022]
Affiliation(s)
- Christopher R Bilder
- Department of Statistics, University of Nebraska-Lincoln, Lincoln, Nebraska, USA
| | - Joshua M Tebbs
- Department of Statistics, University of South Carolina, Columbia, South Carolina, USA
| | - Christopher S McMahan
- School of Mathematical and Statistical Sciences, Clemson University, Clemson, South Carolina, USA
| |
Collapse
|
7
|
Haber G, Malinovsky Y, Albert PS. Is group testing ready for prime-time in disease identification? Stat Med 2021; 40:3865-3880. [PMID: 33913183 DOI: 10.1002/sim.9003] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2020] [Revised: 02/19/2021] [Accepted: 03/22/2021] [Indexed: 12/18/2022]
Abstract
Large-scale disease screening is a complicated process in which high costs must be balanced against pressing public health needs. When the goal is screening for infectious disease, one approach is group testing in which samples are initially tested in pools and individual samples are retested only if the initial pooled test was positive. Intuitively, if the prevalence of infection is small, this could result in a large reduction of the total number of tests required. Despite this, the use of group testing in medical studies has been limited, largely due to skepticism about the impact of pooling on the accuracy of a given assay. While there is a large body of research addressing the issue of testing errors in group testing studies, it is customary to assume that the misclassification parameters are known from an external population and/or that the values do not change with the group size. Both of these assumptions are highly questionable for many medical practitioners considering group testing in their study design. In this article, we explore how the failure of these assumptions might impact the efficacy of a group testing design and, consequently, whether group testing is currently feasible for medical screening. Specifically, we look at how incorrect assumptions about the sensitivity function at the design stage can lead to poor estimation of a procedure's overall sensitivity and expected number of tests. Furthermore, if a validation study is used to estimate the pooled misclassification parameters of a given assay, we show that the sample sizes required are so large as to be prohibitive in all but the largest screening programs.
Collapse
Affiliation(s)
- Gregory Haber
- Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, Maryland, USA
| | - Yaakov Malinovsky
- Department of Mathematics and Statistics, University of Maryland, Baltimore County, Baltimore, Maryland, USA
| | - Paul S Albert
- Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, Maryland, USA
| |
Collapse
|
8
|
Malinovsky Y, Haber G, Albert PS. An optimal design for hierarchical generalized group testing. J R Stat Soc Ser C Appl Stat 2020. [DOI: 10.1111/rssc.12409] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
|
9
|
Hitt BD, Bilder CR, Tebbs JM, McMahan CS. The objective function controversy for group testing: Much ado about nothing? Stat Med 2019; 38:4912-4923. [PMID: 31469188 DOI: 10.1002/sim.8341] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2018] [Revised: 06/06/2019] [Accepted: 07/19/2019] [Indexed: 11/09/2022]
Abstract
Group testing is an indispensable tool for laboratories when testing high volumes of clinical specimens for infectious diseases. An important decision that needs to be made prior to implementation is determining what group sizes to use. In best practice, an objective function is chosen and then minimized to determine an optimal set of these group sizes, known as the optimal testing configuration (OTC). There are a few options for objective functions, and they differ based on how the expected number of tests, assay characteristics, and testing constraints are taken into account. These varied options have led to a recent controversy in the literature regarding which of two different objective functions is better. In our paper, we examine these objective functions over a number of realistic situations for infectious disease testing. We show that this controversy may be much ado about nothing because the OTCs and corresponding results (eg, number of tests and accuracy) are largely the same for standard testing algorithms in a wide variety of situations.
Collapse
Affiliation(s)
- Brianna D Hitt
- Department of Statistics, University of Nebraska-Lincoln, Lincoln, Nebraska
| | | | - Joshua M Tebbs
- Department of Statistics, University of South Carolina, Columbia, South Carolina
| | - Christopher S McMahan
- School of Mathematical and Statistical Sciences, Clemson University, Clemson, South Carolina
| |
Collapse
|
10
|
|
11
|
Malinovsky Y, Albert PS. Revisiting Nested Group Testing Procedures: New Results, Comparisons, and Robustness. AM STAT 2018; 73:117-125. [PMID: 31814627 DOI: 10.1080/00031305.2017.1366367] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
Group testing has its origin in the identification of syphilis in the U.S. army during World War II. Much of the theoretical framework of group testing was developed starting in the late 1950s, with continued work into the 1990s. Recently, with the advent of new laboratory and genetic technologies, there has been an increasing interest in group testing designs for cost saving purposes. In this article, we compare different nested designs, including Dorfman, Sterrett and an optimal nested procedure obtained through dynamic programming. To elucidate these comparisons, we develop closed-form expressions for the optimal Sterrett procedure and provide a concise review of the prior literature for other commonly used procedures. We consider designs where the prevalence of disease is known as well as investigate the robustness of these procedures, when it is incorrectly assumed. This article provides a technical presentation that will be of interest to researchers as well as from a pedagogical perspective. Supplementary material for this article available online.
Collapse
Affiliation(s)
- Yaakov Malinovsky
- Department of Mathematics and Statistics University of Maryland, Baltimore County, Baltimore, MD
| | - Paul S Albert
- Biostatistics Branch, Division of Cancer Epidemiology and Genetics National Cancer Institute, Rockville, MD
| |
Collapse
|
12
|
Affiliation(s)
- Gregory Haber
- Department of Mathematics and Statistics, University of Maryland, Baltimore County, Baltimore, Maryland, USA
| | - Yaakov Malinovsky
- Department of Mathematics and Statistics, University of Maryland, Baltimore County, Baltimore, Maryland, USA
| | - Paul S. Albert
- Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, Maryland, USA
| |
Collapse
|
13
|
Wang D, McMahan CS, Tebbs JM, Bilder CR. Group testing case identification with biomarker information. Comput Stat Data Anal 2018; 122:156-166. [PMID: 29977101 DOI: 10.1016/j.csda.2018.01.005] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
Abstract
Screening procedures for infectious diseases, such as HIV, often involve pooling individual specimens together and testing the pools. For diseases with low prevalence, group testing (or pooled testing) can be used to classify individuals as diseased or not while providing considerable cost savings when compared to testing specimens individually. The pooling literature is replete with group testing case identification algorithms including Dorfman testing, higher-stage hierarchical procedures, and array testing. Although these algorithms are usually evaluated on the basis of the expected number of tests and classification accuracy, most evaluations in the literature do not account for the continuous nature of the testing responses and thus invoke potentially restrictive assumptions to characterize an algorithm's performance. Commonly used case identification algorithms in group testing are considered and are evaluated by taking a different approach. Instead of treating testing responses as binary random variables (i.e., diseased/not), evaluations are made by exploiting an assay's underlying continuous biomarker distributions for positive and negative individuals. In doing so, a general framework to describe the operating characteristics of group testing case identification algorithms is provided when these distributions are known. The methodology is illustrated using two HIV testing examples taken from the pooling literature.
Collapse
Affiliation(s)
- Dewei Wang
- Department of Statistics, University of South Carolina, Columbia, SC 29208, USA
| | | | - Joshua M Tebbs
- Department of Statistics, University of South Carolina, Columbia, SC 29208, USA
| | - Christopher R Bilder
- Department of Statistics, University of Nebraska-Lincoln, Lincoln, NE 68583, USA
| |
Collapse
|
14
|
|
15
|
Hou P, Tebbs JM, Bilder CR, McMahan CS. Hierarchical group testing for multiple infections. Biometrics 2016; 73:656-665. [PMID: 27657666 DOI: 10.1111/biom.12589] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2015] [Revised: 06/01/2016] [Accepted: 08/01/2016] [Indexed: 11/30/2022]
Abstract
Group testing, where individuals are tested initially in pools, is widely used to screen a large number of individuals for rare diseases. Triggered by the recent development of assays that detect multiple infections at once, screening programs now involve testing individuals in pools for multiple infections simultaneously. Tebbs, McMahan, and Bilder (2013, Biometrics) recently evaluated the performance of a two-stage hierarchical algorithm used to screen for chlamydia and gonorrhea as part of the Infertility Prevention Project in the United States. In this article, we generalize this work to accommodate a larger number of stages. To derive the operating characteristics of higher-stage hierarchical algorithms with more than one infection, we view the pool decoding process as a time-inhomogeneous, finite-state Markov chain. Taking this conceptualization enables us to derive closed-form expressions for the expected number of tests and classification accuracy rates in terms of transition probability matrices. When applied to chlamydia and gonorrhea testing data from four states (Region X of the United States Department of Health and Human Services), higher-stage hierarchical algorithms provide, on average, an estimated 11% reduction in the number of tests when compared to two-stage algorithms. For applications with rarer infections, we show theoretically that this percentage reduction can be much larger.
Collapse
Affiliation(s)
- Peijie Hou
- Department of Statistics, University of South Carolina, Columbia, South Carolina 29208, U.S.A
| | - Joshua M Tebbs
- Department of Statistics, University of South Carolina, Columbia, South Carolina 29208, U.S.A
| | - Christopher R Bilder
- Department of Statistics, University of Nebraska-Lincoln, Lincoln, Nebraska 68583, U.S.A
| | - Christopher S McMahan
- Department of Mathematical Sciences, Clemson University, Clemson, South Carolina 29634, U.S.A
| |
Collapse
|