1
|
Nambiar A, Pan C, Rana V, Cheraghchi M, Ribeiro J, Maslov S, Milenkovic O. Semi-quantitative group testing for efficient and accurate qPCR screening of pathogens with a wide range of loads. BMC Bioinformatics 2024; 25:195. [PMID: 38760692 PMCID: PMC11100062 DOI: 10.1186/s12859-024-05798-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2023] [Accepted: 04/26/2024] [Indexed: 05/19/2024] Open
Abstract
BACKGROUND Pathogenic infections pose a significant threat to global health, affecting millions of people every year and presenting substantial challenges to healthcare systems worldwide. Efficient and timely testing plays a critical role in disease control and transmission prevention. Group testing is a well-established method for reducing the number of tests needed to screen large populations when the disease prevalence is low. However, it does not fully utilize the quantitative information provided by qPCR methods, nor is it able to accommodate a wide range of pathogen loads. RESULTS To address these issues, we introduce a novel adaptive semi-quantitative group testing (SQGT) scheme to efficiently screen populations via two-stage qPCR testing. The SQGT method quantizes cycle threshold (Ct) values into multiple bins, leveraging the information from the first stage of screening to improve the detection sensitivity. Dynamic Ct threshold adjustments mitigate dilution effects and enhance test accuracy. Comparisons with traditional binary outcome GT methods show that SQGT reduces the number of tests by 24% on the only complete real-world qPCR group testing dataset from Israel, while maintaining a negligible false negative rate. CONCLUSION In conclusion, our adaptive SQGT approach, utilizing qPCR data and dynamic threshold adjustments, offers a promising solution for efficient population screening. With a reduction in the number of tests and minimal false negatives, SQGT holds potential to enhance disease control and testing strategies on a global scale.
Collapse
Affiliation(s)
- Ananthan Nambiar
- Department of Bioengineering, University of Illinois Urbana-Champaign, Urbana, IL, USA.
| | - Chao Pan
- Department of Electrical and Computer Engineering, University of Illinois Urbana-Champaign, Urbana, IL, USA
- Center for Artificial Intelligence and Modeling, Carl R. Woese Institute for Genomic Biology, University of Illinois Urbana-Champaign, Urbana, IL, USA
| | - Vishal Rana
- Department of Electrical and Computer Engineering, University of Illinois Urbana-Champaign, Urbana, IL, USA
| | - Mahdi Cheraghchi
- Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, MI, USA
| | - João Ribeiro
- NOVA LINCS and NOVA School of Science and Technology, Caparica, Portugal
| | - Sergei Maslov
- Department of Bioengineering, University of Illinois Urbana-Champaign, Urbana, IL, USA.
- Center for Artificial Intelligence and Modeling, Carl R. Woese Institute for Genomic Biology, University of Illinois Urbana-Champaign, Urbana, IL, USA.
| | - Olgica Milenkovic
- Department of Electrical and Computer Engineering, University of Illinois Urbana-Champaign, Urbana, IL, USA.
- Center for Artificial Intelligence and Modeling, Carl R. Woese Institute for Genomic Biology, University of Illinois Urbana-Champaign, Urbana, IL, USA.
| |
Collapse
|
2
|
Roy S, Adhya S, Rana S. Estimation of odds ratio from group testing data with misclassified exposure. Biom J 2024; 66:e2200254. [PMID: 38285402 DOI: 10.1002/bimj.202200254] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Revised: 08/21/2023] [Accepted: 09/17/2023] [Indexed: 01/30/2024]
Abstract
For low prevalence disease, we consider estimation of the odds ratio for two specified groups of individuals using group testing data. Broadly the two groups may be classified as "the exposed" and "the unexposed." Often in observational studies, the exposure status is not correctly recorded. In addition, diagnostic tests are rarely completely accurate. The proposed model accounts for imperfect sensitivity and specificity of diagnostic tests along with the misclassification in the exposure status. For model identifiability, we make use of internal validation data, where a subsample of reasonably small size is selected from the original sample by simple random sampling without replacement. Pseudo-maximum likelihood method is employed for the estimation of the model parameters. The performance of group testing methodology is compared with individual testing for different parametric configurations. A limited data study related to COVID-19 prevalence is performed to illustrate the methodology.
Collapse
Affiliation(s)
- Surupa Roy
- Department of Statistics, St Xavier's College (Autonomous), Kolkata, West Bengal, India
| | - Sumanta Adhya
- Department of Statistics, West Bengal State University, Kolkata, West Bengal, India
| | - Subrata Rana
- Department of Statistics, Krishnagar Government College, Kolkata, West Bengal, India
| |
Collapse
|
3
|
Tatsuoka C, Chen W, Lu X. Bayesian group testing with dilution effects. Biostatistics 2023; 24:885-900. [PMID: 35403204 PMCID: PMC10583721 DOI: 10.1093/biostatistics/kxac004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2021] [Revised: 01/18/2022] [Accepted: 01/26/2022] [Indexed: 10/19/2023] Open
Abstract
A Bayesian framework for group testing under dilution effects has been developed, using lattice-based models. This work has particular relevance given the pressing public health need to enhance testing capacity for coronavirus disease 2019 and future pandemics, and the need for wide-scale and repeated testing for surveillance under constantly varying conditions. The proposed Bayesian approach allows for dilution effects in group testing and for general test response distributions beyond just binary outcomes. It is shown that even under strong dilution effects, an intuitive group testing selection rule that relies on the model order structure, referred to as the Bayesian halving algorithm, has attractive optimal convergence properties. Analogous look-ahead rules that can reduce the number of stages in classification by selecting several pooled tests at a time are proposed and evaluated as well. Group testing is demonstrated to provide great savings over individual testing in the number of tests needed, even for moderately high prevalence levels. However, there is a trade-off with higher number of testing stages, and increased variability. A web-based calculator is introduced to assist in weighing these factors and to guide decisions on when and how to pool under various conditions. High-performance distributed computing methods have also been implemented for considering larger pool sizes, when savings from group testing can be even more dramatic.
Collapse
Affiliation(s)
- Curtis Tatsuoka
- Department of Population and Quantitative Health Sciences, CaseWestern Reserve University, Cleveland, OH, 44106, USA
| | - Weicong Chen
- Department of Computer and Data Science, CaseWestern Reserve University, Cleveland, OH, USA
| | - Xiaoyi Lu
- Department of Computer Science and Engineering, University of California Merced, Merced, CA, 95343, USA
| |
Collapse
|
4
|
Warasi S, Tebbs JM, McMahan CS, Bilder CR. Estimating the prevalence of two or more diseases using outcomes from multiplex group testing. Biom J 2023; 65:e2200270. [PMID: 37192524 PMCID: PMC11099910 DOI: 10.1002/bimj.202200270] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Revised: 02/26/2023] [Accepted: 03/01/2023] [Indexed: 05/18/2023]
Abstract
When screening a population for infectious diseases, pooling individual specimens (e.g., blood, swabs, urine, etc.) can provide enormous cost savings when compared to testing specimens individually. In the biostatistics literature, testing pools of specimens is commonly known as group testing or pooled testing. Although estimating a population-level prevalence with group testing data has received a large amount of attention, most of this work has focused on applications involving a single disease, such as human immunodeficiency virus. Modern methods of screening now involve testing pools and individuals for multiple diseases simultaneously through the use of multiplex assays. Hou et al. (2017, Biometrics, 73, 656-665) and Hou et al. (2020, Biostatistics, 21, 417-431) recently proposed group testing protocols for multiplex assays and derived relevant case identification characteristics, including the expected number of tests and those which quantify classification accuracy. In this article, we describe Bayesian methods to estimate population-level disease probabilities from implementing these protocols or any other multiplex group testing protocol which might be carried out in practice. Our estimation methods can be used with multiplex assays for two or more diseases while incorporating the possibility of test misclassification for each disease. We use chlamydia and gonorrhea testing data collected at the State Hygienic Laboratory at the University of Iowa to illustrate our work. We also provide an online R resource practitioners can use to implement the methods in this article.
Collapse
Affiliation(s)
- S. Warasi
- Department of Mathematics and Statistics, Radford University, Radford, VA 24142, USA
| | - Joshua M. Tebbs
- Department of Statistics, University of South Carolina, Columbia, SC 29208, USA
| | - Christopher S. McMahan
- School of Mathematical and Statistical Sciences, Clemson University, Clemson, SC 29634, USA
| | | |
Collapse
|
5
|
Ebert TA, Shawer D, Brlansky RH, Rogers ME. Seasonal Patterns in the Frequency of Candidatus Liberibacter Asiaticus in Populations of Diaphorina citri (Hemiptera: Psyllidae) in Florida. INSECTS 2023; 14:756. [PMID: 37754724 PMCID: PMC10532026 DOI: 10.3390/insects14090756] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Revised: 09/04/2023] [Accepted: 09/06/2023] [Indexed: 09/28/2023]
Abstract
Candidatus Liberibacter asiaticus (CLas) is one of the putative causal agents of huanglongbing, which is a serious disease in citrus production. The pathogen is transmitted by Diaphorina citri Kuwayama (Hemiptera: Psyllidae). As an observational study, six groves in central Florida and one grove at the southern tip of Florida were sampled monthly from January 2008 through February 2012 (50 months). The collected psyllids were sorted by sex and abdominal color. Disease prevalence in adults peaked in November, with a minor peak in February. Gray/brown females had the highest prevalence, and blue/green individuals of either sex had the lowest prevalence. CLas prevalence in blue/green females was highly correlated with the prevalence in other sexes and colors. Thus, the underlying causes for seasonal fluctuations in prevalence operated in a similar fashion for all psyllids. The pattern was caused by larger nymphs displacing smaller ones from the optimal feeding sites and immunological robustness in different sex-color morphotypes. Alternative hypotheses were also considered. Improving our understanding of biological interactions and how to sample them will improve management decisions. We agree with other authors that psyllid management is critical year-round.
Collapse
Affiliation(s)
- Timothy A. Ebert
- Citrus Research and Education Center, University of Florida, 700 Experiment Station Rd., Lake Alfred, FL 33850, USA; (R.H.B.); (M.E.R.)
| | - Dalia Shawer
- Department of Economic Entomology, Faculty of Agriculture, Kafr Elsheikh University, Kafr Elsheikh 33516, Egypt;
| | - Ron H. Brlansky
- Citrus Research and Education Center, University of Florida, 700 Experiment Station Rd., Lake Alfred, FL 33850, USA; (R.H.B.); (M.E.R.)
| | - Michael E. Rogers
- Citrus Research and Education Center, University of Florida, 700 Experiment Station Rd., Lake Alfred, FL 33850, USA; (R.H.B.); (M.E.R.)
| |
Collapse
|
6
|
Cabrera ODLC, Alsehibani R. Statistical modeling and evaluation of the impact of multiplicity classification thresholds on the COVID-19 pool testing accuracy. PLoS One 2023; 18:e0283874. [PMID: 37494364 PMCID: PMC10370739 DOI: 10.1371/journal.pone.0283874] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Accepted: 03/17/2023] [Indexed: 07/28/2023] Open
Abstract
Prior research on pool testing focus on developing testing methods with the main objective of reducing the total number of tests. However, pool testing can also be used to improve the accuracy of the testing process. The objective of this paper is to improve the accuracy of pool testing using the same number of tests as that of individual testing taking into consideration the probability of testing errors and pool multiplicity classification thresholds. Statistical models are developed to evaluate the impact of pool multiplicity classiffcation thresholds on pool testing accuracy using the receiver operating characteristic (ROC) curve and the area under the curve (AUC). The findings indicate that under certain conditions, pool testing multiplicity yields superior testing accuracy compared to individual testing without additional cost. The results reveal that selecting the multiplicity classification threshold is a critical factor in improving the pool testing accuracy and show that the lower the prevalence level the higher the gains in accuracy using multiplicity pool testing. The findings also indicate that performance can be improved using a batch size that is inversely proportional to the prevalence level. Furthermore, the results indicate that multiplicity pool testing not only improves the testing accuracy but also reduces the total cost of the testing process. Based on the findings, the manufacturer's test sensitivity has more significant impact on the accuracy of multiplicity pool testing compared to that of manufacturer's test specificity.
Collapse
Affiliation(s)
- Omar De La Cruz Cabrera
- Department of Mathematical Sciences, Kent State University, Kent, OH, United States of America
| | - Razan Alsehibani
- Department of Mathematical Sciences, Kent State University, Kent, OH, United States of America
| |
Collapse
|
7
|
Brust D, Brust JJ. Effective matrix designs for COVID-19 group testing. BMC Bioinformatics 2023; 24:26. [PMID: 36694117 PMCID: PMC9872308 DOI: 10.1186/s12859-023-05145-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Accepted: 01/10/2023] [Indexed: 01/25/2023] Open
Abstract
BACKGROUND Grouping samples with low prevalence of positives into pools and testing these pools can achieve considerable savings in testing resources compared with individual testing in the context of COVID-19. We review published pooling matrices, which encode the assignment of samples into pools and describe decoding algorithms, which decode individual samples from pools. Based on the findings we propose new one-round pooling designs with high compression that can efficiently be decoded by combinatorial algorithms. This expands the admissible parameter space for the construction of pooling matrices compared to current methods. RESULTS By arranging samples in a grid and using polynomials to construct pools, we develop direct formulas for an Algorithm (Polynomial Pools (PP)) to generate assignments of samples into pools. Designs from PP guarantee to correctly decode all samples with up to a specified number of positive samples. PP includes recent combinatorial methods for COVID-19, and enables new constructions that can result in more effective designs. CONCLUSION For low prevalences of COVID-19, group tests can save resources when compared to individual testing. Constructions from the recent literature on combinatorial methods have gaps with respect to the designs that are available. We develop a method (PP), which generalizes previous constructions and enables new designs that can be advantageous in various situations.
Collapse
Affiliation(s)
- David Brust
- Institute of Future Fuels, German Aerospace Center (DLR), Jülich, Germany
| | - Johannes J. Brust
- Department of Mathematics, University of California, San Diego, San Diego, USA
| |
Collapse
|
8
|
Shah MP, Chebore W, Lyles RH, Otieno K, Zhou Z, Plucinski M, Waller LA, Odongo W, Lindblade KA, Kariuki S, Samuels AM, Desai M, Mitchell RM, Shi YP. Novel application of one-step pooled molecular testing and maximum likelihood approaches to estimate the prevalence of malaria parasitaemia among rapid diagnostic test negative samples in western Kenya. Malar J 2022; 21:319. [PMID: 36336700 PMCID: PMC9638440 DOI: 10.1186/s12936-022-04323-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Accepted: 10/07/2022] [Indexed: 11/08/2022] Open
Abstract
Abstract
Background
Detection of malaria parasitaemia in samples that are negative by rapid diagnostic tests (RDTs) requires resource-intensive molecular tools. While pooled testing using a two-step strategy provides a cost-saving alternative to the gold standard of individual sample testing, statistical adjustments are needed to improve accuracy of prevalence estimates for a single step pooled testing strategy.
Methods
A random sample of 4670 malaria RDT negative dried blood spot samples were selected from a mass testing and treatment trial in Asembo, Gem, and Karemo, western Kenya. Samples were tested for malaria individually and in pools of five, 934 pools, by one-step quantitative polymerase chain reaction (qPCR). Maximum likelihood approaches were used to estimate subpatent parasitaemia (RDT-negative, qPCR-positive) prevalence by pooling, assuming poolwise sensitivity and specificity was either 100% (strategy A) or imperfect (strategy B). To improve and illustrate the practicality of this estimation approach, a validation study was constructed from pools allocated at random into main (734 pools) and validation (200 pools) subsets. Prevalence was estimated using strategies A and B and an inverse-variance weighted estimator and estimates were weighted to account for differential sampling rates by area.
Results
The prevalence of subpatent parasitaemia was 14.5% (95% CI 13.6–15.3%) by individual qPCR, 9.5% (95% CI (8.5–10.5%) by strategy A, and 13.9% (95% CI 12.6–15.2%) by strategy B. In the validation study, the prevalence by individual qPCR was 13.5% (95% CI 12.4–14.7%) in the main subset, 8.9% (95% CI 7.9–9.9%) by strategy A, 11.4% (95% CI 9.9–12.9%) by strategy B, and 12.8% (95% CI 11.2–14.3%) using inverse-variance weighted estimator from poolwise validation. Pooling, including a 20% validation subset, reduced costs by 52% compared to individual testing.
Conclusions
Compared to individual testing, a one-step pooled testing strategy with an internal validation subset can provide accurate prevalence estimates of PCR-positivity among RDT-negatives at a lower cost.
Collapse
|
9
|
Self S, McMahan C, Mokalled S. Capturing the pool dilution effect in group testing regression: A Bayesian approach. Stat Med 2022; 41:4682-4696. [PMID: 35879887 PMCID: PMC9489666 DOI: 10.1002/sim.9532] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Revised: 07/01/2022] [Accepted: 07/05/2022] [Indexed: 01/07/2023]
Abstract
Group (pooled) testing is becoming a popular strategy for screening large populations for infectious diseases. This popularity is owed to the cost savings that can be realized through implementing group testing methods. These methods involve physically combining biomaterial (eg, saliva, blood, urine) collected on individuals into pooled specimens which are tested for an infection of interest. Through testing these pooled specimens, group testing methods reduce the cost of diagnosing all individuals under study by reducing the number of tests performed. Even though group testing offers substantial cost reductions, some practitioners are hesitant to adopt group testing methods due to the so-called dilution effect. The dilution effect describes the phenomenon in which biomaterial from negative individuals dilute the contributions from positive individuals to such a degree that a pool is incorrectly classified. Ignoring the dilution effect can reduce classification accuracy and lead to bias in parameter estimates and inaccurate inference. To circumvent these issues, we propose a Bayesian regression methodology which directly acknowledges the dilution effect while accommodating data that arises from any group testing protocol. As a part of our estimation strategy, we are able to identify pool specific optimal classification thresholds which are aimed at maximizing the classification accuracy of the group testing protocol being implemented. These two features working in concert effectively alleviate the primary concerns raised by practitioners regarding group testing. The performance of our methodology is illustrated via an extensive simulation study and by being applied to Hepatitis B data collected on Irish prisoners.
Collapse
Affiliation(s)
- Stella Self
- Department of Epidemiology and Biostatistics, Arnold School of Public HealthUniversity of South CarolinaColumbiaSouth CarolinaUSA
| | - Christopher McMahan
- School of Mathematical and Statistical SciencesClemson UniversityClemsonSouth CarolinaUSA
| | - Stefani Mokalled
- School of Mathematical and Statistical SciencesClemson UniversityClemsonSouth CarolinaUSA
| |
Collapse
|
10
|
Xiong W, Ding J, Zhang W, Liu A, Li Q. Nested Group Testing Procedure. COMMUNICATIONS IN MATHEMATICS AND STATISTICS 2022; 11:1-31. [PMID: 36213843 PMCID: PMC9525165 DOI: 10.1007/s40304-021-00269-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/09/2021] [Revised: 06/27/2021] [Accepted: 10/13/2021] [Indexed: 06/16/2023]
Abstract
We investigated the false-negative, true-negative, false-positive, and true-positive predictive values from a general group testing procedure for a heterogeneous population. We show that its false (true)-negative predictive value of a specimen is larger (smaller), and the false (true)-positive predictive value is smaller (larger) than that from individual testing procedure, where the former is in aversion. Then we propose a nested group testing procedure, and show that it can keep the sterling characteristics and also improve the false-negative predictive values for a specimen, not larger than that from individual testing. These characteristics are studied from both theoretical and numerical points of view. The nested group testing procedure is better than individual testing on both false-positive and false-negative predictive values, while retains the efficiency as a basic characteristic of a group testing procedure. Applications to Dorfman's, Halving and Sterrett procedures are discussed. Results from extensive simulation studies and an application to malaria infection in microscopy-negative Malawian women exemplify the findings.
Collapse
Affiliation(s)
- Wenjun Xiong
- School of Mathematics and Statistics, Guangxi Normal University, Guilin, 541004 People’s Republic of China
| | - Juan Ding
- Department of Information and Computing Science, College of Sciences, Hohai University, Nanjing, 210098 People’s Republic of China
| | - Wei Zhang
- LSC, NCMIS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, 100190 People’s Republic of China
| | - Aiyi Liu
- Biostatisics and Bioinformatics Branch, Eunice Kennedy Shriver National Institute of Child Health, Bethesda, 20817 USA
| | - Qizhai Li
- LSC, NCMIS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, 100190 People’s Republic of China
- University of Chinese Academy of Sciences, Beijing, 100049 People’s Republic of China
| |
Collapse
|
11
|
Warasi MS, Hungerford LL, Lahmers K. Optimizing Pooled Testing for Estimating the Prevalence of Multiple Diseases. JOURNAL OF AGRICULTURAL, BIOLOGICAL AND ENVIRONMENTAL STATISTICS 2022; 27:713-727. [PMID: 35975123 PMCID: PMC9373899 DOI: 10.1007/s13253-022-00511-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Revised: 05/27/2022] [Accepted: 07/27/2022] [Indexed: 11/25/2022]
Abstract
Pooled testing can enhance the efficiency of diagnosing individuals with diseases of low prevalence. Often, pooling is implemented using standard groupings (2, 5, 10, etc.). On the other hand, optimization theory can provide specific guidelines in finding the ideal pool size and pooling strategy. This article focuses on optimizing the precision of disease prevalence estimators calculated from multiplex pooled testing data. In the context of a surveillance application of animal diseases, we study the estimation efficiency (i.e., precision) and cost efficiency of the estimators with adjustments for the number of expended tests. This enables us to determine the pooling strategies that offer the highest benefits when jointly estimating the prevalence of multiple diseases, such as theileriosis and anaplasmosis. The outcomes of our work can be used in designing pooled testing protocols, not only in simple pooling scenarios but also in more complex scenarios where individual retesting is performed in order to identify positive cases. A software application using the shiny package in R is provided with this article to facilitate implementation of our methods. Supplementary materials accompanying this paper appear online.
Collapse
Affiliation(s)
- Md S. Warasi
- Department of Mathematics and Statistics, Radford University, Whitt Hall 224, Radford, VA 24142 USA
| | - Laura L. Hungerford
- Virginia-Maryland College of Veterinary Medicine, Virginia Tech, Blacksburg, VA 24061 USA
| | - Kevin Lahmers
- Virginia-Maryland College of Veterinary Medicine, Virginia Tech, Blacksburg, VA 24061 USA
| |
Collapse
|
12
|
Smith J, Sun Y, Hijano DR, Hoffman JM, Hakim H, Webby RJ, Hayden RT, Gaur AH, Armstrong GT, Mori M, Tang L. covidscreen: a web app and R Package for assessing asymptomatic COVID-19 testing strategies. BMC Public Health 2022; 22:1361. [PMID: 35840948 PMCID: PMC9284969 DOI: 10.1186/s12889-022-13718-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2022] [Accepted: 06/29/2022] [Indexed: 11/29/2022] Open
Abstract
Background COVID-19 has caused over 305 million infections and nearly 5.5 million deaths globally. With complete eradication unlikely, organizations will need to evaluate their risk and the benefits of mitigation strategies, including the effects of regular asymptomatic testing. We developed a web application and R package that provides estimates and visualizations to aid the assessment of organizational infection risk and testing benefits to facilitate decision-making, which combines internal and community information with malleable assumptions. Results Our web application, covidscreen, presents estimated values of risk metrics in an intuitive graphical format. It shows the current expected number of active, primarily community-acquired infections among employees in an organization. It calculates and explains the absolute and relative risk reduction of an intervention, relative to the baseline scenario, and shows the value of testing vaccinated and unvaccinated employees. In addition, the web interface allows users to profile risk over a chosen range of input values. The performance and output are illustrated using simulations and a real-world example from the employee testing program of a pediatric oncology specialty hospital. Conclusions As the COVID-19 pandemic continues to evolve, covidscreen can assist organizations in making informed decisions about whether to incorporate covid test based screening as part of their on-campus risk-mitigation strategy. The web application, R package, and source code are freely available online (see “Availability of data and materials”). Supplementary Information The online version contains supplementary material available at 10.1186/s12889-022-13718-4.
Collapse
Affiliation(s)
- Jesse Smith
- Department of Biostatistics, St. Jude Children's Research Hospital, Mail Stop 768, 262 Danny Thomas Place, TN, 38105, Memphis, USA
| | - Yilun Sun
- Department of Biostatistics, St. Jude Children's Research Hospital, Mail Stop 768, 262 Danny Thomas Place, TN, 38105, Memphis, USA
| | - Diego R Hijano
- Department of Infectious Diseases, St. Jude Children's Research Hospital, Mail Stop 768, 262 Danny Thomas Place, TN, 38105, Memphis, USA
| | - James M Hoffman
- Department of Pharmacy and Pharmaceutical Sciences, St. Jude Children's Research Hospital, Mail Stop 768, 262 Danny Thomas Place, TN, 38105, Memphis, USA.,Department of the Office of Quality and Patient Safety, St. Jude Children's Research Hospital, Mail Stop 768, 262 Danny Thomas Place, TN, 38105, Memphis, USA
| | - Hana Hakim
- Department of Infectious Diseases, St. Jude Children's Research Hospital, Mail Stop 768, 262 Danny Thomas Place, TN, 38105, Memphis, USA
| | - Richard J Webby
- Department of Infectious Diseases, St. Jude Children's Research Hospital, Mail Stop 768, 262 Danny Thomas Place, TN, 38105, Memphis, USA
| | - Randall T Hayden
- Department of Pathology, St. Jude Children's Research Hospital, Mail Stop 768, 262 Danny Thomas Place, TN, 38105, Memphis, USA
| | - Aditya H Gaur
- Department of Infectious Diseases, St. Jude Children's Research Hospital, Mail Stop 768, 262 Danny Thomas Place, TN, 38105, Memphis, USA
| | - Gregory T Armstrong
- Department of Epidemiology and Cancer Control, St. Jude Children's Research Hospital, Mail Stop 768, 262 Danny Thomas Place, TN, 38105, Memphis, USA
| | - Motomi Mori
- Department of Biostatistics, St. Jude Children's Research Hospital, Mail Stop 768, 262 Danny Thomas Place, TN, 38105, Memphis, USA
| | - Li Tang
- Department of Biostatistics, St. Jude Children's Research Hospital, Mail Stop 768, 262 Danny Thomas Place, TN, 38105, Memphis, USA.
| |
Collapse
|
13
|
Comess S, Wang H, Holmes S, Donnat C. Statistical Modeling for Practical Pooled Testing During the COVID-19 Pandemic. Stat Sci 2022. [DOI: 10.1214/22-sts857] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
- Saskia Comess
- Saskia Comess is a PhD student, Emmett Interdisciplinary Program in Environment and Resources, Stanford University, Stanford, California
| | - Hannah Wang
- Hannah Wang is a resident physician, Department of Anatomic and Clinical Pathology, Stanford University School of Medicine, Stanford, California
| | - Susan Holmes
- Susan Holmes is a Professor, Department of Statistics, Stanford University, Stanford, California
| | - Claire Donnat
- Claire Donnat is an Assistant Professor, Department of Statistics, The University of Chicago, Chicago, Illinois
| |
Collapse
|
14
|
Hong D, Dey R, Lin X, Cleary B, Dobriban E. Group testing via hypergraph factorization applied to COVID-19. Nat Commun 2022; 13:1837. [PMID: 35383149 PMCID: PMC8983763 DOI: 10.1038/s41467-022-29389-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Accepted: 03/14/2022] [Indexed: 11/09/2022] Open
Abstract
Large scale screening is a critical tool in the life sciences, but is often limited by reagents, samples, or cost. An important recent example is the challenge of achieving widespread COVID-19 testing in the face of substantial resource constraints. To tackle this challenge, screening methods must efficiently use testing resources. However, given the global nature of the pandemic, they must also be simple (to aid implementation) and flexible (to be tailored for each setting). Here we propose HYPER, a group testing method based on hypergraph factorization. We provide theoretical characterizations under a general statistical model, and carefully evaluate HYPER with alternatives proposed for COVID-19 under realistic simulations of epidemic spread and viral kinetics. We find that HYPER matches or outperforms the alternatives across a broad range of testing-constrained environments, while also being simpler and more flexible. We provide an online tool to aid lab implementation: http://hyper.covid19-analysis.org .
Collapse
Affiliation(s)
- David Hong
- Department of Statistics and Data Science, The Wharton School, University of Pennsylvania, Philadelphia, PA, USA
| | - Rounak Dey
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Xihong Lin
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
- Department of Statistics, Harvard University, Cambridge, MA, USA.
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA.
| | - Brian Cleary
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA.
| | - Edgar Dobriban
- Department of Statistics and Data Science, The Wharton School, University of Pennsylvania, Philadelphia, PA, USA.
| |
Collapse
|
15
|
Real-world Evaluation of a Sample Pooling Strategy for Large-Scale Rapid COVID-19 Testing. J Clin Virol 2022; 149:105133. [PMID: 35338949 PMCID: PMC8923011 DOI: 10.1016/j.jcv.2022.105133] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2022] [Revised: 03/08/2022] [Accepted: 03/13/2022] [Indexed: 12/23/2022]
Abstract
Background The worldwide outbreak of COVID-19 has become a public health crisis of unprecedented proportions. The fast spread of emerging variants increases the needs of rapid diagnostic and screening testing. Sample pooling efficiently expands the testing capacity under limited resources. Objectives We evaluated the performance of sample pooling on the Point-of-Care (POC) Liat® and cobas® 6800 systems and provided real-world experiences for implementing these systems in large-scale screenings. Methods Positive nasopharyngeal (NP) specimens with Ct values < 25, 25∼30 or > 30 were tested individually and in pools to optimize the POC Liat® and cobas® 6800 systems, which were then implemented in community screenings. Results The 5-sample pooling strategy did not affect the positive detection rates on Liat® or cobas® 6800 in samples with Ct values <25 or 25∼30. However, in samples with low viral loads (Ct values >30), five-sample pooling has a higher positive detection rate on POC Liat® (20/20; 100%), compared to cobas® 6800 (9/20; 45%). Five-sample pooled on POC Liat® and two-sample pooled on cobas® 6800 appear to be appropriate for SARS-CoV-2 detection. By implementing the pooling strategies in two large-scale community screenings, 7,606 NP specimens was tested within 36 h; the average turn-around time was 4.8 h for cobas® 6800 and 1.3 h for POC Liat®. Eight positive specimens (0.11%; 8/7,606) were identified, with Ct values ranging from 18.85 to 37.68. Conclusion The performance of sample pooling on POC Liat® was demonstrated to be an effective, accurate, and economical approach for large-scale community screenings for COVID-19.
Collapse
|
16
|
El Hajj H, Bish DR, Bish EK, Aprahamian H. Screening multi-dimensional heterogeneous populations for infectious diseases under scarce testing resources, with application to COVID-19. NAVAL RESEARCH LOGISTICS 2022; 69:3-20. [PMID: 38607835 PMCID: PMC8251476 DOI: 10.1002/nav.21985] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/27/2020] [Revised: 01/22/2021] [Accepted: 01/25/2021] [Indexed: 05/12/2023]
Abstract
Testing provides essential information for managing infectious disease outbreaks, such as the COVID-19 pandemic. When testing resources are scarce, an important managerial decision is who to test. This decision is compounded by the fact that potential testing subjects are heterogeneous in multiple dimensions that are important to consider, including their likelihood of being disease-positive, and how much potential harm would be averted through testing and the subsequent interventions. To increase testing coverage, pooled testing can be utilized, but this comes at a cost of increased false-negatives when the test is imperfect. Then, the decision problem is to partition the heterogeneous testing population into three mutually exclusive sets: those to be individually tested, those to be pool tested, and those not to be tested. Additionally, the subjects to be pool tested must be further partitioned into testing pools, potentially containing different numbers of subjects. The objectives include the minimization of harm (through detection and mitigation) or maximization of testing coverage. We develop data-driven optimization models and algorithms to design pooled testing strategies, and show, via a COVID-19 contact tracing case study, that the proposed testing strategies can substantially outperform the current practice used for COVID-19 contact tracing (individually testing those contacts with symptoms). Our results demonstrate the substantial benefits of optimizing the testing design, while considering the multiple dimensions of population heterogeneity and the limited testing capacity.
Collapse
Affiliation(s)
- Hussein El Hajj
- Department of Industrial and Systems EngineeringVirginia TechBlacksburgVirginiaUSA
| | - Douglas R. Bish
- Department of Information Systems, Statistics, and Management ScienceUniversity of AlabamaTuscaloosaAlabamaUSA
| | - Ebru K. Bish
- Department of Information Systems, Statistics, and Management ScienceUniversity of AlabamaTuscaloosaAlabamaUSA
| | - Hrayer Aprahamian
- Department of Industrial and Systems EngineeringTexas A&MCollege StationTexasUSA
| |
Collapse
|
17
|
Augenblick N, Kolstad J, Obermeyer Z, Wang A. Pooled testing efficiency increases with test frequency. Proc Natl Acad Sci U S A 2022; 119:e2105180119. [PMID: 34983870 PMCID: PMC8764680 DOI: 10.1073/pnas.2105180119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/04/2021] [Indexed: 12/02/2022] Open
Abstract
Pooled testing increases efficiency by grouping individual samples and testing the combined sample, such that many individuals can be cleared with one negative test. This short paper demonstrates that pooled testing is particularly advantageous in the setting of pandemics, given repeated testing, rapid spread, and uncertain risk. Repeated testing mechanically lowers the infection probability at the time of the next test by removing positives from the population. This effect alone means that increasing frequency by x times only increases expected tests by around [Formula: see text] However, this calculation omits a further benefit of frequent testing: Removing infections from the population lowers intragroup transmission, which lowers infection probability and generates further efficiency. For this reason, increasing testing frequency can paradoxically reduce total testing cost. Our calculations are based on the assumption that infection rates are known, but predicting these rates is challenging in a fast-moving pandemic. However, given that frequent testing naturally suppresses the mean and variance of infection rates, we show that our results are very robust to uncertainty and misprediction. Finally, we note that efficiency further increases given natural sampling pools (e.g., workplaces, classrooms) that induce correlated risk via local transmission. We conclude that frequent pooled testing using natural groupings is a cost-effective way to provide consistent testing of a population to suppress infection risk in a pandemic.
Collapse
Affiliation(s)
- Ned Augenblick
- Haas School of Business, University of California, Berkeley, CA 94720;
| | - Jonathan Kolstad
- Haas School of Business, University of California, Berkeley, CA 94720
- Department of Economics, University of California, Berkeley, CA 94720
| | - Ziad Obermeyer
- School of Public Health, University of California, Berkeley, CA 94704
| | - Ao Wang
- Department of Economics, University of California, Berkeley, CA 94720
| |
Collapse
|
18
|
Tatsuoka C, Chen W, Lu X. BAYESIAN GROUP TESTING WITH DILUTION EFFECTS. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2021:2021.01.15.21249894. [PMID: 33501464 PMCID: PMC7836136 DOI: 10.1101/2021.01.15.21249894] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
A Bayesian framework for group testing under dilution effects has been developed, using lattice-based models. This work has particular relevance given the pressing public health need to enhance testing capacity for COVID-19 and future pandemics, and the need for wide-scale and repeated testing for surveillance under constantly varying conditions. The proposed Bayesian approach allows for dilution effects in group testing and for general test response distributions beyond just binary outcomes. It is shown that even under strong dilution effects, an intuitive group testing selection rule that relies on the model order structure, referred to as the Bayesian halving algorithm, has attractive optimal convergence properties. Analogous look-ahead rules that can reduce the number of stages in classification by selecting several pooled tests at a time are proposed and evaluated as well. Group testing is demonstrated to provide great savings over individual testing in the number of tests needed, even for moderately high prevalence levels. However, there is a trade-off with higher number of testing stages, and increased variability. A web-based calculator is introduced to assist in weighing these factors and to guide decisions on when and how to pool under various conditions. High performance distributed computing methods have also been implemented for considering larger pool sizes, when savings from group testing can be even more dramatic.
Collapse
|
19
|
Warasi MS. groupTesting: an R package for group testing estimation. COMMUN STAT-SIMUL C 2021. [DOI: 10.1080/03610918.2021.2009867] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Affiliation(s)
- Md S. Warasi
- Department of Mathematics and Statistics, Radford University, Radford, VA, USA
| |
Collapse
|
20
|
Daon Y, Huppert A, Obolski U. An accurate model for SARS-CoV-2 pooled RT-PCR test errors. ROYAL SOCIETY OPEN SCIENCE 2021; 8:210704. [PMID: 34737873 PMCID: PMC8564620 DOI: 10.1098/rsos.210704] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/21/2021] [Accepted: 10/05/2021] [Indexed: 06/13/2023]
Abstract
Pooling is a method of simultaneously testing multiple samples for the presence of pathogens. Pooling of SARS-CoV-2 tests is increasing in popularity, due to its high testing throughput. A popular pooling scheme is Dorfman pooling: test N individuals simultaneously, if the test is positive, each individual is then tested separately; otherwise, all are declared negative. Most analyses of the error rates of pooling schemes assume that including more than a single infected sample in a pooled test does not increase the probability of a positive outcome. We challenge this assumption with experimental data and suggest a novel and parsimonious probabilistic model for the outcomes of pooled tests. As an application, we analyse the false-negative rate (i.e. the probability of a negative result for an infected individual) of Dorfman pooling. We show that the false-negative rates under Dorfman pooling increase when the prevalence of infection decreases. However, low infection prevalence is exactly the condition when Dorfman pooling achieves highest throughput efficiency. We therefore urge the cautious use of pooling and development of pooling schemes that consider correctly accounting for tests' error rates.
Collapse
Affiliation(s)
- Yair Daon
- School of Public Health, The Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- Porter School of the Environment and Earth Sciences, The Faculty of Exact Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Amit Huppert
- School of Public Health, The Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- The Biostatistics and Biomathematics Unit, The Gertner Institute for Epidemiology and Health Policy Research, Sheba Medical Center, Tel Hashomer, 52621 Ramat Gan, Israel
| | - Uri Obolski
- School of Public Health, The Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- The Biostatistics and Biomathematics Unit, The Gertner Institute for Epidemiology and Health Policy Research, Sheba Medical Center, Tel Hashomer, 52621 Ramat Gan, Israel
| |
Collapse
|
21
|
Liu Y, McMahan CS, Tebbs JM, Gallagher CM, Bilder CR. Generalized additive regression for group testing data. Biostatistics 2021; 22:873-889. [PMID: 32061081 PMCID: PMC8511943 DOI: 10.1093/biostatistics/kxaa003] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2019] [Revised: 01/04/2020] [Accepted: 01/13/2020] [Indexed: 11/13/2022] Open
Abstract
In screening applications involving low-prevalence diseases, pooling specimens (e.g., urine, blood, swabs, etc.) through group testing can be far more cost effective than testing specimens individually. Estimation is a common goal in such applications and typically involves modeling the probability of disease as a function of available covariates. In recent years, several authors have developed regression methods to accommodate the complex structure of group testing data but often under the assumption that covariate effects are linear. Although linearity is a reasonable assumption in some applications, it can lead to model misspecification and biased inference in others. To offer a more flexible framework, we propose a Bayesian generalized additive regression approach to model the individual-level probability of disease with potentially misclassified group testing data. Our approach can be used to analyze data arising from any group testing protocol with the goal of estimating multiple unknown smooth functions of covariates, standard linear effects for other covariates, and assay classification accuracy probabilities. We illustrate the methods in this article using group testing data on chlamydia infection in Iowa.
Collapse
Affiliation(s)
- Yan Liu
- School of Community Health Sciences, University of Nevada, Reno, 1664 N. Virginia St, Reno, NV 89557, USA
| | - Christopher S McMahan
- School of Mathematical and Statistical Sciences, Clemson University, O-110 Martin Hall, Box 340975, Clemson, SC 29634, USA
| | - Joshua M Tebbs
- Department of Statistics, University of South Carolina, 1523 Greene St, Columbia, SC 29208, USA
| | - Colin M Gallagher
- School of Mathematical and Statistical Sciences, Clemson University, O-110 Martin Hall, Box 340975, Clemson, SC 29634, USA
| | - Christopher R Bilder
- Department of Statistics, University of Nebraska-Lincoln, 340 Hardin Hall North, Lincoln, NE 68583, USA
| |
Collapse
|
22
|
Armendáriz I, Ferrari PA, Fraiman D, Martínez JM, Menzella HG, Ponce Dawson S. Nested pool testing strategy for the diagnosis of infectious diseases. Sci Rep 2021; 11:18108. [PMID: 34518603 PMCID: PMC8438083 DOI: 10.1038/s41598-021-97534-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2021] [Accepted: 08/17/2021] [Indexed: 11/18/2022] Open
Abstract
The progress of the SARS-CoV-2 pandemic requires the design of large-scale, cost-effective testing programs. Pooling samples provides a solution if the tests are sensitive enough. In this regard, the use of the gold standard, RT-qPCR, raises some concerns. Recently, droplet digital PCR (ddPCR) was shown to be 10-100 times more sensitive than RT-qPCR, making it more suitable for pooling. Furthermore, ddPCR quantifies the RNA content directly, a feature that, as we show, can be used to identify nonviable samples in pools. Cost-effective strategies require the definition of efficient deconvolution and re-testing procedures. In this paper we analyze the practical implementation of an efficient hierarchical pooling strategy for which we have recently derived the optimal, determining the best ways to proceed when there are impediments for the use of the absolute optimum or when multiple pools are tested simultaneously and there are restrictions on the throughput time. We also show how the ddPCR RNA quantification and the nested nature of the strategy can be combined to perform self-consistency tests for a better identification of infected individuals and nonviable samples. The studies are useful to those considering pool testing for the identification of infected individuals.
Collapse
Affiliation(s)
- Inés Armendáriz
- Departamento de Matemática & IMAS, FCEN-UBA & CONICET, C1428EGA, Buenos Aires, Argentina
| | - Pablo A Ferrari
- Departamento de Matemática & IMAS, FCEN-UBA & CONICET, C1428EGA, Buenos Aires, Argentina
| | - Daniel Fraiman
- Departamento de Matemática y Ciencias, Universidad de San Andrés & CONICET, B1644BID, Victoria, Argentina
| | - José M Martínez
- Department of Applied Mathematics, IMECC-UNICAMP, Campinas, 13083-859, Brazil
| | - Hugo G Menzella
- Departamento de Tecnología & IPROByQ, FBIOyF-UNR & CONICET, S2002LRK, Rosario, Argentina
| | - Silvina Ponce Dawson
- Departamento de Física & IFIBA, FCEN-UBA & CONICET, C1428EGA, Buenos Aires, Argentina.
| |
Collapse
|
23
|
Brand A, May S, Hughes JP, Nakigozi G, Reynolds SJ, Gabriel EE. Prediction-driven pooled testing methods: Application to HIV treatment monitoring in Rakai, Uganda. Stat Med 2021; 40:4185-4199. [PMID: 34046930 PMCID: PMC8487918 DOI: 10.1002/sim.9022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2020] [Revised: 04/19/2021] [Accepted: 04/21/2021] [Indexed: 11/11/2022]
Abstract
Chronic medical conditions often necessitate regular testing for proper treatment. Regular testing of all afflicted individuals may not be feasible due to limited resources, as is true with HIV monitoring in resource-limited settings. Pooled testing methods have been developed in order to allow regular testing for all while reducing resource burden. However, the most commonly used methods do not make use of covariate information predictive of treatment failure, which could improve performance. We propose and evaluate four prediction-driven pooled testing methods that incorporate covariate information to improve pooled testing performance. We then compare these methods in the HIV treatment management setting to current methods with respect to testing efficiency, sensitivity, and number of testing rounds using simulated data and data collected in Rakai, Uganda. Results show that the prediction-driven methods increase efficiency by up to 20% compared with current methods while maintaining equivalent sensitivity and reducing number of testing rounds by up to 70%. When predictions were incorrect, the performance of prediction-based matrix methods remained robust. The best performing method using our motivating data from Rakai was a prediction-driven hybrid method, maintaining sensitivity over 96% and efficiency over 75% in likely scenarios. If these methods perform similarly in the field, they may contribute to improving mortality and reducing transmission in resource-limited settings. Although we evaluate our proposed pooling methods in the HIV treatment setting, they can be applied to any setting that necessitates testing of a quantitative biomarker for a threshold-based decision.
Collapse
Affiliation(s)
- Adam Brand
- Department of Medical Epidemiology and Biostatistics, Karolinska Institute, Sweden
| | - Susanne May
- Department of Biostatistics, University of Washington, Seattle, Washington, USA
| | - James P. Hughes
- Department of Biostatistics, University of Washington, Seattle, Washington, USA
| | | | - Steven J. Reynolds
- Johns Hopkins University, School of Medicine, Baltimore, MD, USA
- Division of Intramural Research, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, USA
| | - Erin E. Gabriel
- Department of Medical Epidemiology and Biostatistics, Karolinska Institute, Sweden
| |
Collapse
|
24
|
Verdun CM, Fuchs T, Harar P, Elbrächter D, Fischer DS, Berner J, Grohs P, Theis FJ, Krahmer F. Group Testing for SARS-CoV-2 Allows for Up to 10-Fold Efficiency Increase Across Realistic Scenarios and Testing Strategies. Front Public Health 2021; 9:583377. [PMID: 34490172 PMCID: PMC8416485 DOI: 10.3389/fpubh.2021.583377] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Accepted: 07/26/2021] [Indexed: 11/24/2022] Open
Abstract
Background: Due to the ongoing COVID-19 pandemic, demand for diagnostic testing has increased drastically, resulting in shortages of necessary materials to conduct the tests and overwhelming the capacity of testing laboratories. The supply scarcity and capacity limits affect test administration: priority must be given to hospitalized patients and symptomatic individuals, which can prevent the identification of asymptomatic and presymptomatic individuals and hence effective tracking and tracing policies. We describe optimized group testing strategies applicable to SARS-CoV-2 tests in scenarios tailored to the current COVID-19 pandemic and assess significant gains compared to individual testing. Methods: We account for biochemically realistic scenarios in the context of dilution effects on SARS-CoV-2 samples and consider evidence on specificity and sensitivity of PCR-based tests for the novel coronavirus. Because of the current uncertainty and the temporal and spatial changes in the prevalence regime, we provide analysis for several realistic scenarios and propose fast and reliable strategies for massive testing procedures. Key Findings: We find significant efficiency gaps between different group testing strategies in realistic scenarios for SARS-CoV-2 testing, highlighting the need for an informed decision of the pooling protocol depending on estimated prevalence, target specificity, and high- vs. low-risk population. For example, using one of the presented methods, all 1.47 million inhabitants of Munich, Germany, could be tested using only around 141 thousand tests if the infection rate is below 0.4% is assumed. Using 1 million tests, the 6.69 million inhabitants from the city of Rio de Janeiro, Brazil, could be tested as long as the infection rate does not exceed 1%. Moreover, we provide an interactive web application, available at www.grouptexting.com, for visualizing the different strategies and designing pooling schemes according to specific prevalence scenarios and test configurations. Interpretation: Altogether, this work may help provide a basis for an efficient upscaling of current testing procedures, which takes the population heterogeneity into account and is fine-grained towards the desired study populations, e.g., mild/asymptomatic individuals vs. symptomatic ones but also mixtures thereof. Funding: German Science Foundation (DFG), German Federal Ministry of Education and Research (BMBF), Chan Zuckerberg Initiative DAF, and Austrian Science Fund (FWF).
Collapse
Affiliation(s)
- Claudio M. Verdun
- Department of Mathematics, Technical University of Munich, Garching, Germany
- Department of Electrical and Computer Engineering, Technical University of Munich, Munich, Germany
| | - Tim Fuchs
- Department of Mathematics, Technical University of Munich, Garching, Germany
| | - Pavol Harar
- Research Network Data Science, University of Vienna, Vienna, Austria
- Department of Telecommunications, Brno University of Technology, Brno, Czechia
| | | | - David S. Fischer
- Institute of Computational Biology, Helmholtz Zentrum München, Munich, Germany
| | - Julius Berner
- Faculty of Mathematics, University of Vienna, Vienna, Austria
| | - Philipp Grohs
- Research Network Data Science, University of Vienna, Vienna, Austria
- Faculty of Mathematics, University of Vienna, Vienna, Austria
- Johann Radon Institute for Computational and Applied Mathematics, Austrian Academy of Sciences, Linz, Austria
| | - Fabian J. Theis
- Department of Mathematics, Technical University of Munich, Garching, Germany
- Institute of Computational Biology, Helmholtz Zentrum München, Munich, Germany
| | - Felix Krahmer
- Department of Mathematics, Technical University of Munich, Garching, Germany
- Munich Data Science Institute, Technical University of Munich, Garching, Germany
| |
Collapse
|
25
|
Bilder CR, Tebbs JM, McMahan CS. Discussion on "Is group testing ready for prime-time in disease identification". Stat Med 2021; 40:3881-3886. [PMID: 34251038 PMCID: PMC8441930 DOI: 10.1002/sim.8988] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Accepted: 03/25/2021] [Indexed: 11/08/2022]
Affiliation(s)
- Christopher R Bilder
- Department of Statistics, University of Nebraska-Lincoln, Lincoln, Nebraska, USA
| | - Joshua M Tebbs
- Department of Statistics, University of South Carolina, Columbia, South Carolina, USA
| | - Christopher S McMahan
- School of Mathematical and Statistical Sciences, Clemson University, Clemson, South Carolina, USA
| |
Collapse
|
26
|
Daon Y, Huppert A, Obolski U. DOPE: D-Optimal Pooling Experimental design with application for SARS-CoV-2 screening. J Am Med Inform Assoc 2021; 28:2562-2570. [PMID: 34343285 PMCID: PMC8436773 DOI: 10.1093/jamia/ocab169] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Revised: 06/16/2021] [Accepted: 07/31/2021] [Indexed: 11/13/2022] Open
Abstract
OBJECTIVE Testing individuals for the presence of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the pathogen causing the coronavirus disease 2019 (COVID-19), is crucial for curtailing transmission chains. Moreover, rapidly testing many potentially infected individuals is often a limiting factor in controlling COVID-19 outbreaks. Hence, pooling strategies, wherein individuals are grouped and tested simultaneously, are employed. Here, we present a novel pooling strategy that builds on the Bayesian D-optimal experimental design criterion. MATERIALS AND METHODS Our strategy, called DOPE (D-Optimal Pooling Experimental design), is built on a novel Bayesian formulation of pooling. DOPE defines optimal pooled tests as those maximizing the mutual information between data and infection states. We estimate said mutual information via Monte-Carlo sampling and employ a discrete optimization heuristic to maximize it. RESULTS We compare DOPE to other, commonly used pooling strategies, as well as to individual testing. DOPE dominates the other strategies as it yields lower error rates while utilizing fewer tests. We show that DOPE maintains this dominance for a variety of infection prevalence values. DISCUSSION DOPE has several additional advantages over common pooling strategies: it provides posterior distributions of the probability of infection, rather than only binary classification outcomes; it naturally incorporates prior information of infection probabilities and test error rates; and finally, it can be easily extended to include other, newly discovered information regarding COVID-19. CONCLUSION DOPE can substantially improve accuracy and throughput over current pooling strategies. Hence, DOPE can facilitate rapid testing and aid the efforts of combating COVID-19 and other future pandemics.
Collapse
Affiliation(s)
- Yair Daon
- School of Public Health, Tel Aviv University, Tel Aviv, Israel.,Porter School of the Environment and Earth Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Amit Huppert
- School of Public Health, Tel Aviv University, Tel Aviv, Israel.,The Gertner Institute for Epidemiology and Health Policy Research, Tel Hashomer, Israel
| | - Uri Obolski
- School of Public Health, Tel Aviv University, Tel Aviv, Israel.,Porter School of the Environment and Earth Sciences, Tel Aviv University, Tel Aviv, Israel
| |
Collapse
|
27
|
Hou P, Tebbs JM, Wang D, McMahan CS, Bilder CR. Array testing for multiplex assays. Biostatistics 2021; 21:417-431. [PMID: 30371749 DOI: 10.1093/biostatistics/kxy058] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2018] [Revised: 08/31/2018] [Accepted: 09/09/2018] [Indexed: 11/13/2022] Open
Abstract
Group testing involves pooling individual specimens (e.g., blood, urine, swabs, etc.) and testing the pools for the presence of disease. When the proportion of diseased individuals is small, group testing can greatly reduce the number of tests needed to screen a population. Statistical research in group testing has traditionally focused on applications for a single disease. However, blood service organizations and large-scale disease surveillance programs are increasingly moving towards the use of multiplex assays, which measure multiple disease biomarkers at once. Tebbs and others (2013, Two-stage hierarchical group testing for multiple infections with application to the Infertility Prevention Project. Biometrics69, 1064-1073) and Hou and others (2017, Hierarchical group testing for multiple infections. Biometrics73, 656-665) were the first to examine hierarchical group testing case identification procedures for multiple diseases. In this article, we propose new non-hierarchical procedures which utilize two-dimensional arrays. We derive closed-form expressions for the expected number of tests per individual and classification accuracy probabilities and show that array testing can be more efficient than hierarchical procedures when screening individuals for multiple diseases at once. We illustrate the potential of using array testing in the detection of chlamydia and gonorrhea for a statewide screening program in Iowa. Finally, we describe an R/Shiny application that will help practitioners identify the best multiple-disease case identification algorithm.
Collapse
Affiliation(s)
- Peijie Hou
- Statistical and Quantitative Sciences, Takeda Pharmaceutical Inc., 300 Massachusetts Avenue, Cambridge, MA, USA
| | - Joshua M Tebbs
- Department of Statistics, University of South Carolina, 1523 Greene St, Columbia, SC, USA
| | - Dewei Wang
- Department of Statistics, University of South Carolina, 1523 Greene St, Columbia, SC, USA
| | - Christopher S McMahan
- School of Mathematical and Statistical Sciences, Clemson University, O-110 Martin Hall, Clemson, SC, USA
| | - Christopher R Bilder
- Department of Statistics, University of Nebraska-Lincoln, 340 Hardin Hall North, Lincoln, NE, USA
| |
Collapse
|
28
|
Haber G, Malinovsky Y, Albert PS. Is group testing ready for prime-time in disease identification? Stat Med 2021; 40:3865-3880. [PMID: 33913183 DOI: 10.1002/sim.9003] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2020] [Revised: 02/19/2021] [Accepted: 03/22/2021] [Indexed: 12/18/2022]
Abstract
Large-scale disease screening is a complicated process in which high costs must be balanced against pressing public health needs. When the goal is screening for infectious disease, one approach is group testing in which samples are initially tested in pools and individual samples are retested only if the initial pooled test was positive. Intuitively, if the prevalence of infection is small, this could result in a large reduction of the total number of tests required. Despite this, the use of group testing in medical studies has been limited, largely due to skepticism about the impact of pooling on the accuracy of a given assay. While there is a large body of research addressing the issue of testing errors in group testing studies, it is customary to assume that the misclassification parameters are known from an external population and/or that the values do not change with the group size. Both of these assumptions are highly questionable for many medical practitioners considering group testing in their study design. In this article, we explore how the failure of these assumptions might impact the efficacy of a group testing design and, consequently, whether group testing is currently feasible for medical screening. Specifically, we look at how incorrect assumptions about the sensitivity function at the design stage can lead to poor estimation of a procedure's overall sensitivity and expected number of tests. Furthermore, if a validation study is used to estimate the pooled misclassification parameters of a given assay, we show that the sample sizes required are so large as to be prohibitive in all but the largest screening programs.
Collapse
Affiliation(s)
- Gregory Haber
- Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, Maryland, USA
| | - Yaakov Malinovsky
- Department of Mathematics and Statistics, University of Maryland, Baltimore County, Baltimore, Maryland, USA
| | - Paul S Albert
- Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, Maryland, USA
| |
Collapse
|
29
|
Hu Y, Walter SD, Hepworth G. Estimating proportions by group retesting with unequal group sizes at each stage. COMMUN STAT-THEOR M 2021. [DOI: 10.1080/03610926.2021.1900253] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Affiliation(s)
- Yusang Hu
- Mathematics and Statistics, McMaster University, Hamilton, ON, Canada
| | - S. D. Walter
- Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, ON, Canada
| | - Graham Hepworth
- Mathematics & Statistics, The University of Melbourne, Melbourne, VIC, Australia
| |
Collapse
|
30
|
Bilder CR, Tebbs JM, McMahan CS. Informative array testing with multiplex assays. Stat Med 2021; 40:3021-3034. [PMID: 33763901 DOI: 10.1002/sim.8954] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2020] [Revised: 02/12/2021] [Accepted: 03/01/2021] [Indexed: 11/07/2022]
Abstract
High-volume testing of clinical specimens for sexually transmitted diseases is performed frequently by a process known as group testing. This algorithmic process involves testing portions of specimens from separate individuals together as one unit (or "group") to detect diseases. Retesting is performed on groups that test positively in order to differentiate between positive and negative individual specimens. The overall goal is to use the least number of tests possible across all individuals without sacrificing diagnostic accuracy. One of the most efficient group testing algorithms is array testing. In its simplest form, specimens are arranged into a grid-like structure so that row and column groups can be formed. Positive-testing rows/columns indicate which specimens to retest. With the growing use of multiplex assays, the increasing number of diseases tested by these assays, and the availability of subject-specific risk information, opportunities exist to make this testing process even more efficient. We propose specific specimen arrangements within an array that can reduce the number of retests needed when compared with other array testing algorithms. We examine how to calculate operating characteristics, including the expected number of tests and the SD for the number of tests, and then subsequently find a best arrangement. Our methods are illustrated for chlamydia and gonorrhea detection with the Aptima Combo 2 Assay. We also provide R functions to make our research accessible to laboratories.
Collapse
Affiliation(s)
- Christopher R Bilder
- Department of Statistics, University of Nebraska-Lincoln, Lincoln, Nebraska, USA
| | - Joshua M Tebbs
- Department of Statistics, University of South Carolina, Columbia, South Carolina, USA
| | - Christopher S McMahan
- School of Mathematical and Statistical Sciences, Clemson University, Clemson, South Carolina, USA
| |
Collapse
|
31
|
Bilder CR, Iwen PC, Abdalhamid B. Pool Size Selection When Testing for Severe Acute Respiratory Syndrome Coronavirus 2. Clin Infect Dis 2021; 72:1104-1105. [PMID: 32544948 PMCID: PMC7337646 DOI: 10.1093/cid/ciaa774] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Affiliation(s)
- Christopher R Bilder
- Department of Statistics, University of Nebraska-Lincoln, Lincoln, Nebraska, USA
| | - Peter C Iwen
- Department of Pathology and Microbiology, University of Nebraska Medical Center, Omaha, Nebraska, USA.,Nebraska Public Health Laboratory, University of Nebraska Medical Center, Omaha, Nebraska, USA
| | - Baha Abdalhamid
- Department of Pathology and Microbiology, University of Nebraska Medical Center, Omaha, Nebraska, USA.,Nebraska Public Health Laboratory, University of Nebraska Medical Center, Omaha, Nebraska, USA
| |
Collapse
|
32
|
Mokalled SC, McMahan CS, Tebbs JM, Andrew Brown D, Bilder CR. Incorporating the dilution effect in group testing regression. Stat Med 2021; 40:2540-2555. [PMID: 33598950 DOI: 10.1002/sim.8916] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Revised: 11/25/2020] [Accepted: 02/03/2021] [Indexed: 11/10/2022]
Abstract
When screening for infectious diseases, group testing has proven to be a cost efficient alternative to individual level testing. Cost savings are realized by testing pools of individual specimens (eg, blood, urine, saliva, and so on) rather than by testing the specimens separately. However, a common concern that arises in group testing is the so-called "dilution effect." This occurs if the signal from a positive individual's specimen is diluted past an assay's threshold of detection when it is pooled with multiple negative specimens. In this article, we propose a new statistical framework for group testing data that merges estimation and case identification, which are often treated separately in the literature. Our approach considers analyzing continuous biomarker levels (eg, antibody levels, antigen concentrations, and so on) from pooled samples to estimate both a binary regression model for the probability of disease and the biomarker distributions for cases and controls. To increase case identification accuracy, we then show how estimates of the biomarker distributions can be used to select diagnostic thresholds on a pool-by-pool basis. Our proposals are evaluated through numerical studies and are illustrated using hepatitis B virus data collected on a prison population in Ireland.
Collapse
Affiliation(s)
- Stefani C Mokalled
- School of Mathematical and Statistical Sciences, Clemson University, Clemson, South Carolina, USA
| | - Christopher S McMahan
- School of Mathematical and Statistical Sciences, Clemson University, Clemson, South Carolina, USA
| | - Joshua M Tebbs
- Department of Statistics, University of South Carolina, Columbia, South Carolina, USA
| | - Derek Andrew Brown
- School of Mathematical and Statistical Sciences, Clemson University, Clemson, South Carolina, USA
| | - Christopher R Bilder
- Department of Statistics, University of Nebraska-Lincoln, Lincoln, Nebraska, USA
| |
Collapse
|
33
|
Ambrosis N, Martin Aispuro P, Belhart K, Bottero D, Crisp RL, Dansey MV, Gabrielli M, Filevich O, Genoud V, Giordano A, Lin MC, Lodeiro A, Marceca F, Pregi N, Lenicov FR, Rocha-Viegas L, Rudi E, Solovey G, Zurita E, Pecci A, Etchenique R, Hozbor D. Active Surveillance of Asymptomatic, Presymptomatic, and Oligosymptomatic SARS-CoV-2-Infected Individuals in Communities Inhabiting Closed or Semi-closed Institutions. Front Med (Lausanne) 2021; 8:640688. [PMID: 33614689 PMCID: PMC7889965 DOI: 10.3389/fmed.2021.640688] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2020] [Accepted: 01/11/2021] [Indexed: 12/23/2022] Open
Abstract
Background: The high COVID-19 dissemination rate demands active surveillance to identify asymptomatic, presymptomatic, and oligosymptomatic (APO) SARS-CoV-2-infected individuals. This is of special importance in communities inhabiting closed or semi-closed institutions such as residential care homes, prisons, neuropsychiatric hospitals, etc., where risk people are in close contact. Thus, a pooling approach-where samples are mixed and tested as single pools-is an attractive strategy to rapidly detect APO-infected in these epidemiological scenarios. Materials and Methods: This study was done at different pandemic periods between May 28 and August 31 2020 in 153 closed or semi-closed institutions in the Province of Buenos Aires (Argentina). We setup pooling strategy in two stages: first a pool-testing followed by selective individual-testing according to pool results. Samples included in negative pools were presumed as negative, while samples from positive pools were re-tested individually for positives identification. Results: Sensitivity in 5-sample or 10-sample pools was adequate since only 2 Ct values were increased with regard to single tests on average. Concordance between 5-sample or 10-sample pools and individual-testing was 100% in the Ct ≤ 36. We tested 4,936 APO clinical samples in 822 pools, requiring 86-50% fewer tests in low-to-moderate prevalence settings compared to individual testing. Conclusions: By this strategy we detected three COVID-19 outbreaks at early stages in these institutions, helping to their containment and increasing the likelihood of saving lives in such places where risk groups are concentrated.
Collapse
Affiliation(s)
- Nicolás Ambrosis
- Laboratorio VacSal, Facultad de Ciencias Exactas, Instituto de Biotecnología y Biología Molecular, Universidad Nacional de La Plata y CCT La Plata-CONICET, La Plata, Argentina
| | - Pablo Martin Aispuro
- Laboratorio VacSal, Facultad de Ciencias Exactas, Instituto de Biotecnología y Biología Molecular, Universidad Nacional de La Plata y CCT La Plata-CONICET, La Plata, Argentina
| | - Keila Belhart
- Laboratorio VacSal, Facultad de Ciencias Exactas, Instituto de Biotecnología y Biología Molecular, Universidad Nacional de La Plata y CCT La Plata-CONICET, La Plata, Argentina
| | - Daniela Bottero
- Laboratorio VacSal, Facultad de Ciencias Exactas, Instituto de Biotecnología y Biología Molecular, Universidad Nacional de La Plata y CCT La Plata-CONICET, La Plata, Argentina
| | - Renée Leonor Crisp
- Universidad de Buenos Aires, Facultad de Ciencias Exactas y Naturales, Departamento de Química Biológica, Buenos Aires, Argentina
| | - María Virginia Dansey
- Universidad de Buenos Aires, Facultad de Ciencias Exactas y Naturales, Departamento de Química Biológica, Buenos Aires, Argentina
- Universidad de Buenos Aires, Facultad de Ciencias Exactas y Naturales, Unidad de Microanálisis y Métodos Físicos en Química Orgánica, CONICET, Buenos Aires, Argentina
| | - Magali Gabrielli
- Laboratorio VacSal, Facultad de Ciencias Exactas, Instituto de Biotecnología y Biología Molecular, Universidad Nacional de La Plata y CCT La Plata-CONICET, La Plata, Argentina
| | - Oscar Filevich
- Escuela de Ciencia y Tecnología de la Universidad Nacional de San Martín, CONICET, Buenos Aires, Argentina
| | - Valeria Genoud
- Universidad de Buenos Aires, Facultad de Ciencias Exactas y Naturales, Departamento de Química Biológica, Buenos Aires, Argentina
| | - Alejandra Giordano
- Laboratorio VacSal, Facultad de Ciencias Exactas, Instituto de Biotecnología y Biología Molecular, Universidad Nacional de La Plata y CCT La Plata-CONICET, La Plata, Argentina
| | - Min Chih Lin
- Universidad de Buenos Aires, Facultad de Ciencias Exactas y Naturales, Departamento de Química Biológica, Buenos Aires, Argentina
- Universidad de Buenos Aires, Facultad de Ciencias Exactas y Naturales, Instituto del Cálculo, CONICET, Buenos Aires, Argentina
| | - Anibal Lodeiro
- Laboratorio VacSal, Facultad de Ciencias Exactas, Instituto de Biotecnología y Biología Molecular, Universidad Nacional de La Plata y CCT La Plata-CONICET, La Plata, Argentina
- Laboratorio de Genética, Facultad de Ciencias Agrarias y Forestales, Universidad Nacional de La Plata, La Plata, Argentina
| | - Felipe Marceca
- Universidad de Buenos Aires, Facultad de Ciencias Exactas y Naturales, Departamento de Química Biológica, Buenos Aires, Argentina
- Universidad de Buenos Aires, Facultad de Ciencias Exactas y Naturales, Departamento de Matemática, Instituto de Investigaciones Matemáticas “Luis Santaló”, CONICET, Buenos Aires, Argentina
| | - Nicolás Pregi
- Universidad de Buenos Aires, Facultad de Ciencias Exactas y Naturales, Departamento de Química Biológica, Buenos Aires, Argentina
- Universidad de Buenos Aires, Facultad de Ciencias Exactas y Naturales, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales, CONICET, Buenos Aires, Argentina
| | - Federico Remes Lenicov
- Universidad de Buenos Aires, Instituto de Investigaciones Biomédicas en Retrovirus y SIDA, CONICET, Buenos Aires, Argentina
| | - Luciana Rocha-Viegas
- Universidad de Buenos Aires, Facultad de Ciencias Exactas y Naturales, Departamento de Química Biológica, Buenos Aires, Argentina
- Universidad de Buenos Aires, Facultad de Ciencias Exactas y Naturales, Instituto de Fisiología, Biología Molecular y Neurociencias, CONICET, Buenos Aires, Argentina
| | - Erika Rudi
- Laboratorio VacSal, Facultad de Ciencias Exactas, Instituto de Biotecnología y Biología Molecular, Universidad Nacional de La Plata y CCT La Plata-CONICET, La Plata, Argentina
| | - Guillermo Solovey
- Universidad de Buenos Aires, Facultad de Ciencias Exactas y Naturales, Departamento de Química Biológica, Buenos Aires, Argentina
- Universidad de Buenos Aires, Facultad de Ciencias Exactas y Naturales, Instituto del Cálculo, CONICET, Buenos Aires, Argentina
| | - Eugenia Zurita
- Laboratorio VacSal, Facultad de Ciencias Exactas, Instituto de Biotecnología y Biología Molecular, Universidad Nacional de La Plata y CCT La Plata-CONICET, La Plata, Argentina
| | - Adali Pecci
- Universidad de Buenos Aires, Facultad de Ciencias Exactas y Naturales, Departamento de Química Biológica, Buenos Aires, Argentina
- Universidad de Buenos Aires, Facultad de Ciencias Exactas y Naturales, Instituto de Fisiología, Biología Molecular y Neurociencias, CONICET, Buenos Aires, Argentina
| | - Roberto Etchenique
- Universidad de Buenos Aires, Facultad de Ciencias Exactas y Naturales, Departamento de Química Biológica, Buenos Aires, Argentina
- Universidad de Buenos Aires, Facultad de Ciencias Exactas y Naturales, Departamento de Química Inorgánica, Analítica y Química Física, Instituto de Química Física de los Materiales, Medio Ambiente y Energía, CONICET, Buenos Aires, Argentina
| | - Daniela Hozbor
- Laboratorio VacSal, Facultad de Ciencias Exactas, Instituto de Biotecnología y Biología Molecular, Universidad Nacional de La Plata y CCT La Plata-CONICET, La Plata, Argentina
| |
Collapse
|
34
|
Schulte PA, Weissman DN, Luckhaupt SE, de Perio MA, Beezhold D, Piacentino JD, Radonovich LJ, Hearl FJ, Howard J. Considerations for Pooled Testing of Employees for SARS-CoV-2. J Occup Environ Med 2021; 63:1-9. [PMID: 33378322 PMCID: PMC7773162 DOI: 10.1097/jom.0000000000002049] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
OBJECTIVES To identify important background information on pooled tested of employees that employers workers, and health authorities should consider. METHODS This paper is a commentary based on the review by the authors of pertinent literature generally from preprints in medrixiv.org prior to August 2020. RESULTS/CONCLUSIONS Pooled testing may be particularly useful to employers in communities with low prevalence of COVID-19. It can be used to reduce the number of tests and associated financial costs. For effective and efficient pooled testing employers should consider it as part of a broader, more comprehensive workplace COVID-19 prevention and control program. Pooled testing of asymptomatic employees can prevent transmission of SARS-CoV-2 and help assure employers and customers that employees are not infectious.
Collapse
Affiliation(s)
- Paul A Schulte
- National Institute for Occupational Safety and Health, Centers for Disease Control and Prevention, 1090 Tusculum Avenue, Cincinnati, Ohio (Dr Schulte); National Institute for Occupational Safety and Health, Centers for Disease Control and Prevention, 1095 Willowdale Road, Morgantown, West Virginia (Dr Weissman, Dr Beezhold, Dr Radonovich); National Institute for Occupational Safety and Health, Centers for Disease Control and Prevention, 5555 Ridge Avenue, Cincinnati, Ohio (Dr Luckhaupt, Dr de Perio); National Institute for Occupational Safety and Health, Centers for Disease Control and Prevention, 395 E Street SW, Washington, DC 20024 (Dr Piacentino, Hearl, Dr Howard)
| | | | | | | | | | | | | | | | | |
Collapse
|
35
|
Hanel R, Thurner S. Boosting test-efficiency by pooled testing for SARS-CoV-2-Formula for optimal pool size. PLoS One 2020; 15:e0240652. [PMID: 33147228 PMCID: PMC7641378 DOI: 10.1371/journal.pone.0240652] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2020] [Accepted: 09/30/2020] [Indexed: 11/19/2022] Open
Abstract
In the current COVID19 crisis many national healthcare systems are confronted with an acute shortage of tests for confirming SARS-CoV-2 infections. For low overall infection levels in the population the pooling of samples can drastically amplify the testing capacity. Here we present a formula to estimate the optimal group-size for pooling, the efficiency gain (tested persons per test), and the expected upper bound of missed infections in pooled testing, all as a function of the population-wide infection levels and the false negative/positive rates of the currently used PCR tests. Assuming an infection level of 0.1% and a false negative rate of 2%, the optimal pool-size is about 34, and an efficiency gain of about 15 tested persons per test is possible. For an infection level of 1% the optimal pool-size is 11, the efficiency gain is 5.1 tested persons per test. For an infection level of 10% the optimal pool-size reduces to about 4, the efficiency gain is about 1.7 tested persons per test. For infection levels of 30% and higher there is no more benefit from pooling. To see to what extent replicates of the pooled tests improve the estimate of the maximal number of missed infections, we present results for 1 to 5 replicates.
Collapse
Affiliation(s)
- Rudolf Hanel
- Section for Science of Complex Systems, Medical University of Vienna, Vienna, Austria
- Complexity Science Hub Vienna, Vienna, Austria
| | - Stefan Thurner
- Section for Science of Complex Systems, Medical University of Vienna, Vienna, Austria
- Complexity Science Hub Vienna, Vienna, Austria
- Santa Fe Institute, Santa Fe, NM, United States of America
- IIASA, Laxenburg, Austria
| |
Collapse
|
36
|
Joyner CN, McMahan CS, Tebbs JM, Bilder CR. From mixed effects modeling to spike and slab variable selection: A Bayesian regression model for group testing data. Biometrics 2020; 76:913-923. [PMID: 31729015 PMCID: PMC7944974 DOI: 10.1111/biom.13176] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Revised: 10/22/2019] [Accepted: 10/29/2019] [Indexed: 12/20/2022]
Abstract
Due to reductions in both time and cost, group testing is a popular alternative to individual-level testing for disease screening. These reductions are obtained by testing pooled biospecimens (eg, blood, urine, swabs, etc.) for the presence of an infectious agent. However, these reductions come at the expense of data complexity, making the task of conducting disease surveillance more tenuous when compared to using individual-level data. This is because an individual's disease status may be obscured by a group testing protocol and the effect of imperfect testing. Furthermore, unlike individual-level testing, a given participant could be involved in multiple testing outcomes and/or may never be tested individually. To circumvent these complexities and to incorporate all available information, we propose a Bayesian generalized linear mixed model that accommodates data arising from any group testing protocol, estimates unknown assay accuracy probabilities and accounts for potential heterogeneity in the covariate effects across population subgroups (eg, clinic sites, etc.); this latter feature is of key interest to practitioners tasked with conducting disease surveillance. To achieve model selection, our proposal uses spike and slab priors for both fixed and random effects. The methodology is illustrated through numerical studies and is applied to chlamydia surveillance data collected in Iowa.
Collapse
Affiliation(s)
- Chase N. Joyner
- School of Mathematical and Statistical Sciences, Clemson University, Clemson, SC 29634, U.S.A
| | - Christopher S. McMahan
- School of Mathematical and Statistical Sciences, Clemson University, Clemson, SC 29634, U.S.A
| | - Joshua M. Tebbs
- Department of Statistics, University of South Carolina, Columbia, SC 29208, U.S.A
| | | |
Collapse
|
37
|
|
38
|
Zhang W, Liu A, Li Q, Albert PS. Incorporating retesting outcomes for estimation of disease prevalence. Stat Med 2019; 39:687-697. [PMID: 31758594 DOI: 10.1002/sim.8439] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2019] [Revised: 10/31/2019] [Accepted: 11/03/2019] [Indexed: 11/12/2022]
Abstract
Group testing has been widely used as a cost-effective strategy to screen for and estimate the prevalence of a rare disease. While it is well-recognized that retesting is necessary for identifying infected subjects, it is not required for estimating the prevalence. For a test without misclassification, gains in statistical efficiency are expected from incorporating retesting results in the estimation of the prevalence. However, when the test is subject to misclassification, it is not clear how much gain should be expected. There are a number of theoretical challenges in addressing this issue, including (1) enumerating the potential test results from retesting individual subjects in a group, (2) the dependence among these test results and the test result from testing at the group level, and (3) differential misclassification due to pooling of biospecimens. Overcoming some of these challenges, we show that retesting subjects in either positive or negative groups can substantially improve the efficiency of the estimation and that retesting positive groups yields higher efficiency than retesting a same number or proportion of negative groups.
Collapse
Affiliation(s)
- Wei Zhang
- LSC, NCMIS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China.,Division of Intramural Population Health Research, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, Maryland
| | - Aiyi Liu
- Division of Intramural Population Health Research, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, Maryland
| | - Qizhai Li
- LSC, NCMIS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
| | - Paul S Albert
- Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Institutes of Health, Rockville, Maryland
| |
Collapse
|
39
|
Hitt BD, Bilder CR, Tebbs JM, McMahan CS. The objective function controversy for group testing: Much ado about nothing? Stat Med 2019; 38:4912-4923. [PMID: 31469188 DOI: 10.1002/sim.8341] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2018] [Revised: 06/06/2019] [Accepted: 07/19/2019] [Indexed: 11/09/2022]
Abstract
Group testing is an indispensable tool for laboratories when testing high volumes of clinical specimens for infectious diseases. An important decision that needs to be made prior to implementation is determining what group sizes to use. In best practice, an objective function is chosen and then minimized to determine an optimal set of these group sizes, known as the optimal testing configuration (OTC). There are a few options for objective functions, and they differ based on how the expected number of tests, assay characteristics, and testing constraints are taken into account. These varied options have led to a recent controversy in the literature regarding which of two different objective functions is better. In our paper, we examine these objective functions over a number of realistic situations for infectious disease testing. We show that this controversy may be much ado about nothing because the OTCs and corresponding results (eg, number of tests and accuracy) are largely the same for standard testing algorithms in a wide variety of situations.
Collapse
Affiliation(s)
- Brianna D Hitt
- Department of Statistics, University of Nebraska-Lincoln, Lincoln, Nebraska
| | | | - Joshua M Tebbs
- Department of Statistics, University of South Carolina, Columbia, South Carolina
| | - Christopher S McMahan
- School of Mathematical and Statistical Sciences, Clemson University, Clemson, South Carolina
| |
Collapse
|
40
|
Lin J, Wang D, Zheng Q. Regression analysis and variable selection for two-stage multiple-infection group testing data. Stat Med 2019; 38:4519-4533. [PMID: 31297869 DOI: 10.1002/sim.8311] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2018] [Revised: 03/03/2019] [Accepted: 06/14/2019] [Indexed: 12/17/2022]
Abstract
Group testing, as a cost-effective strategy, has been widely used to perform large-scale screening for rare infections. Recently, the use of multiplex assays has transformed the goal of group testing from detecting a single disease to diagnosing multiple infections simultaneously. Existing research on multiple-infection group testing data either exclude individual covariate information or ignore possible retests on suspicious individuals. To incorporate both, we propose a new regression model. This new model allows us to perform a regression analysis for each infection using multiple-infection group testing data. Furthermore, we introduce an efficient variable selection method to reveal truly relevant risk factors for each disease. Our methodology also allows for the estimation of the assay sensitivity and specificity when they are unknown. We examine the finite sample performance of our method through extensive simulation studies and apply it to a chlamydia and gonorrhea screening data set to illustrate its practical usefulness.
Collapse
Affiliation(s)
- Juexin Lin
- Department of Statistics, University of South Carolina, South Carolina
| | - Dewei Wang
- Department of Statistics, University of South Carolina, South Carolina
| | - Qi Zheng
- Department of Bioinformatics and Biostatistics, University of Louisville, Kentucky
| |
Collapse
|
41
|
Gregory KB, Wang D, McMahan CS. Adaptive elastic net for group testing. Biometrics 2019; 75:13-23. [PMID: 30267535 PMCID: PMC7938860 DOI: 10.1111/biom.12973] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2017] [Accepted: 09/14/2018] [Indexed: 11/28/2022]
Abstract
For disease screening, group (pooled) testing can be a cost-saving alternative to one-at-a-time testing, with savings realized through assaying pooled biospecimen (eg, urine, blood, saliva). In many group testing settings, practitioners are faced with the task of conducting disease surveillance. That is, it is often of interest to relate individuals' true disease statuses to covariate information via binary regression. Several authors have developed regression methods for group testing data, which is challenging due to the effects of imperfect testing. That is, all testing outcomes (on pools and individuals) are subject to misclassification, and individuals' true statuses are never observed. To further complicate matters, individuals may be involved in several testing outcomes. For analyzing such data, we provide a novel regression methodology which generalizes and extends the aforementioned regression techniques and which incorporates regularization. Specifically, for model fitting and variable selection, we propose an adaptive elastic net estimator under the logistic regression model which can be used to analyze data from any group testing strategy. We provide an efficient algorithm for computing the estimator along with guidance on tuning parameter selection. Moreover, we establish the asymptotic properties of the proposed estimator and show that it possesses "oracle" properties. We evaluate the performance of the estimator through Monte Carlo studies and illustrate the methodology on a chlamydia data set from the State Hygienic Laboratory in Iowa City.
Collapse
Affiliation(s)
- Karl B. Gregory
- Department of Statistics, University of South Carolina, Columbia, SC 29208, U.S.A
| | - Dewei Wang
- Department of Statistics, University of South Carolina, Columbia, SC 29208, U.S.A
| | | |
Collapse
|
42
|
Roy S, Banerjee T. Estimation of log-odds ratio from group testing data using Firth correction. Biom J 2019; 61:714-728. [PMID: 30645765 DOI: 10.1002/bimj.201800125] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2018] [Revised: 11/07/2018] [Accepted: 11/09/2018] [Indexed: 11/10/2022]
Abstract
We consider the estimation of the prevalence of a rare disease, and the log-odds ratio for two specified groups of individuals from group testing data. For a low-prevalence disease, the maximum likelihood estimate of the log-odds ratio is severely biased. However, Firth correction to the score function leads to a considerable improvement of the estimator. Also, for a low-prevalence disease, if the diagnostic test is imperfect, the group testing is found to yield more precise estimate of the log-odds ratio than the individual testing.
Collapse
Affiliation(s)
- Surupa Roy
- Department of Statistics, St Xavier's College, Kolkata, India
| | | |
Collapse
|
43
|
Malinovsky Y, Albert PS. Revisiting Nested Group Testing Procedures: New Results, Comparisons, and Robustness. AM STAT 2018; 73:117-125. [PMID: 31814627 DOI: 10.1080/00031305.2017.1366367] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
Group testing has its origin in the identification of syphilis in the U.S. army during World War II. Much of the theoretical framework of group testing was developed starting in the late 1950s, with continued work into the 1990s. Recently, with the advent of new laboratory and genetic technologies, there has been an increasing interest in group testing designs for cost saving purposes. In this article, we compare different nested designs, including Dorfman, Sterrett and an optimal nested procedure obtained through dynamic programming. To elucidate these comparisons, we develop closed-form expressions for the optimal Sterrett procedure and provide a concise review of the prior literature for other commonly used procedures. We consider designs where the prevalence of disease is known as well as investigate the robustness of these procedures, when it is incorrectly assumed. This article provides a technical presentation that will be of interest to researchers as well as from a pedagogical perspective. Supplementary material for this article available online.
Collapse
Affiliation(s)
- Yaakov Malinovsky
- Department of Mathematics and Statistics University of Maryland, Baltimore County, Baltimore, MD
| | - Paul S Albert
- Biostatistics Branch, Division of Cancer Epidemiology and Genetics National Cancer Institute, Rockville, MD
| |
Collapse
|
44
|
Wang D, McMahan CS, Tebbs JM, Bilder CR. Group testing case identification with biomarker information. Comput Stat Data Anal 2018; 122:156-166. [PMID: 29977101 DOI: 10.1016/j.csda.2018.01.005] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
Abstract
Screening procedures for infectious diseases, such as HIV, often involve pooling individual specimens together and testing the pools. For diseases with low prevalence, group testing (or pooled testing) can be used to classify individuals as diseased or not while providing considerable cost savings when compared to testing specimens individually. The pooling literature is replete with group testing case identification algorithms including Dorfman testing, higher-stage hierarchical procedures, and array testing. Although these algorithms are usually evaluated on the basis of the expected number of tests and classification accuracy, most evaluations in the literature do not account for the continuous nature of the testing responses and thus invoke potentially restrictive assumptions to characterize an algorithm's performance. Commonly used case identification algorithms in group testing are considered and are evaluated by taking a different approach. Instead of treating testing responses as binary random variables (i.e., diseased/not), evaluations are made by exploiting an assay's underlying continuous biomarker distributions for positive and negative individuals. In doing so, a general framework to describe the operating characteristics of group testing case identification algorithms is provided when these distributions are known. The methodology is illustrated using two HIV testing examples taken from the pooling literature.
Collapse
Affiliation(s)
- Dewei Wang
- Department of Statistics, University of South Carolina, Columbia, SC 29208, USA
| | | | - Joshua M Tebbs
- Department of Statistics, University of South Carolina, Columbia, SC 29208, USA
| | - Christopher R Bilder
- Department of Statistics, University of Nebraska-Lincoln, Lincoln, NE 68583, USA
| |
Collapse
|
45
|
McMahan CS, Tebbs JM, Hanson TE, Bilder CR. Bayesian regression for group testing data. Biometrics 2017; 73:1443-1452. [PMID: 28405965 PMCID: PMC5638690 DOI: 10.1111/biom.12704] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2016] [Revised: 03/01/2017] [Accepted: 03/01/2017] [Indexed: 01/10/2023]
Abstract
Group testing involves pooling individual specimens (e.g., blood, urine, swabs, etc.) and testing the pools for the presence of a disease. When individual covariate information is available (e.g., age, gender, number of sexual partners, etc.), a common goal is to relate an individual's true disease status to the covariates in a regression model. Estimating this relationship is a nonstandard problem in group testing because true individual statuses are not observed and all testing responses (on pools and on individuals) are subject to misclassification arising from assay error. Previous regression methods for group testing data can be inefficient because they are restricted to using only initial pool responses and/or they make potentially unrealistic assumptions regarding the assay accuracy probabilities. To overcome these limitations, we propose a general Bayesian regression framework for modeling group testing data. The novelty of our approach is that it can be easily implemented with data from any group testing protocol. Furthermore, our approach will simultaneously estimate assay accuracy probabilities (along with the covariate effects) and can even be applied in screening situations where multiple assays are used. We apply our methods to group testing data collected in Iowa as part of statewide screening efforts for chlamydia, and we make user-friendly R code available to practitioners.
Collapse
Affiliation(s)
| | - Joshua M. Tebbs
- Department of Statistics, University of South Carolina, Columbia, SC 29208, U.S.A
| | - Timothy E. Hanson
- Department of Statistics, University of South Carolina, Columbia, SC 29208, U.S.A
| | | |
Collapse
|
46
|
Improved HIV-1 Viral Load Monitoring Capacity Using Pooled Testing With Marker-Assisted Deconvolution. J Acquir Immune Defic Syndr 2017; 75:580-587. [PMID: 28489730 DOI: 10.1097/qai.0000000000001424] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
OBJECTIVE Improve pooled viral load (VL) testing to increase HIV treatment monitoring capacity, particularly relevant for resource-limited settings. DESIGN We developed marker-assisted mini-pooling with algorithm (mMPA), a new VL pooling deconvolution strategy that uses information from low-cost, routinely collected clinical markers to determine an efficient order of sequential individual VL testing and dictates when the sequential testing can be stopped. METHODS We simulated the use of pooled testing to ascertain virological failure status on 918 participants from 3 studies conducted at the Academic Model Providing Access to Healthcare in Eldoret, Kenya, and estimated the number of assays needed when using mMPA and other pooling methods. We also evaluated the impact of practical factors, such as specific markers used, prevalence of virological failure, pool size, VL measurement error, and assay detection cutoffs on mMPA, other pooling methods, and single testing. RESULTS Using CD4 count as a marker to assist deconvolution, mMPA significantly reduces the number of VL assays by 52% [confidence interval (CI): 48% to 57%], 40% (CI: 38% to 42%), and 19% (CI: 15% to 22%) compared with individual testing, simple mini-pooling, and mini-pooling with algorithm, respectively. mMPA has higher sensitivity and negative/positive predictive values than mini-pooling with algorithm, and comparable high specificity. Further improvement is achieved with additional clinical markers, such as age and time on therapy, with or without CD4 values. mMPA performance depends on prevalence of virological failure and pool size but is insensitive to VL measurement error and VL assay detection cutoffs. CONCLUSIONS mMPA can substantially increase the capacity of VL monitoring.
Collapse
|
47
|
Warasi MS, McMahan CS, Tebbs JM, Bilder CR. Group testing regression models with dilution submodels. Stat Med 2017; 36:4860-4872. [PMID: 28856774 DOI: 10.1002/sim.7455] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2017] [Revised: 05/27/2017] [Accepted: 08/11/2017] [Indexed: 12/21/2022]
Abstract
Group testing, where specimens are tested initially in pools, is widely used to screen individuals for sexually transmitted diseases. However, a common problem encountered in practice is that group testing can increase the number of false negative test results. This occurs primarily when positive individual specimens within a pool are diluted by negative ones, resulting in positive pools testing negatively. If the goal is to estimate a population-level regression model relating individual disease status to observed covariates, severe bias can result if an adjustment for dilution is not made. Recognizing this as a critical issue, recent binary regression approaches in group testing have utilized continuous biomarker information to acknowledge the effect of dilution. In this paper, we have the same overall goal but take a different approach. We augment existing group testing regression models (that assume no dilution) with a parametric dilution submodel for pool-level sensitivity and estimate all parameters using maximum likelihood. An advantage of our approach is that it does not rely on external biomarker test data, which may not be available in surveillance studies. Furthermore, unlike previous approaches, our framework allows one to formally test whether dilution is present based on the observed group testing data. We use simulation to illustrate the performance of our estimation and inference methods, and we apply these methods to 2 infectious disease data sets.
Collapse
Affiliation(s)
- Md S Warasi
- Department of Mathematics and Statistics, Radford University, Radford, VA 24142, USA
| | | | - Joshua M Tebbs
- Department of Statistics, University of South Carolina, Columbia, SC 29208, USA
| | - Christopher R Bilder
- Department of Statistics, University of Nebraska-Lincoln, Lincoln, NE 68583, NE, USA
| |
Collapse
|
48
|
Abstract
Pooled testing is useful to identify positive specimens for large-scale screening. Matrix pooling is one of the commonly used algorithms. In this work, we investigate the properties of matrix pooling and reveal that the efficiency of matrix pooling is related with the magnitude of overlapping among groups. Based on this property, we develop a new design to further improve the efficiency while taking into account of testing error. The efficiency, pooling sensitivity and specificity of this algorithm are explicitly derived and verified through plasmode simulation of detecting acute human immunodeficiency virus among patients who were suspected to have malaria in rural Ugandan. We show that the new design outperforms matrix pooling in efficiency while retain the pooling sensitivity and specificity.
Collapse
Affiliation(s)
- Wenjun Xiong
- 1 School of Mathematics and Statistics, Guangxi Normal University, Guilin, China.,2 Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
| | - Juan Ding
- 1 School of Mathematics and Statistics, Guangxi Normal University, Guilin, China.,3 Department of Medicine, Vanderbilt University School of Medicine, Nashville, USA
| | - Yuanzhen He
- 4 School of Mathematical Sciences, Beijing Normal University, Beijing, China
| | - Qizhai Li
- 2 Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
49
|
Huang X, Sarker Warasi MS. Maximum Likelihood Estimators in Regression Models for Error‐prone Group Testing Data. Scand Stat Theory Appl 2017. [DOI: 10.1111/sjos.12282] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Xianzheng Huang
- Department of Statistics, College of Arts & Sciences University of South Carolina
| | | |
Collapse
|
50
|
Hou P, Tebbs JM, Bilder CR, McMahan CS. Hierarchical group testing for multiple infections. Biometrics 2016; 73:656-665. [PMID: 27657666 DOI: 10.1111/biom.12589] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2015] [Revised: 06/01/2016] [Accepted: 08/01/2016] [Indexed: 11/30/2022]
Abstract
Group testing, where individuals are tested initially in pools, is widely used to screen a large number of individuals for rare diseases. Triggered by the recent development of assays that detect multiple infections at once, screening programs now involve testing individuals in pools for multiple infections simultaneously. Tebbs, McMahan, and Bilder (2013, Biometrics) recently evaluated the performance of a two-stage hierarchical algorithm used to screen for chlamydia and gonorrhea as part of the Infertility Prevention Project in the United States. In this article, we generalize this work to accommodate a larger number of stages. To derive the operating characteristics of higher-stage hierarchical algorithms with more than one infection, we view the pool decoding process as a time-inhomogeneous, finite-state Markov chain. Taking this conceptualization enables us to derive closed-form expressions for the expected number of tests and classification accuracy rates in terms of transition probability matrices. When applied to chlamydia and gonorrhea testing data from four states (Region X of the United States Department of Health and Human Services), higher-stage hierarchical algorithms provide, on average, an estimated 11% reduction in the number of tests when compared to two-stage algorithms. For applications with rarer infections, we show theoretically that this percentage reduction can be much larger.
Collapse
Affiliation(s)
- Peijie Hou
- Department of Statistics, University of South Carolina, Columbia, South Carolina 29208, U.S.A
| | - Joshua M Tebbs
- Department of Statistics, University of South Carolina, Columbia, South Carolina 29208, U.S.A
| | - Christopher R Bilder
- Department of Statistics, University of Nebraska-Lincoln, Lincoln, Nebraska 68583, U.S.A
| | - Christopher S McMahan
- Department of Mathematical Sciences, Clemson University, Clemson, South Carolina 29634, U.S.A
| |
Collapse
|