1
|
Yang J, Liu A, Perkins N, Chen Z. Youden index estimation based on group-tested data. Stat Methods Med Res 2025; 34:45-54. [PMID: 39659139 DOI: 10.1177/09622802241295319] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2024]
Abstract
Youden index, a linear function of sensitivity and specificity, provides a direct measurement of the highest diagnostic accuracy achievable by a biomarker. It is maximized at the cut-off point that optimizes the biomarker's overall classification rate while assigning equal weight to sensitivity and specificity. In this paper, we consider the problem of estimating the Youden index when only group-tested data are available. The unavailability of individual disease statuses poses a challenge, especially when there is differential false positives and negatives in disease screening. We propose both parametric and nonparametric procedures for estimation of the Youden index, and exemplify our methods by utilizing data from the National Health and Nutrition Examination Survey (NHANES) to evaluate the diagnostic ability of monocyte for predicting chlamydia.
Collapse
Affiliation(s)
- Jin Yang
- National Institute of Child Health and Human Development, Bethesda, Maryland, United States
| | - Aiyi Liu
- National Institute of Child Health and Human Development, Bethesda, Maryland, United States
| | - Neil Perkins
- National Institute of Child Health and Human Development, Bethesda, Maryland, United States
| | - Zhen Chen
- National Institute of Child Health and Human Development, Bethesda, Maryland, United States
| |
Collapse
|
2
|
Yang J, Zhang W, Albert PS, Liu A, Chen Z. Combining Biomarkers to Improve Diagnostic Accuracy in Detecting Diseases With Group-Tested Data. Stat Med 2024; 43:5182-5192. [PMID: 39375883 PMCID: PMC11583953 DOI: 10.1002/sim.10230] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Revised: 08/22/2024] [Accepted: 09/11/2024] [Indexed: 10/09/2024]
Abstract
We consider the problem of combining multiple biomarkers to improve the diagnostic accuracy of detecting a disease when only group-tested data on the disease status are available. There are several challenges in addressing this problem, including unavailable individual disease statuses, differential misclassification depending on group size and number of diseased individuals in the group, and extensive computation due to a large number of possible combinations of multiple biomarkers. To tackle these issues, we propose a pairwise model fitting approach to estimating the distribution of the optimal linear combination of biomarkers and its diagnostic accuracy under the assumption of a multivariate normal distribution. The approach is evaluated in simulation studies and applied to data on chlamydia detection and COVID-19 diagnosis.
Collapse
Affiliation(s)
- Jin Yang
- Biostatistics and Bioinformatics Branch, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, Maryland, USA
| | - Wei Zhang
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
| | - Paul S Albert
- Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, Maryland, USA
| | - Aiyi Liu
- Biostatistics and Bioinformatics Branch, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, Maryland, USA
| | - Zhen Chen
- Biostatistics and Bioinformatics Branch, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, Maryland, USA
| |
Collapse
|
3
|
Heath B, Villar SS, Robertson DS. How could a pooled testing policy have performed in managing the early stages of the COVID-19 pandemic? Results from a simulation study. Stat Med 2024; 43:2239-2262. [PMID: 38545961 DOI: 10.1002/sim.10062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2023] [Revised: 12/08/2023] [Accepted: 02/26/2024] [Indexed: 05/18/2024]
Abstract
A coordinated testing policy is an essential tool for responding to emerging epidemics, as was seen with COVID-19. However, it is very difficult to agree on the best policy when there are multiple conflicting objectives. A key objective is minimizing cost, which is why pooled testing (a method that involves pooling samples taken from multiple individuals and analyzing this with a single diagnostic test) has been suggested. In this article, we present results from an extensive and realistic simulation study comparing testing policies based on individually testing subjects with symptoms (a policy resembling the UK strategy at the start of the COVID-19 pandemic), individually testing subjects at random or pools of subjects randomly combined and tested. To compare these testing methods, a dynamic model compromised of a relationship network and an extended SEIR model is used. In contrast to most existing literature, testing capacity is considered as fixed and limited rather than unbounded. This article then explores the impact of the proportion of symptomatic infections on the expected performance of testing policies. Symptomatic testing performs better than pooled testing unless a low proportion of infections are symptomatic. Additionally, we include the novel feature for testing of non-compliance and perform a sensitivity analysis for different compliance assumptions. Our results suggest for the pooled testing scheme to be superior to testing symptomatic people individually, only a small proportion of the population (> 10 % $$ >10\% $$ ) needs to not comply with the testing procedure.
Collapse
Affiliation(s)
- Bethany Heath
- MRC Biostatistics Unit, University of Cambridge, Cambridge, UK
| | - Sofía S Villar
- MRC Biostatistics Unit, University of Cambridge, Cambridge, UK
| | | |
Collapse
|
4
|
Cabrera ODLC, Alsehibani R. Statistical modeling and evaluation of the impact of multiplicity classification thresholds on the COVID-19 pool testing accuracy. PLoS One 2023; 18:e0283874. [PMID: 37494364 PMCID: PMC10370739 DOI: 10.1371/journal.pone.0283874] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Accepted: 03/17/2023] [Indexed: 07/28/2023] Open
Abstract
Prior research on pool testing focus on developing testing methods with the main objective of reducing the total number of tests. However, pool testing can also be used to improve the accuracy of the testing process. The objective of this paper is to improve the accuracy of pool testing using the same number of tests as that of individual testing taking into consideration the probability of testing errors and pool multiplicity classification thresholds. Statistical models are developed to evaluate the impact of pool multiplicity classiffcation thresholds on pool testing accuracy using the receiver operating characteristic (ROC) curve and the area under the curve (AUC). The findings indicate that under certain conditions, pool testing multiplicity yields superior testing accuracy compared to individual testing without additional cost. The results reveal that selecting the multiplicity classification threshold is a critical factor in improving the pool testing accuracy and show that the lower the prevalence level the higher the gains in accuracy using multiplicity pool testing. The findings also indicate that performance can be improved using a batch size that is inversely proportional to the prevalence level. Furthermore, the results indicate that multiplicity pool testing not only improves the testing accuracy but also reduces the total cost of the testing process. Based on the findings, the manufacturer's test sensitivity has more significant impact on the accuracy of multiplicity pool testing compared to that of manufacturer's test specificity.
Collapse
Affiliation(s)
- Omar De La Cruz Cabrera
- Department of Mathematical Sciences, Kent State University, Kent, OH, United States of America
| | - Razan Alsehibani
- Department of Mathematical Sciences, Kent State University, Kent, OH, United States of America
| |
Collapse
|
5
|
Tsirtsis S, De A, Lorch L, Gomez-Rodriguez M. Pooled testing of traced contacts under superspreading dynamics. PLoS Comput Biol 2022; 18:e1010008. [PMID: 35344547 PMCID: PMC8989305 DOI: 10.1371/journal.pcbi.1010008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Revised: 04/07/2022] [Accepted: 03/10/2022] [Indexed: 11/18/2022] Open
Abstract
Testing is recommended for all close contacts of confirmed COVID-19 patients. However, existing pooled testing methods are oblivious to the circumstances of contagion provided by contact tracing. Here, we build upon a well-known semi-adaptive pooled testing method, Dorfman's method with imperfect tests, and derive a simple pooled testing method based on dynamic programming that is specifically designed to use information provided by contact tracing. Experiments using a variety of reproduction numbers and dispersion levels, including those estimated in the context of the COVID-19 pandemic, show that the pools found using our method result in a significantly lower number of tests than those found using Dorfman's method. Our method provides the greatest competitive advantage when the number of contacts of an infected individual is small, or the distribution of secondary infections is highly overdispersed. Moreover, it maintains this competitive advantage under imperfect contact tracing and significant levels of dilution.
Collapse
Affiliation(s)
- Stratis Tsirtsis
- Μax Planck Institute for Software Systems, Kaiserslautern, Germany
| | | | | | | |
Collapse
|
6
|
Verdun CM, Fuchs T, Harar P, Elbrächter D, Fischer DS, Berner J, Grohs P, Theis FJ, Krahmer F. Group Testing for SARS-CoV-2 Allows for Up to 10-Fold Efficiency Increase Across Realistic Scenarios and Testing Strategies. Front Public Health 2021; 9:583377. [PMID: 34490172 PMCID: PMC8416485 DOI: 10.3389/fpubh.2021.583377] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Accepted: 07/26/2021] [Indexed: 11/24/2022] Open
Abstract
Background: Due to the ongoing COVID-19 pandemic, demand for diagnostic testing has increased drastically, resulting in shortages of necessary materials to conduct the tests and overwhelming the capacity of testing laboratories. The supply scarcity and capacity limits affect test administration: priority must be given to hospitalized patients and symptomatic individuals, which can prevent the identification of asymptomatic and presymptomatic individuals and hence effective tracking and tracing policies. We describe optimized group testing strategies applicable to SARS-CoV-2 tests in scenarios tailored to the current COVID-19 pandemic and assess significant gains compared to individual testing. Methods: We account for biochemically realistic scenarios in the context of dilution effects on SARS-CoV-2 samples and consider evidence on specificity and sensitivity of PCR-based tests for the novel coronavirus. Because of the current uncertainty and the temporal and spatial changes in the prevalence regime, we provide analysis for several realistic scenarios and propose fast and reliable strategies for massive testing procedures. Key Findings: We find significant efficiency gaps between different group testing strategies in realistic scenarios for SARS-CoV-2 testing, highlighting the need for an informed decision of the pooling protocol depending on estimated prevalence, target specificity, and high- vs. low-risk population. For example, using one of the presented methods, all 1.47 million inhabitants of Munich, Germany, could be tested using only around 141 thousand tests if the infection rate is below 0.4% is assumed. Using 1 million tests, the 6.69 million inhabitants from the city of Rio de Janeiro, Brazil, could be tested as long as the infection rate does not exceed 1%. Moreover, we provide an interactive web application, available at www.grouptexting.com, for visualizing the different strategies and designing pooling schemes according to specific prevalence scenarios and test configurations. Interpretation: Altogether, this work may help provide a basis for an efficient upscaling of current testing procedures, which takes the population heterogeneity into account and is fine-grained towards the desired study populations, e.g., mild/asymptomatic individuals vs. symptomatic ones but also mixtures thereof. Funding: German Science Foundation (DFG), German Federal Ministry of Education and Research (BMBF), Chan Zuckerberg Initiative DAF, and Austrian Science Fund (FWF).
Collapse
Affiliation(s)
- Claudio M. Verdun
- Department of Mathematics, Technical University of Munich, Garching, Germany
- Department of Electrical and Computer Engineering, Technical University of Munich, Munich, Germany
| | - Tim Fuchs
- Department of Mathematics, Technical University of Munich, Garching, Germany
| | - Pavol Harar
- Research Network Data Science, University of Vienna, Vienna, Austria
- Department of Telecommunications, Brno University of Technology, Brno, Czechia
| | | | - David S. Fischer
- Institute of Computational Biology, Helmholtz Zentrum München, Munich, Germany
| | - Julius Berner
- Faculty of Mathematics, University of Vienna, Vienna, Austria
| | - Philipp Grohs
- Research Network Data Science, University of Vienna, Vienna, Austria
- Faculty of Mathematics, University of Vienna, Vienna, Austria
- Johann Radon Institute for Computational and Applied Mathematics, Austrian Academy of Sciences, Linz, Austria
| | - Fabian J. Theis
- Department of Mathematics, Technical University of Munich, Garching, Germany
- Institute of Computational Biology, Helmholtz Zentrum München, Munich, Germany
| | - Felix Krahmer
- Department of Mathematics, Technical University of Munich, Garching, Germany
- Munich Data Science Institute, Technical University of Munich, Garching, Germany
| |
Collapse
|