Ewens WJ, Wilf HS. Computing the distribution of the maximum in balls-and-boxes problems with application to clusters of disease cases.
Proc Natl Acad Sci U S A 2007;
104:11189-91. [PMID:
17592129 PMCID:
PMC2040874 DOI:
10.1073/pnas.0704691104]
[Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2007] [Indexed: 11/18/2022] Open
Abstract
We present a rapid method for the exact calculation of the cumulative distribution function of the maximum of multinomially distributed random variables. The method runs in time O(mn), where m is the desired maximum and n is the number of variables. We apply the method to the analysis of two situations in which an apparent clustering of cases of a disease in some locality has raised epidemiological concerns, and these concerns have been discussed in the recent literature. We conclude that one of these clusters may be explained on purely random grounds, namely the leukemia cluster in Niles, IL, in 1956-1960; whereas the other, a leukemia cluster in Fallon, NV, in 1999-2001, may not.
Collapse