Jensen AJ, Kelly RP, Anderson EC, Satterthwaite WH, Shelton AO, Ward EJ. Introducing zoid: A mixture model and R package for modeling proportional data with zeros and ones in ecology.
Ecology 2022;
103:e3804. [PMID:
35804486 DOI:
10.1002/ecy.3804]
[Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/24/2022] [Revised: 04/26/2022] [Accepted: 04/29/2022] [Indexed: 11/08/2022]
Abstract
Many ecological datasets are proportional, representing mixtures of constituent elements such as species, populations, or strains. Analyses of proportional data are challenged by categories with zero observations (zeros), all the observations (ones), and overdispersion. In lieu of ad-hoc data adjustments, we describe and evaluate a zero-and-one inflated Dirichlet regression model, with its corresponding R package (zoid), capable of handling observed data (x consisting of three possible categories: zeros, proportions, or ones. Instead of fitting the model to observations of single biological units (e.g., individual organisms) within a sample, we sum proportional contributions across units and estimate mixture proportions using one aggregated observation per sample. Optional estimation of overdispersion and covariate influences expand model applications. We evaluate model performance, as implemented in Stan, using simulations and two ecological case studies. We show that zoid successfully estimates mixture proportions using simulated data with varying sample sizes and is robust to overdispersion and covariate structure. In the empirical case studies, we estimate composition of a mixed-stock Chinook salmon (Oncorhynchus tshawytscha) fishery and analyze stomach contents of Atlantic cod (Gadus morhua). Our implementation of the model as an R package facilitates its application to varied ecological datasets composed of proportional observations.
Collapse