A comparison of observation-level random effect and Beta-Binomial models for modelling overdispersion in Binomial data in ecology & evolution

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Overdispersion is a common feature of models of biological data, but researchers often fail to model the excess variation driving the overdispersion, resulting in biased parameter estimates and standard errors. Quantifying and modeling overdispersion when it is present is therefore critical for robust biological inference. One means to account for overdispersion is to add an observation-level random effect (OLRE) to a model, where each data point receives a unique level of a random effect that can absorb the extra-parametric variation in the data. Although some studies have investigated the utility of OLRE to model overdispersion in Poisson count data, studies doing so for Binomial proportion data are scarce. Here I use a simulation approach to investigate the ability of both OLRE models and Beta-Binomial models to recover unbiased parameter estimates in mixed effects models of Binomial data under various degrees of overdispersion. In addition, as ecologists often fit random intercept terms to models when the random effect sample size is low (<5 levels), I investigate the performance of both model types under a range of random effect sample sizes when overdispersion is present. Simulation results revealed that the efficacy of OLRE depends on the process that generated the overdispersion; OLRE failed to cope with overdispersion generated from a Beta-Binomial mixture model, leading to biased slope and intercept estimates, but performed well for overdispersion generated by adding random noise to the linear predictor. Comparison of parameter estimates from an OLRE model with those from its corresponding Beta-Binomial model readily identified when OLRE were performing poorly due to disagreement between effect sizes, and this strategy should be employed whenever OLRE are used for Binomial data to assess their reliability. Beta-Binomial models performed well across all contexts, but showed a tendency to underestimate effect sizes when modelling non-Beta-Binomial data. Finally, both OLRE and Beta-Binomial models performed poorly when models contained <5 levels of the random intercept term, especially for estimating variance components, and this effect appeared independent of total sample size. These results suggest that OLRE are a useful tool for modelling overdispersion in Binomial data, but that they do not perform well in all circumstances and researchers should take care to verify the robustness of parameter estimates of OLRE models.

Related collections

Most cited references 27

Record: found
Abstract: not found
Book: not found

Negative Binomial Regression

Joseph Hilbe (2011)

0 comments Cited 368 times – based on 0 reviews

Bookmark

Record: found
Abstract: found
Article: found

Is Open Access

Using observation-level random effects to model overdispersion in count data in ecology and evolution

Xavier A. Harrison, Gregorio Mentaberre (2014)

Overdispersion is common in models of count data in ecology and evolutionary biology, and can occur due to missing covariates, non-independent (aggregated) data, or an excess frequency of zeroes (zero-inflation). Accounting for overdispersion in such models is vital, as failing to do so can lead to biased parameter estimates, and false conclusions regarding hypotheses of interest. Observation-level random effects (OLRE), where each data point receives a unique level of a random effect that models the extra-Poisson variation present in the data, are commonly employed to cope with overdispersion in count data. However studies investigating the efficacy of observation-level random effects as a means to deal with overdispersion are scarce. Here I use simulations to show that in cases where overdispersion is caused by random extra-Poisson noise, or aggregation in the count data, observation-level random effects yield more accurate parameter estimates compared to when overdispersion is simply ignored. Conversely, OLRE fail to reduce bias in zero-inflated data, and in some cases increase bias at high levels of overdispersion. There was a positive relationship between the magnitude of overdispersion and the degree of bias in parameter estimates. Critically, the simulations reveal that failing to account for overdispersion in mixed models can erroneously inflate measures of explained variance (r 2), which may lead to researchers overestimating the predictive power of variables of interest. This work suggests use of observation-level random effects provides a simple and robust means to account for overdispersion in count data, but also that their ability to minimise bias is not uniform across all types of overdispersion and must be applied judiciously.

0 comments Cited 335 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Article: not found

R: language and environment for statistical computing

RC Team, R Rteam, R Team … (2014)

0 comments Cited 233 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Xavier A. Harrison

Journal

Journal ID (nlm-ta): PeerJ

Journal ID (iso-abbrev): PeerJ

Journal ID (pmc): PeerJ

Journal ID (publisher-id): PeerJ

Title: PeerJ

Publisher: PeerJ Inc. (San Francisco, USA )

ISSN (Electronic): 2167-8359

Publication date (Electronic): 21 July 2015

Publication date Collection: 2015

Volume: 3

Electronic Location Identifier: e1114

Affiliations

[-1]Institute of Zoology, Zoological Society of London , UK

Article

Publisher ID: 1114

DOI: 10.7717/peerj.1114

PMC ID: 4517959

PubMed ID: 26244118

SO-VID: 67ea8de9-6fb7-4086-9438-62ff52993a13

License:

This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.

History

Date received : 26 March 2015

Date accepted : 29 June 2015

Funding

Funded by: Zoological Society of London

Funded by: BES Research Grant

Award ID: 4720/5758

This work was funded by a Research Fellowship awarded to XH by the Zoological Society of London, and a BES Research Grant (Grant Number 4720/5758). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

A comparison of observation-level random effect and Beta-Binomial models for modelling overdispersion in Binomial data in ecology & evolution

Read this article at

Abstract

Related collections

Crocodylian evolution

Most cited references 27

Negative Binomial Regression

Using observation-level random effects to model overdispersion in count data in ecology and evolution

R: language and environment for statistical computing

Author and article information

Contributors

Journal

Affiliations

Article

History

Funding

Categories

Comments

Comment on this article

Similar content 236

Cited by 114

Most referenced authors 306