Extending the interpretation and utility of mixed effects logistic regression models

Edward R Atwill, Hussni O. Mohammed, Janet M. Scarlett, Charles E. McCulloch

Research output: Contribution to journalArticle

15 Citations (Scopus)

Abstract

The veterinary research community has begun to use mixed effects logistic regression (MELR) for analyzing disease data obtained from groups of animals. In this article we discuss the issues of how to analyze these models and how to interpret MELR risk estimates and random effect variances (single and nested). We provide empirical evidence for their use and present equations for interpreting the results and comparing ordinary logistic regression (OLR) and MELR. These equations allow for a deeper interpretation of what random effects signify within the MELR model and help reveal the relationship between marginal (OLR) coefficients and conditional (MELR) coefficients. We used three veterinary data sets to illustrate our aims. The data sets contained data on vesicular stomatitis virus infection in cattle, Mycoplasma gallisepticum infection in chicken flocks, and three infectious conditions in puppies (respiratory, intestinal illness, and internal parasites). The chicken data had nested random effects such that 357 flocks were housed on 104 different farms operated by 45 different owners. Significant random effects were detected for all but intestinal illness in puppies and the nested farm random effect in the chicken data. The intra-group correlation coefficients on the logit scale, calculated from the random effect variances, were 0.47 and 0.55 for the cow and chicken data, respectively. This indicated that about 50% of the total variance on the logit scale for the probability of disease was attributable to unmeasured or unmeasurable group-level factors. Since the farm random effect was not significant once the owner random effect was controlled for in the chicken data, the unknown factor(s) inducing the intra-group correlation was operating at the owner level or higher. These data sets were also used to illustrate why predicted probabilities from the MELR model should not be presented as point estimates. For example, the predicted OLR probability of testing seropositive to vesicular stomatitis virus New Jersey serotype (VSV-NJ) for 5-year-old Bos taunts cattle living at an elevation of 0-500 m with a mean annual rainfall of 0-2 m was 74%. Given that significant herd random effects were present, however, the true probability of testing seropositive to VSV-NJ for such a cow should be formulated as a range of herd-specific probabilities: the probability varied from 48 to 97% for the central 65% of the herds and from 14 to 99% for the central 95% of the herds. We have also shown why marginal OLR coefficients are biased downward as estimates of conditional MELR coefficients owing to the intra-group correlation and non-linearity of the logistic regression model.

Original languageEnglish (US)
Pages (from-to)187-201
Number of pages15
JournalPreventive Veterinary Medicine
Volume24
Issue number3
DOIs
StatePublished - 1995
Externally publishedYes

Fingerprint

Logistic Models
chickens
Vesicular stomatitis New Jersey virus
herds
puppies
farms
Chickens
flocks
serotypes
Bos
Vesiculovirus
cows
Mycoplasma gallisepticum
risk estimate
cattle
infection
testing
rain
parasites
Mycoplasma Infections

Keywords

  • Intra-group correlation
  • Logistic regression
  • Nested designs
  • Random effects

ASJC Scopus subject areas

  • Animal Science and Zoology
  • Food Animals

Cite this

Extending the interpretation and utility of mixed effects logistic regression models. / Atwill, Edward R; Mohammed, Hussni O.; Scarlett, Janet M.; McCulloch, Charles E.

In: Preventive Veterinary Medicine, Vol. 24, No. 3, 1995, p. 187-201.

Research output: Contribution to journalArticle

Atwill, Edward R ; Mohammed, Hussni O. ; Scarlett, Janet M. ; McCulloch, Charles E. / Extending the interpretation and utility of mixed effects logistic regression models. In: Preventive Veterinary Medicine. 1995 ; Vol. 24, No. 3. pp. 187-201.
@article{b3034e7c73844d1a87cf8fc62efb55fe,
title = "Extending the interpretation and utility of mixed effects logistic regression models",
abstract = "The veterinary research community has begun to use mixed effects logistic regression (MELR) for analyzing disease data obtained from groups of animals. In this article we discuss the issues of how to analyze these models and how to interpret MELR risk estimates and random effect variances (single and nested). We provide empirical evidence for their use and present equations for interpreting the results and comparing ordinary logistic regression (OLR) and MELR. These equations allow for a deeper interpretation of what random effects signify within the MELR model and help reveal the relationship between marginal (OLR) coefficients and conditional (MELR) coefficients. We used three veterinary data sets to illustrate our aims. The data sets contained data on vesicular stomatitis virus infection in cattle, Mycoplasma gallisepticum infection in chicken flocks, and three infectious conditions in puppies (respiratory, intestinal illness, and internal parasites). The chicken data had nested random effects such that 357 flocks were housed on 104 different farms operated by 45 different owners. Significant random effects were detected for all but intestinal illness in puppies and the nested farm random effect in the chicken data. The intra-group correlation coefficients on the logit scale, calculated from the random effect variances, were 0.47 and 0.55 for the cow and chicken data, respectively. This indicated that about 50{\%} of the total variance on the logit scale for the probability of disease was attributable to unmeasured or unmeasurable group-level factors. Since the farm random effect was not significant once the owner random effect was controlled for in the chicken data, the unknown factor(s) inducing the intra-group correlation was operating at the owner level or higher. These data sets were also used to illustrate why predicted probabilities from the MELR model should not be presented as point estimates. For example, the predicted OLR probability of testing seropositive to vesicular stomatitis virus New Jersey serotype (VSV-NJ) for 5-year-old Bos taunts cattle living at an elevation of 0-500 m with a mean annual rainfall of 0-2 m was 74{\%}. Given that significant herd random effects were present, however, the true probability of testing seropositive to VSV-NJ for such a cow should be formulated as a range of herd-specific probabilities: the probability varied from 48 to 97{\%} for the central 65{\%} of the herds and from 14 to 99{\%} for the central 95{\%} of the herds. We have also shown why marginal OLR coefficients are biased downward as estimates of conditional MELR coefficients owing to the intra-group correlation and non-linearity of the logistic regression model.",
keywords = "Intra-group correlation, Logistic regression, Nested designs, Random effects",
author = "Atwill, {Edward R} and Mohammed, {Hussni O.} and Scarlett, {Janet M.} and McCulloch, {Charles E.}",
year = "1995",
doi = "10.1016/0167-5877(95)92833-J",
language = "English (US)",
volume = "24",
pages = "187--201",
journal = "Preventive Veterinary Medicine",
issn = "0167-5877",
publisher = "Elsevier",
number = "3",

}

TY - JOUR

T1 - Extending the interpretation and utility of mixed effects logistic regression models

AU - Atwill, Edward R

AU - Mohammed, Hussni O.

AU - Scarlett, Janet M.

AU - McCulloch, Charles E.

PY - 1995

Y1 - 1995

N2 - The veterinary research community has begun to use mixed effects logistic regression (MELR) for analyzing disease data obtained from groups of animals. In this article we discuss the issues of how to analyze these models and how to interpret MELR risk estimates and random effect variances (single and nested). We provide empirical evidence for their use and present equations for interpreting the results and comparing ordinary logistic regression (OLR) and MELR. These equations allow for a deeper interpretation of what random effects signify within the MELR model and help reveal the relationship between marginal (OLR) coefficients and conditional (MELR) coefficients. We used three veterinary data sets to illustrate our aims. The data sets contained data on vesicular stomatitis virus infection in cattle, Mycoplasma gallisepticum infection in chicken flocks, and three infectious conditions in puppies (respiratory, intestinal illness, and internal parasites). The chicken data had nested random effects such that 357 flocks were housed on 104 different farms operated by 45 different owners. Significant random effects were detected for all but intestinal illness in puppies and the nested farm random effect in the chicken data. The intra-group correlation coefficients on the logit scale, calculated from the random effect variances, were 0.47 and 0.55 for the cow and chicken data, respectively. This indicated that about 50% of the total variance on the logit scale for the probability of disease was attributable to unmeasured or unmeasurable group-level factors. Since the farm random effect was not significant once the owner random effect was controlled for in the chicken data, the unknown factor(s) inducing the intra-group correlation was operating at the owner level or higher. These data sets were also used to illustrate why predicted probabilities from the MELR model should not be presented as point estimates. For example, the predicted OLR probability of testing seropositive to vesicular stomatitis virus New Jersey serotype (VSV-NJ) for 5-year-old Bos taunts cattle living at an elevation of 0-500 m with a mean annual rainfall of 0-2 m was 74%. Given that significant herd random effects were present, however, the true probability of testing seropositive to VSV-NJ for such a cow should be formulated as a range of herd-specific probabilities: the probability varied from 48 to 97% for the central 65% of the herds and from 14 to 99% for the central 95% of the herds. We have also shown why marginal OLR coefficients are biased downward as estimates of conditional MELR coefficients owing to the intra-group correlation and non-linearity of the logistic regression model.

AB - The veterinary research community has begun to use mixed effects logistic regression (MELR) for analyzing disease data obtained from groups of animals. In this article we discuss the issues of how to analyze these models and how to interpret MELR risk estimates and random effect variances (single and nested). We provide empirical evidence for their use and present equations for interpreting the results and comparing ordinary logistic regression (OLR) and MELR. These equations allow for a deeper interpretation of what random effects signify within the MELR model and help reveal the relationship between marginal (OLR) coefficients and conditional (MELR) coefficients. We used three veterinary data sets to illustrate our aims. The data sets contained data on vesicular stomatitis virus infection in cattle, Mycoplasma gallisepticum infection in chicken flocks, and three infectious conditions in puppies (respiratory, intestinal illness, and internal parasites). The chicken data had nested random effects such that 357 flocks were housed on 104 different farms operated by 45 different owners. Significant random effects were detected for all but intestinal illness in puppies and the nested farm random effect in the chicken data. The intra-group correlation coefficients on the logit scale, calculated from the random effect variances, were 0.47 and 0.55 for the cow and chicken data, respectively. This indicated that about 50% of the total variance on the logit scale for the probability of disease was attributable to unmeasured or unmeasurable group-level factors. Since the farm random effect was not significant once the owner random effect was controlled for in the chicken data, the unknown factor(s) inducing the intra-group correlation was operating at the owner level or higher. These data sets were also used to illustrate why predicted probabilities from the MELR model should not be presented as point estimates. For example, the predicted OLR probability of testing seropositive to vesicular stomatitis virus New Jersey serotype (VSV-NJ) for 5-year-old Bos taunts cattle living at an elevation of 0-500 m with a mean annual rainfall of 0-2 m was 74%. Given that significant herd random effects were present, however, the true probability of testing seropositive to VSV-NJ for such a cow should be formulated as a range of herd-specific probabilities: the probability varied from 48 to 97% for the central 65% of the herds and from 14 to 99% for the central 95% of the herds. We have also shown why marginal OLR coefficients are biased downward as estimates of conditional MELR coefficients owing to the intra-group correlation and non-linearity of the logistic regression model.

KW - Intra-group correlation

KW - Logistic regression

KW - Nested designs

KW - Random effects

UR - http://www.scopus.com/inward/record.url?scp=0039175138&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0039175138&partnerID=8YFLogxK

U2 - 10.1016/0167-5877(95)92833-J

DO - 10.1016/0167-5877(95)92833-J

M3 - Article

AN - SCOPUS:0039175138

VL - 24

SP - 187

EP - 201

JO - Preventive Veterinary Medicine

JF - Preventive Veterinary Medicine

SN - 0167-5877

IS - 3

ER -