Epidemiologic studies are often designed to serve several purposes. The complex sampling plans necessary to ensure an adequate number of cases, a similar age distribution among cases and controls, or other important design constraints for comparative studies may make the additional goal of estimating prevalence in the population or in important subgroups difficult to attain with existing computer software, which typically assumes simple random sample selection. We consider here various methods for estimating overall and subgroup prevalence from complex samples, including crude prevalences, direct standardization of prevalences, and standardization using logistic regression to smooth the sampling group prevalences. We illustrate these methods using a complex sample to estimate the prevalence of Alzheimer's disease in an urban community. A simulation study under various models in this setting is also described. We conclude that the use of logistic regression to smooth sampling group prevalences before standardization is an effective method for estimation of overall prevalence, provided that the adequacy of fit of a logistic model is carefully checked.
- Clinical epidemiologic research
- Statistical models
ASJC Scopus subject areas
- Public Health, Environmental and Occupational Health