A model for repeated clustered data with informative cluster sizes

Ana-Maria Iosif, AllanR Sampson

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

Many chronic diseases or health conditions manifest with recurring episodes, each of which can be characterized by a measure of intensity or severity. Both the number of episodes and the severity of each episode can depend on the latent severity of an individual's underlying condition. Data such as this are commonly gathered repeatedly at fixed follow-up intervals. An example is a study of the association between stressful life events and the onset of depression. Stress exposure is assessed through the frequency and intensity of stressful life events occurring each month. Both the number of events and the intensity of each event at each measurement occasion are informative about the underlying severity of stress over time. One might hypothesize that people that approach the onset of a depressive episode have worse stress profiles than the controls, reflected by both more frequent and more intense stressors. We propose models to analyze data collected repeatedly on both the frequency of an event and its severity when both of these are informative about the underlying latent severity. Maximum likelihood estimators are developed, and simulations with small to moderate sample sizes show that the estimators also have good finite sample properties, and they are robust against misspecification of the model. This method is applied to a psychiatric data set.

Original languageEnglish (US)
Pages (from-to)738-759
Number of pages22
JournalStatistics in Medicine
Volume33
Issue number5
DOIs
StatePublished - Feb 28 2014

Fingerprint

Clustered Data
Sample Size
Psychiatry
Chronic Disease
Health
Model
Misspecification
Maximum Likelihood Estimator
Estimator
Interval
Simulation
Datasets

Keywords

  • Clustered data
  • Informative cluster size
  • Joint modeling
  • Recurring episodes
  • Repeated measures

ASJC Scopus subject areas

  • Epidemiology
  • Statistics and Probability
  • Medicine(all)

Cite this

A model for repeated clustered data with informative cluster sizes. / Iosif, Ana-Maria; Sampson, AllanR.

In: Statistics in Medicine, Vol. 33, No. 5, 28.02.2014, p. 738-759.

Research output: Contribution to journalArticle

@article{1f10dd48d453411cbf4fc99073bd22c9,
title = "A model for repeated clustered data with informative cluster sizes",
abstract = "Many chronic diseases or health conditions manifest with recurring episodes, each of which can be characterized by a measure of intensity or severity. Both the number of episodes and the severity of each episode can depend on the latent severity of an individual's underlying condition. Data such as this are commonly gathered repeatedly at fixed follow-up intervals. An example is a study of the association between stressful life events and the onset of depression. Stress exposure is assessed through the frequency and intensity of stressful life events occurring each month. Both the number of events and the intensity of each event at each measurement occasion are informative about the underlying severity of stress over time. One might hypothesize that people that approach the onset of a depressive episode have worse stress profiles than the controls, reflected by both more frequent and more intense stressors. We propose models to analyze data collected repeatedly on both the frequency of an event and its severity when both of these are informative about the underlying latent severity. Maximum likelihood estimators are developed, and simulations with small to moderate sample sizes show that the estimators also have good finite sample properties, and they are robust against misspecification of the model. This method is applied to a psychiatric data set.",
keywords = "Clustered data, Informative cluster size, Joint modeling, Recurring episodes, Repeated measures",
author = "Ana-Maria Iosif and AllanR Sampson",
year = "2014",
month = "2",
day = "28",
doi = "10.1002/sim.5988",
language = "English (US)",
volume = "33",
pages = "738--759",
journal = "Statistics in Medicine",
issn = "0277-6715",
publisher = "John Wiley and Sons Ltd",
number = "5",

}

TY - JOUR

T1 - A model for repeated clustered data with informative cluster sizes

AU - Iosif, Ana-Maria

AU - Sampson, AllanR

PY - 2014/2/28

Y1 - 2014/2/28

N2 - Many chronic diseases or health conditions manifest with recurring episodes, each of which can be characterized by a measure of intensity or severity. Both the number of episodes and the severity of each episode can depend on the latent severity of an individual's underlying condition. Data such as this are commonly gathered repeatedly at fixed follow-up intervals. An example is a study of the association between stressful life events and the onset of depression. Stress exposure is assessed through the frequency and intensity of stressful life events occurring each month. Both the number of events and the intensity of each event at each measurement occasion are informative about the underlying severity of stress over time. One might hypothesize that people that approach the onset of a depressive episode have worse stress profiles than the controls, reflected by both more frequent and more intense stressors. We propose models to analyze data collected repeatedly on both the frequency of an event and its severity when both of these are informative about the underlying latent severity. Maximum likelihood estimators are developed, and simulations with small to moderate sample sizes show that the estimators also have good finite sample properties, and they are robust against misspecification of the model. This method is applied to a psychiatric data set.

AB - Many chronic diseases or health conditions manifest with recurring episodes, each of which can be characterized by a measure of intensity or severity. Both the number of episodes and the severity of each episode can depend on the latent severity of an individual's underlying condition. Data such as this are commonly gathered repeatedly at fixed follow-up intervals. An example is a study of the association between stressful life events and the onset of depression. Stress exposure is assessed through the frequency and intensity of stressful life events occurring each month. Both the number of events and the intensity of each event at each measurement occasion are informative about the underlying severity of stress over time. One might hypothesize that people that approach the onset of a depressive episode have worse stress profiles than the controls, reflected by both more frequent and more intense stressors. We propose models to analyze data collected repeatedly on both the frequency of an event and its severity when both of these are informative about the underlying latent severity. Maximum likelihood estimators are developed, and simulations with small to moderate sample sizes show that the estimators also have good finite sample properties, and they are robust against misspecification of the model. This method is applied to a psychiatric data set.

KW - Clustered data

KW - Informative cluster size

KW - Joint modeling

KW - Recurring episodes

KW - Repeated measures

UR - http://www.scopus.com/inward/record.url?scp=84893812195&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84893812195&partnerID=8YFLogxK

U2 - 10.1002/sim.5988

DO - 10.1002/sim.5988

M3 - Article

C2 - 24123049

AN - SCOPUS:84893812195

VL - 33

SP - 738

EP - 759

JO - Statistics in Medicine

JF - Statistics in Medicine

SN - 0277-6715

IS - 5

ER -