Академический Документы
Профессиональный Документы
Культура Документы
Background
Lifetime prevalences (i.e. the proportion of the population fulfilling a
diagnostic at any time in life) encompass highly valuable information in
epidemiologic studies about mental disorders, their relationships, duration
and severity, and environmental factors. Such information is capital for
describing and explaining the distribution of mental pathologies during
development, as well as for studying sequences of disorder co-occurrence
(multimorbodity) or the beginning of symptoms with shared etiopathogeny.
Longitudinal studies are the ideal design for establishing these relationships,
as they would allow survival-analytic approaches of the onset of mental
disorders. However, longitudinal designs are difficult to implement and
challenging when seeking a representative sampling.
Given these problems, most epidemiologic information on mental heath is
estimated from cross-sectional samples, which is an efficient data gathering
design when trying to achieve a population representative sample. However,
this design makes impossible to follow an individual until the appearance of a
disorder. A common compromise consists in collecting disorder onset
information retrospectively, but the moment of first exposure varies from
individual to individual, as it starts at different times. As a result, for certain
observations, a part of the exposure period might occur after the interview,
so that the observed disorder onset is right-censored.
Pij yij 1
ij
( j )
ij
y ij
1 Pij
( 1 y ij )
L Pij
i 1 j 1
y ij
1 Pij
( 1 y ij )
Along with the event presence at C, we also code Tij , the age of the
individual i at the moment of occurrence of the event j, and Ai the age of the
individual at C. In this fashion, the exact moment when the event occurs is
not observed, but it is known to be in one of the m j discrete age intervals of
t jl 1 ,t jl ,...,
t jm 1,t jm . As
exposure of equal length units u, t jo ,t j 1 , t j 1 ,t j 2 ,...,
regards the exposure period, the initial observation time tj0 is the age when
individuals are first considered to be at risk of the event j, and tjm is the age
when there is no further risk of the event. The moments tj0 and tjm define the
duration of the exposure time, and can be decided on theoretical
considerations or based on sample information. The individual is not
considered to be at risk outside the exposure period, so that Tij <tjm and Dj
tjm-tj0 u mj.
Whenever the event occurs during the period of exposure, it is either up to
the age of interview, Tij [tj(l-1), tjl), Tij < Ai , or it is right-censored (Tij>Ai). For
each subject i we either we observe the event during the initial
Rij ( Ai t j 0 ) / u 1 age intervals of nj, which span from [tj0, tjm], 1< Rij < mj
or it is right censored at Ai. For the i th individual, let Oij ,Rij 1 if the event
occurs within the interval 1,Rij , and Oij ,Rij 0 if it is censored.
As eq. assumes that for any i,j , Oij ,Rij 1, without taking into account into
account the existence of Oij ,Rij 0 cases, the marginal probabilities of the
j are considered free from censorship bias:
events (i.e. lifetime prevalences)
j y j 1 N 1 i 1 P yij f 1 d
N
N 1 i 1 P y ij 1, yij ' f 1 d
NC
j y j 1 N 1
P yiC j f 1 d
i C 1
NI
y
i I 1
i I j f 1 d
ratio
NI
approaches 1. This ratio involves a nuisance parameter depending on
N I NC
the distribution of event onset times of the events as well as on the age distribution of
the individuals and the proportion of negative events at C. The effect of these
parameters is more critical as the number of joint events considered in the
prevalence estimates increases, because outcome probabilities become zero
whenever the information for any j is incomplete.
Correcting censorship bias with a pseudo partial likelihood approach
0,Rij
ij ,Rij
is
contrary, cases
t j 0 , A i
ij ,Rij
ij ,Rij
A ,t
i
jm
. In this
ij
Thus, it is necessary to correct the exact likelihood function in order to take into
account that the probability of negative cases at the C is greater or equal than that at
the end of the exposure time.
Cox (1975) introduced a method for correcting the standard maximum likelihood when
estimating time-to-event models affected by nuisance parameters (Cox, 1975, Binder,
1992,
Lin,
1994;
Lin
And
Ying,
2003).
Under
Cox
regression
model
l ( ) i T z i log( g(T% T%
i )exp( z i )
i
which proceeds by estimating for each case and then maximizing the loglikelihood equation.
Eq. describes the log-likelihood as function of the data, the parameters and
the failure status of the individual at Ci, thus yielding an unbiased estimating
equation. Such estimating functions have also received the name of
composite-likelihoods or quasi-likelihoods (Besag,1977; Prentice, 1986;
Suzuki, 1985), and are a special case of the pseudo-maximum likelihood
estimators. In this family of estimators, a deconvolution model is built where
a parameter subset is estimated by some technique other than the standard
ML, and then replaced in the standard likelihood function to be estimated
via ML.
ij (Oij ,Rij ,Rij ,mij ,S j ) Oij ,Rij Rij ,mij ,S j ( 1 Oij ,Rij ).
Notice that
( Ri j ,m j ,S j )
Rij
mj
(1 S j )
where the observed proportion of the exposure period (Rij/mj) weights Sj, the real
proportion of negative cases at the end of the exposure period. As the ratio
NI
is
NI NC
not directly observable, its influence is indirectly taken into account by means of Thus,
Sj is a nuisance parameter that can be estimated using standard survival techniques,
by projecting the prevalence of the event up to a certain arbitrary time tm=Tmax by means
of a survival function.
Under this framework, we propose the following Partial Likelihood (PL)
function:
N
L Pij
y ij ij ( b )
i 1 j 1
1 Pij
( 1 y ij ) ij ( b )
S j ( t m ) P(T t m ) P(T t k | T t k 1 )
k 1
This S j estimate is obtained using the non parametric life-table method (Klein &
Moeschberger, 2003). Estimation assumes that failures and lost cases occur, on
average, at the midpoint of the interval, so that they were at risk within the first half of
10
and the
l log ij L
i 1 j 1
N
( ),
i 1 j 1
ij
l w i log Lij
i 1 j 1
w ( ).
i 1 j 1
ij
Notice that the individual sampling weights have equal effect over the J
events, but theij weights varies across the events. Parameter estimation is
achieved by solving , and it can be obtained by solving the weighted loglikelihood equation l 0 using a Newton-Raphson method. The Partial
Log-likelihood equation is a Destimator (Kulich et al., 2012), as the
weights are dependent on failure status. The first order derivatives of the
11
2
2
The second order derivatives are l ''( ) w i ij ( ) .
i 1 j 1
12
In the second step we censored the lifetime data at the age of the
observation. Finally, the censored data was analysed according to the
aforementioned methods.
Complete lifetime data was simulated from an IRT two parameter model
with four binary indicators. As in applied settings prevalences are usually
low, we set the indicators events to have low frequency. In order to do so,
two of the events (Y1 and Y3) had their intercept fixed at =-2.0, while the
remaining two (Y2 and Y4) were fixed at =-0.5. Slopes were set according to
two different scenarios, low and highly correlated indicators, as indicated by
slopes 0.25 or 0.75. For the sake of simplicity, all indicators in each
scenario were considered to have equal slopes.
The generated binary variables indicated the absence or presence of the
event during the full observable age period nj. The observed period was
created by generating the age Ai of the individual at the time of the
interview, according to a normal age distribution (mean=46.72; sd=17.87)
as found in the ESEMeD dataset (Alonso, Angermeyer, & Bernert, 2004).
When the latent trait model generated a positive realization for an
individual, the age of onset Tij was generated by assuming that the onsets
could only occur at ages Tij =(25,35,50). Whenever the model yielded yij=1,
the onset was assigned to one of the three ages with two distributions: a)
'early' disorder: realization probabilities at each onset age were (.75, .25, .
00), resulting in an average age of onset at 27.5 years; and b) 'late'
disorder: realization probabilities were (.25, .50 , .25), with an average age
on onset of 36.25 years. Three setting conditions were simulated: 1) Early
onset, using four early disorder events Y1-Y4; 2) late onset, where Y1-Y4
13
using four late onset variables, and 2) Mixed onset: where events Y 1 and Y2
had early onset distributions while Y3 and Y4 had late onset distributions.
Complete lifetime data was transformed into a cross-sectional sample
with censored binary indicators. To determine whether the variable would be
observed, we set an exposure period of 50 years from the initial
observation for the late onset conditions and 35 year for the early onset
conditions. Cases where the model yielded yij 0 or yij1 and Tij < Ai < Tmax
the observed event at interview coincides with the true lifetime realization.
Cases where the model yielded yij 1 and Tij >Ai were assigned yij 0, and
thus censored at Ai . In each case, observation weights for individual
contribution to likelihood were computed using equation . The censored
samples were analysed using 2 different methods:
a) Raw estimation with Marginal Maximum Likelihood (MML method): IRT
modelling of censored data with using standard MML estimation.
b) Censored Model with Partial Likelihood (PL method): IRT modelling of
censored data using the Partially Weighted-Likelihood Method.
We used the population model for parameter estimation, so that the
simulation produced results for the best possible estimation scenario, where
the population model is known and correctly specified. Simulations were
conducted using 500 replicates of 5000 observations for each conditions of
the 3 x 2 x 2 simulation design (time of onset slope estimation method)
using a common seed for data generation.
The complete lifetime data generation and subsequent IRT analyses were
conducted using LatentGold 4.5 (Vermunt and Magidson, 2005) with a a
logistic link function. Outcomes were included as binomial and exposure
14
was ij (Oij ,Rij ,Rij ,mij ,S j ) . In so doing, the standard loglikelihood becomes the
loglikelihood of a multivariate Bernouilli as in with case weights generated
with . This solution could also be implemented with other parameterizations
and link functions. Estimation of the Sj weights were conducted using
actuarial survival method as implemented in SAS v9.2 PROC lifetest method.
3.2 Convergence and parameter estimates
First, we studied the convergence rate, defined as the percentage of
replications per condition that achieved convergence with Latent Gold
defaults but excluding improper solutions. As in other studies (Flora &
Curran, 2004, Forero & Maydeu-Olivares, 2009), a solution was deemed
improper whenever at least one estimated parameter was greater than or
equal to 22.35 (corresponding to a factor loading larger than 0.999) in
absolute value. Improper and non-convergent solutions were removed from
subsequent analyses.
_________________________________
INSERT TABLE 1 ABOUT HERE
__________________________________
As can be seen in table 1, PL yielded higher convergence rates in all
conditions than the direct MML method. The highest difference in
convergence rates between methods was found when = 0.75 and late
onset variables, a setting where the PL yielded 5 times more convergent and
valid replicates than the MML. In the most realistic setting, the Mixed Onset
setting, the PL achieved 1.20 times more convergence than the MML.
Recovery of model parameters was studied using parameter relative bias
defined as:
15
= 100
Re lative Bias()
( )
1/ 2
-2, and the opposite for -0.5, but it is noticeable that this was the
condition with lowest convergence rates. As for RMSE, PL showed, on the
overall, better results than MML.
16
18
to -10.92% of MML. PL also showed lower RMSEs than MML across all
conditions.
4. Empirical illustration: Lifetime Prevalence of Mental Health disorders
We illustrate the Partial Likelihood method in data from the ESEMeD
Project. ESEMeD consisted of a series of cross-sectional surveys in adult
population in six european countries to estimate the prevalence of mental
health disorders as assessed with the Composite International Diagnostic
Interview (CIDI) version 3.0 (Kessler & Ustun, 2004) and according to the
DSM-IV criteria (American Psychiatric Association, 2000). In the current
analysis, an ESEMeD subsample was used (N=8,796). Individuals were
weighted to adjust for their population representativeness. Details on
methods and the participants of the ESEMeD project can be found elsewhere
(Alonso et al., 2004c; Alonso et al., 2004a).
In this empirical study, the main outcome analysed was the presence of a
DSM-IV disorder at any time previous to the interview. Included disorders
were: major depression episode (mde), dysthymia (dys), general anxiety
disorder (gad), posttraumatic stress disorder (ptsd), agoraphobia with or
without panic (ago), specific phobia (sp), social phobia (so), panic disorder
(pd). We modelled these disorders as resulting from the expression of a
unidimensional lifetime internalising vulnerability factor (Krueger, 1999;
Krueger & Finger, 2001; Clark & Watson, 1991; Clark, 2005). The model was
scaled so that lower scores indicate higher diathesis levels (and greater
vulnerability to disorders). As the ESEMeD Surveys collected retrospective
information on the age of onset of each disorder, it was possible to
implement the PL estimation, choosing the 99 th-percentile of disorder age of
as Tmaxj. The internalising model was also fitted with observed data, but
ignoring right-censorship in no-disorder cases (MML estimation). The Survival
19
20
the average age of onset (r=0.94). As for the bivariate prevalences, the PL
yielded estimates that were on average 2.08 times above the observed
values and 1.85 times above the MML estimates. The PL tended to yield
larger prevalence bias corrections for the comorbid conditions where one or
both the disorders had a late onset, while combinations where both disorders
appear early in life showed low or moderate prevalence bias corrections.
Conversely, and as expected, the MML estimates almost did not correct the
raw prevalences, yielding very similar estimates with both the univariate
(APR=1.01) and the bivariate prevalences (APR= 1.12) very close to 1
between the MML and raw prevalence estimates.
5. Discussion
Modelling the lifetime presence of disorders is of chief interest in the field
of psychiatric epidemiology. Diathesis models of psychopathology,
developmental studies of symptom appearance, and general etiopathogenic
models can benefit from latent trait modelling. However, the limitations of
longitudinal designs for obtaining representative samples and the cost of
long-term prospective data gathering, greatly limit the scope of these
studies. The more frequent cross-sectional mixed-aged samples designs are
limited by an inevitable right-censorship causing downward bias in lifetime
prevalence estimates. Even though the usual survival modelling framework
allows correcting this bias, it impedes estimating the variables jointly with
underlying factors.
In this paper, we propose estimating IRT models of lifetime, censored
outcomes arising from cross-sectional survey designs using a partial
likelihood estimation method that corrects censorship bias. The PL
methodology is the result of combining a non-parametric survival approach
for estimating an unbiased prevalence parameter (Kessler et al, 2005) with
21
ML latent trait estimation on censored data (Muthn, 1989; Shi & Lee, 1997)
to construct a pseudo maximum likelihood estimating function (Gong and
Smaniego, 1981). On a first step, the nuisance parameter (prevalence at the
interview time) is estimated and held fixed; the second step involves ML
estimation of a model underlying the observed variables. The simulation
results and the applied example in ESEMeD data indicate that the PL
methodology yields higher prevalences than other methods. This is a
warning for applied researchers in the area of psychiatric epidemiology:
current lifetime prevalences of psychiatric disorders are probably
understated. More importantly, lifetime comorbidity in cross sectional
designs is most likely underestimated. Such problem has been identified in
the literature, (Kraemer, Wilson and Hayward, 2006; Eaton, 2006; Cerda,
Sagdeo and Galea, 2008), even though its magnitude has been seldom
quantified (Kessler, Berglund et al. 2003; Kessler & Merikangas, 2004).
Theories of lifetime development of psychiatric disorders should take this
into account.
The method that we have set forth here is similar to the pseudo partial
maximum likelihood methodology used in a survival analysis framework with
observed variables (Andersen et al. 1993; Breslow & Wellner, 2007; Borgan,
Langholz, Samuelsen, Goldstein, & Pogoda, 2000). The main difference
between the survival analysis estimates and our proposal is the status of the
explanatory variables; whereas Cox method proposed a regression survival
function based on a single observed variables, in our case the observed
variables depend on a latent trait, as given by . To our knowledge, this is the
first time that such methodology is applied to models with unobserved
factors to simultaneously estimate lifetime prevalences and modelling the
relationships between the observations.
22
23
case, PL yielded closer estimates of real frequencies. In both cases, the worst
estimation scenario was the case of high likelihood of censorship of
infrequent events with low intercorrelations, a case which is already
described in the literature (Forero & Maydeu-Olivares, 2009). In this scenario,
both methods failed when recovering slopes, but the PL still yielded good
recovery of intercepts. Even in this case, PL recovered population
frequencies more precisely, however model estimates zero slopes for all
outcomes and thus, the substantive interpretation would be that the
measures are totally independent. However, MML would greatly overestimate
the relationships among the events.
When we used the PL methodology on the ESEMeD data, it showed
substantial corrections in the univariate and bivariate frequencies of mental
disorders. As expected, bivariate corrections for censorship tended to be, on
average, larger than the univariate frequencies, as in the comorbid
conditions a case is censored whenever either of the disorders is censored.
The PL estimates also yielded corrections for the survival projections of
prevalence for the univariate frequencies. The PL bias corrections show a
picture where lifetime presence of disorders is even more frequent than
reported in the literature even in the case well-known disorders, such as
Major Depression or Generalized Anxiety. This is an issue of great
importance, as to our knowledge no longitudinal cohort has retrieved lifetime
prevalence estimates for the entire individuals life expectancy. Moffit, Caspi
et al. (2010) found in the Dunedin cohort that lifetime comorbid prevalence
up to 32 years old doubles the estimates in retrospective studies up to the
same age. Our estimates coincide in that lifetime comorbidity has across
disorders about twice the prevalence than raw estimates. Even though the
retrospective data gathering design does not allow establishing causality as
24
25
26