Академический Документы
Профессиональный Документы
Культура Документы
http://informahealthcare.com/ahb
ISSN: 0301-4460 (print), 1464-5033 (electronic)
Ann Hum Biol, 2015; 42(4): 368378
! 2015 Informa UK Ltd. DOI: 10.3109/03014460.2015.1045430
RESEARCH PAPER
Abstract
Keywords
Background: Multivariate ordinal categorical data have figured prominently in the age
estimation literature. Unfortunately, the osteological and dental age estimation literature is
often disconnected from the statistical literature that provides the underpinnings for rationale
analyses.
Aim: The aim of the study is to provide an analytical basis for age estimation using multiple
ordinal categorical traits.
Subjects and methods: Data on ectocranial suture closure from 1152 individuals are analysed in a
multivariate cumulative probit model fit using a Markov Chain Monte Carlo (MCMC) method.
Results: Twenty-six parameters in a five variable analysis are estimated, including the 10 unique
elements of the five five correlation matrix. The correlation matrix differs substantially from
the identity matrix one would assume under conditional independence among the sutures.
Conclusion: While the assumption of conditional independence between traits greatly simplifies
the use of parametric models in age estimation, this assumption is not a necessary step.
Further, in the analysis discussed here there are considerable residual correlations between
ectocranial suture closure scores even after regressing out the effect of age.
Introduction
Following the publication of the Rostock Volume on
paleodemography (Hoppa & Vaupel, 2002) and particularly
the contained chapter on transition analysis (Boldsen
et al., 2002), there has been considerable interest in applying
parametric models to ordinal categorical data (Cardoso et al.,
2010; DiGangi et al., 2009; Franklin & Flavel, 2015; Godde
& Hens, 2012, 2015; Gregersen et al., 2006; Kimmerle et al.,
2008; Konigsberg et al., 2008; Langley-Shirley & Jantz,
2010; Milner & Boldsen, 2012; Passalacqua, 2013; Prince
et al., 2008; Ros et al., 2008; Shirley & Jantz, 2011;
Thevissen et al., 2010). There had certainly been considerable historical precedent for such age estimation models
(Fanning, 1961; Holman and Jones, 1998; Konigsberg &
Holman, 1999; Kuykendall et al., 1992; Moorrees et al.,
1963a,b), but the new millennium brought with it the
realisation that the analysis of ordinal categorical age
indicators could become routine. Now that transition
analysis has become more common it will be useful to
review the method with a particular eye to applications
involving multiple indicators.
The following makes extensive use of ectocranial suture
closure data to demonstrate some of the properties of age
History
Received 22 April 2015
Accepted 23 April 2015
Published online 20 July 2015
The sample
The following analysis makes use of ectocranial suture
closure data from 1152 individuals. Of these individuals,
363 are males from McKern & Stewarts (1957) Korean War
dead study and the data consist of observations extracted from
Keysort Card No. 1. The remaining 789 individuals are
males and females from the Robert J. Terry Anatomical
Collection (Hunt & Albanese, 2005). For the McKern and
Stewart data ages-at-death were obtained by converting the
DOI: 10.3109/03014460.2015.1045430
birth and death dates to decimal years (Pearl & Miner, 1932;
McVarish, 1962; Toops, 1922) and subtracting the birth from
the death date. For the Terry Collection the recorded
chronological age in years was used as the age-at-death.
Assuming a Gompertz mortality model (Finch & Pike, 1996;
Pletcher, 1999; Wilson, 1993) starting at age 15 years, the a3
and b3 parameters for the combined Korean War dead and
Terry samples are 0.025 and 0.018, respectively. The
numbering of the two parameters is in reference to the third
(senescent) component of the Siler mortality model (Gage,
1991; Gage & Dyke, 1986; Siler, 1979; Wood et al., 2002).
The analyses in this paper make use of five suture sites,
these being the mid-lambdoid suture, the sagittal/lambdoid
suture at lambda, the anterior part of the sagittal suture, the
coronal/sagittal suture at bregma and the sphenofrontal
suture. The McKern and Stewart data was originally coded
by them as 0 for open, 1 for one-quarter closed, 2 for
half closed, 3 for three-quarters closed and 4 for closed.
The Terry sample data were collected using Meindl and
Lovejoys (1985) scale of open, minimal, significant
and closed. Given the definitions of stages in McKern and
Stewart and in Meindl and Lovejoy, the latters open
corresponds to McKern and Stewarts 0, minimal corresponds to McKern and Stewarts 1 and 2, significant
corresponds to McKern and Stewarts 3 and closed
corresponds to their 4. For the first four sutures a further
reduction was made so that these sutures were scored as
open, minimal and significant through closed. The
fifth suture was maintained as four separate stages. The
collapsed scoring for the first four sutures was based on
goodness-of-fit tests for the cumulative probit model
(described below in the section on Does the cumulative
probit model fit?).
369
Count
Mean
SD
Median
Range
313
150
116
162
169
242
26.5
33.7
41.1
47.0
49.0
56.8
12.1
15.7
17.3
19.7
18.2
17.6
22.0
28.0
36.0
45.0
48.0
55.5
1495
17.786
19102
17101
17.695
21.795
277
151
125
160
166
273
33.3
37.9
41.4
43.4
46.3
50.0
12.5
11.8
12.7
12.3
12.6
11.1
31.1
37.8
41.2
43.3
45.8
50.7
15.173.6
16.368.7
15.673.7
15.475.5
16.382.5
19.377
L. W. Konigsberg
1.0
0.8
0.6
Probability
0.4
0.2
0.0
50
50
100
150
Age
0.012
0.010
12
23
0.008
0.006
0.304
Density
Py 1j x 1:037 0:031 x
0.004
Py 2j x 1:754 0:031 x
0.002
0.000
370
50
50
100
150
Age
DOI: 10.3109/03014460.2015.1045430
0.020
0.010
0.005
Density
0.015
12
23
0.000
0
4-stages
3-stages
pseudo R2
mid-lambdoid
Lambda
anterior sagittal
Bregma
sphenofrontal
50.0001
50.0001
50.0001
0.0006
0.4394
0.5618
0.2190
0.1088
0.8213
0.290
0.294
0.218
0.327
0.346
35.0
20
40
60
80
100
Age
Suture
19.2
10.5
371
Figure 3. A re-drawing of the model depicted in Figure 2 but using lognormal transitions rather than normal transitions. The modal age for the
first transition (19.2) is indicated, as are the left and right boundaries for
the 50% highest posterior density region.
372
L. W. Konigsberg
Table 3. Slopes (zero-intercept model) and thresholds for the multivariate cumulative probit model using five sutures. T1 is the threshold
value between open and minimal, T2 is the threshold value between
minimal and significant and T3 is the threshold value between
significant and closed. For the first four sutures T2 and T3 are
shown as equal because significant and closed were not distinguished.
Suture
Slope
T1
T2
T3
mid-lambdoid (S1)
lambda (S2)
anterior sagittal (S3)
bregma (S4)
sphenofrontal (S5)
1.457
1.414
1.142
1.556
1.734
5.045
4.721
3.555
5.442
6.369
5.775
5.325
4.086
6.117
6.713
5.775
5.325
4.086
6.117
7.191
S2
S3
S4
S5
S1
S2
S3
S4
0.837
0.699
0.656
0.466
0.785
0.708
0.441
0.825
0.506
0.564
DOI: 10.3109/03014460.2015.1045430
(b)
(a)
373
significant
minimal
open
Latent trait
Latent trait
significant
minimal
open
10
20
30
40
50
60
10
20
Age
30
40
50
60
Age
Figure 4. (a) Latent trait representation of the model from Figure 3. The curved line represents the regression of the mean latent trait value on the
logarithm of age. This line has an intercept of 0 on the log scale (1 year on the raw scale). (b) A re-drawing of (a) that uses zero for the first threshold
and consequently has a non-zero intercept for the regression line.
0.020
closed
S1 = 3, S2 = 3, S3 = 3, S4 = 3, S5 = 1
independent
S5
dependent
Density
S4
0.005
open
0.010
0.015
S
M
6
5
4
Sphenofrontal suture
S2
S1
open
significant
minimal
0.000
S3
Midlambdoid suture
20
40
60
80
100
120
Age
374
L. W. Konigsberg
DOI: 10.3109/03014460.2015.1045430
375
Table 5. Realised coverage under an informative (Gompertz) prior and two uniform priors for age (15120 years and 1585 years). The p
values are from the exact binomial test.
Informative prior
Expected count
Realised count
p Value
Realised count
p Value
Realised count
p Value
50
100
150
200
250
300
350
400
450
36
85
146
205
251
303
345
382
437
0.037
0.105
0.773
0.648
0.964
0.820
0.626
0.050
0.062
38
70
120
168
207
253
301
376
460
0.074
0.001
0.003
0.003
50.001
50.001
50.001
0.009
0.156
35
71
99
167
210
284
252
408
464
0.025
0.001
50.001
0.003
50.001
0.144
0.884
0.402
0.037
Discussion
This paper has shown that it is indeed feasible to analyse
multivariate ordinal categorical age indicator data while
accounting for the potentially non-zero residual correlations
between indicators. Although direct optimisation of the
likelihood in this setting is a perilous and slow process, the
optimisation can be replaced using Markov Chain Monte
Carlo methods. In the current example, the first principal
50
Age
60
70
80
40
30
10%
20%
30%
40%
50%
60%
70%
80%
90%
20
Coverage
100
200
300
400
500
Index
Figure 7. Fifty per cent highest posterior densities (HPD) of age for the
500 cases in the simulation sample. The heavy line shows the (sorted)
ages for the 500 individuals. The vertical dotted lines are instances where
the 50% HPDs do not include the age while the solid lines are instances
where the age is included. Vertical lines are shown only for every other
case. This figure uses an informative Gompertz prior for age.
376
L. W. Konigsberg
50
20
30
40
Age
60
70
80
100
200
300
400
500
Acknowledgements
I thank the co-editors of this issue for their invitation to participate and
the anonymous reviewers for their helpful comments on a previous draft.
80
Declaration of interest
50
References
30
40
Age
60
70
20
Index
As an example of the former, we can consider any publications that use a reference sample with a non-uniform age
distribution to provide summary statistics for age distributions
within stages. Unless the relationship between age and the
indicators is strong, such studies will contain an inherent
bias from the reference sample age distribution as BocquetAppel & Masset (1982) previously showed. As an example of
the latter problem of providing programs without access to
raw data, we can consider the ADBOU program. Although the
ADBOU software (Milner & Boldsen, 2012) is now readily
available (it can be downloaded from http://math.mercyhurst.edu/~sousley/Software/), the reference sample data is not
available with the software. This is unfortunate as there are
alternative methods of analysis that researchers may want to
consider in the future. Ultimately, to be of greatest utility,
researchers need to have access to source code for compiled
programs or scripts for interpreted languages. They also need
to have access to the raw reference sample data. It is for this
reason that all of the code and data used in this paper are
available for download from tinyurl.com/n7eagfp.
100
200
300
400
500
Index
Much effort has been devoted in the past to collecting reference sample information and in defining new
scoring systems and methods for age estimation.
Unfortunately, considerably less effort has been expended in
developing the statistical and probabilistic basis for estimation
of individual ages and estimation of hazard parameters for
osteological or dental samples. Equally unfortunate is that
there has been a general disconnect between the studies
that focus on osteological or dental scoring methods and those
that focus on the appropriate methods for analysing the
resultant data. This disconnect is also apparent in the
publication of summary statistics that cannot be appropriately
applied in actual analyses and in the production of software
that does not allow access to the raw reference sample data.
Acharya AB, Bhowmik B, Naikmasur VG. 2014. Accuracy of identifying juvenile/adult status from third molar development using prediction probabilities derived from logistic regression analysis. J Forens
Sci 59:665670.
Aitchison J, Silvey SD. 1957. The generalization of probit analysis to the
case of multiple reponses. Biometrika 44:131140.
Albert JH, Chib S. 1993. Bayesian analysis of binary and polychotomous
response data. J Am Stat Assoc 88:669679.
Anderson MF, Anderson DT, Wescott DJ. 2010. Estimation of adult
skeletal age-at-death using the Sugeno fuzzy integral. Am J Phys
Anthropol 142:3041.
Becker WE, Kennedy PE. 1992. A graphical exposition of the ordered
probit. Econometr Theory 8:127131.
Bedford ME, Russell KF, Lovejoy CO, Meindl RS, Simpson SW,
Stuart-Macadam PL. 1993. Test of the multifactorial aging method
using skeletons with known ages- at-death from the Grant collection.
Am J Phys Anthropol 91:287297.
Bocquet-Appel J-P, Masset C. 1982. Farewell to paleodemography.
J Hum Evol 11:321333.
Bocquet-Appel J-P, Masset C. 1985. Paleodemography: resurrection or
ghost? J Hum Evol 14:107111.
Bocquet-Appel JP, Masset C. 1996. Paleodemography: expectancy and
false hope. Am J Phys Anthropol 99:571583.
Boldsen JL, Milner GR, Konigsberg LW, Wood JM. 2002. Transition
analysis: a new method for estimating age from skeletons. In: Hoppa
RD, Vaupel JW, editors. Paleodemography: age distributions from
skeletal samples. New York, NY: Cambridge University Press.
p 73106.
Buckberry JL, Chamberlain AT. 2002. Age estimation from the auricular
surface of the ilium: a revised method. Am J Phys Anthropol 119:
231239.
Cardoso HF, Heuze Y, Julio P. 2010. Secular change in the timing of
dental root maturation in Portuguese boys and girls. Am J Hum Biol
22:791800.
DOI: 10.3109/03014460.2015.1045430
377
378
L. W. Konigsberg