Академический Документы
Профессиональный Документы
Культура Документы
Abstract
Objective: The purpose of this study was to assess the relative accuracies of the Conners Brief Rating Scale, Parent Version,
the Conners Continuous Performance Test II (CPT II), and a novel interactive game called Groundskeeper to discriminate
child psychiatric patients with and without attention-deficit/hyperactivity disorder (ADHD).
Methods: We administered the three assessments to 113 clinically referred ADHD and non-ADHD patients who had been
diagnosed with the Kiddie-Schedule of Affective Disorders and Schizophrenia- Present and Lifetime (K-SADS-PL), Version 19.
Results: As measured by the area under the curve (AUC) statistic from receiver operating characteristic (ROC) analysis, the
diagnostic accuracy of Groundskeeper (0.79) was as high as the accuracy of the Conners parent rating of inattention (0.76)
and better than the CPT II percent correct (0.62). Combining the three tests produced an AUC of 0.87. Correlations among the
three measures were small and, mostly, not significant.
Conclusions: Our finding of similar diagnostic accuracies between Groundskeeper and the Conners inattention scale is
especially remarkable given that the Conners inattention scale shares method variance with the diagnostic process. Although
our work is preliminary, it suggests that computer games may be useful in the diagnostic process. This provides an important
direction for research, given the objectivity of such measures and the fact that computer games are well tolerated by youth.
Keywords: ADHD, gaming, decision-making, diagnosis, biomarker
Introduction
ttention-deficit/hyperactivity disorder (ADHD) is diagnosed by evaluating symptoms of hyperactivity, impulsivity, and inattention, and impaired functioning across settings. The
diagnosis of ADHD shows considerable levels of concurrent and
predictive validity in its clinical features, course, neurobiology, and
treatment response (Faraone et al. 2000; Faraone 2005). The diagnosis has high diagnostic reliability, one of the highest in the
Diagnostic and Statistical Manual of Mental Disorders, 5th ed.
(DSM-5) (American Psychiatric Association 2013; Regier et al.
2013). Nevertheless, concerns about diagnostic accuracy persist.
Some suggest that the use of subjective diagnostic procedures may
lead to the overdiagnosis of ADHD in the community (Bruchmuller
et al. 2012; Visser et al. 2014) (although see Sciutto and Eisenberg
[2007] for a contrary view). The diagnosis has been called sub-
Departments of Psychiatry and of Neuroscience and Physiology, SUNY Upstate Medical University, Syracuse, New York.
K.G. Jebsen Center for Research on Neuropsychiatric Disorders, Department of Biomedicine, University of Bergen, Bergen, Norway.
3
Icahn School of Medicine at Mount Sinai, New York, New York.
4
Department of Psychology, Syracuse University, Syracuse, New York.
5
Department of Psychiatry and Child and Adolescent Psychiatry, New York University Langone School of Medicine, New York, New York.
6
CogCubed, Minneapolis, Minnesota.
Funding: Dr. Faraone is supported by the K.G. Jebsen Center for Research on Neuropsychiatric Disorders, University of Bergen, Bergen, Norway, the
European Unions Seventh Framework Program for research, technological development, and demonstration under grant agreement no 602805.
2
672
673
n in control
group
n in ADHD
group
45
36
8
2
10
33
27
11
6
23
674
FARAONE ET AL.
FIG. 1.
675
Factor 8
Factor 10
Movement, incorrect
Omissions
Reaction time, correct
Reaction rate, correct
Reaction rate, correct, 27
Reaction rate, correct, 811
Reaction rate, Incorrect, 27
Incorrect, S, V, A distractors
Performance
Reaction time, correct, 27
Reaction time, Correct, 811
Reaction time, Incorrect, 811
-0.3002
0.2579
0.3403
0.1249
0.7368
0.0490
0.3729
0.2566
-0.2755
0.7544
0.2150
-0.0418
0.0229
-0.0475
0.1154
0.2645
0.1026
0.7961
-0.0655
-0.0722
-0.4649
0.1804
0.6856
0.3437
676
To clarify the degree to which the three logistic models based on
the Groundskeeper, Conners, and CPT identified the same cases,
we used each of the corresponding logistic regression models to
compute the probability of each participant having K-SADSdiagnosed ADHD. We took a median split of these predicted
probabilities and classified participants above the median as ADHD
and those below the median as not ADHD. Only 14% of participants were predicted to have ADHD by the three methods. In this
group, 79% were diagnosed with ADHD. Only 18% of participants
were predicted to not have ADHD by all three models. In this
group, 16% were diagnosed with ADHD. The j coefficients of
agreement were 0.15 for Groundskeeper versus Conners (z = 1.6,
p = 0.06), 0.18 for Groundskeeper versus CPT (z = 1.9, p = 0.9), and
0.3 for Conners versus CPT (z = 3.2, p = 0.0007).
Discussion
We found that the Groundskeeper game can significantly discriminate ADHD patients from other psychiatric patients. As
measured by the AUC statistic from ROC analysis, the diagnostic
accuracy of Groundskeeper (0.79) was as high as the accuracy of
the Conners parent rating scale (0.76), which is used as a screening
or adjunct diagnostic tool for the diagnosis of ADHD.
The Conners and Groundskeeper models had similar levels of
diagnostic accuracy. Both predicted ADHD diagnoses more accurately than the CPT. The similar diagnostic accuracies between
Groundskeeper and the Conners is remarkable, given that the
questions asked of parents during the diagnostic interview are
similar to the questions asked of the parent by the Conners form.
Both require subjective reports of ADHD symptoms and share
method variance. In contrast, Groundskeeper is performed by the
child with no parent involvement. Therefore, it is encouraging that
an objective measure (Groundskeeper) is as accurate as subjective
assessments of symptom criteria.
Although the AUCs for Groundskeeper and the Conners rating
scales were of similar magnitude, Figure 2 shows that their tradeoffs
between sensitivity and specificity differ in clinically important
ways. Groundskeeper maintains a false positive rate of zero for a
sensitivity of 37%, a positive predictive power of 100%, and a
negative predictive power of 53%. For the Conners to achieve a
sensitivity of 37%, the false positive rate would rise to 9.1%, the
positive predictive power would be 85%, and the negative predictive
power would be 51%. In Figure 2, this difference is seen as the
Groundskeeper ROC being skewed toward the left side of the graph,
whereas the Conners ROC is spread more evenly throughout the
graph. This means that the Conners will be a better test for ADHD if
a higher false positive rate can be tolerated. For example, the graphs
show that, at a false positive rate of 25%, the Conners has a sensitivity of 75%, a positive predictive power of 81%, and a negative
predictive power of 69%; Groundskeeper has a sensitivity of 65%, a
positive predictive power of 75%, and a negative predictive power
of 65%. For screening tests, a high false positive rate can be tolerated if the costs of a second stage confirmation test are low, but if the
costs are high, then the low false positive rate of Groundskeeper is
preferred. A low false positive rate is essential for clinicians seeking
to confirm an ADHD diagnosis about which they are uncertain,
suggesting that Groundskeeper could be used for diagnostic confirmation. For example, in settings such as college health clinics,
where the misuse and diversion of ADHD medications is a major
concern, having a means of eliminating false positives would be
very important. These examples are only illustrative. It would be
premature to suggest specific cut-points for clinical practice.
FARAONE ET AL.
Groundskeeper should not be used to diagnose ADHD independently from a clinical diagnosis.
The correlations among the Groundskeeper, Conners, and CPT
scores were mostly low and not significant. Consistent with this, we
found low kappa coefficients of agreement between each models
predictions of ADHD diagnoses. This lack of shared variance is
probably because of the unique method variance for each method,
the imperfect reliability of each method, and the possibility that
each is sensitive to different components of the ADHD syndrome.
Consistent with this latter interpretation, each score domain remained significant when all were included in the same model.
Future work should address the possibility that multimodal
assessments of ADHD are needed to create a highly accurate objective diagnostic tool.
Our work has limitations. We used a medicated, psychiatric
control group, which we presumed would be relatively difficult to
differentiate from ADHD compared with healthy controls. Use of
the latter would likely lead to better diagnostic accuracy statistics.
Because this was a preliminary study, we used a wide age range so
that we could examine age effects. Although we found no effects of
age on diagnostic accuracy, our sample was too small to detect
small effects. The sample was also small relative to the number of
variables analyzed. This required us to use factor analysis to limit
our statistical tests. Because most research participants were taking
medications of many different types, we cannot rule out medication
effects completely and cannot be certain that our results will generalize to samples with a different profile of medication use. We
also excluded youth with conduct disorder and tics, which further
reduces the generalizability of our findings. Because the CPT and
Groundskeeper tests were not counterbalanced, results could have
been biased by sequence effects. Moreover, Groundskeeper benefited from the use of multiple variables derived from the test
whereas for the CPT, we only used the percent correct, a commonly
used index for the version we used. Our results may not generalize
to other CPTs or to more complex analyses of CPT data. For these
reasons, our results must be considered tentative until cross validated in an independent sample. We do not know if our results will
generalize to adults with ADHD or if Groundskeeper will be useful
for studying treatment effects over time. Future work must address
these issues.
Conclusions
Despite these limitations, our results are encouraging. We
demonstrated good discriminative ability when comparing ADHD
patients with other psychiatric patients. The discriminative ability
of Groundskeeper will likely be much better for discriminating
ADHD patients from subjects not having psychiatric disorders.
Future work should examine this possibility and should also determine if a revision to Groundskeeper or the application of different algorithms will be able to improve its diagnostic accuracy.
Clinical Significance
Although Groundskeeper is not ready to be used as a diagnostic
tool, this work foreshadows to clinicians a new genre of clinical
tool that is in development. For a diagnostic tool to be useful, it
must be accepted by patients and must be engaging enough to
assure that valid data are obtained. Clinicians also need to be aware
that the ability of a tool to be successful depends on the diagnostic
context. We have shown here that contrasting ADHD with other
disorders is a difficult task, which suggests that future uses of
Groundskeeper address a different diagnostic issue.
677
678
FARAONE ET AL.
Sumner CR, Haynes VS, Teicher MH, Newcorn JH: Does placebo response
differ between objective and subjective measures in children with
attention-deficit/hyperactivity disorder? Postgrad Med 122:5261, 2010.
Swets JA: Form of empirical ROCs in discrimination and diagnostic
tasks: Implications for theory and measurement of performance.
Psychol Bull 99:181198, 1986a.
Swets JA: Indices of discrimination or diagnostic accuracy: Their
ROCs and implied models. Psychol Bull 99:110117, 1986b.
Swets JA: Sensitivities and specificities of diagnostic tests. JAMA
248:548549, 1982.
Swets JA, Pickett RM. Evaluation of Diagnostic Systems: Methods
From Signal Detection Theory. New York: Academic Press; 1982.
Thome J, Ehlis AC, Fallgatter AJ, Krauel K, Lange KW, Riederer P,
Romanos M, Taurines R, Tucha O, Uzbekov M, Gerlach M: Biomarkers
for attention-deficit/hyperactivity disorder (ADHD). A consensus report
of the WFSBP task force on biological markers and the World Federation of ADHD. World J Biol Psychiatry 13:379400, 2012.
Toplak ME, Dockstader C, Tannock R: Temporal information processing in ADHD: Findings to date and new methods. J Neurosci
Methods 151:1529, 2006.
Van der Oord S, Ponsioen AJ, Geurts HM, Brink EL, Prins PJ: A pilot
study of the efficacy of a computerized executive functioning remediation training with game elements for children with ADHD in
an outpatient setting: Outcome on parent- and teacher-rated executive functioning and ADHD behavior. J Atten Disord 18:699712,
2014.
Visser SN, Danielson ML, Bitsko RH, Holbrook JR, Kogan MD,
Ghandour RM, Perou R, Blumberg SJ: Trends in the parent-report
of health care provider-diagnosed and medicated attention-deficit/
hyperactivity disorder: United States, 20032011. J Am Acad Child
Adolesc Psychiatry 53:3446 e32, 2014.
Appendix A
This script is read to each participant prior to game administration.
Today you will be playing a game called Groundskeeper. It will
be used to measure your attention. The goal is to move the mallet
cube to the left or right side of the cube showing an image of a
gopher. When you have a correct hit, you will hear a boink noise.
There will be no noise for an incorrect hit. Keep in mind you only
want to hit the gopher! Do not let other images, like birds or rabbits,
or noises distract you from this task.
There will be a short pause between each level. The first level
that you play is for practice and will not be counted against your
final score. After each level, a voice will instruct you to move the
cubes to colored dots on the game board. Note that the mallet cube
should not be moved to these dots.
Be sure to use two hands and hold the cubes together until
you hear a sound. If you do not hear a boink sound, you do not
get a point. Move as quickly as possible!
Appendix B
Summary of Groundskeeper Sessions
Each session is 1.5 minutes with 20 seconds between sessions. There will be a 10 second countdown on the screen
prior to each session. Program will automatically progress to
the next session without intervention by person conducting the test. Total run time is 24 minutes (plus 5 minutes for
pauses).
Session 1: Screen shot of gopher, groundskeeper, or grass
(neutral) presented for 1, 2, or 3 seconds at a random frequency.
Go/no go task.
679
680
NA
correct
incorrect correct
incorrect
incorrect correct
error omission
incorrect correct
eval_id
mCRR
correct_reaction
incorrect_reaction
All
correctdisplay
100
correct correctdisplay
error_omission
offtilt_reaction
All
incorrect
100
imagedraws correctdisplay
incorrect
Reaction time
Movement
All
Reaction time
Omission
Incorrect
Correct
Movement
Counter
Movement
Movement
Movement
All
All
All
All
correct
100
correctdisplay
TiltX+TiltY+TiltZ
NA
All
correct
Movement
VirtualTicks
f (TiltZ)
m = TiltFlip, n = TiltFlipBack
All
TiltY = TiltUp+TiltDown
TiltY
TiltZ
All
Omission
All
TiltX = TiltLeft+TiltRight
Incorrect
Identifier
Correct
Category
All
All
All
Level
TiltX
mEOM
mIRR
Formula
Variable
(continued)
Description
681
correct_reaction_
movement
incorrect_reaction_
movement
omission_movement
mNR2_7
mCT2_7
mCR2_7
mICRRate2_7
mCRRate2_7
incorrect_speed_variance
correct_speed_variance
Formula
neighbor_reaction
ontilt_reaction
Variable
Incorrect
27
27
27
Movement
Reaction time
Reaction time
Correct
27
27
Movement
Movement
All
All
Movement
All
Movement
All
Movement
Reaction time
All
All
Movement
Category
All
Level
Appendix C. (Continued)
(continued)
Description
682
mICR2_7
mOTR2_7
mONTR2_7
mMMO2_7
mAOM2_7
mCRRate8_11
mNR8_11
mCT8_11
mCR8_11
mICR8_11
mOTR8_11
mONTR8_11
mMMO8_11
mAOM8_11
mICRRate8_11
Formula
Variable
811
811
811
811
811
811
811
Movement
Movement
Movement
Movement
Reaction time
Movement
Reaction time
Reaction time
Incorrect
811
811
Movement
Movement
Correct
Movement
27
27
27
811
Movement
Reaction time
Category
27
27
Level
Appendix C. (Continued)
Average of ontilt_reaction
Average of offtilt_reaction
Average of ontilt_reaction
Average of offtilt_reaction
Description
(continued)
683
mEOM4=mEOM5
mCRR4=mCRR5
ASC_AFC_O
45
45
mCRR4=mCRR5
mIRR4=mIRR5
ASC_AFC
Distraction
23
Distraction
Distraction
Distraction
Movement
1215
23
VSC_VFC
mMRATIO12_15
Movement
Movement
Movement
Movement
1215
1215
1315
1215
mEOM2=mEOM3
mCRR2=mCRR3
mONTR12_15
Movement
1215
VSC_VFC_O
mOTR12_15
Reaction time
Movement
Reaction time
Reaction time
1215
TiltX+TiltY+TiltZ
TiltX+TiltY+TiltZ
mMMO13 15
mAOM13 15
mMMO12 15
mAOM12 15
mCRR2=mCRR3
mIRR2=mIRR3
mICR12_15
1215
1215
mMMO12_15
mAOM12_15
mMRATIO13_15
TiltX+TiltY+TiltZ
+m < VirtualTicks < n f (VirtualTicks)
n
n Responsecorrect m NewImageld
mCT12_15
mCR12_15
1215
Incorrect
1215
Correct
1215
correct
incorrect correct
incorrect
incorrect correct
Category
Level
Formula
mNR12_15
mICRRate12_15
mCRRate12_15
Variable
Appendix C. (Continued)
(continued)
Average of ontilt_reaction
Average of offtilt_reaction
Description
684
2,3,6,7
2,3,6,7
((mCRR2=mCRR6)=(mIRR2=mIRR6
((mCRR3=mCRR7)=(mIRR3=mIRR7
((mCRR2=mCRR6)=(mEOM2=mEOM6))
((mCRR3=mCRR7)=(mEOM3=mEOM7))
((mCRR2=mCRR6)=(mCRR3=mCRR7))
((mIRR2=mIRR6)=(mIRR3=mIRR7))
((mEOM2=mEOM6)=(mEOM3=mEOM7))
((mCRR2=mCRR6)=(mCRR3=mCRR7))
((mCRR4=mCRR6)=(mCRR5=mCRR7))
((mIRR4=mIRR6)=(mIRR5=mIRR7))
((mEOM4=mEOM6)=(mEOM5=mEOM7))
((mCRR4=mCRR6)=(mCRR5=mCRR7))
LA2TSC_LA2TFC_
HA2TSC_HA2TFC
LA2TSC_LA2TFC_
HA2TSC_HA2TFC_O
CVSC_CVFC
CVSC_CVFC_O
CASC_CAFC
CASC_CAFC_O
LearningCurve_O
LearningCurve
CSSC_O
CSSC
SSC_SFC_ICR
(mEOM11=mEOM15)
(mCRR11=mCRR15)
mCRR16
1 100
mCRR1
mEOM16
1 100
mEOM1
Learning curve
Learning curve
1,16
1,16
Distraction
Distraction
Distraction
11,15
11,15
6,11
Distraction
6,11
SSC_SFC_CR
Distraction
6,11
mNR6
mNR11
mCR6
mCR11
mICR6
mICR11
(mCRR11=mCRR15)
(mIRR11=mIRR15)
SSC_SFC_NR
Distraction
6,11
(mEOM6=mEOM11)
(mCRR6=mCRR11)
Distraction
SSC_SFC_O
6,11
(mCRR6=mCRR11)
(mIRR6=mIRR11)
Distraction
Distraction
Distraction
Distraction
Distraction
Distraction
Category
SSC_SFC
4,5,6,7
4,5,6,7
2,3,6,7
2,3,6,7
Level
Formula
Variable
Appendix C. (Continued)
(continued)
Description
685
K
NA
NA
NA
NA
NA
NA
NA
NA
Gender
Age
Inattention
Hyperactivity
Combined
Depression
Autism
Anxiety
1 or 0
cubepressed
doublehit
1 or 0
TiltX+TiltY+TiltZ
TiltX+TiltY+TiltZ
K
Formula
mCT16
1 100
mCT1
mNR16
1 100
mNR1
mCR16
1 100
mCR11
mICR16
1 100
mICR1
topbottomhit
mallet_movement_only
all_other_movement
subsecond_response_total
LearningCurve_ICR
LearningCurve_CR
LearningCurve_NR
LearningCurve_CT
Variable
NA
NA
NA
NA
NA
NA
NA
NA
All
All
All
All
All
All
1,16
1,16
Demographic
Demographic
Condition
Condition
Condition
Condition
Condition
Condition
Auxiliary movement
Auxiliary movement
Auxiliary movement
Movement
Movement
Reaction time
Learning curve
Learning curve
Learning curve
Learning curve
1,16
1,16
Category
Level
Appendix C. (Continued)
Description