Академический Документы
Профессиональный Документы
Культура Документы
Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at
http://www.jstor.org/page/info/about/policies/terms.jsp. JSTOR's Terms and Conditions of Use provides, in part, that unless
you have obtained prior permission, you may not download an entire issue of a journal or multiple copies of articles, and you
may use content in the JSTOR archive only for your personal, non-commercial use.
Please contact the publisher regarding any further use of this work. Publisher contact information may be obtained at
http://www.jstor.org/action/showPublisher?publisherCode=springer.
Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed
page of such transmission.
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of
content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms
of scholarship. For more information about JSTOR, please contact support@jstor.org.
Springer is collaborating with JSTOR to digitize, preserve and extend access to Educational Technology
Research and Development.
http://www.jstor.org
A Five-DimensionalFramework for
Authentic Assessment
JudithT.M.Gulikers
TheoJ. Bastioens
PaulA. Kirschner
authentic authentic
instructon assessment
perceptionof
authenticity
winthina
student authenticlearning transfer sueS
winthina
student
authentic
achievement
1997),but ratherthat we see it as a specific sub- ment were distinguished: (a) the assessment
set within a specific field of endeavor, namely task, (b) the physical context, (c) the social con-
becoming an academic. In this we concur with text, (d) the assessment result or form, and (e)
Brown,Collinsand Duguid (1989)who, too, saw the assessment criteria.These dimensions can
authenticachievementto be morethanauthentic vary in their level of authenticity(i.e., they are
academicachievement. continuums).It is a misconceptionto think that
The following section discusses five dimen- something is either authentic or not authentic
sions (a theoreticalframework)that can vary in (Cronin, 1993; Newmann & Wehlage, 1993),
their degree of authenticityin determiningthe because the degree of authenticityis not solely a
authenticity of an assessment. The purpose of characteristicof the assessmentchosen;it needs
this frameworkis to shed light on in the concept to be defined in relationto the criterionsituaiton
of assessmentauthenticityand to provide guide- derivedfromprofessionalparctice.Forexample:
lines for implementing authenticity elements carryingout an assessmentin a teamis authentic
into competency-basedassessment. onlyif the chosen assessmenttask is also carried
out in a team in real life. The main point of the
framework is that each of the five dimensions
TOWARD
A FIVE-DIMENSIONAL can resemblethe criterionsituationto a varying
FRAMEWORKFORAUTHENTIC degree, thereby increasing or decreasing the
ASSESSMENT authenticityof the assessment.
Because authentic assessment should be
To define authenticassessment,we carriedout a aligned to authentic instruction (Biggs, 1996;
review of literatureon authenticassessment,on Van Merrienboer,1997),the five dimensionsof a
authenticity and assessmentin general,and on framework for authentic assessment are also
student perceptions of (authentic) assessment applicableto authenticinstruction.Even though
elements. Five dimensions of authenticassess- the focus of this article is on authentic assess-
A FIVE-DIMENSIONAL
FRAMEWORK
FORAUTHENTIC
ASSESSMENT 71
g
0)
u~ c
0
u,
r, C
O)
o-~*~g
'-(-'5o
r
)
0O a 0
0-
0> O
~~E5
(I 0P' OcL'u
0-
00
0
0)~E
V -
))
+o
a! 2 0)
L
mQ)[n
L
cnQ,
t+r 0 CL oB
c k
o
co O3~
+e ~ Q:
(b
I+ +' ;5 a, g~e~y
o
r~9
~cEl~
CeL
t E "
80
04"-
0.-I
K ~ *c
IO2
QO_
ix::
0 " Co
4>7J
Oc.-
~00
Cu'-
0
C0
m' a C
-0a E
~C
0,
0)'
7
0.0
cLo
0,
0 -~
0
~C:b
L
o0i
I CO)
.c =
O>a0 3
CL
EC.
C.). o
CE~o 4-- CL
0
0) u
S>0..-
00~
i>Cu'-'0
>0)0
2E
A FIVE-DIMENSIONAL
FRAMEWORK
FORAUTHENTIC
ASSESSMENT 73
E ~c
O
t... q)5C
C' Q~I L X ,
C.,Q 0 Uv G
E '
a c'
tO4 0)O- CpQ
2. U2 O
74 Vol. 52, No. 3
ETR&D,
problems, the assessment task should resemble contain relevant as well as irrelevantinforma-
the complexityand ownershiplevels of the real- tion (Herrington& Oliver),should resemblethe
life criterionsituation. resourcesavailablein the criterionsituation.For
Up to this point, task authenticityappearsto example,Resnick(1987)arguedthatmost school
be a fairly objectivedimension. This objectivity tests involve memorywork, while out-of-school
is confounded by Sambell, McDowell, and activitiesareoftenintimatelyengaged with tools
Brown (1997),who showed that it is crucialthat and resources (calculators,tables, standards),
studentsperceive a task as relevant,that (a) they making such school tests less authentic.Segers
see the link to a situation in the real world or et al. (1999)argued that it would be inauthentic
working situation;or (b)they regardit as a valu- to deprivestudentsof resources,becauseprofes-
able transferable skill. McDowell (1995) also sionals do rely on resources.Anotherimportant
stressedthat students should see a link between characteristiccrucialfor providing an authentic
the assessment task and their personalinterests physical contextis the timestudents are given to
before they perceive the task as meaningful. perform the assessment task (Wiggins, 1989).
Clearly,perceived relevanceor meaningfulness Tests are normally administeredin a restricted
will differfrom student to student and will pos- period of time, for example two hours, com-
sibly even change as students become more pletely devoted to the test. In real life, profes-
experienced. sional activities often involve more time
scatteredover days or, on the contrary,require
fast and immediate reaction in a split second.
Physicalcontext. Where we are, often if not Wiggins (1989) said that an authentic assess-
always, determineshow we do something,and ment should not rely on unrealisticand arbitrary
often the realplace is dirtier(literallyand figura-
time constraints.In sum, the level of authenticity
tively) than safe learning environments.Think, of the physical context is defined by the resem-
for example, of an assessmentfor auto mechan- blance of these elements to the criterionsitua-
ics for the military.The capacity of a soldier to tion.
find the problemin a nonfunctioningjeep canbe
assessed in a clean garage,with all the conceiv-
ably needed equipment available,but a future Socialcontext. Not only the physicalcontext,but
physical environment may possibly involve a also the social context,influencesthe authentic-
war zone, inclement weather conditions, less ity of the assessment. In real life, working
space,and less equipment.Eventhough the task together is often the rule ratherthan the excep-
itself is authentic,it can be questioned whether tion, and Resnick(1987)emphasized that learn-
assessing students in a clean and safe environ- ing and performingout of school mostly takes
ment really assesses their ability to wisely use place in a social system. Therefore,a model for
theircompetenciesin real-lifesituations. authenticassessmentshould considersocialpro-
The physical context of an authenticassess- cesses that arepresentin real-lifecontexts.What
ment should reflect the way knowledge, skills, is really importantin an authenticassessmentis
and attitudes will be used in professionalprac- that the social processes of the assessment
tice (Brown et al., 1989; Herrington & Oliver, resemble the social processes in an equivalent
2000).Fidelityis often used in the contextof com- situationin reality.At this point, this framework
puter simulations,which describehow closely a disagrees with literature on authentic assess-
simulation imitates reality (Alessi, 1988). ment that defines collaborationas a characteris-
Authentic assessment often deals with high- tic of authenticity (e.g., Herrington &
fidelity contexts. The presentation of material Herrington,1998).Ourframeworkarguesthat if
and the amount of detail presented in the con- the real situation demands collaboration,the
text are importantaspects of the degree of fidel- assessment should also involve collaboration,
ity. Likewise, an important element of the but if the situationis normallyhandled individ-
authenticity of the physical context is that the ually, the assessment should be individual.
number and kinds of resources available When the assessment requires collaboration,
(Segers,Dochy, & De Corte,1999),which mostly processes such as social interaction, positive
A FIVE-DIMENSIONAL
FRAMEWORK
FORAUTHENTIC
ASSESSMENT 75
tive overload and, in turn, have a negative the authenticity dimensions differed between
impact on learning (Sweller,Van Merrienboer, students and teachers and between students
& Paas, 1998). As a result, a criterionsituation with different amounts of practicaland educa-
will often need to be an abstractionof real pro- tional experience.The differencesand similari-
fessional practice in order to be attainable for ties along a limited number of dimensions can
students at a certaineducationallevel. The ques- give insight into what is crucialfor defining and
tion that immediatelycomes to mind in this con- designing authenticassessments.
text is How do you create an authentic
assessmentfor studentswho arenot preparedto
function as beginning professionals? The METHOD
answer is that the authenticityof an assessment
should be defined by its degree of resemblance Participants
to the criterionsituation(i.e.,an abstractionfrom
professionalpractice)and not necessarilyto real Students and teachers from a nursing college
professional practice. Van Merrianboer(1997) took part in this study. One session of the study
argued that an abstractionof real professional involved only teachers, one session involved
practice (i.e., the criterionsituation)can still be
sophomorestudents (secondyear), and one ses-
authentic as long as the abstracted situation
sion involved senior students (fourthyear). The
requiresstudents to performthe whole compe- student groups could be furtherdivided into a
tency as an integratedwhole of constituentcom-
group of students studying nursing in a voca-
petencies. The abstraction results from tional training program (VTP)where they are
simplifying contextual factors that complicate primarily in school and make use of short
the performanceof the whole competency.
internships,and a group that studied nursing in
A third consideration also sheds a light on a block release program (BRP)where learning
the question stated in the previous sections, and working are integrated on an almost daily
namely the subjectivityof authenticity.The per- basis. This resulted in five groups of partici-
ception of what authenticityis may change as a pants: (a) 8 sophomore VTPstudents (M age =
resultof educationallevel, personalinterest,age, 18.5 years), (b) 8 sophomore BRP students (M
or amount of practicalexperiencewith profes- age = 20.9 years), (c) 8 senior VTP students (M
sional practice (Honebein et al., 1993). This age = 19.7 years), (d) 4 senior BRPstudents (M
implies that the five dimensionsthat are argued age = 31.4 years), and (e) 11 teachers (M age =
in the framework for authenticassessment are 42.8 years). The numberof participantsper ses-
not absolute but, rather,variable.It is possible sion was limited because of the practicalpossi-
that assessing professional competence of stu- bilities of the group support system used in this
dents in their final year of study, when they study.
have often served internshipsand have a better
idea of professional practice, requires more
authenticityof the physical context than when Materials
assessing first year students, who usually or
often have little practicalexperience.Designers An electronicgroup supportsystem (GSS)at the
must take changing student perspectives into
Open Universityof the Netherlandswas used as
accountwhen designing authenticassessment. researchtool. A GSSis a computer-basedinfor-
The qualitativestudy describedin the rest of mation processing system designed to facilitate
this articlehas two main goals. First,it explores group decision making. It is centered on group
whether our five-dimensionalframeworkcom- productivity through idea generation, prefer-
pletely describesauthenticityor whetherimpor- ence, and opinion exchange of people involved
tant elements may be missing. Second, it in a common task in a sharedenvironment.The
explores the relative importance of the five GSS allows collaborativeand individual activi-
dimensions. A subgoal of this study was to ties such as brainstorming,idea generation,sort-
explore if the perceptionof (the importanceof) ing, rating, and clustering via computer
A FIVE-DIMENSIONAL
FRAMEWORK
FORAUTHENTIC
ASSESSMENT 77
paired comparisons to determine the relative The paired comparison data of the five
importance of the dimensions. Activity three dimensions, that is, the number of times that a
consisted of 10 paired comparisonsof the five dimension in the paired comparisonswas rated
dimensions (5 x 4/2). Participantshad to choose as more important than another dimension,
the dimensions of the frameworkthat they per- were talliedper participantgroup. The absolute
ceived to be more important for authentic scores were then translatedinto rankings. The
assessment.The fourth activitywas the same as paired comparisons of the case descriptions
the activity at the beginning of the experiment: were analyzedin the same way.
The participantswere again required to carry
out pairedcomparisonsof case descriptionsthat
varied in theiramountof authenticityaccording RESULTS
to the five-dimensionalframework.Eachgroup
receivedthe counterbalancedset of case descrip- In general, the task, the result or form, and the
tions to those compared at the beginning of the criteria were rated as most important for the
experiment. authenticity of the assessment. The social con-
text was clearlyconsideredto be least important
for authenticity,and the importanceof the phys-
Analysis
ical contextwas stronglydiscussed.
A characteristicof the GSS is that the answers,
statements, choices, and so forth, of each indi-
vidual participantare anonymous. This means The RelativeImportance of the Five
that scores per participantwere not available, Dimensions:PairedComparisons
which precluded the possibility of carryingout
statisticaltests. On the otherhand, the anonym- Thepairedcomparisonsof the dimensionsand of
ity inhibitedsocially acceptedansweringbehav- the case descriptionsgave insightinto the relative
ior, and has been shown to stimulateresponsein importanceof the five dimensionsfor designing
idea generation and increase the reliability of authenticassessments.The comparisonsof the
answers.The data, thus, were qualitativelyana- dimensionsresultedin five rankings(sophomore
lyzed. The tapes of the discussions were tran- VTPstudents,sophomoreBRPstudents,teachers,
scribed. Both discussion statements and the senior VTP students, and senior BRP students)
statements keyed in during the brainstorms from 1 to 5. The pairedcomparisonsof the case
were analyzed to discern which of the five descriptionswere analyzedfor the same groups,
dimensions of the framework they fit. State- but were measuredin pre- and posttests,which
ments that did not fit were classifiedas other. resultedin ten rankingsfrom1 to 4.
Table1 shows rankingsper group of the five dimensions was perceived as most authentic
dimensions, based on their perceived impor- (score 1) by all, except the senior BRPstudents
tancein providing authenticityto an assessment on the posttest (score2.5). The other three kinds
(1 = most important, 5 = least important).Table 1 of cases showed an interestingpattern.The case
shows that all groups perceived the task as that was authenticexcept for the task received
important (score 1 or 2), and all groups except mostly a score of 2, which meant that this case
the senior VTP students (score 3.5), perceived was perceived as relatively authentic,which in
the social contextas the leastimportant.Further- turn meant that the task (which was not authen-
more, the result or form and criteriondimen- tic in this case) was notperceivedas very impor-
sions received more than average importance, tant in designing an authenticassessment.This
whereas all groups perceived the physical con- is contraryto the findings of the paired compar-
text as relativelyunimportant(scoreabout4). In isons of the dimensions in which the task was
short, independent of the group (see totals in perceived as very important in providing
Table 1), the task was perceived as most impor- authenticityto an assessment.Finally,the partic-
tant,followed by the resultor formand criterion ipant groups disagreedaboutthe authenticityof
dimensions;the physical context and especially the remaining two kinds of cases. All
the social contextlagged farbehind.
sophomore students ranked the case that was
The results of the paired comparisonsof the authenticexpectfor the resultas 4, meaningthat
case descriptions,in pre-and posttests,also gave they perceivedthis case to be the least authentic.
insight into the relative importance of the In otherwords, they perceivedthe resultor form
dimensions. Table2 shows rankingsper group dimension as most importantfor designing an
of the four case descriptions. authentic assessment. Teachers, on the other
A 1 meant that this case was perceived as the hand, rankedthe case that was authenticexcept
most authenticcase descriptionand a 4 referred for physical context as the least authenticcase
to the least authentic case description. An (score 4), which meant that teachers perceived
importantfinding, for the framework,was that physical context to be most important in
the case that described a completely authentic designing an authentic assessment. Senior stu-
assessment based on the presence of all five dents did not appear to differentiate,meaning
that they perceived the cases with no authentic should be real professionalpracticeor a simula-
physical context or with no authenticresult or tion in school.
form as equally inauthentic(score3.5). To sum, A closerlook at the contentof the brainstorm
the findings of the paired comparisonsof the
statements gave the impression that teachers
case descriptionsindicatedthat when all of the
and seniors agreed more with each other and
dimensions in the frameworkare present in a
with the idea of the framework,than with the
case, the case was unequivocally seen as most
sophomorestudents,especiallywhen it came to
authentic.In addition,thereappearto be contra- task and result or form dimensions. Teachers
dictory results with respect to task authenticity and seniors agreed with the frameworkthat an
compared to the results of the paired compari- authentictask requiredan integrationof profes-
sons of the dimensions. Finally, when evaluat-
sional knowledge, skills, and attitudes,and they
ing assessment cases, teachers and students
acknowledged that the task should resemble
appear to differ with respect to the importance real-life complexity. On the other hand,
of the authenticity of physical context versus
sophomore students were preoccupied with
resultauthenticity.
knowledge testing,they had problemspicturing
the idea of integratedtesting, and were primar-
ily concerned with making assessment easier
Completeness and Relative and clearer (e.g., "assignments should be less
Importance:What Do ParticipantsSay? vague, not more than one answershould be pos-
sible") instead of simulating real-world com-
Table 3 shows that all dimensions received plexity.In the resultor formdimension,teachers
attention in the brainstormand discussion ses- and seniors agreed that more assessment
sions. Furthermore,these results corroborated moments and methods should be combinedfor
the earlier findings, in that social context a fairerand more authenticpictureof students'
receivedthe least attentionin all groups.Besides professional competence. Sophomores did not
the five dimensions, almost all subelements of discuss the resultor form dimensionmuch;they
the dimensions, described in the framework, only mentioned that reshaping currenttests in
were reviewed. the form of cases would make them more realis-
Based on the number of statementsand the tic. In other words, every kind of assessment
could be made more authenticby adding realis-
ratios of the statementscomparedto each other,
tic information.
as shown in Table3, sophomoresplace primary
interest on task, followed by physical context. A specificationof the otherstatements (see
Seniors and teachers place equal emphasis on Table 4) showed, first, that all groups made
task and result. Teachers differ from all stu- statementsemphasizingthe alignmentbetween
dents, regardless of level, with respect to the instructionand assessment,and between school
emphasis on physical context.Teachersdevoted and real-lifepractice.This is in agreementwith
a lot of time to discussing the requiredfidelity the theoreticalideas behind the frameworkfor
level of the physical context in an effective authentic assessment. Second, Table 4 shows
authentic assessment. Especially emphasized that issues concerningthe assessorof an authen-
was the questionof whetherthe physicalcontext tic assessment, and organizational or pre-
Sophomore students 24 19 6 7 13 45
Senior students 34 21 9 36 12 26
Teachers 16 39 5 19 21 56
A FIVE-DIMENSIONAL
FRAMEWORK
FORAUTHENTIC
ASSESSMENT 81
students Seniorstudents
Sophomore Teachers
conditional issues, should be taken into account A combination of the results of the GSS activ-
in a framework for authentic assessment. Issues ities led to the conclusion that task, result or
related to the assessor dealt with the realization form, and criteria were perceived as very impor-
that people from professional practice should be tant for authentic assessment. Physical context
involved in defining and using criteria and stan- was most important in the eyes of teachers.
dards. Organizational issues involved state- Social context was perceived as the least impor-
ments about conditions that should be met before tant dimension.
authentic assessment can be implemented in Furthermore, not all groups perceived the
school. For example, teachers talked about plac- dimensions and elements in the same way.
ing students in professional practice sooner and Teachers and seniors mostly agreed with each
more often for the purpose of assessing them in other and with the theoretical framework; how-
this professional context. Finally, Table 4 shows ever, sophomores often deviated from the other
that sophomores took the opportunity to talk groups. There were no differences between VTP
and complain about the instruction. Although and BRP students.
instruction was not being evaluated (i.e., it was
about assessment), 28 statements dealt with
what was taught and not with what was
DISCUSSION
assessed. Seniors were more focused, and
teacher statements were spread over different
other variables and the 26 statement of the not To reiterate: The two questions with which we
defined variable included mostly jokes or ques- began were (a) Is the framework complete? (b)
tions they asked each other. Do students differ from teachers with respect to
what they perceive as important for authentic-
ity? Both of these questions shed light on possi-
ble guidelines for designing authentic
CONCLUSION assessments.
The answer to Question 1 appears to be yes.
Overall, the five-dimensional framework gave a The five dimensions appear to adequately
good description of what dimensions and ele- define authenticity, as demonstrated by both the
ments should be taken into account in an brainstorming and the high ranking of those
authentic assessment; the participants discussed cases that were authentic on all five dimensions.
all dimensions and almost all elements The adequacy of the framework is corroborated
described in the framework. However, elements by the finding that during the brainstorming,
concerning the assessor and organization issues most subelements of the dimensions as de-
should be added to complete the framework, as scribed by the framework were seen as impor-
these elements turned out to be important to all tant when designing authentic assessment. The
participant groups. paired comparisons showed some subtle differ-
82 Vol. 52, No. 3
ETR&D,
ences in the importanceof the five dimensions their perception of authenticity,some interest-
for providing authenticity.While the task, the ing findings came to light. The most differences
result or form, and the criterion dimensions were found between sophomores and teachers,
turnedout to be very importantfor authenticity, while seniors agreed with teachersmore often.
the physical context and especially the social Moreover, the perceptions of teachers and
context dimensions were perceived as less seniors agreed more with the ideas of the theo-
important.Social context is unequivocallyper- retical framework.Possibly, the perceptions of
ceived as the least important dimension of older students have changed during their col-
authenticity. All groups stressed the need for lege careeras a result of having had experience
individual testing, although both students and with professional practice; the perceptions of
teachersstressed that most nursing activitiesin sophomores-who have less practical experi-
real life are collaborative.Teachers explained ence-seemed to be based primarily on their
that "assessing in groups is a soft spot, we just previous experiences with assessment, which
don't know how to assess students together, explainedthe focus on knowledge and in-school
because at the end we want to be sure that every testing. In other words, it appears that
individual student is competent."It should not sophomore students have differentconceptions
be concluded, based on these findings, that and possibly misconceptionsof realprofessional
social context is not important for authentic practiceand, thus, authenticityof assessment.
assessment, but if choices have to be made in Furthermore, the brainstorming and the
designing an authentic assessment, social con- paired comparisons of the case descriptions
text is probablythe first dimensionto leave out. showed differencesbetween teachers and stu-
The findings on importanceof task are some- dents in the perception of physical context.
times contradictory.Although the brainstorm- Teachersfocused on the importanceof increas-
ing and the paired comparisons of the ing the authenticityof physical contextby plac-
dimensions show that task was perceived as ing the assessment in professional practice,
very importantby all, the pairedcomparisonsof whereas students, especially sophomores,
the cases made task seem less important.It is mostly focused on in-school testing with, for
possible, thus, that although the respondents example,simulatedpatientsand realisticequip-
consider task (as an abstractedconcept) to be ment.
most important,they arenot ableto identify(i.e., Finally, all groups agreed on the relative
they do not perceive)an authentictask. A possi- unimportanceof the social context and on the
ble explanationfor this is that the all-authentic- importance of using criteriathat resemble the
except-for-the-task case resembles current criteriaused in realprofessionalpractice.Teach-
assessment practices. Because previous experi- ers and students agree that, at this point, the cri-
ences are found to strongly influence percep- teriaused in school differtoo much fromcriteria
tions (Birenbaum,2003),the familiarityof these used in professionalinstitutes, and that school
cases may have influenced the paired compari- criteriaare often unknown or misinterpretedby
sons of the cases. If this is the case, the paired assessorsat the professionalinstitutes.
comparisonsof the five dimensionswere proba-
bly a more objectivemeasureof the importance
of the five dimensions. FutureImplications
Finally, it might be the case that assessor- The findings of the study allow for some critical
related issues would complete the framework.
questions and guidelines concerningthe design
Thiscould be done by adding a sixth dimensions of authentic assessment. First, student percep-
called "the assessor," or by adding the issues tions should be consideredin designingeffective
concerning who should use and develop authenticassessments.The qualitativeresults of
authenticcriteriaand standardsas subelements
this study showed that students, especially at
to the criteriondimension.
the beginning of theirstudy and with little prac-
With respect to Question 2, concerning the tical experience, have different conceptions
differences between students and teachers in (possibly misconceptions)of what authenticity
FRAMEWORK
A FIVE-DIMENSIONAL FORAUTHENTIC
ASSESSMENT 83
See Appendix,
overleaf
86 Vol.52, No. 3
ETR&D,