Академический Документы
Профессиональный Документы
Культура Документы
AND
RELIABILITY
Measurement and Evaluation in Education
Graduate School- LSU Ozamis
Presenter: Euprime B. Regalado
Validity
• A degree to which a test measures
what it is supposed to measure
• The most central and essential
quality in the development,
interpretation, and use of
educational measures.
Evidence
• Is used to determine whether a
test is measuring what it is
supposed to measure
• Grouped into three categories:
– Construct-related evidence
– Content-related evidence
– Criterion-related evidence
Construct-Related Evidence
• Establishes a link between
the underlying
psychological construct
we wish to measure and the
visible performance we
choose to observe
• In measuring learned
knowledge, bear in mind the
question:
•The consistency of
measurements
A RELIABLE TEST
Produces similar scores
across various conditions
and situations, including
different evaluators and
testing environments.
How do we account for an individual who
does not get exactly the same test score
every time he or she takes the test?
TARGET BEHAVIOR
A specific behavior the
observer is looking to record
ALTERNATE FORMS RELIABILITY
• Also known as equivalent forms reliability or
parallel forms reliability
• Obtained by administering two equivalent
tests to the same group of examinees
• Items are matched for difficulty on each test
• It is necessary that the time frame between
giving the two forms be as short as possible
OBTAINED SCORE
•The score you get when you administer a test
•Consists of two parts: the true score and the
error score
STANDARD ERROR of
MEASUREMENT (SEM)
Gives the margin or error that you should
expect in an individual test score because of
imperfect reliability of the test
FACTORS AFFECTING RELIABILITY
1. Test length
2. Test-retest interval
3. Variability of scores
4. Guessing
5. Variation within the test
situation
Let the target ( bull’s eye), be the content objective
Let the distance of the darts to the target be the measure of validity
Let the distance of the darts to each other be the TEST SCORES
RELIABLE RELIABLE
VALID
THUS, A RELIABLE TEST IS NOT ALWAYS VALID
Thank you
for
LISTENING!