Академический Документы
Профессиональный Документы
Культура Документы
A Little History
Over a century ago, Sir Francis Galton noticed that very tall men didn’t
ordinarily have sons as tall as themselves. The sons of these tall
fathers were nearer the population mean in height. To be sure, most
of the sons were taller than average, but they were not typically as tall
as their fathers. Compared with their fathers, their stature had
regressed toward the mean.
The mean deviation score (that is, the mean of the x scores) for the
fathers is +12 cm (190 cm - 178 cm). Similarly, the mean deviation
score for the sons is +6 cm (184 cm - 178 cm).
Half of the fathers' deviance was manifested in the sons. That is, half
of the fathers' mean deviation of +12 cm was maintained in the sons'
mean deviation of +6 cm. This .5 represents the regression
coefficient or Pearson r.
1
The title of Galton's paper was, "Regression Toward Mediocracy in Hereditary Stature."
His wording seems unfortunate. First, the word "mediocracy" is disparaging in tone; the
synonymous phrase, "the mean" serves better. Second, an ardent hereditarian, Sir
Francis unnecessarily went out on a speculative limb in using the word "hereditary."
Actually, no assumption concerning nature-nurture contribution or interaction is
necessary for considering and understanding the regression effect.
2
For human height examples, gender of parent and child must be specified because of sex
differences. We could, of course, just as well have used examples of father-daughter,
mother-son, or mother-daughter height. For subsequent examples, such as parent-
child IQ, gender need not be specified.
3
This assumes that the population mean and standard deviation remain constant across
the two generations. If the population were getting taller or if its variability were
changing, then we would have to adjust for this in our calculations. But to keep the
examples simple, let's assume that height, IQ, and other traits do not change from one
generation to the next.
Student’s Companion Website for Assessment for Effective Teaching
Clearing up Confusion
No, the regression effect over several generations does not cause the
population to converge upon the mean. What has been seen is that tall
men don’t produce enough tall sons to replenish the ranks of the tall.
Likewise, short men don’t sire enough short sons to replace their
numbers in the next generation. But others do! The key point is that
the next generation of tall men doesn’t come exclusively from the
ranks of tall fathers; although tall fathers have more than their share
of the tall sons, some average fathers and even a few short fathers
have tall sons. Likewise, the next generation of short men doesn’t
derive solely from short fathers.
4
Note again that we are making the simplifying assumption that the standard deviation
remains the same in the two populations (generations in this case).
Student’s Companion Website for Assessment for Effective Teaching
5
This procedure would not be appropriate for identifying gifted students. Important
decisions should never be made on the basis of one datum alone, and rigid cutoff points
shouldn’t be used in defining giftedness (or most anything else). Moreover, in most
jurisdictions, classification would legally require use of an individually administered
test of scholastic aptitude.
6
From the reader’s point of view, this .5 is a given, not something that one should be
able to figure out. By the way, it is only coincidental that it equals the approximate
fraction of genes shared by parent-child pairs.
Student’s Companion Website for Assessment for Effective Teaching
Knowing that r = .5, what is the most likely mean IQ of the children of
this deviant sample? To obtain the predicted deviation score of the
children, the deviation score of the deviant sample of parents is
multiplied by the r. Thus, +40 (140 - 100) times .5 equals +20. This
expected mean deviation score of +20 is then added to the population
mean of 100 to obtain the answer of 120.
By this point, some readers will have reasoned that prediction can be
improved if the IQs of both parents are used as predictors. Indeed it
can. Studies of the relationship between mean of parents' IQs and
those of adolescent offsprings report correlations of about .6.
The regression effect is bad news for the gifted. Gifted individuals
typically have children who are less bright than themselves. Along the
same lines, most famous athletes don’t have children as successful as
themselves. Celebrated composers, philosophers, and politicians are
likewise unlikely to produce children as prominent as themselves.
There are exceptions, but regression toward the mean tilts the odds
against it.
7
Equal weighting of the predictors is appropriate because parents have approximately
equal genetic and environmental impacts. In some problems, however, maximum
validity is attained by very unequal weighting of the predictors.
Student’s Companion Website for Assessment for Effective Teaching
The regression effect is good news for those who work with persons
having intellectual disabilities. The example of Jenny and her brother
could just as well have been Jenny and her child. Knowledge of this can
increase realistic optimism in some kinds of professional work.
For the next example, think of your brightest friend with whose family
you are acquainted. Chances are that your friend will have children
who are less bright, will have siblings who are less bright, and will have
parents who are less bright.
To show the scope and limits of the regression effect, let’s now
consider some very extreme examples.
That illustrates one limiting case. When there is zero correlation, then
regression will be to the mean. In the usual case where there is some
(either positive or negative) correlation, then regression is only
toward the mean.)
the person whose Wrong Score is at the class mean, you have also
identified the person whose Right Score is at the class mean. Of, if
you selected the person whose Right Score was second from the
bottom, you can be certain that this is the person whose Wrong score
is second from the top.
Any research design for a study of this type should take the
regression effect into account. How much drift upwards toward the
mean would be predicted on the basis of the regression effect alone?
That is, if the intervention were worthless, what mean IQ would be
expected at age 4 for this sample selected because of negative
deviation from the mean? The answer is found by multiplying the .8 r
by the deviation score of -30 (70 - 100) to get the expected
Student’s Companion Website for Assessment for Effective Teaching
Improvement in Reading
8
Accounting for the other 4 points of gain is more problematic. Since we know the
investigator was so unsophisticated as to fail to account for the regression effect, we
might worry that there may have been some tilting of the intervention toward the test--
some teaching to the test. This could easily account for the other 4 points of gain.
Another possibility, of course, is that the treatment really does have merit and that it
did contribute to the gain. Another is that the outcome was, in part, a fluke of chance;
this is especially likely, of course, with small samples. Yet another viable hypothesis is
a practice effect that raised the scores on the second occasion. Although all of this is
speculative, it is not speculative to say that the researcher lacked competence and that it
is therefore hazardous to assume that all is well with all aspects of the research design.
Student’s Companion Website for Assessment for Effective Teaching
The test manual reports that for a typical sample, the parallel-form
correlation over a one-semester interval was .84. Now the amount of
"free ride" from the regression effect can be calculated. How much
apparent gain is to be expected from the regression effect? Try to
compute it before reading on.
The -2.50 mean z-score of the deviant group is multiplied by the .84 r
to yield an expected mean z-score of -2.10 if there is no real gain.
That is, the regression effect alone should raise the scores about .4 z-
score units. The interpretation of results should "discount" the
findings by this amount.
The regression has now been “flushed out” before the comparison of
pretest and posttest scores. If the researcher refrains from any
further selection (or de-selection) on the basis of the second test's
results (i.e., the results of the pretest), then the difference between
the pretest and the posttest is not contaminated by the regression
effect. That is, if there is no further selection between pretest and
posttest, then there will be no further regression between them.
Suppose a teacher administers the first social studies test of the year
to a tenth-grade class. The performance of 5 of the 25 students is
reported below.
• Abe received the highest score.
• Bertha obtained a high-average score.
• Maxie got an average score.
• Olivia received a low-average score.
• Rob got the lowest score.
Now which of these kinds of things are likely to recur at the time of
the next test? Superior knowledge, but not luck. Abe is likely to be
well above average in his knowledge in the next unit. But the good luck
he may have had isn’t any more likely to revisit him than to visit
anyone else. Since he is the most likely to have had good luck, he is
the one with the greatest chance of declining on the next test.
Of the five, Bertha is the next most likely to decline. The best bet is
that Rob would improve the most and that Olivia would improve the
Student’s Companion Website for Assessment for Effective Teaching
next most. The only person for whom we would predict stability would
be Maxie; since she did not deviate from the mean, there can be no
tendency to move toward it.
It has been well over a century since Sir Francis Galton described the
regression effect, and it seems reasonable to expect competent
researchers to be cognizant of the phenomenon and to allow for it in
designing their investigations. Otherwise, the artifact can seriously
contaminate research findings and interpretations.
Summary
2 Jane is shorter than any other female on either side of her family.
Her daughters are most apt to be
A shorter than she.
B about equal to her stature.
C taller than she, but shorter than most female members of her
family.
D as tall as most female members of her family.
E taller than most female members of her family.
3 Suppose that the first two tests in a course are correlated +0.7.
Jason receives a z-score of -2.0 on a the first test. What z-
scores would we best predict that he would make on the next
quiz? (Ignore affective factors.)
A +.0
B .0
C -.7
D -1.0
E -1.4