Академический Документы
Профессиональный Документы
Культура Документы
Dionissi Aliprantis
Federal R eserve Bank of Cleveland
This paper studies causal effects informative for deciding the age when children
should start kindergarten. I present evidence from the Early Childhood Longitu-
dinal Study, Kindergarten Class of 1998–99 ðECLS-KÞ that standard instrumental
variable strategies do not identify effects of delaying kindergarten entry for any
subpopulation of interest. I propose and implement a new strategy for identifying
individual-level education production function parameters. Estimates indicate that
there can be decreasing and even negative returns to relative age: For the oldest
children in a cohort, educational achievement in third grade decreases as their age
relative to that of their classmates increases.
I. Introduction
Investing in early childhood education appears to be good policy from
several perspectives. Empirical evidence from studies such as the Perry
Preschool Experiment and the Abecedarian Project indicate that invest-
ments in early childhood can significantly improve outcomes over the life
cycle ðCurrie 2001; Heckman et al. 2010bÞ. Even if one ignores the nor-
mative arguments that such interventions could help move our society
toward greater equality of opportunity, there are still compelling argu-
ments for investing in early childhood education. There is reason to
believe that educational investments made at early ages yield higher
returns than similar ones made later in the life cycle ðCunha et al.
2005Þ, and researchers have calculated the returns to society from such
investments to be extremely high ðRolnick and Grunewald 2003; Heck-
man et al. 2010aÞ. Policy makers have responded to this research by
increasing investments in early childhood education during recent years
I thank Becka Maynard, Francisca G.-C. Richter, Ken Wolpin, Petra Todd, Fabienne
Doucet, Guhan Venkatu, Russ Cole, Rob Grunewald, Kimber Bogard, Tim Dunne, Julie
Cullen, Isaac Ehrlich, Ed Vytlacil, participants at the Penn Institute of Education Sciences
Seminar and Penn Empirical Micro Club, and two anonymous referees for helpful com-
ments. The research reported here was supported by the Institute of Education Sciences,
US Department of Education, through grant R305C050041-05 to the University of Pennsyl-
vania. The opinions expressed are those of the author and do not represent views of the
Federal Reserve Bank of Cleveland, the Board of Governors of the Federal Reserve System, or
the US Department of Education.
481
ðKirp 2007Þ, and these investments figure only to increase after the
Obama adminstration’s recent proposal to provide preschool for all low-
and moderate-income 4-year-old children ðWhite House 2013Þ.
The coordination of related policy will likely determine the effective-
ness of new investments in early childhood education ðBogard and
Takanishi 2005Þ. One related policy widely believed to have long-run
effects, and one that has changed considerably in recent years, is the age
at which children start kindergarten. In the United States, children are
eligible to enroll in kindergarten if they turn 5 before a specific date set
by their state of residence, known as an entrance cutoff date. In recent
decades many state governments have chosen to move entrance cutoff
dates earlier, increasing the average age at which children start kindergar-
ten in the United States.1 Parents and schools have also helped to increase
the average entrance age during recent decades by “redshirting” children,
or choosing to delay their entry into kindergarten ðWeil 2007; Elder and
Lubotsky 2009Þ.
This paper presents empirical evidence related to three causal effects
that are crucial for determining the age when children should start school:
ðiÞ the effect of delaying a child’s kindergarten entry by 1 year, also known
as the effect of redshirting a child; ðiiÞ the effect of a state changing its
entrance cutoff date; and ðiiiÞ the effect of a child’s birth ðand conceptionÞ
being moved to an earlier date. While effect i is relevant for parents’ and
schools’ decisions, effect ii is relevant for entrance age policy makers, and
effects ii and iii are relevant for gaining a general understanding of
children’s development process. The empirical analysis uses data from
the Early Childhood Longitudinal Study, Kindergarten Class of 1998–99
ðECLS-KÞ to understand these effects on educational achievement. The
strength of this data set is that it contains an unusually rich set of individual-
level variables, allowing for the identification of effects unidentified by
state-level data. The weakness of the ECLS-K, however, is that it allows us
to study only short-term effects.
The paper makes four key contributions in the process of investigating
the aforementioned effects. The first is to show that the effects of delaying
entry identified in the literature may be less informative than is currently
appreciated. I empirically test and reject the hypothesis of treatment effect
homogeneity for academic achievement. When combined with the failure
of the instrument monotonicity condition known to occur in this context,
such evidence of essential heterogeneity of treatment effects suggests that
most instrumental variable ðIVÞ strategies do not identify the effect of
choosing to delay a child’s entry for any subpopulation of interest ðBarua
and Lang 2009; Aliprantis 2012Þ. Thus the best evidence on the effects of
redshirting is likely to come from regression discontinuity designs, which
1
For example, 22 states moved their cutoff dates to an earlier point in the school year
between 1975 and 2000 ðStipek 2002Þ. See fig. 1 in Elder and Lubotsky ð2009Þ for a summary
of the evolution of these laws during the past 40 years.
2
This counterfactual contrasts with what could be estimated with a state-level identifica-
tion strategy and data. Such a counterfactual would represent a combination of this effect
and that from redshirting for various subpopulations.
Dhuey 2006; Datar 2006Þ, grade progression ðMcEwan and Shapiro 2008;
Elder and Lubotsky 2009; Dobkin and Ferreira 2010Þ, and attainment
ðFredriksson and Öckert 2005; Puhani and Weber 2005Þ to imply that
redshirting the youngest children in a cohort is optimal whenever possible
ð“Redshirting” episode on 60 Minutes, CBS News, March 4, 2012Þ.3
The central result of this paper, that initial age advantages need not
translate into superior development trajectories, is more consistent with
the view that normal development trajectories often cross each other,
being heterogeneous over individuals ðShonkoff et al. 2000, 28–29Þ, age
ð29–30Þ, and initial achievement ðWillett 1988Þ. More specifically, the
results in this paper are consistent with the literature finding negligible
effects from increased entrance age on IQ and earnings ðBlack, Devereux,
and Salvanes 2011Þ and negligible effects from relative age on achieve-
ment ðCascio and Schanzenbach 2007Þ or adult outcomes such as employ-
ment, wages, and home ownership ðDobkin and Ferreira 2010Þ. Most
consistent with the results in this paper, Elder and Lubotsky ð2009Þ also
find that being relatively younger within a cohort improves academic
achievement.4
The paper finishes by discussing several challenges to determining
optimal entrance age policy that are unlikely to be overcome. Most im-
portant is that causal effects estimated in the literature, including in this
analysis, are unlikely to represent inherent or unaltering relationships
between age and achievement. Thus strong assumptions are necessary in
order to predict effects from future changes to entrance and relative ages
using effects identified in past data ðAliprantis 2014bÞ.
Viewing the empirical results together with these challenges suggests
that attention should be given to designing a broad class of policies that
can accommodate the significant heterogeneity of entrance and relative
ages in any cohort of children. An obvious and existing policy focus would
be to improve the environment before entering kindergarten, which
should reduce the disparities in what kids know and can do upon entering
school ðShonkoff et al. 2000, 386–88Þ. Another obvious proposal would be
to focus on supporting children’s development once they arrive at school,
making sure that each classroom is ready for its children rather than vice
versa. This might be implemented through mixed-age classrooms, per-
haps following Montessori or similar models ðLillard 2008; Khan 2012Þ.
This proposal might also be implemented in just the opposite way, by al-
locating young children to homogeneous classrooms using demographic
characteristics such as age ðinterview with Malcolm Gladwell, 60 Minutes,
March 4, 2012Þ and/or gender. Further research into the effects of various
3
In addition to the literature on the effects entrance age has on academic outcomes, there
is also a literature documenting the important role of relative age in competitive sports
ðMuscha and Grondin 2001Þ and leadership ðDhuey and Lipscomb 2008Þ. Appendix A
discusses the background of entrance age policy in greater detail.
4
Elder and Lubotsky ð2009Þ additionally find evidence that age-related differences in
academic performance dissipate as children advance in school, attributing most of the initial
differences to the accumulation of skills before children enter kindergarten.
entrance age policies should be helpful for designing policies that can
support skills more efficiently begetting skills ðCunha and Heckman 2007Þ
without advantage necessarily begetting advantage.
The remainder of the paper is organized as follows. Section II presents
empirical evidence that redshirting choices are made under essential
heterogeneity and discusses the implications of this evidence. Both the
ECLS-K data set used in the analysis and some variable definitions are
introduced in this section. Section III presents the strategy for identifying
EPF parameters, with Section III.B first describing the selection of the
sample used in estimation. Section III.C specifies the EPFs to be estimated
and explicitly states the identifying assumptions necessary to identify their
parameters. Section III.D discusses the variation in the data used to
identify the EPF parameters, which also merits attention. Estimation re-
sults are presented in Section III.E, and Sections IV and V use these
estimated EPFs to determine the effects of counterfactual entrance age
policies and individual choices. Section VI discusses broad policy implica-
tions, and Section VII presents conclusions.
←
Throughout the analysis I use “5” to denote statistical equations and “5 ”
to denote structural equations to avoid the confusion discussed in Chen
and Pearl ð2012Þ. Statistical equations convey information about observed
changes in the outcome and treatment variables, while structural equa-
tions convey information about controlled changes in these variables. This
distinction is similar to the one between setting and conditioning discussed
in Heckman ð2005Þ, and further discussion can be found in Chen and
Pearl ð2012Þ and Aliprantis ð2014bÞ. I am interested in identifying fea-
tures of the joint distribution of treatment Di and the treatment effect
bi ; Yi ð1Þ 2 Yi ð0Þ, and in this analysis the outcome of interest is educa-
tional achievement.
Suppose further that individuals select into treatment on the basis of
a latent index:
TEH. U1 2 U0
jj U .
D
7
Although assuming X 0 5 X 1 is standard throughout the literature, this is a nontrivial
assumption. See Aliprantis ð2014bÞ for an analysis of selection into covariates in response to
treatment assignment.
EH. CovðU1 2 U0 ; UD Þ ≠ 0.
C. Data
The analysis presented in this paper utilizes data from the Early Child-
hood Longitudinal Study, Kindergarten Class of 1998–99 ðECLS-KÞ data
set, which is a nationally representative sample of over 22,000 children
enrolled in over 1,200 schools who started kindergarten in the fall of
1998. Data were collected during the fall and the spring of kindergarten
ð1998–99Þ, the fall and spring of first grade ð1999–2000Þ, the spring of
third grade ð2002Þ, fifth grade ð2004Þ, and eighth grade ð2007Þ from the
children, their parents/guardians, teachers, and school administrators.8
Since the survey in round 3 ðfall of first gradeÞ was designed to gather
data from only a subsample of about 30 percent, this round will not be
considered for the analysis. To account for the sampling scheme used
to collect the ECLS-K, weights will be utilized in all estimation.
Variables.—We define monthly groups for observed relative age as the
month relative to the entrance cutoff date ðof the year when entering
kindergartenÞ during which a child turns 5. For the cohort of children
we observe entering school in the fall of 1998, we will observe children
entering who are too young to be eligible but who enter anyway, chil-
dren who enter when first eligible, and children who were eligible to
begin school in the fall of 1997 but whose parents decided to enter them
in kindergarten in the fall of 1998. We define the youngest of these
children to be in the relative age group M –12 and the oldest to be in the
relative age group M24. These relative age groups can be seen in figure 1.
In order to assign children in the ECLS-K to these relative age groups, I
first construct their observed entrance and relative ages. Let TðCalendar
DateÞ denote a function of dates from the calendar to a time line in which
day 1 is January 1, 1990. I define each child’s observed entrance age EA
to be her age in months over 5 years at the start of the school year when
she began school, and I implement this definition empirically as the child’s
age over 5 on September 1, 1998. I also define a child’s observed relative
age RA for the entire sample to be the amount of time before the en-
trance cutoff date she turned 5 years of age:
EA ; T ðEntrance Date; Year of EntryÞ 2 T ð5th BDayÞ
ð8Þ
; T ðSept: 1; 1998Þ 2 T ð5th BDayÞ;
Definitions ð8Þ and ð9Þ help to illustrate that since redshirting changes
the year of a child’s entry by 1 year, counterfactual changes to year of entry
change both entrance and relative ages. With regard to relative age, for
children who enter on time when first eligible, the youngest children
will have a relative age of 0, and the oldest children will have a relative
8
Eighth grade will be the last round of data collection because of sample attrition.
Figure 1.—Identifying variation in entrance and relative ages: A, children facing a Septem-
ber 1, 1998, cutoff date; B, children facing a November 1, 1998, cutoff date. Date/location a : a
child turning 5 on January 1, 1998, facing a September 1, 1998, cutoff will have EA 5 8 and
R A 5 8. Date/location b : a child turning 5 on March 1, 1998, facing a September 1, 1998,
cutoff will have E A 5 6 and R A 5 6. Date/location c : a child turning 5 on January 1, 1998,
facing a November 1, 1998, cutoff will have E A 5 8 and R A 5 10. Date/location d : a child
turning 5 on March 1, 1998, facing a November 1, 1998, cutoff will have EA 5 6 and R A 5 8.
Figure 3.—Private school students in the sample: differences in school- and state-level
entrance cutoff dates ðin monthsÞ.
large differences in the entrance cutoff date measures are found among
this population.
The outcome measures used in this analysis are math and reading item
response theory ðIRTÞ test scores.
After partitioning the propensity scores into these intervals, I then esti-
mate a regression of test scores on the predicted propensity score p^ðx; zÞ
as well as the covariates in X with separate slopes for p^ðz; xÞ ∈ ½p ; p A and
p^ðz; xÞ ∈ ½p ; p B . The test of H0IV is empirically implemented as a Wald
A
B
test of the hypothesis that these slope coefficients are equal over the
different intervals of p^ðx; zÞ.
Tables 1 and 2 report the results of these tests together with the esti-
mates of the two coefficients at each test date. It is most reasonable to
view these as four tests of the hypothesis that selection into redshirting
is characterized by treatment effect homogeneity with respect to math
test scores and four separate tests of the hypothesis that selection into
redshirting is characterized by effect homogeneity with respect to read-
ing test scores. The final column presents corrected critical values to con-
trol the familywise error rate arising from the multiple comparisons made
here. These corrected critical values are displayed so that the ranked
p-values are compared to
a a a
; ; :::;
n n21 1
in a sequentially rejective Bonferroni test, as developed in Holm ð1979Þ,
where n is the number of tests performed and a is the desired size.
Tables 1 and 2 present strong evidence against treatment effect ho-
mogeneity: All tests reject the null of effect homogeneity. This evidence
TABLE 1
Testing H0IV for Math: bIV;½p ;p A 5 bIV;½p ;p B
A B
TABLE 2
Testing H0IV for Reading: bIV;½p ; pA 5 bIV;½p ; pB
A B
10
This need not be true when the benefit of delaying is cast in terms of the counterfactual
outcomes related to educational achievement at an older age, competitive sports ðMuscha and
Grondin 2001Þ, or leadership ðDhuey and Lipscomb 2008Þ. Also, this statement about coun-
terfactual outcomes is compatible with the recent finding that cognitive ability measured
before children enter school is not predictive of redshirting ðBassock and Reardon 2013Þ.
TABLE 3
Entering Rates by Month in the ECLS-K ð%Þ
Month before Cutoff Turned 5
Entering 12 11 10 9 8 7 6 5 4 3 2 1
Early 6 2 1 0 0 0 0 0 0 0 0 0
On time 93 97 98 98 97 97 96 93 90 87 81 74
Late 1 1 1 1 3 3 4 6 11 14 19 26
TABLE 4
Entering Rates by Quarter in the ECLS-K ð%Þ
Quarter before Cutoff Turned 5
Entering 4 3 2 1
Early 3 0 0 0
On time 96 97 93 80
Late 1 3 7 20
The data presented in table 3 and figure 6 guide the selection of a sam-
ple relatively unaffected by redshirting. Note that over a quarter of chil-
dren who turned 5 within 1 month of their school’s cutoff date, 26 per-
cent, were redshirted. While this rate does decline as children become
relatively older, it is still 11 percent for children who turned 5 in the
fourth month before their school’s cutoff date. Also note that a negli-
gible number of children enter early, except for 6 percent in the oldest
assigned relative age group. Finally, turning our attention to children
who turned 5 between 6 and 11 months before their school’s entrance
cutoff date, over 96 percent of each cohort entered kindergarten when
first eligible.
The sample used for the remaining analysis in this paper is composed
of those kindergartners in the observed relative age groups M 6 through
TABLE 5
Entering Rates by Demographic Characteristics in the ECLS-K
Percent
Entering Late
Overall 8
Race:
Black 5
White 10
Hispanic 6
Asian 3
Gender:
Male 10
Female 5
Home environment:
Many books ð≥100Þ 9
Few books ð<30Þ 7
Father in household 8
No father in household 6
Foreign language spoken at home 4
No foreign language spoken at home 8
Socioeconomic status:
High SES 8
Low SES 7
Mother:
High school dropout 7
BA holder 9
Employed 7
Not employed 8
Figure 6.—Initial enrollment decisions ðby assigned relative age cohorts BjÞ
M11 because they are the least affected by redshirting. Furthermore, the
sample is restricted to children living in states that set an entrance cut-
off date between August 31 and January 1. Children from states with local
education authority options ðColorado, Massachusetts, New Hampshire,
New Jersey, New York, Pennsylvania, and VermontÞ, from the three states
with entrance cutoff dates before August 31 ðAlaska, Indiana, and Mis-
souriÞ, and from states with fewer than 30 respondents ðArkansas, Dela-
ware, District of Columbia, Hawaii, Idaho, Montana, Nebraska, Nevada,
North Dakota, Oregon, South Carolina, South Dakota, West Virginia, and
WyomingÞ are omitted from our sample. The 27 states included in our
sample are Alabama, Arizona, California, Connecticut, Florida, Georgia,
Illinois, Iowa, Kansas, Kentucky, Louisiana, Maine, Maryland, Michi-
gan, Minnesota, Mississippi, New Mexico, North Carolina, Ohio, Okla-
homa, Rhode Island, Tennessee, Texas, Utah, Virginia, Washington, and
Wisconsin.
This analysis will not consider outcomes after the third grade because
of the severe attrition in the ECLS-K data set. Tables 6 and 7 show the
sample sizes for math and reading IRT test scores by survey round. The
tables reveal heavy attrition in grades 5 and 8 in the ECLS-K. In our
sample, 41 percent of initial respondents do not have reading IRT scores
by the fifth grade, and 51 percent attrite by the eighth grade. The prob-
lem is even worse for math IRT scores.
Figures 7 and 8 show the distributions of math and reading IRT test
scores of children in the sample. These figures show how these distribu-
tions changed for all children between the fall of kindergarten and the
TABLE 6
Attrition in the ECLS-K: Math IRT Score Present
Sample All ECLS-K
Round Attrition ð%Þ N Attrition ð%Þ N
Fall kindergarten 0 3,950 5 18,640
Spring kindergarten 3 3,850 0 19,650
Fall 1st grade ... ... ... ...
Spring 1st grade 17 3,270 15 16,630
Spring 3rd grade 28 2,830 27 14,370
Spring 5th grade 44 2,230 43 11,270
Spring 8th grade 54 1,830 53 9,290
Note.—Fall 1st grade is not used in the analysis since this wave sampled only about 30 per-
cent of the entire sample.
spring of third grade. Math scores are more smoothly distributed than
reading scores.
where aðtÞ represents the age of the youngest first-time eligible child in
the sample at time t, EAi is child i’s entrance age, and RAi is child i’s rela-
tive age as defined in Section II.C. Note that while aðtÞ, which can be
thought to represent the age at test, increases over time, EAi and RAi stay
fixed.
I now specify a generalization of equation ð3Þ from Todd and Wolpin
ð2003Þ that allows for more general age effects. Assume that the test score
TABLE 7
Attrition in the ECLS-K: Reading IRT Score Present
Sample All ECLS-K
Round Attrition ð%Þ N Attrition ð%Þ N
Fall kindergarten 0 3,740 7 17,620
Spring kindergarten 1 3,710 0 18,940
Fall 1st grade ... ... ... ...
Spring 1st grade 14 3,230 14 16,340
Spring 3rd grade 25 2,820 25 14,280
Spring 5th grade 41 2,220 41 11,270
Spring 8th grade 51 1,820 51 9,230
←
Yijt 5 YAðtÞ fFij ½Ai ðtÞ; Sij ½Ai ðtÞ; mij0 ; eijt g: ð10Þ
11
Alternatively, it might be the case that if children cannot keep pace, such inputs might
also leave children behind, decreasing their test scores.
MA2. Family and school inputs are unchanging over time, so that in-
put measures at the fall of kindergarten capture the entire history of in-
puts. The specific implementation of MA2 estimated here can be stated
as assuming Fij½AiðtÞ 5 Fij for all t > 1 and that school inputs are en-
trance age, relative age, and a vector of school dummies:
S ij ½Ai ðtÞ ; ½EAi ; RAi ; EAi RAi ; Sij4 ; : : : ; Sijk :
12
Note that here the coefficient on entrance age has changed to a z , indicating that effects
from entrance age at this point in time represent effects from the biological development
process in isolation from any school effects.
of the input variables will vary little, assumption MA2 is not considered
to be a strong assumption.
As implemented, assumption MA2 interprets entrance age and rela-
tive ages as school inputs. The expression aaðtÞ;0 1 baðtÞ;0 sets the level for
the cohort at each point in time based on the overall age of the cohort,
aðtÞ. The deviation from that average level based on the additional en-
trance age of the child is determined by baðtÞ;1, and the deviation due to
the relative age of the child’s classmates is determined by baðtÞ;2. Thus aaðtÞ;0
1 baðtÞ;0 and baðtÞ;1 represent the average growth in achievement due sim-
ply to the biological development process. The combination of baðtÞ;1 , baðtÞ;2 ,
and baðtÞ;3 determines the precise way the child’s own biological develop-
ment process is influenced by that of other children through direct so-
cial interactions between children, as well as indirectly through social
interactions with teachers and the broader school environment. That is,
EAi and RAi can be interpreted as school inputs because they determine
the age at which one is exposed to educational instruction in school and
also help to determine the type of classroom and teacher interactions
students will have as a result of their own development and their develop-
ment relative to that of their classmates. Finally, it is also worth noting
that EAi will determine the amount of time spent in the home environ-
ment rather than school, so its effect parameters are likely to be sum-
maries representative of heterogeneous individual circumstances.
Two further assumptions are maintained in order to overcome prob-
lem i and obtain estimates of ability mij0. With ability measures in our pos-
session, equation ð13Þ should satisfy the standard OLS assumptions and
therefore be straightforward to estimate. To be precise, the additional
maintained assumptions are as follows.
MA3. Define a set of race/ethnicity dummies Ri ; ðRi1 ; Ri2 ; Ri3 Þ, where
Ri is a binary indicator for being black, Ri2 is an indicator for being His-
1
panic, and Ri3 is an indicator for being Asian. It is assumed that race/
ethnicity is correlated with family inputs: E½Rik Fij ≠ 0 for at least one k
for each element Fij of Fij.
MA4. For each subject, unobserved factors are identically and in-
dependently distributed, eijt ∼ iidN ð0; j2et Þ, with ability being normally dis-
tributed ðmij0 ∼ N ð0; j2m ÞÞ independently of race: mij0 j j R k for all k.
i
Estimates of ability mij0 are obtained by recalling that under MA1, fall
of kindergarten ðt 5 1Þ school inputs have not yet affected the produc-
tion of educational achievement:
←
Yij1 5 F ij ½Ai ð1Þaað1Þ 1 z að1Þ EAi 1 dað1Þ mij0 1 eij1 : ð14Þ
←
Yij1 5 F ij aað1Þ 1 z að1Þ EAi 1 hij1 ; ð15Þ
where
hij1 ; dað1Þ mij0 1 eij1 :
Equation ð15Þ can be estimated via two-stage least squares ð2SLSÞ using
the three race dummies in R i as instruments for the three endogenous
variables in Fij. The residuals obtained from this estimation are
←
hij1 5 dað1Þ mij0 1 eij1 : ð16Þ
If Rik is a race dummy, then MA3 and MA4 imply the standard IV
assumptions.
MA3. E½Rik Fij ≠ 0 for at least one k for each element Fij of Fij.
MA4*. E½Rik hij1 5 0 for all k.
13
Although the extensive debate over assumption MA4 is best left for other studies
such as Goldberger and Manski ð1995Þ, here it is at least worth mentioning two points
about MA4. One is that first-order mechanisms such as the history of racial discrimina-
tion ðBlackmon 2008; Aliprantis and Carroll 2013Þ and secular changes in the labor mar-
ket ðWilson 1987Þ can explain important between-group differences in outcomes under
MA4. Another is that the violation of MA4 is an idea that has been used to justify malicious
behavior and policies ðsee Zuberi ½2001 or Washington ½2006, chap. 6 for important
historical examplesÞ as well as policy fallacies ðManski 2011Þ.
The OLS estimation of equation ð17Þ brings two issues to mind. First,
the random assignment of entrance age ðEAÞ and relative age ðRAÞ does
not ensure that other inputs are not chosen in response to these vari-
ables. Aliprantis ð2014bÞ shows that ðb^OLS ; b ^OLS ; b ^OLS Þ do identify the EPF
aðtÞ;1 aðtÞ;2 aðtÞ;3
parameters of equation ð17Þ as long as any other input chosen in response
to entrance or relative ages is included as an observed input in estima-
tion.14 Second, attenuation bias is a concern because our measure of ability
includes measurement error ð^hij1 5 daðtÞ m ^ij0 1 ^eij1 Þ. This is a general prob-
lem when using identification strategies that rely on noisy measures. Since
this issue is only beginning to receive attention for value-added models
ðKoedel, Leatherman, and Parsons 2012Þ, fully accommodating this issue
is left for future research.
←
14
The structural equation notation ð5Þ is used to clarify that the EPF parameters of
eq. ð17Þ correspond to the direct effects discussed in Aliprantis ð2014bÞ.
TABLE 8
Entrance and Relative Ages as a Function
of Birth Date and State Cutoff
Birth Date/State Entrance Relative
Combination Age Age
a 8 8
b 6 6
c 8 10
d 6 8
to the sample selection conducted to ensure that these ages vary ran-
domly, we observe only children between the relative ages of 5 and
11 months. It must be remembered that the subsequent analysis can
speak only to this sample of children and is uninformative about age
effects on children starting kindergarten at other relative or absolute ages.
If the identification scheme just described is to provide unbiased esti-
mates of the entrance and relative age effects using the variation in ages
shown in figure 10, date of birth must be random conditional on observ-
ables ðfamily inputs, school inputs, school fixed effects, and state fixed
effectsÞ. However, considerable evidence indicates that date of birth is
not random but rather is the product of parents’ choices, biological fac-
tors, or both. Seasonal birth patterns have been demonstrated to be re-
lated to maternal characteristics such as age, marital status, education,
and birth order ðBobak and Gjonca 2001; Buckles and Hungerman 2008Þ;
Figure 9.—Variation in entrance age and relative age in the entire ECLS-K
TABLE 9
p -Values of F -Tests of Equality of Characteristic Means by Quarter and Month
All Children Sample
Characteristic Quarter Month Quarter Month
Race:
Black .98 .07 .00 .00
White .41 .42 .00 .00
Hispanic .44 .79 .00 .00
Asian .38 .91 .03 .22
Gender:
Female .71 .18 .89 .92
Mom’s education level:
Mom’s highest grade completed <12 .03 .31 .73 .09
Mom’s highest degree received
5 high school diploma .42 .61 .33 .64
Mom’s highest degree received ≥ BA .81 .82 .63 .40
Home characteristics:
Number of books at home .30 .31 .99 .52
Live with father .65 .91 .00 .04
Birth characteristics:
Parents married at birth .18 .37 .62 .26
Birth weight .30 .07 .11 .12
Socioeconomic status:
SES 5 1 .03 .08 .47 .11
SES 5 2 .54 .69 .75 .15
SES 5 3 .96 .73 .39 .48
SES 5 4 .64 .91 .78 .19
SES 5 5 .14 .59 .08 .11
TABLE 10
Seasonality of Characteristics: Means and Differences
from Reference Group by Quarter
All Children
Quarter 1 Quarter 2 Quarter 3 Quarter 4
Characteristic ðMeanÞ ðDiff.Þ ðDiff.Þ ðDiff.Þ
Race:
Black .16 2.00 2.00 2.00
White .56 .02 .00 .01
Hispanic .20 2.01 .00 2.01
Asian .04 2.00 2.01 .00
Gender:
Female .49 2.01 2.01 2.01
Mom’s education level:
Mom’s highest grade completed <12 .14 2.00 .01 .02
Mom’s highest degree received 5
high school diploma .64 .00 2.00 2.01
Mom’s highest degree received ≥ BA .22 2.00 2.01 2.00
Home characteristics:
Number of books at home 71.82 1.94 .89 2.26
Live with father .77 .01 .01 .01
Birth characteristics:
Parents married at birth .68 .02 .02 .01
Birth weight ðpoundsÞ 7.35 .02 .05 .05
Socioeconomic status:
SES 5 1 .20 2.01 .01 .01
SES 5 2 .20 .01 2.01 2.00
SES 5 3 .20 2.00 .00 .00
SES 5 4 .20 2.00 2.01 .00
SES 5 5 .20 .01 2.00 2.01
characteristics such as the state poverty rate, male labor force participa-
tion rate, and high school graduation rate are small and statistically
insignificant.
E. Estimation Results
Before estimating the EPF in equation ð17Þ, I first obtain estimates of
math and reading ability by estimating the fall of kindergarten EPF in
equation ð15Þ. I assume that there are four inputs into the production
of achievement in the fall of kindergarten: age, mother’s educational
attainment, household income, and prenatal inputs as measured by birth
weight. Assuming that race is independent of ability but correlated with
the fall test scores through the family inputs, I use race indicators as in-
struments for family inputs. The specification imposes that the error term
from the 2SLS estimation of equation ð15Þ is composed only of ability
and measurement error.
The ^hij1 estimates from equation ð15Þ are shown in figures 15, 16, 17,
and 18. Overall, the ability estimates appear not only to have the same
mean as imposed by the IV estimation ðMA4*Þ but also to have similar
TABLE 12
Seasonality of Characteristics: Means and Differences from
Reference Group by Quarter
Sample
Quarter 1 Quarter 2 Quarter 3 Quarter 4
Characteristic ðMeanÞ ðDiff.Þ ðDiff.Þ ðDiff.Þ
Race:
Black .17 2.07 2.13 .00
White .58 2.03 2.01 .05
Hispanic .19 .10 .03 2.04
Asian .03 .02 2.00 2.01
Gender:
Female .49 2.01 .07 .00
Mom’s education level:
Mom’s highest grade completed <12 .13 .01 2.01 .01
Mom’s highest degree received 5
high school diploma .66 2.03 2.03 2.03
Mom’s highest degree received ≥ BA .22 .02 .04 .02
Home characteristics:
Number of books at home 72.56 .66 2.57 2.38
Live with father .78 .06 .16 .01
Birth characteristics:
Parents married at birth .70 .01 .09 .02
Birth weight 7.29 .15 .38 .05
Socioeconomic status:
SES 5 1 .17 .01 2.10 .01
SES 5 2 .20 .00 .03 .01
SES 5 3 .21 2.03 2.06 2.02
SES 5 4 .23 2.02 2.03 2.01
SES 5 5 .20 .03 .17 2.00
Sample
Jan. Feb. March April May June July Aug. Sept. Oct. Nov. Dec.
Characteristic ðMeanÞ ðDiff.Þ ðDiff.Þ ðDiff.Þ ðDiff.Þ ðDiff.Þ ðDiff.Þ ðDiff.Þ ðDiff.Þ ðDiff.Þ ðDiff.Þ ðDiff.Þ
Race:
Black .17 .03 2.02 2.05 2.08 2.10 2.12 .00 2.17 2.01 2.01 .01
White .58 2.02 .01 .06 2.14 2.12 .02 .00 2.58 .02 .06 .05
Hispanic .19 2.01 .00 2.01 .19 .23 2.01 .00 .61 2.02 2.04 2.06
Asian .04 2.01 2.01 .00 .02 .01 2.02 .00 .16 2.01 2.01 2.02
Gender:
Female .48 .01 .00 .01 .01 2.05 .05 .00 .52 2.02 .01 .02
Mom’s education level:
Mom’s highest grade completed <12 .15 2.04 2.03 2.04 .02 2.02 2.07 .00 .65 2.02 2.00 2.02
514
Mom’s highest degree received 5
high school diploma .67 2.00 2.02 2.01 2.07 2.06 2.02 .00 2.47 2.04 2.04 2.03
Mom’s highest degree received ≥ BA .18 .05 .06 .04 .05 .08 .09 .00 2.18 .05 .05 .06
Home characteristics:
Number of books at home 68.51 5.27 6.55 8.06 22.34 4.16 5.52 .00 234.91 1.06 4.79 4.24
Live with father .79 .01 2.02 .04 .06 .08 .15 .00 .21 .03 2.00 2.01
Birth characteristics:
Parents married at birth .67 .05 .03 .05 .03 2.01 .15 .00 2.47 .01 .08 .03
Birth weight 7.41 2.19 2.16 .05 2.07 .10 .35 .00 21.41 2.15 2.00 2.08
Socioeconomic status:
SES 5 1 .21 2.06 2.05 2.03 .00 2.05 2.13 .00 2.21 2.05 2.02 2.03
SES 5 2 .20 2.02 .01 .03 2.05 2.03 2.02 .00 .80 2.00 .03 2.00
SES 5 3 .21 .02 2.02 2.02 2.05 2.01 2.06 .00 2.21 2.03 2.03 2.00
SES 5 4 .21 .01 .03 2.04 .07 .01 2.01 .00 2.21 .04 2.01 .00
SES 5 5 .17 .05 .04 .06 .02 .08 .22 .00 2.17 .04 .03 .03
Figure 12.—State male labor force participation and 1998 kindergarten entrance cutoff
dates. State-level data are from the US Census/NHGIS, and entrance cutoff dates are taken
from Elder and Lubotsky ð2009Þ.
515
Figure 14.—State male high school graduation rate and 1998 kindergarten entrance cutoff
dates. State-level data are from the US Census/NHGIS, and entrance cutoff dates are taken
from Elder and Lubotsky ð2009Þ.
516
Figure 16.—Math ^
h estimates by race
517
Figure 18.—Reading ^
h estimates by race
TABLE 14
Math Test Score Parameter Estimates for All Children
Spring
Kindergarten Spring 1st Grade Spring 3rd Grade
Raw j Raw j Raw j
^aðtÞ;1 Þ
Entrance age ðb 2.10 .18 5.99 .33 3.91 .16
ð.5Þ** ð.78Þ** ð1.36Þ**
^aðtÞ;2 Þ
Relative age ðb 2.50 2.04 24.77 2.26 .58 .02
ð.38Þ ð.83Þ** ð1.14Þ
^aðtÞ;3 Þ
Interaction ðb 2.05 2.00 2.01 2.00 2.26 2.01
ð.05Þ ð.1Þ ð.14Þ
Observations 2,800 2,730 2,350
Note.— Outcomes are IRT test scores. Home inputs included in estimation are income,
mother’s education, whether a foreign language is spoken at home, whether enrolled in a
prekindergarten program, mother’s employment between birth and kindergarten, whether
the father was living in the household, and whether the child ever received WIC benefits.
School inputs include entrance age, relative age, the interaction of the two ages, school and
state fixed effects, gender, and race. Ability estimates as described in Secs. III.C and III.E are
also included in estimation.
** p < .01.
All else equal, in this time period, 1 month of extra entrance age increases
math IRT scores between 0.16 and 0.33 standard deviations and reading
IRT scores between 0.19 and 0.28 standard deviations. In the spring of
third grade the entrance age coefficient is under 0:2j for both math and
reading test scores.
A second impression from tables 14 and 15 is that the parameters
pertaining to being relatively older than one’s classmates are quite
TABLE 15
Reading Test Score Parameter Estimates for All Children
Spring
Kindergarten Spring 1st Grade Spring 3rd Grade
Raw j Raw j Raw j
^aðtÞ;1 Þ
Entrance age ðb 3.85 .28 5.05 .22 5.06 .19
ð.56Þ** ð1.18Þ** ð1.94Þ**
^aðtÞ;2 Þ
Relative age ðb 23.10 2.23 22.89 2.12 22.43 2.09
ð.4Þ** ð1.15Þ* ð1.55Þ
^aðtÞ;3 Þ
Interaction ðb 2.007 2.01 2.06 2.00 2.05 2.00
ð.06Þ ð.12Þ ð.19Þ
Observations 2,710 2,690 2,350
Note.—Outcomes are IRT test scores. Home inputs included in estimation are income,
mother’s education, whether a foreign language is spoken at home, whether enrolled in a
prekindergarten program, mother’s employment between birth and kindergarten, whether
the father was living in the household, and whether the child ever received WIC benefits.
School inputs include entrance age, relative age, the interaction of the two ages, school and
state fixed effects, gender, and race. Ability estimates as described in Secs. III.C and III.E are
also included in estimation.
* p < .05.
** p < .01.
different from the parameters for entering at an older age. All else equal,
being 1 month relatively older tends to decrease IRT scores. For reading
these effects are large but fade over time, although for math these effects
are sometimes large and sometimes near zero. A final impression from
tables 14 and 15 is that the interaction effects are very small.
In light of the evidence of essential heterogeneity provided in Sec-
tion II, equation ð17Þ is also estimated by subgroup. Tables 16, 17, 18, and
19 show separate effect estimates for boys and girls. Tables 20, 21, 22, and
23 show similar estimates for children with different home environments
as measured by the number of children’s books they have at home.
The estimates indicate that entrance age effects are extremely different
for boys and girls. By the spring of third grade the entrance age parameter
for math scores is approximately 0:1j for girls, but for boys it is 0:5j!
Although the trends in earlier grades are different for reading, we see an
almost identical pattern in the entrance age effect by the spring of third
grade: increasing entrance age has very large and beneficial effects on
boys but relatively small impacts on girls.
Relative age effects tend to be negative or zero for both boys and girls.
Although many of these effects are not statistically different from zero, by
the spring of third grade, relative age effects are negative and large for
boys.
Finally, I investigate heterogeneity in effects by one measure of the
home environment, the number of children’s books the child has at home.
I define children to have few books if they have fewer than 35 books,
which is the 33rd percentile of the sample distribution, and I define chil-
dren to have many books at home if they have more than 100 books, the
66th percentile of the sample distribution.
Entrance age and relative age have huge effects on children with few
books at home. All else constant, for every month older they are when they
enter school, these children score 0.28 and 0.47 standard deviations worse
on their third grade math and reading tests, respectively. Relative age
effects have the opposite sign and in the spring of third grade have an
TABLE 16
Math Test Score Parameter Estimates for Boys
Spring
Kindergarten Spring 1st Grade Spring 3rd Grade
Raw j Raw j Raw j
^aðtÞ;1 Þ
Entrance age ðb 1.24 .10 4.10 .23 12.27 .50
ð.76Þ ð1.32Þ** ð2.39Þ**
^aðtÞ;2 Þ
Relative age ðb 1.65 .14 21.30 2.07 25.03 2.21
ð.74Þ* ð1.97Þ ð2.31Þ*
^aðtÞ;3 Þ
Interaction ðb 2.14 2.01 2.13 2.01 2.41 2.02
ð.08Þ ð.18Þ ð.26Þ
Observations 1,400 1,360 1,170
* p < .05.
** p < .01.
TABLE 18
Reading Test Score Parameter Estimates for Boys
Spring
Kindergarten Spring 1st Grade Spring 3rd Grade
Raw j Raw j Raw j
^aðtÞ;1 Þ
Entrance age ðb 1.74 .13 21.03 2.04 12.94 .48
ð.95Þ ð1.93Þ ð2.87Þ**
^aðtÞ;2 Þ
Relative age ðb .04 .00 5.54 .24 26.24 2.23
ð.75Þ ð2.35Þ* ð3.56Þ
^aðtÞ;3 Þ
Interaction ðb 2.07 2.01 2.19 2.01 2.29 2.01
ð.09Þ ð.20Þ ð.37Þ
Observations 1,360 1,350 1,170
* p < .05.
** p < .01.
TABLE 19
Reading Test Score Parameter Estimates for Girls
Spring Spring 3rd
Kindergarten Spring 1st Grade Grade
Raw j Raw j Raw j
^aðtÞ;1 Þ
Entrance age ðb 3.00 .22 5.16 .22 4.08 .15
ð1.10Þ** ð2.00Þ* ð3.51Þ
^aðtÞ;2 Þ
Relative age ðb 23.52 2.26 25.18 2.22 22.44 2.09
ð.97Þ** ð1.91Þ** ð2.77Þ
^aðtÞ;3 Þ
Interaction ðb .07 .01 .06 .00 2.02 2.00
ð.10Þ ð.21Þ ð.31Þ
Observations 1,350 1,350 1,180
* p < .05.
** p < .01.
TABLE 21
Math Parameter Estimates with Many Books ð≥100Þ at Home
Spring
Kindergarten Spring 1st Grade Spring 3rd Grade
Raw j Raw j Raw j
^aðtÞ;1 Þ
Entrance age ðb 2.34 .20 8.47 .47 6.51 .27
ð1.09Þ* ð1.59Þ** ð2.47Þ**
^aðtÞ;2 Þ
Relative age ðb 21.60 2.13 25.35 2.29 22.90 2.12
ð.75Þ* ð1.37Þ** ð2.14Þ
^aðtÞ;3 Þ
Interaction ðb .009 .00 2.17 2.01 2.21 2.01
ð.10Þ ð.18Þ ð.28Þ
Observations 1,070 1,040 910
* p < .05.
** p < .01.
TABLE 22
Reading Parameter Estimates with Few Books ð≤35Þ at Home
Spring Spring 1st
Kindergarten Grade Spring 3rd Grade
Raw j Raw j Raw j
^aðtÞ;1 Þ
Entrance age ðb 213.00 2.96 21.22 2.05 212.70 2.47
ð2.15Þ** ð3.53Þ ð5.68Þ*
^aðtÞ;2 Þ
Relative age ðb 16.06 1.18 3.68 .16 16.34 .60
ð1.44Þ** ð2.74Þ ð4.54Þ**
^aðtÞ;3 Þ
Interaction ðb 2.14 2.01 2.10 2.00 2.05 2.00
ð.15Þ ð.36Þ ð.50Þ
Observations 730 760 660
* p < .05.
** p < .01.
TABLE 23
Reading Parameter Estimates with Many Books ð≥100Þ at Home
Spring Spring 3rd
Kindergarten Spring 1st Grade Grade
Raw j Raw j Raw j
^aðtÞ;1 Þ
Entrance age ðb .65 .05 212.45 2.53 4.05 .15
ð.95Þ ð3.58Þ** ð3.70Þ
^aðtÞ;2 Þ
Relative age ðb 2.75 2.06 12.24 .52 21.39 2.05
ð.79Þ ð2.47Þ** ð3.72Þ
^aðtÞ;3 Þ
Interaction ðb .03 .00 .06 .00 2.09 2.00
ð.10Þ ð.27Þ ð.36Þ
Observations 1,070 1,040 910
** p < .01.
even larger magnitude for these children, at 0.34 and 0.60 standard
deviations for math and reading test scores.
Entrance age and relative age have smaller parameters for children with
many books at home, and these parameters tend to have the opposite sign
of those for children with few books at home. Entering 1 month older
increases math test scores in the spring of third grade, on average, by 0.27
standard deviations for children with many books, but entering relatively
older decreases test scores at this time by 0.12 standard deviations. The
effects on reading test scores are similar but of smaller magnitudes; the
entrance age parameter in spring of third grade is 0:15j and the relative
age parameter is 20:05j.
yYijt
2 5 2baðtÞ;2 2 baðtÞ;3 EA: ð18Þ
yRA
The relative age parameter baðtÞ;2 from the EPF determines the level of the
effect from changing the entrance cutoff date, and the interaction param-
eter bt3 determines how much this varies at different entrance ages.
When interpreting these results, it is useful to recall the mechanisms
discussed in Section III.C.1 through which relative age functions as a
school input. One such mechanism is that as a child becomes older
relative to other children in his classroom, the child may receive additional
inputs at school, perhaps in the form of additional attention or encour-
agement from his teacher or from tracking with a higher-achieving peer
group. Stimulated from the extra input, the child’s achievement goes up.
This mechanism could also work in reverse for children receiving fewer
inputs as a result of being relatively younger than their classmates. If this
mechanism were to dominate, we would expect relatively older children to
perform better on achievement tests or for there to be a negative effect
from decreasing a child’s relative age.
The second mechanism through which relative age might serve as a
school input is also related to peer effects and social interactions. If
teachers teach to the average level of achievement in their classroom,
school inputs will be tailored toward the average age in a classroom.
Inputs aimed at higher-achieving peers could cause children to accelerate
their learning to keep pace with their peers, raising their test scores.15 In
this case, we would expect the effect of decreasing relative age to have a
positive sign.
Table 24 shows the implied effects in the spring of third grade of an
earlier entrance cutoff date based on the EPF parameter estimates at
15
Alternatively, it might be the case that if children cannot keep pace, such inputs might
also leave children behind, decreasing their test scores.
TABLE 24
Counterfactual 1: Effects from Changing to an Earlier Entrance
Cutoff Date Spring of Third Grade
Math Reading
Sample Raw j Raw j
All children 1.21 .05 2.76 .11
ð.70Þ ð.94Þ
Gender:
Boys 7.94 .33 8.26 .34
ð1.47Þ ð1.74Þ
Girls .85 .03 2.61 .11
ð1.40Þ ð2.11Þ
Home environment:
Few books 28.08 2.33 216.00 2.66
ð2.54Þ ð3.61Þ
Many books 4.36 .18 2.05 .08
ð.90Þ ð2.62Þ
^aðtÞ;2 2 b
Note.— 2yYijt =yRA 5 2b ^aðtÞ;3 EAj
EA57 .
5 years and 7.00 months. The effects on all children in the sample of
changing the entrance cutoff date are large, especially considering that
this date typically varies across a 4-month interval. The estimates imply
that the average child would experience effects of 0.20 and 0.44 stan-
dard deviations on math and reading test scores if the state moved from
the latest to the earliest cutoff date in the sample. However, the effects
of 1-month changes are muted and are no larger than 0:11j for both
math and reading.
It is perhaps surprising that the effects of an earlier cutoff date are
positive because this indicates that children perform better when relatively
younger than their classmates. This positive effect indicates that the sec-
ond mechanism through which relatively younger children speed up their
learning to match the level of their peers dominates the harm done by
the first mechanism. And while it might be surprising that children have
higher achievement when relatively younger, similar results have also been
reported in the literature. Elder and Lubotsky ð2009Þ document that rel-
atively younger children have higher achievement, and Dobkin and Fer-
reira ð2010Þ find that younger children have higher academic attainment.
The heterogeneity in effects by gender and home environment is
remarkably strong, and this heterogeneity is likely to be the most impor-
tant finding for policy makers. The effects on boys are much larger than
the effects on girls. If it is surprising that children perform better when
relatively younger than their classmates, it might be even more surprising
to see this result for boys. Together with the well-documented practice of
redshirting boys more often than girls ðDeming and Dynarski 2008; Ali-
prantis 2012Þ, this heterogeneity suggests that a topic for future research
to investigate is whether early childhood trends might be contributing to
the gender gap in attainment ðGoldin, Katz, and Kuziemko 2006Þ.
Children with few books at home perform much worse when relatively
younger than their classmates. Thinking about the two mechanisms dis-
cussed earlier, one hypothesis is that such children’s family inputs work in
favor of the first mechanism dominating the second mechanism. Children
with low family inputs who are also relatively young may simply be too far
behind to catch up with their peers. Policy makers moving their state’s
entrance cutoff date to an earlier point hurts children with few home
inputs as measured by books at home.
TABLE 25
Counterfactual 2: Effects from an Earlier Birth Date Spring of Third Grade
Math Reading
Sample Raw j Raw j
All children 8.33 .34 3.34 .14
ð4.17Þ ð5.89Þ
Gender:
Boys 13.46 .55 11.05 .45
ð7.73Þ ð11.27Þ
Girls 7.59 .31 2.01 .08
ð6.50Þ ð9.38Þ
Home environment:
Few books 1.97 .08 4.38 .18
ð10.01Þ ð15.04Þ
Many books 6.72 .28 4.08 .17
ð8.55Þ ð10.80Þ
^aðtÞ;1 1 b
Note.— 2yYt =yBDay 5 b ^aðtÞ;2 2 b
^aðtÞ;3 ðEA 1 RAÞj
EA1RA515 .
that entrance age parameters typically have the opposite sign and a larger
magnitude than their corresponding relative age parameters.
VII. Conclusion
This paper studied causal effects informative for determining the age
when children should start kindergarten. The first results were a set of
hypothesis tests rejecting the homogeneity of treatment effects from
delaying kindergarten entry. Because choices to delay entry create viola-
tions of the instrument monotonicity condition, this evidence suggests
that standard IV strategies do not identify the effect of choosing to delay a
child’s entry for any subpopulation of interest ðBarua and Lang 2009;
Aliprantis 2012Þ.
I presented an identification strategy combining features of Angrist and
Krueger ð1991Þ and Todd and Wolpin ð2003Þ modified to identify educa-
tion production function parameters in spite of these methodological
difficulties. The identified EPF parameters allowed for the construction
of outcomes under counterfactual policies and parental choices. The
central result of the paper was that increasing the relative age of students’
classmates would actually increase their achievement in the spring of third
grade, at least for the older children in a class. That is, relative age has
negative returns for older children. The finding that initial age advantages
need not translate into superior development trajectories is evidence that
relative advantages at early ages need not determine subsequent develop-
ment trajectories.
The heterogeneity of effects documented in this study points early
childhood education policy toward early childhood environments and
schools that can accommodate the relatively large differences in ages and
skills at initial enrollment. Effects identified in the analysis were remark-
ably heterogeneous by gender and by home environment as measured by
the number of books at home. Relative age effects on boys were much
larger than the effects on girls. Children with few books at home per-
formed much worse when entering at an older entrance age. If a goal of
policy is to foster skills more efficiently begetting skills without advantages
necessarily begetting advantages, future research could contribute to this
goal by further investigating the trade-offs alternative early childhood
education policies represent for various subpopulations.
Appendix A
Policy Background
Appendix B
with
←
D* 5 mD ðX ; Z Þ 2 V
←
ðB2Þ
5 mðX Þ 1 gZ 2 V :
Assumption EH1 holds in this model because of the restricted subsample
selected in Section II.D. Consider the full sample of all children indexed by their
fifth birthday. Specifically, consider the children born on dates e, f, and g in fig-
ure B1. Suppose that a child whose fifth birthday was on date g was counterfactu-
ally moved to date f. The child faces two possibilities. If he enrolls when first eli-
gible, then this counterfactual change in birth date would result in his entering at
a younger age. If he were redshirted, or delayed entry, he would in contrast enter
later as a result of this counterfactual change in birth date. This is the violation of
monotonicity discussed in Barua and Lang ð2009Þ and Aliprantis ð2012Þ.
This violation of monotonicity is not of concern for the model in Section II
because of the restricted nature of the treatment considered and because of the
restricted subsample chosen in Section II.D. In terms of treatment, children enter
either within a 3-month window or else 1 year later. The model abstracts from
variation in entrance or relative ages within either of these 3-month windows and
instead attributes all age effects to the variation in entrance and relative ages
generated by moving between these 3-month windows, or by redshirting.
16
Other estimates of the prevalence of redshirting range between 4 and 5.5 percent ðfor
first-time kindergartners; Bassock and Reardon 2013Þ to 16 percent ðDeming and Dynarski
2008Þ in the United States and 15 percent in Australia ðEdwards, Fiorini, and Taylor 2011Þ.
The data used to construct this estimate are discussed in depth in Secs. II.C and III.
Figure B1.—Sample selection for the effect of redshirting model: A, all children facing a
September 1, 1998, cutoff date; B, the entering cohort ðonlyÞ facing a September 1, 1998,
cutoff date. Date e: a child turning 5 on July 1, 1998. Date f : a child turning 5 on August 1,
1998. Date g: a child turning 5 on October 1, 1998.
In terms of the sample, the only children used in estimation of the model were
children who turned 5 within 3 months before the entrance cutoff date. Thus the
only possible counterfactual manipulations to fifth birth dates is between dates
such as e and f, not between e and g or f and g. Owing to the binary nature of
treatment, counterfactual manipulations of fifth birth dates between dates e and f
leave treatment unaffected at D 5 0 as long as the child enters when first eligible.
The birth date instrument affects outcomes only through its affect on redshirting
by making children more likely to receive the redshirting treatment and therefore
to switch from D 5 0 to D 5 1. Had the sample been less restricted and included
children turning 5 on date g, this simple model of treatment would not apply, and
monotonicity would be violated.
Appendix C
Under what assumptions would the OLS estimation of equation ðC1Þ identify
the parameters of equation ð10Þ? Recall that according to Todd and Wolpin
ð2003Þ, the empirical implementation of equation ð10Þ has floundered on two
basic problems.
←
Yijt 2 gYijt21 5 X ij ½Ai ðtÞa1 1 X ij ½Ai ðt 2 1Þða2 2 ga1 Þ 1
ðC3Þ
1 X ij ½Ai ð0Þðaa 2 gaa21 Þ 1 ½daðtÞ 2 gdaðt21Þ mij0 1 eijt 2 geijt21 :
Todd and Wolpin ð2003Þ derive the assumptions under which equation ðC3Þ re-
duces to equation ðC1Þ so that input coefficients can be consistently estimated via
OLS estimation of ðC1Þ. They show that in addition to MAðiÞ, the following as-
sumptions must also be made.
equation ð10Þ. Empirically, Rothstein ð2010Þ and Andrabi et al. ð2011Þ find that
value-added specifications do not overcome problem i because of the correlation
of ability with inputs, and Todd and Wolpin ð2007Þ find that in the NLSY79
children sample, there is “some support for the pattern of declining coefficients
with age; however, statistical tests reject the strict value-added formulation” ð100Þ.
Although value-added specifications may perform well in certain contexts ðTodd
and Wolpin 2007; Kinsler 2012Þ, there is no reason to believe that assumptions
MAðiÞ–MAðivÞ impose weaker restrictions on the underlying technology in equa-
tion ð10Þ than the assumptions adopted in this analysis, MA1–MA4.
References
Aliprantis, D. 2012. “Redshirting, Compulsory Schooling Laws, and Educational
Attainment.” J. Educ. and Behavioral Statis. 37 ð2Þ: 316–38.
———. 2014a. “Assessing the Evidence on Neighborhood Effects from Moving to
Opportunity.” Working Paper no. 12-33r, Fed. Reserve Bank Cleveland.
———. 2014b. “Covariates and Causal Effects: The Problem of Context.” Working
Paper no. 13 -10r, Fed. Reserve Bank Cleveland.
Aliprantis, D., and D. Carroll. 2013. “Neighborhood Dynamics and the Distribu-
tion of Opportunity.” Working Paper no. 12-12r, Fed. Reserve Bank Cleveland.
Almond, D., and J. Currie. 2010. “Human Capital Development before Age Five.”
Working Paper no. 15827, NBER, Cambridge, MA.
Andrabi, T., J. Das, A. I. Khwaja, and T. Zajonc. 2011. “Do Value-Added Estimates
Add Value? Accounting for Learning Dynamics.” American Econ. J.: Appl. Econ. 3
ð3Þ: 29–54.
Angrist, J. D., and A. B. Krueger. 1991. “Does Compulsory School Attendance
Affect Schooling and Earnings?” Q.J.E. 106 ð4Þ: 979 –1014.
Barua, R., and K. Lang. 2009. “School Entry, Educational Attainment and Quarter
of Birth: A Cautionary Tale of LATE.” Working Paper no. 15236, NBER, Cam-
bridge, MA.
Bassock, D., and S. F. Reardon. 2013. “ ‘Academic Redshirting’ in Kindergarten:
Prevalence, Patterns, and Implications.” Educ. Evaluation and Policy Analysis 35
ð3Þ: 283 –97.
Bedard, K., and E. Dhuey. 2006. “The Persistence of Early Childhood Maturity:
International Evidence of Long-Run Age Effects.” Q.J.E. 121 ð4Þ: 1437–72.
Black, S. E., P. J. Devereux, and K. G. Salvanes. 2011. “Too Young to Leave the Nest?
The Effects of School Starting Age.” Rev. Econ. and Statis. 93 ð2Þ: 455– 67.
Blackmon, D. A. 2008. Slavery by Another Name: The Re-enslavement of Black People in
America from the Civil War to World War II. New York: Doubleday.
Bobak, M., and A. Gjonca. 2001. “The Seasonality of Live Birth Is Strongly Influ-
enced by Socio-demographic Factors.” Human Reproduction 16 ð7Þ: 1512–17.
Bogard, K., and R. Takanishi. 2005. “PK-3: An Aligned and Coordinated Approach
to Education for Children 3 to 8 Years Old.” Soc. Policy Report 19 ð3Þ: 3–23.
Bound, J., and D. A. Jaeger. 2000. “Do Compulsory School Attendance Laws Alone
Explain the Association between Quarter of Birth and Earnings?” In Worker Well-
Being, edited by S. W. Polachek, 83–108. Research in Labor Economics, vol. 19.
New York: JAI.
Buckles, K., and D. M. Hungerman. 2008. “Season of Birth and Later Outcomes:
Old Questions, New Answers.” Working Paper no. 14573, NBER, Cambridge,
MA.
Cameron, S. V., and J. J. Heckman. 2001. “The Dynamics of Educational Attain-
ment for Black, Hispanic, and White Males.” J.P.E. 109 ð3Þ: 455–99.
Cascio, E. U. 2009. “Maternal Labor Supply and the Introduction of Kindergartens
into American Public Schools.” J. Human Resources 44 ð1Þ: 140 –70.
Cascio, E. U., and E. G. Lewis. 2006. “Schooling and the Armed Forces Qualifying
Test.” J. Human Resources 41 ð2Þ: 294–318.
Cascio, E. U., and D. W. Schanzenbach. 2007. “First in the Class? Age and the Ed-
ucation Production Function.” Working Paper no. 13663, NBER, Cambridge, MA.
Chen, B., and J. Pearl. 2012. “Regression and Causation: A Critical Examination of
Econometrics Textbooks.” Manuscript, Cognitive Systems Lab., Univ. California,
Los Angeles.
Cook, P. J., and S. Kang. 2013. “Birthdays, Schooling, and Crime: New Evidence
on the Dropout-Crime Nexus.” Working Paper no. 18791, NBER, Cambridge,
MA.
Cunha, F., and J. J. Heckman. 2007. “The Technology of Skill Formation.” A.E.R.
97 ð2Þ: 31– 47.
Cunha, F., J. J. Heckman, L. Lochner, and D. V. Masterov. 2005. “Interpreting the
Evidence on Life Cycle Skill Formation.” IZA Discussion Paper no. 1675, Inst.
Study Labor, Bonn.
Currie, J. 2001. “Early Childhood Education Programs.” J. Econ. Perspectives 15 ð2Þ:
213–38.
Datar, A. 2006. “Does Delaying Kindergarten Entrance Give Children a Head
Start?” Econ. Educ. Rev. 25:43–62.
Dawid, H., and G. Muehlheusser. 2012. “Repeated Selection with Heterogeneous
Individuals and Relative Age Effects.” IZA Discussion Paper no. 6478, Inst. Study
Labor, Bonn.
Deming, D., and S. Dynarski. 2008. “The Lengthening of Childhood.” J. Econ.
Perspectives 22 ð3Þ: 71–92.
Dhuey, E., and S. Lipscomb. 2008. “What Makes a Leader? Relative Age and High
School Leadership.” Econ. Educ. Rev. 27 ð2Þ: 173 –83.
Dickert-Conlin, S., and A. Chandra. 1999. “Taxes and the Timing of Births.” J.P.E.
107 ð1Þ: 161–77.
Dickert-Conlin, S., and T. Elder. 2009. “Suburban Legend: School Cutoff Dates
and the Timing of Births.” Manuscript, Michigan State Univ.
Dobkin, C., and F. Ferreira. 2010. “Do School Entry Laws Affect Educational
Attainment and Labor Market Outcomes?” Econ. Educ. Rev. 29 ð1Þ: 40–54.
Eckstein, Z., and K. I. Wolpin. 1999. “Why Youths Drop Out of High School: The
Impact of Preferences, Opportunities, and Abilities.” Econometrica 67 ð6Þ: 1295–
1339.
Edwards, B., M. Fiorini, and M. Taylor. 2011. “Who Gets the ‘Gift of Time’ in
Australia? Exploring Delayed Primary School Entry.” Australian Rev. Public Affairs
10 ð1Þ: 41– 60.
Elder, T. E., and D. H. Lubotsky. 2009. “Kindergarten Entrance Age and Children’s
Achievement: Impacts of State Policies, Family Background, and Peers.” J.
Human Resources 44 ð3Þ: 562–98.
Fiorini, M., and M. Keane. 2011. “How the Allocation of Children’s Time Affects
Cognitive and Non- cognitive Development.” Manuscript, Univ. Technology
Sydney.
Fitzpatrick, M. D. 2010. “Preschoolers Enrolled and Mothers at Work? The Effects
of Universal Prekindergarten.” J. Labor Econ. 28 ð1Þ: 51–85.
Fredriksson, P., and B. Öckert. 2005. “Is Early Learning Really More Productive?
The Effect of School Starting Age on School and Labor Market Performance.”
IZA Discussion Paper no. 1659, Inst. Study Labor, Bonn.
Goldberger, A. S., and C. F. Manski. 1995. Review of The Bell Curve by Herrnstein
and Murray. J. Econ. Literature 33 ð2Þ: 762–76.
Goldin, C., L. F. Katz, and I. Kuziemko. 2006. “The Homecoming of American
College Women: The Reversal of the College Gender Gap.” J. Econ. Perspectives
20 ð4Þ: 133 –56.
Graue, M. E. 1993. Ready for What? Constructing Meanings of Readiness for Kinder-
garten. Albany: State Univ. New York Press.
Graue, M. E., and J. DiPierna. 2000. “Redshirting and Early Retention: Who Gets
the ‘Gift of Time’ and What Are Its Outcomes?” American Educ. Res. J. 37 ð2Þ:
509–34.
Hahn, J., P. Todd, and W. van der Klaauw. 2001. “Identification and Estimation of
Treatment Effects with a Regression-Discontinuity Design.” Econometrica 69 ð1Þ:
201– 9.
Heckman, J. J. 2005. “The Scientific Model of Causality.” Sociological Methodology 35
ð1Þ: 1–97.
Heckman, J. J., H. Ichimura, J. Smith, and P. Todd. 1998. “Characterizing Selection
Bias Using Experimental Data.” Econometrica 66 ð5Þ: 1017–98.
Heckman, J. J., S. H. Moon, R. Pinto, P. Savelyev, and A. Yavitz. 2010a. “Analyzing
Social Experiments as Implemented: A Reexamination of the Evidence from the
HighScope Perry Preschool Program.” Quantitative Econ. 1 ð1Þ: 1–46.
———. 2010b. “The Rate of Return to the HighScope Perry Preschool Program.”
J. Public Econ. 94 ð1–2Þ: 114–28.
Heckman, J. J., D. Schmierer, and S. Urzúa. 2010. “Testing the Correlated Random
Coefficient Model.” J. Econometrics 158 ð2Þ: 177–203.
Heckman, J. J., S. Urzúa, and E. Vytlacil. 2006. “Understanding Instrumental
Variables in Models with Essential Heterogeneity.” Rev. Econ. and Statis. 88 ð3Þ:
389–432.
Heckman, J. J., and E. Vytlacil. 2005. “Structural Equations, Treatment Effects, and
Econometric Policy Evaluation.” Econometrica 73 ð3Þ: 669 –738.
Holm, S. 1979. “A Simple Sequentially Rejective Multiple Test Procedure.” Scandi-
navian J. Statis. 6 ð2Þ: 65–70.
Imbens, G. W., and J. D. Angrist. 1994. “Identification and Estimation of Local
Average Treatment Effects.” Econometrica 62 ð2Þ: 467–75.
Keane, M. P., and K. I. Wolpin. 1997. “The Career Decisions of Young Men.” J.P.E.
105 ð3Þ: 473–522.
———. 2000. “Eliminating Race Differences in School Attainment and Labor
Market Success.” J. Labor Econ. 18 ð4Þ: 614–52.
Khan, S. 2012. The One World Schoolhouse: Education Reimagined. New York: Twelve.
Kinsler, J. 2012. “Assessing Rothstein’s Critique of Teacher Value-Added Models.”
Quantitative Econ. 3 ð2Þ: 333–62.
Kirp, D. L. 2007. The Sandbox Investment: The Preschool Movement and Kids-First Politics.
Cambridge, MA: Harvard Univ. Press.
Koedel, C., R. Leatherman, and E. Parsons. 2012. “Test Measurement Error and
Inference from Value-Added Models.” B.E. J. Econ. Analysis and Policy 12 ð1Þ: art.
52.
Lam, D. A., and J. A. Miron. 1996. “The Effects of Temperature on Human
Fertility.” Demography 33 ð3Þ: 291–305.
Lam, D. A., J. A. Miron, and A. Riley. 1994. “Modeling Seasonality in Fecundability,
Conceptions, and Births.” Demography 31 ð2Þ: 321–46.
Lillard, A. S. 2008. Montessori: The Science behind the Genius. Oxford: Oxford Univ.
Press.
Manski, C. F. 2011. “Genes, Eyeglasses, and Social Policy.” J. Econ. Perspectives 25 ð4Þ:
83–94.
McCrary, J., and H. N. Royer. 2011. “The Effect of Maternal Education on Fertility
and Infant Health: Evidence from School Entry Policies Using Exact Date of
Birth.” A.E.R. 101 ð1Þ: 158–95.
McEwan, P. J., and J. S. Shapiro. 2008. “The Benefits of Delayed Primary School
Enrollment: Discontinuity Estimates Using Exact Birth Dates.” J. Human Re-
sources 43 ð1Þ: 1–29.